Chat with your PDFs and automate document workflows using AWS Bedrock (Claude + Titan), LangChain, FAISS, and Streamlit.
This project extracts knowledge from documents, stores embeddings, and enables interactive Q&A or structured document generation โ making it ideal for document automation and Proof of Concept (POC) creation.
- ๐ PDF ingestion: Upload and process multiple PDFs.
- ๐งฉ Chunking & embeddings: Split documents into chunks and embed them with Amazon Titan.
- ๐ Vector storage: Store embeddings locally using FAISS for fast retrieval.
- ๐ค Conversational AI: Query documents with Claude (via Bedrock) using LangChainโs RetrievalQA.
- ๐ Streamlit UI: Simple web interface to manage vector updates and ask questions.
- โก Automation-ready: Extendable for structured outputs like meeting minutes, POCs, or summaries.
- AWS Bedrock โ Claude 3 Sonnet & Titan Embeddings
- LangChain โ document loaders, embeddings, RetrievalQA
- FAISS โ vector store for similarity search
- Streamlit โ interactive frontend
- Clone the repo
git clone https://github.com/<your-username>/doc-automation-poc-maker.git cd doc-automation-poc-maker
python -m venv venv source venv/bin/activate # (Linux/Mac) venv\Scripts\activate # (Windows)
pip install -r requirements.txt aws configure streamlit run app.py