π RAG Document Assistant
A Retrieval-Augmented Generation (RAG) based AI system that answers questions from documents using semantic search and LLMs.
π Features π Load and process PDF documents π Semantic search using embeddings π§ Context-aware answer generation π Automatic summarization of documents
π Tech Stack Python LangChain ChromaDB (Vector Database) HuggingFace Transformers Sentence Transformers
βοΈ How it Works Load PDF documents Split into chunks Convert text into embeddings Store in vector database Retrieve relevant chunks Generate answer using LLM
π Project Structure rag-doc-assistant/ β βββ sample_docs/ β βββ rpa_blueprism.pdf β βββ rag_pipeline.ipynb βββ README.md
π‘ Example Queries What is RPA? Explain Blue Prism Summarize the document
π Future Improvements Add web interface (Streamlit) Support multiple documents Use advanced LLM APIs
π¨βπ» Author Anirvan Mohapatra