Multi-Modal RAG

An AI-powered pipeline for extracting, chunking, and summarizing content from PDF documents using advanced chunking strategies and generative models. Supports text, tables, and images, with vector search and retrieval via ChromaDB.

Features

PDF partitioning and intelligent chunking
Handles text, tables, and images
AI-generated summaries for mixed content
Embedding and vector search with ChromaDB

Quick Start

Clone the repository
Install dependencies
Run the notebook to process your PDF and create a searchable vector store

Technologies

Python
LangChain
Unstructured
ChromaDB
Google Generative AI

License

MIT

Name		Name	Last commit message	Last commit date
Latest commit History 2 Commits
dbv1/chroma_db		dbv1/chroma_db
.env		.env
Docker CheatSheet ApnaCollege.pdf		Docker CheatSheet ApnaCollege.pdf
README.md		README.md
attention-is-all-you-need.pdf		attention-is-all-you-need.pdf
multi-modal.ipynb		multi-modal.ipynb

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Multi-Modal RAG

Features

Quick Start

Technologies

License

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

Multi-Modal RAG

Features

Quick Start

Technologies

License

About

Topics

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages