🐊 Snappy's unique approach unifies vision-language late interaction with structured OCR for region-level knowledge retrieval. Like the project? Drop a star! ⭐
-
Updated
Feb 9, 2026 - Python
🐊 Snappy's unique approach unifies vision-language late interaction with structured OCR for region-level knowledge retrieval. Like the project? Drop a star! ⭐
Vision-first document RAG for PDF, DOCX, and image QA/extraction with ColQwen2, Qwen2.5-VL, Qdrant, and FastAPI.
Multimodal RAG with per-query routing to text / visual / hybrid retrieval paths. Vision-LLM answers and regression-gated evals.
RAG Lab — Local visual document analysis workbench powered by ColPali embeddings and Ollama vision models
A Faster R-CNN object detection model for recognizing handwritten mathematical symbols on whiteboards. Built as the core detection engine for an end-to-end math expression visual retrieval pipeline.
Add a description, image, and links to the visual-retrieval topic page so that developers can more easily learn about it.
To associate your repository with the visual-retrieval topic, visit your repo's landing page and select "manage topics."