Skip to content
#

pdf-qa

Here are 35 public repositories matching this topic...

RAG implemented from scratch without using LangChain and LangGraph - designed specifically for processing and querying PDF documents with advanced support for visual content like tables, charts, and mathematical formulas.

  • Updated Oct 25, 2025
  • Python

Production-ready multilingual RAG system for scientific PDFs. Supports 10+ Indic languages with E5 embeddings, ChromaDB vector store, Gemini 2.5 Flash LLM, and NLLB-200 translation. Ask questions in any language, get accurate answers with citations

  • Updated Jun 5, 2026
  • Python

本地化医学文献 RAG 问答系统。覆盖毕业论文文献调研、科研综述撰写、临床指南查阅、药学信息查询等场景 ,同时具备小数据量下的生产级可用性。支持多篇中英文医学论文 PDF(糖尿病、心血管、BMI 等)的智能检索与生成式问答。基于 Faiss 语义向量 + BM25 关键词 + Cross-Encoder 重排序三重检索 pipeline,模块化设计可自由切换嵌入模型、生成模型与重排序模型。全程不依赖任何第三方 API,基于 Ollama 本地部署 bge-m3 嵌入模型与 qwen2:7b 生成模型,确保医学数据隐私安全。实现了多会话隔离、流式 SSE 输出、异步 PDF 上传、前端历史持久化等生产级特性。

  • Updated Jun 6, 2026
  • Python

Sleek Streamlit chat app for Google Gemini (Files API). Dark, gradient UI with model picker, usage dialog, file/image/audio/PDF attach & preview, chat history, image persistence, robust error handling, and token usage tracking. Supports streaming replies and modular backend via google-genai.

  • Updated Oct 4, 2025
  • Python

Improve this page

Add a description, image, and links to the pdf-qa topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the pdf-qa topic, visit your repo's landing page and select "manage topics."

Learn more