基于 ColPali + MUVERA + Qdrant 的多模态视觉文档问答系统,支持 pdf、图片及 pptx 文档
-
Updated
May 28, 2026 - Python
基于 ColPali + MUVERA + Qdrant 的多模态视觉文档问答系统,支持 pdf、图片及 pptx 文档
Multi-agent AI platform for social media automation — research, content writing, media generation, scraping, and publishing across Instagram, LinkedIn, YouTube & Reddit. Built with FastAPI + Next.js.
Multimodal AI-powered e-commerce recommendation bot built on Azure — supports text, voice, and image inputs using Azure OpenAI (GPT), CLU, Computer Vision, AI Search (RAG), and Bot Framework SDK. Deployed on Azure App Service with Direct Line Web Chat integration.
Add a description, image, and links to the mutimodal-ai-agent topic page so that developers can more easily learn about it.
To associate your repository with the mutimodal-ai-agent topic, visit your repo's landing page and select "manage topics."