An AI Teaching Assistant built using Retrieval-Augmented Generation (RAG) to turn long video lectures and recordings into interactive, searchable knowledge bases.
Imagine sitting through hours of lecture or training videos just to find one specific answer. The traditional method of scrubbing through timelines is inefficient, frustrating, and a major barrier to effective learning. Key knowledge remains locked away inside lengthy recordings.
EduRAG solves this problem by ingesting video content and transforming it into a smart knowledge base. Users can ask questions in natural language and receive instant, accurate answers with context, eliminating the need for manual searching.
- π¬ Natural Language Queries: Ask questions in plain English, just like you would talk to a human teaching assistant.
- π Instant & Accurate Answers: Leverage the power of RAG and Large Language Models (LLMs) to get precise information without delay.
- π For Students: Instantly find answers and clarify concepts from hours of lecture recordings. Ace your exams without re-watching entire courses.
- π₯ For Creators & Trainers: Make your educational content highly accessible and searchable, increasing its value and engagement.
- π Private & Local: Build a secure, private knowledge base from your personal recordings, running entirely on your local machine with Ollama.
This project is built with a powerful, open-source stack designed for efficient AI-powered data processing:
- Video/Audio Processing:
FFmpeg,Whisper(OpenAI) - Data Handling & Numerics:
Pandas,NumPy - Machine Learning & Embeddings:
Scikit-learn,Joblib,BGE-M3 - Local LLM Inference:
Ollama - Language Models:
DeepSeek R1,Llama 3.2
To get a local copy up and running, follow these steps.
Make sure you have the following installed on your system:
-
Clone the Repository:
git clone [https://github.com/your-username/EduRAG.git](https://github.com/your-username/EduRAG.git) cd EduRAG -
Create a Virtual Environment:
python -m venv venv source venv/bin/activate # On Windows: venv\Scripts\activate
-
Install Dependencies:
pip install -r requirements.txt
-
Set up Ollama & Pull Models:
- Ensure the Ollama service is running.
- Pull the required models from the command line:
ollama pull llama3.2 ollama pull deepseek-coder-v2 ollama pull bge-m3 # For embeddings
-
Add Your Video: Place your video file (e.g.,
lecture.mp4) into the/videosdirectory. -
Process the Video: Run the main processing script to transcribe, chunk, and embed the content.
python process_video.py --file "lecture.mp4" -
Ask Questions: Launch the interactive Q&A interface.
python ask.py
You can now start asking questions about the video content!
This project is open for collaboration! If you're curious, have feedback, or want to extend this project, please feel free to connect.
- Fork the Project
- Create your Feature Branch (
git checkout -b feature/NewFeature) - Commit your Changes (
git commit -m 'Add some NewFeature') - Push to the Branch (
git push origin feature/NewFeature) - Open a Pull Request
Distributed under the MIT License. See LICENSE for more information.