🎙️ VoiceNote AI: Local Summarizer

VoiceNote AI is a completely free, privacy-first, local web application that transcribes audio and video files and generates concise bullet-point summaries.

It leverages OpenAI's Whisper for highly accurate speech-to-text and Meta's Llama 3 (via Ollama) for intelligent summarization. Designed with a "Lean Docker" architecture, it ensures stability by running the app logic in a lightweight container while offloading heavy AI processing to the host machine.

✨ Features

100% Local & Private: No data is sent to the cloud. Everything runs on your hardware.
Multi-Format Support: Upload .mp3, .wav, .m4a, and .mp4 files.
High-Accuracy Transcription: Powered by Whisper (Base model).
Smart Summarization: Extracts key points using Llama 3.
Lean Docker Setup: Prevents "Out of Memory" crashes by keeping the Docker container small and connecting to the host machine's AI models.

📸 Screenshots

VoiceNote Dashboard

Transcription & Summary

🛠️ Tech Stack

Frontend/Backend: Python 3.10, Streamlit
Audio Processing: FFmpeg, OpenAI Whisper
LLM Engine: Ollama, Llama 3
Containerization: Docker

🚀 Step-by-Step Local Setup & Run Instructions

Follow these instructions to get VoiceNote AI running on your local machine.

Step 1: Prerequisites

Before running the project, ensure you have the following installed on your computer:

Git: Installed on your machine.
Docker: Installed and running (Docker Desktop or Docker Engine).
Ollama: Download and install for your operating system from ollama.com.

Step 2: Clone the Repository

Open your terminal and clone this project to your local machine, then navigate into the folder:

git clone https://github.com/MalindaBotheju/VoiceNote-AI

cd VoiceNote-AI

Repository Structure

After cloning, ensure your project directory contains the following structure:

VoiceNote-AI/
├── screenshots/      (Contains images used in this README)
├── .dockerignore     (Crucial: Keeps Docker builds fast by ignoring venv, etc.)
├── .gitignore        (Tells Git which local files to ignore)
├── Dockerfile        (Instructions to build the lightweight Python image)
├── README.md         (This documentation file)
├── app.py            (The main Streamlit application code)
└── requirements.txt  (Python dependencies: streamlit, openai-whisper, ollama)

Step 3: Set Up the Local Virtual Environment (For Development)

Now that you are inside the folder, create and activate the virtual environment so your IDE has a clean workspace.

For Windows:

python -m venv venv

.\venv\Scripts\Activate

For Linux / macOS:

python3 -m venv venv

source venv/bin/activate

Step 4: Install the dependencies locally

pip install -r requirements.txt

Step 5: Prepare the AI Model (Ollama)

Open your terminal and pull the Llama 3 model into your local Ollama instance:

ollama run llama3

(Once it downloads and starts, you can type /bye to exit. Ollama will keep running in the background).

Step 6: Build the Docker Image

Open your terminal inside the VoiceNote-AI project folder and build the container image.

docker build -t voicenote-app .

Note: This will download Python and FFmpeg. It may take a few minutes the first time.

Step 7: Run the App

Run the container using the following command. This specifically maps the port and tells the Docker container to look for Ollama on your host machine (host.docker.internal), preventing memory crashes.

docker run -p 8501:8501 -e OLLAMA_BASE_URL=http://host.docker.internal:11434 voicenote-app

(Optional: Add -d after run if you want it to run in the background).

Step 8: Use the App

Open your web browser.
Navigate to: http://localhost:8501
Upload an audio or video file, click "Transcribe & Summarize", and enjoy!

(Note on first run: Whisper will automatically download its base model (~139MB) the first time you transcribe a file. Subsequent runs will be much faster.)

👨‍💻 Created By

Malinda Botheju * GitHub: @MalindaBotheju

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

🎙️ VoiceNote AI: Local Summarizer

✨ Features

📸 Screenshots

🛠️ Tech Stack

🚀 Step-by-Step Local Setup & Run Instructions

Step 1: Prerequisites

Step 2: Clone the Repository

Repository Structure

Step 3: Set Up the Local Virtual Environment (For Development)

Step 4: Install the dependencies locally

Step 5: Prepare the AI Model (Ollama)

Step 6: Build the Docker Image

Step 7: Run the App

Step 8: Use the App

👨‍💻 Created By

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 13 Commits
screenshots		screenshots
.dockerignore		.dockerignore
.gitignore		.gitignore
Dockerfile		Dockerfile
README.md		README.md
app.py		app.py
requirements.txt		requirements.txt

Folders and files

Latest commit

History

Repository files navigation

🎙️ VoiceNote AI: Local Summarizer

✨ Features

📸 Screenshots

🛠️ Tech Stack

🚀 Step-by-Step Local Setup & Run Instructions

Step 1: Prerequisites

Step 2: Clone the Repository

Repository Structure

Step 3: Set Up the Local Virtual Environment (For Development)

Step 4: Install the dependencies locally

Step 5: Prepare the AI Model (Ollama)

Step 6: Build the Docker Image

Step 7: Run the App

Step 8: Use the App

👨‍💻 Created By

About

Topics

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages