Faceless Video Generator

Generate MP4 videos from MP3 voice-overs — with real transcription and burned-in subtitles.

Architecture

┌──────────────────┐       ┌────────────────────┐
│   React + Vite   │──────▶│   FastAPI Backend   │
│   (port 5173)    │ proxy │   (port 8000)       │
└──────────────────┘       └────────┬───────────┘
                                    │
                           ┌────────▼───────────┐
                           │  Background Worker  │
                           │                     │
                           │  1. Whisper (STT)   │
                           │  2. FFmpeg (video)  │
                           │  3. Burn subtitles  │
                           └────────┬───────────┘
                                    │
                           ┌────────▼───────────┐
                           │  SQLite + Local FS  │
                           └────────────────────┘

Features

Drag & drop audio upload (MP3, WAV, M4A, OGG, FLAC, AAC)
Real-time job tracking with progress bar and pipeline logs
Local Whisper transcription — no API keys needed
Word-level SRT subtitles burned into video
Audio-reactive waveform video background
Fallback mode — static subtitles if transcription fails
SQLite persistence — jobs survive server restarts

Prerequisites

1. Python 3.10+

Download from https://www.python.org/downloads/

2. Node.js 18+

Download from https://nodejs.org/

3. FFmpeg (required)

Windows (recommended — using winget):

winget install Gyan.FFmpeg

Windows (alternative — using Chocolatey):

choco install ffmpeg

Windows (manual):

Download from https://www.gyan.dev/ffmpeg/builds/
Extract to C:\ffmpeg
Add C:\ffmpeg\bin to your system PATH

Verify installation:

ffmpeg -version

Quick Start

1. Backend Setup

cd backend
python -m venv venv
.\venv\Scripts\Activate.ps1
pip install -r requirements.txt

2. Frontend Setup

cd frontend
npm install

3. Run (two terminals)

Terminal 1 — Backend:

cd backend
.\venv\Scripts\Activate.ps1
uvicorn main:app --reload --port 8000

Terminal 2 — Frontend:

cd frontend
npm run dev

4. Open

Navigate to http://localhost:5173

Usage

Upload an MP3 file (max 50 MB, 2 minutes)
Click Generate Video
Watch the pipeline progress in real time
Download the final MP4 when complete

Configuration

Edit backend/config.py:

Setting	Default	Description
`WHISPER_MODEL`	`base`	Whisper model size (tiny/base/small)
`VIDEO_WIDTH`	`1280`	Output video width
`VIDEO_HEIGHT`	`720`	Output video height
`MAX_AUDIO_DURATION`	`120`	Max audio length in seconds
`FONT_SIZE`	`28`	Subtitle font size

You can also set the Whisper model via environment variable:

set WHISPER_MODEL=small

API Endpoints

Method	Path	Description
POST	`/api/jobs/upload`	Upload audio, create job
GET	`/api/jobs/{id}`	Get job status
GET	`/api/jobs/{id}/download`	Download generated MP4
GET	`/api/jobs/`	List all jobs
GET	`/api/health`	Health check

Tech Stack

Frontend: React 18 + Vite
Backend: Python FastAPI + Uvicorn
Transcription: OpenAI Whisper (local)
Video: FFmpeg (H.264 + AAC)
Database: SQLite (WAL mode)
Worker: Background thread (daemon)

Name		Name	Last commit message	Last commit date
Latest commit History 5 Commits
backend		backend
frontend		frontend
.gitignore		.gitignore
ARCHITECTURE.md		ARCHITECTURE.md
README.md		README.md
setup.bat		setup.bat
start.bat		start.bat

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Faceless Video Generator

Architecture

Features

Prerequisites

1. Python 3.10+

2. Node.js 18+

3. FFmpeg (required)

Quick Start

1. Backend Setup

2. Frontend Setup

3. Run (two terminals)

4. Open

Usage

Configuration

API Endpoints

Tech Stack

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

Faceless Video Generator

Architecture

Features

Prerequisites

1. Python 3.10+

2. Node.js 18+

3. FFmpeg (required)

Quick Start

1. Backend Setup

2. Frontend Setup

3. Run (two terminals)

4. Open

Usage

Configuration

API Endpoints

Tech Stack

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages