Subtitle Player

Repository: https://github.com/archoor/subtitle-player

Open-source timed subtitle viewer for .srt, .vtt, and FunASR [hh:mm:ss] .txt transcripts — with auto-scroll, karaoke / MTV-style word highlight, bilingual EN↔ZH display, LLM subtitle translation (DashScope / Qwen), and a portable Windows .exe. No video required: practice language listening & reading from transcript files alone.

Load timestamped subtitle files and simulate timed playback: auto-scroll, MTV-style word-by-word highlight, bilingual display (English above Chinese in a narrow stacked column), eye-care themes, speed and font-size controls. Chinese translations are generated on demand via LLM and cached locally.

Two deployment modes share the same frontend and backend:

Web app — FastAPI local server, open in a browser (development and debugging).
Desktop client (Electron) — packaged as a single SubtitlePlayer.exe with an embedded Python backend; no separate install.

Overview

Subtitle Player is a local-first subtitle reader and karaoke-style lyric viewer. It is built for people who already have a transcript (from FunASR, Whisper, podcast STT, course videos, etc.) and want to read along on a timer without opening a video player.

Also known as / related searches: subtitle scroll player · timed transcript reader · bilingual subtitle tool · SRT/VTT viewer · karaoke lyrics highlighter · English podcast subtitle translator · FunASR transcript player · eye-care subtitle reading · language-learning subtitle app · Electron subtitle desktop app · portable subtitle exe.

Tech stack: Python · FastAPI · uvicorn · vanilla JavaScript · Electron · PyInstaller · SSE streaming · DashScope (OpenAI-compatible) · MIT license.

Use Cases

Scenario	How Subtitle Player helps
Language learning	Read English podcasts / lectures with auto-scroll; generate Chinese side-by-side while you follow the timer.
FunASR / STT output	Open `[hh:mm:ss] text` `.txt` from speech recognition; no need to convert to SRT first.
Bilingual review	Switch Bilingual / English / Chinese; cache translations in `.bilingual.json` for offline reuse.
Long-screen reading	Warm cream & dark eye-care themes; adjustable font size; current line centered like teleprompter / lyrics apps.
No-video workflow	Timer-only playback when you only have subtitles, not the original media file.
Windows desktop	Single portable `SubtitlePlayer.exe` — double-click, pick a file, no Python install for end users.
Developer / self-host	Run `uv run python subtitle_player/run.py` locally; REST + SSE API for custom integrations.

Scenario 1: YouTube & foreign-video subtitle study

Watch YouTube or other foreign-language videos with bilingual subtitles side by side for faster learning.

Download subtitles from the video (browser extensions, yt-dlp, or site export) as .srt / .vtt.
Open the subtitle file in Subtitle Player; switch to Bilingual view (English on top, Chinese below).
If Chinese is missing, click Generate Chinese translation or press Play — translation streams in while you read.
Play the video in another window; follow Subtitle Player’s auto-scroll and karaoke highlight to stay in sync by eye.
Click any line to jump; adjust speed (0.5×–2×) to match the speaker. Cached .bilingual.json lets you resume next time without re-translating.

Tip: Subtitle Player does not play video — keep the video in your browser and use this tool as a dedicated bilingual subtitle panel.

Scenario 2: Podcast / course transcript follow-along (no video)

You only have a transcript from STT (FunASR, Whisper, batch_transcribe, etc.) — no video file at all.

Open the [hh:mm:ss] text .txt or .srt output directly (no conversion needed for FunASR format).
Use simulated playback: current line centers on screen, words light up left-to-right like karaoke lyrics.
Switch to English only for dictation practice, or Bilingual after generating Chinese.
Slow down to 0.5×–0.75× when the speaker is fast; enlarge font for long reading sessions with eye-care themes.
Re-open the same file later — translation cache and scroll position make it easy to continue where you left off.

Ideal for: podcast notes, online course replays, interview transcripts, and any audio you have already transcribed.

Scenario 3: Conference talks & technical talks — line-by-line deep read

Dense English content (AI talks, product keynotes, academic lectures) where you need to pause, re-read, and compare wording.

Load the official or auto-generated .srt / .vtt subtitle.
Start in English view; use click-to-seek on any line instead of scrubbing a timeline.
Switch to Bilingual and generate Chinese for terms and long sentences; untranslated lines show a placeholder until ready.
Toggle dark eye-care theme and bump font size for 30–60 minute sessions.
Export-ready cache (*.bilingual.json) sits next to the source file — share or archive the bilingual version for team review.

Ideal for: TED-style talks, technical webinars, product launches, and any content where accuracy matters more than real-time speed.

FAQ

Does it play video or audio?
No. It is a subtitle-only simulated player driven by timestamps. Use it when you want scroll + highlight without syncing to media.

Which subtitle formats are supported?
.srt, .vtt, and FunASR-style .txt with [hh:mm:ss] or [mm:ss] prefixes.

Can it translate English subtitles to Chinese?
Yes. Configure DASHSCOPE_API_KEY (or compatible API in .env). Translation streams via SSE while you play; results cache next to the source file.

Does it work offline?
Playback and cached .bilingual.json work offline. New LLM translation needs network + API key.

FunASR timestamps only have start times — is karaoke accurate?
End times are inferred from the next line; highlight timing is character-linear within each segment — great visual karaoke, not word-level forced alignment.

Web vs Electron desktop?
Same UI. Web uploads files; Electron uses a native file picker. Desktop build embeds the Python backend.

Is this free and open source?
Yes — MIT License. Part of the xk-knowledge-base monorepo under subtitle_player/.

Features

Feature	Description
Load subtitles	funasr `[hh:mm:ss] text` `.txt`, standard `.srt` / `.vtt`
View modes	Bilingual (EN on top, ZH below), English only, Chinese only; desktop window fixed at 450px width
Simulated playback	Timer-driven (no audio); current line centered with smooth auto-scroll
Karaoke highlight	Current line lights up word-by-word left to right
Speed	0.5× – 2×
Font size	16 – 40 px, live update
Eye-care themes	Default warm cream; one-click dark eye-care mode (no pure white/black)
Translation	Streaming SSE: translate while playing; untranslated segments show a placeholder; incremental cache for resume

Timestamps: funasr .txt lines only have start times. This tool fills end times from the next line’s start; the last line uses the “audio duration” from stats. Karaoke timing is linearly interpolated by character count within each segment — good for the marquee effect, not word-level precision.

Quick Start

# From the repository root (deps in root pyproject.toml: fastapi / uvicorn)
uv run python subtitle_player/run.py

The browser opens at http://127.0.0.1:8800.

Desktop shortcut (Windows)

Double-click the desktop shortcut Subtitle Player, or run:

powershell -ExecutionPolicy Bypass -File subtitle_player/create_desktop_shortcut.ps1

Or double-click subtitle_player/launch.bat.

CLI flags:

Flag	Default	Description
`--host`	`127.0.0.1`	Bind address
`--port`	`8800`	Port
`--no-open`	off	Do not open the browser

uv run python subtitle_player/run.py --port 9000 --no-open

Usage

Click Choose file… → pick a .txt / .srt / .vtt → click Load.
Press ▶ or Space to play/pause.
Click any subtitle line to seek; drag or click the progress bar.
Top bar: switch Bilingual / English / Chinese; ⚙ translation settings; 🌗 theme; EN/中 UI language.
For English-only subtitles: click Generate Chinese translation, or press Play (translation starts automatically). Untranslated lines show a placeholder until ready.

Shortcuts

Action	Key
Play / Pause	`Space`

Translation

Panel-first, .env optional fallback. Open ⚙ Settings to configure Base URL, API Key, and model — saved to the user data directory and effective immediately. If panel fields are empty, the app falls back to .env (see .env.example).

Default endpoint: Alibaba Cloud DashScope (https://dashscope.aliyuncs.com/compatible-mode/v1), model qwen3.6-flash.
Also supports any OpenAI-compatible API.

Panel / `.env` variable	Description
Base URL	OpenAI-compatible endpoint
API Key	Service API key
Model	e.g. `qwen3.6-flash` — optional in `.env`; set in ⚙ Settings or use built-in default

Additional panel parameters: temperature (0–2), max_tokens (256–16384), batch_size (1–32).

Priority per field: panel translate_config.json → .env (TRANSLATE_* / DASHSCOPE_* / BASE_URL+API_KEY) → built-in defaults.

DashScope calls ignore system proxies for direct domestic access.

Without API credentials, playback still works; translation is disabled (banner points to ⚙ Settings).

Translations cache as xxx.bilingual.json next to the subtitle file. Delete to force re-translation.

Desktop Client & Packaging (Electron + PyInstaller → single exe)

The desktop build wraps the web app in Electron: the main process picks a free port, starts the embedded Python backend, waits for /api/health, then loads the local URL. In Electron, file picking uses a native dialog (web upload otherwise). electron-builder produces SubtitlePlayer.exe.

One-time setup

# 1) Python packaging tools (dev deps in root pyproject.toml)
uv sync

# 2) Node / Electron (first run is slow; mirrors recommended in China)
cd subtitle_player/desktop
$env:ELECTRON_MIRROR="https://npmmirror.com/mirrors/electron/"
$env:ELECTRON_BUILDER_BINARIES_MIRROR="https://npmmirror.com/mirrors/electron-builder-binaries/"
npm install --registry=https://registry.npmmirror.com

Development (no packaging)

cd subtitle_player/desktop
npm start

Build portable exe

# Step 1 — from repo root: PyInstaller backend → dist/SubtitlePlayer-backend/
uv run pyinstaller --noconfirm subtitle_player/packaging/backend.spec

# Step 2 — electron-builder single-file portable exe
cd subtitle_player/desktop
$env:ELECTRON_MIRROR="https://npmmirror.com/mirrors/electron/"
$env:ELECTRON_BUILDER_BINARIES_MIRROR="https://npmmirror.com/mirrors/electron-builder-binaries/"
npm run dist

Output: subtitle_player/desktop/release/SubtitlePlayer.exe (~345 MB).

Build uses compression: store (no compression). Re-enabling 7z compression can hang on large apps; store is faster and more reliable.

`.env` optional fallback

No .env required if you configure everything in ⚙ Settings. For team defaults or development, optional .env lookup order:

Same directory as the exe
%APPDATA%\SubtitlePlayer\.env
Project root .env (development)

Copy-Item .\.env "$env:APPDATA\SubtitlePlayer\.env"   # optional

Uploads and translate_config.json live under %APPDATA%\SubtitlePlayer\.

Project Structure

subtitle_player/
├── run.py                  # Entry (imports app for uvicorn + PyInstaller)
├── backend/
│   ├── app.py              # FastAPI: static files + API routes
│   ├── parser.py           # funasr .txt / .srt / .vtt → Segment
│   ├── translator.py       # LLM batch translate + .bilingual.json cache (SSE)
│   ├── translate_config.py # Model params (UI + JSON in user data dir)
│   ├── paths.py            # Resource / user data / .env (dev + frozen)
│   └── models.py           # Segment / Subtitle types
├── frontend/
│   ├── index.html
│   ├── style.css
│   ├── i18n.js             # UI strings (en default, zh)
│   └── app.js              # Playback, scroll, highlight, translation
├── desktop/                # Electron shell
├── packaging/backend.spec  # PyInstaller spec
├── mockup.html             # Offline UI prototype
├── .env.example
├── LICENSE
├── CONTRIBUTING.md
└── CHANGELOG.md

API

Endpoint	Description
`GET /`	Frontend page
`GET /api/health`	Health check; `llm` availability
`POST /api/subtitle/upload`	Upload subtitle (multipart)
`GET /api/subtitle?path=`	Parse subtitle (includes cache if present)
`GET /api/translate-config`	Read translation settings
`PUT /api/translate-config`	Save translation settings
`POST /api/translate-config/reset`	Reset to `.env` defaults
`GET /api/translate?path=`	SSE streaming translation

See docs/API.md for request/response details.

Eye-care Color Palette

Token	Warm cream	Dark eye-care
Background	`#ECE6D6`	`#20242A`
Highlight	`#2E7D6F`	`#6FD3BE`
Chinese text	`#8A6D3B`	`#D7B783`

Avoids pure #fff / #000; warm tones and moderate contrast for long reading sessions.

Contributing

See CONTRIBUTING.md (中文).

License

MIT © xk

Keywords

subtitle-player subtitle-viewer srt-player vtt-player bilingual-subtitles karaoke-lyrics auto-scroll-subtitles timed-transcript funasr speech-to-text english-chinese-translation llm-translation dashscope qwen language-learning podcast-transcript eye-care-reading electron-app fastapi python open-source windows-portable-exe subtitle-translation lyric-highlight teleprompter sse-streaming

Name		Name	Last commit message	Last commit date
Latest commit History 6 Commits
subtitle_player		subtitle_player
.env.example		.env.example
.gitignore		.gitignore
CHANGELOG.md		CHANGELOG.md
CONTRIBUTING.md		CONTRIBUTING.md
CONTRIBUTING.zh-CN.md		CONTRIBUTING.zh-CN.md
LICENSE		LICENSE
README.md		README.md
README.zh-CN.md		README.zh-CN.md
pyproject.toml		pyproject.toml

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Subtitle Player

Overview

Use Cases

Scenario 1: YouTube & foreign-video subtitle study

Scenario 2: Podcast / course transcript follow-along (no video)

Scenario 3: Conference talks & technical talks — line-by-line deep read

FAQ

Features

Quick Start

Desktop shortcut (Windows)

Usage

Shortcuts

Translation

Desktop Client & Packaging (Electron + PyInstaller → single exe)

One-time setup

Development (no packaging)

Build portable exe

`.env` optional fallback

Project Structure

API

Eye-care Color Palette

Contributing

License

Keywords

About

Uh oh!

Releases 3

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

Subtitle Player

Overview

Use Cases

Scenario 1: YouTube & foreign-video subtitle study

Scenario 2: Podcast / course transcript follow-along (no video)

Scenario 3: Conference talks & technical talks — line-by-line deep read

FAQ

Features

Quick Start

Desktop shortcut (Windows)

Usage

Shortcuts

Translation

Desktop Client & Packaging (Electron + PyInstaller → single exe)

One-time setup

Development (no packaging)

Build portable exe

.env optional fallback

Project Structure

API

Eye-care Color Palette

Contributing

License

Keywords

About

Topics

Resources

License

Contributing

Uh oh!

Stars

Watchers

Forks

Releases 3

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

`.env` optional fallback

Packages