whisper-alternative

Here are 13 public repositories matching this topic...

modelscope / FunASR

Industrial-grade speech recognition toolkit: 170x realtime, 50+ languages, speaker diarization, emotion detection, streaming, and OpenAI-compatible API.

Updated Jun 8, 2026
Python

FunAudioLLM / SenseVoice

Star

Multilingual speech understanding: ASR + emotion recognition + audio event detection. 50+ languages, 15x faster than Whisper, non-autoregressive.

multilingual python pytorch audio-analysis speech-recognition speech-to-text asr emotion-detection cross-lingual speech-emotion-recognition voice-ai llm audio-event-detection whisper-alternative

Updated May 25, 2026
Python

FunAudioLLM / Fun-ASR

Star

End-to-end speech recognition large model: 31 languages, dialects, accents, lyrics, hotwords, timestamps, speaker diarization. Trained on tens of millions of hours.

pytorch speech-recognition speech-to-text transcription asr speaker-diarization chinese-dialects real-time-asr audio-language-model multilingual-asr fun-asr whisper-alternative 31-languages llm-asr

Updated Jun 8, 2026
Python

felixfu824 / HushType

Star

Local voice-to-text for macOS and iOS. Multilingual (EN/ZH/JP) with Traditional Chinese output. Runs Qwen3-ASR on Apple Silicon via MLX. No cloud, no subscription.

Updated May 27, 2026
Swift

Vincent-WenZX / CWX-Transcribe

Star

Production pipeline around OpenAI gpt-4o-transcribe-diarize for long-form 2-speaker interviews. Cross-chunk speaker consistency · diarization hallucination fix · async GPT-5.5 domain-term correction. WER 6.05% / DER 4.28% on 2h26m benchmark. Beats raw OpenAI API by +11.5 Q.

Updated May 6, 2026
Python

bykcyc / Cadence

Star

Private, local-first meeting recorder + transcription, diarization, AI notes, voice dictation & read-aloud for Windows — runs on your own GPU.

Updated Jun 7, 2026
TypeScript

199-biotechnologies / textstream-asr

Star

Live speech-to-text streaming on Apple Silicon — Qwen3-ASR + Silero VAD + MLX

python macos speech-recognition server-sent-events speech-to-text transcription asr mlx voice-activity-detection live-captions on-device-ai apple-silicon silero-vad real-time-transcription local-ai offline-asr qwen3-asr streaming-transcription whisper-alternative

Updated Mar 30, 2026
Python

tristan-mcinnis / local-dictation

Star

Free, private, on-device dictation for macOS (Apple Silicon). Push-to-talk speech-to-text with on-device LLM cleanup — an offline, local alternative to cloud & Whisper dictation. Parakeet TDT v3 ASR + Qwen 2.5 1.5B cleanup + macOS Accessibility injection. Pure Rust, ~300–400 ms per utterance, nothing leaves your Mac.

Updated Jun 4, 2026
Rust

briancaffey / nemotron-asr-server

Star

OpenAI-compatible speech-to-text server for nvidia/nemotron-3.5-asr-streaming-0.6b (NeMo). Runs on the DGX Spark / GB10.

nvidia speech-to-text transcription nemo asr fastapi openai-api nemotron dgx-spark whisper-alternative

Updated Jun 8, 2026
Python

Trust-1-eng / transcription-studio

Star

Local FastAPI transcription studio: AssemblyAI Universal-2 (99 lang), FFmpeg, yt-dlp, Word/PDF/ZIP export

python ai speech-to-text transcription fastapi yt-dlp assemblyai whisper-alternative

Updated Jun 6, 2026
Python

josuebustosn / gemini-transcribe

Star

Skill de Claude Code que transcribe audios y videos a Markdown estructurado con timestamps y diarización, usando Google Gemini. Reemplazo gratuito de ElevenLabs Scribe / Whisper para quien ya paga Gemini.

python gemini spanish speech-to-text transcription diarization google-gemini claude-code claude-skill elevenlabs-alternative whisper-alternative

Updated May 26, 2026
Python

karandeepbhardwaj / Yapper

Star

Voice-to-text desktop app that captures speech, refines transcripts with AI, and auto-pastes at your cursor

desktop-app windows macos rust productivity ai speech-to-text transcription dictation voice-to-text tauri whisper-alternative

Updated Jun 2, 2026
TypeScript

DENE-ctrl / mlx-live

Star

Build a local, real-time voice assistant for Apple Silicon using MLX for speech recognition, vision-language model responses, and text-to-speech synthesis.

chat crypto local offline speech-recognition server-sent-events muon speech-to-text transcription voice-activity-detection on-device on-device-ai silero-vad local-ai streaming-transcription whisper-alternative

Updated Jun 8, 2026
HTML

Improve this page

Add a description, image, and links to the whisper-alternative topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the whisper-alternative topic, visit your repo's landing page and select "manage topics."

Learn more

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

whisper-alternative

Here are 13 public repositories matching this topic...

modelscope / FunASR

FunAudioLLM / SenseVoice

FunAudioLLM / Fun-ASR

felixfu824 / HushType

Vincent-WenZX / CWX-Transcribe

bykcyc / Cadence

199-biotechnologies / textstream-asr

tristan-mcinnis / local-dictation

briancaffey / nemotron-asr-server

Trust-1-eng / transcription-studio

josuebustosn / gemini-transcribe

karandeepbhardwaj / Yapper

DENE-ctrl / mlx-live

Improve this page

Add this topic to your repo