Extract phoneme-level timestamps from speeh audio.
-
Updated
May 12, 2026 - Python
Extract phoneme-level timestamps from speeh audio.
pytorch model for contexless-phoneme prediction from speech audio
A multilingual phoneme recognizer capable of generalizing zero-shot to unseen phoneme inventories.
Speech Assessment API in FastAPI with HuggingFace 🤗
End-to-end IPA-based phoneme recognition pipeline using Wav2Vec2, featuring preprocessing, vocabulary generation, model training, and evaluation.
My bachelor thesis on Phoneme recognition and alignment on the TIMIT dataset
Natural Language Processing Projects
Official Python SDK for the Vocametrix voice analysis API — AVQI, DSI, jitter/shimmer, pronunciation assessment, speech-to-text, prosody similarity, and 40+ more clinical and acoustic measures.
2-People-Spanish-Average-Tone-Speech-Synthesis-Corpus
Context-aware neural decoding for speech BCI: extending DCoND with skip-diphone auxiliary supervision and temporal smoothness regularization on the Brain-to-Text '24 Benchmark.
Add a description, image, and links to the phoneme-recognition topic page so that developers can more easily learn about it.
To associate your repository with the phoneme-recognition topic, visit your repo's landing page and select "manage topics."