multi-speaker

Star

Here are 22 public repositories matching this topic...

mikebrady / shairport-sync

Sponsor

Star

AirPlay and AirPlay 2 audio player

audio audio-player embedded-systems audio-streaming multi-room-audio airplay multi-speaker synchronized-audio airplay-2

Updated May 10, 2026
C

netease-youdao / EmotiVoice

Star

EmotiVoice 😊: a Multi-Voice and Prompt-Controlled TTS Engine

python text-to-speech ai deep-learning style prompt speech emotion pytorch tts speech-synthesis multi-speaker emotivoice

Updated Aug 13, 2024
Python

r9y9 / deepvoice3_pytorch

Sponsor

Star

PyTorch implementation of convolutional neural networks-based text-to-speech synthesis models

python machine-learning end-to-end pytorch tts speech-synthesis speech-processing multi-speaker

Updated Dec 19, 2023
Python

fluxions-ai / vui

Star

100M parameter lightweight conversational text-to-speech model with breaths, laughter, multi-speaker dialogue, voice cloning, and streaming. Llama-based, on-device.

lightweight text-to-speech streaming pytorch tts speech-synthesis llama multi-speaker conversational-ai on-device voice-cloning edge-ai voice-ai audio-generation

Updated Feb 25, 2026
Python

ranchlai / mandarin-tts

Star

Chinese Mandarin tts text-to-speech 中文 (普通话) 语音合成 , by fastspeech 2 , implemented in pytorch, using waveglow as vocoder, with biaobei and aishell3 datasets

pytorch tts multi-speaker tacotron fastspeech2 tts-chinese tts-hanzi aishell3

Updated May 28, 2022
Python

DrewThomasson / VoxNovel

Star

VoxNovel: generate audiobooks giving each character a different voice actor.

windows linux mac torch tts epub audiobooks multi-speaker m4b torchaudio voice-cloning audiobook-creator booknlp generative-ai styletts2

Updated Jun 8, 2025
Python

keonlee9420 / Comprehensive-Transformer-TTS

Star

A Non-Autoregressive Transformer based Text-to-Speech, supporting a family of SOTA transformers with supervised and unsupervised duration modelings. This project grows with the research community, aiming to achieve the ultimate TTS

Updated Sep 24, 2022
Python

aishoot / LSTM_PIT_Speech_Separation

Star

Two-talker Speech Separation with LSTM/BLSTM by Permutation Invariant Training method.

multi-speaker audio-separation speech-separation speech-enhancement permutation-invariant-training robust-speech-recognition

Updated Jan 6, 2022
Jupyter Notebook

keonlee9420 / Comprehensive-E2E-TTS

Star

A Non-Autoregressive End-to-End Text-to-Speech (text-to-wav), supporting a family of SOTA unsupervised duration modelings. This project grows with the research community, aiming to achieve the ultimate E2E-TTS

text-to-speech deep-learning unsupervised end-to-end pytorch tts speech-synthesis jets multi-speaker sota single-speaker neural-tts non-autoregressive fastspeech2 hifi-gan non-ar ultimate-tts text-to-wav

Updated Jun 6, 2022
Python

anton-jeran / MULTI-AUDIODEC

Star

This is the official implementation of our multi-channel multi-speaker multi-spatial neural audio codec architecture.

overlap codec rir spatial-audio multi-speaker room-impulse-response binaural speech-separation speech-enhancement overlapping-speech neural-coding audio-codecs room-impulse-responses

Updated Mar 17, 2025
Python

Totoketchup / Adaptive-MultiSpeaker-Separation

Star

Adaptive and Focusing Neural Layers for Multi-Speaker Separation Problem

tensorflow adaptive-learning deeplearning multi-speaker source-separation audio-separation speech-separation deep-learning-architectures

Updated Jul 7, 2018
Jupyter Notebook

JaySpiffy / IndexTTS-Workflow-Studio

Sponsor

Star

Draft to Take beta: local-first AI audio production studio powered by IndexTTS2, Docker, Qwen, OmniVoice, SFX, ambience, and music sidecars.

docker text-to-speech gpu self-hosted tts speech-synthesis multi-speaker voice-cloning fastapi ai-audio timeline-editor local-ai indextts index-tts indextts2 speaker-prep draft-to-take

Updated May 7, 2026
Batchfile

keonlee9420 / Comprehensive-Tacotron2

Star

PyTorch Implementation of Google's Natural TTS Synthesis by Conditioning WaveNet on Mel Spectrogram Predictions. This implementation supports both single-, multi-speaker TTS and several techniques to enforce the robustness and efficiency of the model.

text-to-speech deep-learning efficiency pytorch tts speech-synthesis autoregressive multi-speaker robustness comprehensive tacotron single-speaker neural-tts tacotron2 reduction-factor hifi-gan mel-gan diagonal-guided-attention

Updated Jul 31, 2023
Python

hwRG / FastSpeech2-Pytorch-Korean-Multi-Speaker

Star

Multi-Speaker FastSpeech2 applicable to Korean. Description about train and synthesize in detail.

pytorch tts korean transfer-learning multi-speaker fastspeech2

Updated Jul 19, 2022
Python

TheSeraphim / scribe-forge-ai

Star

🎵 Complete offline audio transcription system with speaker diarization using OpenAI Whisper and PyAnnote. Features automatic audio cleaning, precise timestamps, multiple output formats (JSON/TXT/Markdown), and support for 20+ audio formats. No external APIs required - works entirely offline.

Updated Oct 2, 2025
Python

nikitashvarts / CocktailPartySpeakerRecognition

Star

An Algorithm for Speaker Recognition in a Multi-Speaker Environment

deep-learning lstm speaker-recognition multi-speaker cocktail-party-problem

Updated Aug 14, 2020
Python

ZoraizQ / urdu-speech-recognition

Star

Urdu Speech Recognition using Kaldi ASR, by training Triphone Acoustic GMMs using the PRUS dataset.

speech-recognition multi-speaker urdu kaldi-asr prus

Updated Sep 24, 2021
Shell

charles-forsyth / generate-tts

Star

A professional CLI for Google Gemini's Native 2.5 TTS. Generate multi-speaker podcasts ('Deep Dive'), audio summaries, and expressive speech from text/files.

python cli text-to-speech google-cloud tts multi-speaker podcast-generator gemini-api ai-tools audio-generation

Updated Jan 2, 2026
Python

rahulm-28 / celebrity-voice-panel-qwen3-tts

Star

AI voice cloning panel that generates multi-speaker discussions between famous personalities on any topic, powered by Qwen3-TTS

python text-to-speech tts speech-synthesis gradio multi-speaker voice-synthesis voice-cloning ai-voice speech-to-speech qwen3-tts

Updated Jan 31, 2026
Python

drajabr / audio-matrix-router

Star

Grid style audio router

audio multi-speaker virtual-audio audio-routing speaker-array

Updated May 9, 2026
JavaScript

Improve this page

Add a description, image, and links to the multi-speaker topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the multi-speaker topic, visit your repo's landing page and select "manage topics."

Learn more

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

multi-speaker

Here are 22 public repositories matching this topic...

mikebrady / shairport-sync

netease-youdao / EmotiVoice

r9y9 / deepvoice3_pytorch

fluxions-ai / vui

ranchlai / mandarin-tts

DrewThomasson / VoxNovel

keonlee9420 / Comprehensive-Transformer-TTS

aishoot / LSTM_PIT_Speech_Separation

keonlee9420 / Comprehensive-E2E-TTS

anton-jeran / MULTI-AUDIODEC

Totoketchup / Adaptive-MultiSpeaker-Separation

JaySpiffy / IndexTTS-Workflow-Studio

keonlee9420 / Comprehensive-Tacotron2

hwRG / FastSpeech2-Pytorch-Korean-Multi-Speaker

TheSeraphim / scribe-forge-ai

nikitashvarts / CocktailPartySpeakerRecognition

ZoraizQ / urdu-speech-recognition

charles-forsyth / generate-tts

rahulm-28 / celebrity-voice-panel-qwen3-tts

drajabr / audio-matrix-router

Improve this page

Add this topic to your repo