multi-speaker
Here are 15 public repositories matching this topic...
Sort:Most stars
EmotiVoice 😊: a Multi-Voice and Prompt-Controlled TTS Engine
- Updated
Aug 13, 2024 - Python
AirPlay and AirPlay 2 audio player
- Updated
Mar 16, 2025 - C
PyTorch implementation of convolutional neural networks-based text-to-speech synthesis models
- Updated
Dec 19, 2023 - Python
Chinese Mandarin tts text-to-speech 中文 (普通话) 语音 合成 , by fastspeech 2 , implemented in pytorch, using waveglow as vocoder, with biaobei and aishell3 datasets
- Updated
May 28, 2022 - Python
A Non-Autoregressive Transformer based Text-to-Speech, supporting a family of SOTA transformers with supervised and unsupervised duration modelings. This project grows with the research community, aiming to achieve the ultimate TTS
- Updated
Sep 24, 2022 - Python
Two-talker Speech Separation with LSTM/BLSTM by Permutation Invariant Training method.
- Updated
Jan 6, 2022 - Jupyter Notebook
VoxNovel: generate audiobooks giving each character a different voice actor.
- Updated
Jan 17, 2025 - Python
A Non-Autoregressive End-to-End Text-to-Speech (text-to-wav), supporting a family of SOTA unsupervised duration modelings. This project grows with the research community, aiming to achieve the ultimate E2E-TTS
- Updated
Jun 6, 2022 - Python
Adaptive and Focusing Neural Layers for Multi-Speaker Separation Problem
- Updated
Jul 7, 2018 - Jupyter Notebook
PyTorch Implementation of Google's Natural TTS Synthesis by Conditioning WaveNet on Mel Spectrogram Predictions. This implementation supports both single-, multi-speaker TTS and several techniques to enforce the robustness and efficiency of the model.
- Updated
Jul 31, 2023 - Python
This is the official implementation of our multi-channel multi-speaker multi-spatial neural audio codec architecture.
- Updated
Mar 17, 2025 - Python
Multi-Speaker FastSpeech2 applicable to Korean. Description about train and synthesize in detail.
- Updated
Jul 19, 2022 - Python
An Algorithm for Speaker Recognition in a Multi-Speaker Environment
- Updated
Aug 14, 2020 - Python
Urdu Speech Recognition using Kaldi ASR, by training Triphone Acoustic GMMs using the PRUS dataset.
- Updated
Sep 24, 2021 - Shell
- Updated
Nov 1, 2022 - MATLAB
Improve this page
Add a description, image, and links to themulti-speaker topic page so that developers can more easily learn about it.
Add this topic to your repo
To associate your repository with themulti-speaker topic, visit your repo's landing page and select "manage topics."