Movatterモバイル変換

Skip to content

#

multi-speaker

Here are 15 public repositories matching this topic...

Language:All

Filter by language

All15 Python10 Jupyter Notebook2 C1 MATLAB1 Shell1

Sort:Most stars

Sort options

Most stars Fewest stars Most forks Fewest forks Recently updated Least recently updated

netease-youdao /EmotiVoice

EmotiVoice 😊: a Multi-Voice and Prompt-Controlled TTS Engine

python text-to-speech ai deep-learning style prompt speech emotion pytorch tts speech-synthesis multi-speaker emotivoice

UpdatedAug 13, 2024
Python

mikebrady /shairport-sync

AirPlay and AirPlay 2 audio player

audio audio-player embedded-systems audio-streaming multi-room-audio airplay multi-speaker synchronized-audio airplay-2

UpdatedMar 16, 2025
C

r9y9 /deepvoice3_pytorch

PyTorch implementation of convolutional neural networks-based text-to-speech synthesis models

python machine-learning end-to-end pytorch tts speech-synthesis speech-processing multi-speaker

UpdatedDec 19, 2023
Python

ranchlai /mandarin-tts

Chinese Mandarin tts text-to-speech 中文 (普通话) 语音合成 , by fastspeech 2 , implemented in pytorch, using waveglow as vocoder, with biaobei and aishell3 datasets

pytorch tts multi-speaker tacotron fastspeech2 tts-chinese tts-hanzi aishell3

UpdatedMay 28, 2022
Python

keonlee9420 /Comprehensive-Transformer-TTS

A Non-Autoregressive Transformer based Text-to-Speech, supporting a family of SOTA transformers with supervised and unsupervised duration modelings. This project grows with the research community, aiming to achieve the ultimate TTS

text-to-speech deep-learning unsupervised pytorch tts speech-synthesis transformer supervised multi-speaker sota comprehensive single-speaker neural-tts non-autoregressive fastspeech fastspeech2 hifi-gan non-ar mel-gan ultimate-tts

UpdatedSep 24, 2022
Python

aishoot /LSTM_PIT_Speech_Separation

Two-talker Speech Separation with LSTM/BLSTM by Permutation Invariant Training method.

multi-speaker audio-separation speech-separation speech-enhancement permutation-invariant-training robust-speech-recognition

UpdatedJan 6, 2022
Jupyter Notebook

DrewThomasson /VoxNovel

VoxNovel: generate audiobooks giving each character a different voice actor.

windows linux mac torch tts epub audiobooks multi-speaker m4b torchaudio voice-cloning audiobook-creator booknlp generative-ai styletts2

UpdatedJan 17, 2025
Python

keonlee9420 /Comprehensive-E2E-TTS

A Non-Autoregressive End-to-End Text-to-Speech (text-to-wav), supporting a family of SOTA unsupervised duration modelings. This project grows with the research community, aiming to achieve the ultimate E2E-TTS

text-to-speech deep-learning unsupervised end-to-end pytorch tts speech-synthesis jets multi-speaker sota single-speaker neural-tts non-autoregressive fastspeech2 hifi-gan non-ar ultimate-tts text-to-wav

UpdatedJun 6, 2022
Python

Totoketchup /Adaptive-MultiSpeaker-Separation

Adaptive and Focusing Neural Layers for Multi-Speaker Separation Problem

tensorflow adaptive-learning deeplearning multi-speaker source-separation audio-separation speech-separation deep-learning-architectures

UpdatedJul 7, 2018
Jupyter Notebook

keonlee9420 /Comprehensive-Tacotron2

PyTorch Implementation of Google's Natural TTS Synthesis by Conditioning WaveNet on Mel Spectrogram Predictions. This implementation supports both single-, multi-speaker TTS and several techniques to enforce the robustness and efficiency of the model.

text-to-speech deep-learning efficiency pytorch tts speech-synthesis autoregressive multi-speaker robustness comprehensive tacotron single-speaker neural-tts tacotron2 reduction-factor hifi-gan mel-gan diagonal-guided-attention

UpdatedJul 31, 2023
Python

anton-jeran /MULTI-AUDIODEC

This is the official implementation of our multi-channel multi-speaker multi-spatial neural audio codec architecture.

overlap codec rir spatial-audio multi-speaker room-impulse-response binaural speech-separation speech-enhancement overlapping-speech neural-coding audio-codecs room-impulse-responses

UpdatedMar 17, 2025
Python

hwRG /FastSpeech2-Pytorch-Korean-Multi-Speaker

Multi-Speaker FastSpeech2 applicable to Korean. Description about train and synthesize in detail.

pytorch tts korean transfer-learning multi-speaker fastspeech2

UpdatedJul 19, 2022
Python

nikitashvarts /CocktailPartySpeakerRecognition

An Algorithm for Speaker Recognition in a Multi-Speaker Environment

deep-learning lstm speaker-recognition multi-speaker cocktail-party-problem

UpdatedAug 14, 2020
Python

ZoraizQ /urdu-speech-recognition

Urdu Speech Recognition using Kaldi ASR, by training Triphone Acoustic GMMs using the PRUS dataset.

speech-recognition multi-speaker urdu kaldi-asr prus

UpdatedSep 24, 2021
Shell

parisimaa /multi_speaker

deep-learning multi-speaker audio-processing

UpdatedNov 1, 2022
MATLAB

Improve this page

Add a description, image, and links to themulti-speaker topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with themulti-speaker topic, visit your repo's landing page and select "manage topics."

[8]ページ先頭

©2009-2025 Movatter.jp