- Notifications
You must be signed in to change notification settings - Fork83
PAddle PARAllel text-to-speech toolKIT (supporting Tacotron2, Transformer TTS, FastSpeech2/FastPitch, SpeedySpeech, WaveFlow and Parallel WaveGAN)
License
PaddlePaddle/Parakeet
Folders and files
Name | Name | Last commit message | Last commit date | |
---|---|---|---|---|
Repository files navigation
Parakeet has moved toPaddleSpeech, this repo will not update anymore, you can open issues of Parakeet in PaddleSpeech
Parakeet aims to provide a flexible, efficient and state-of-the-art text-to-speech toolkit for the open-source community. It is built on PaddlePaddle dynamic graph and includes many influential TTS models.
- Oct-12-2021, Refector examples code.
- Oct-12-2021, Parallel WaveGAN with LJSpeech. Checkexamples/GANVocoder/parallelwave_gan/ljspeech.
- Oct-12-2021, FastSpeech2/FastPitch with LJSpeech. Checkexamples/fastspeech2/ljspeech.
- Sep-14-2021, Reconstruction of TransformerTTS. Checkexamples/transformer_tts/ljspeech.
- Aug-31-2021, Chinese Text Frontend. Checkexamples/text_frontend.
- Aug-23-2021, FastSpeech2/FastPitch with AISHELL-3. Checkexamples/fastspeech2/aishell3.
- Aug-03-2021, FastSpeech2/FastPitch with CSMSC. Checkexamples/fastspeech2/baker.
- Jul-19-2021, SpeedySpeech with CSMSC. Checkexamples/speedyspeech/baker.
- Jul-01-2021, Parallel WaveGAN with CSMSC. Checkexamples/GANVocoder/parallelwave_gan/baker.
- Jul-01-2021, Montreal-Forced-Aligner. Checkexamples/use_mfa.
- May-07-2021, Voice Cloning in Chinese. Checkexamples/tacotron2_aishell3.
In order to facilitate exploiting the existing TTS models directly and developing the new ones, Parakeet selects typical models and provides their reference implementations in PaddlePaddle. Further more, Parakeet abstracts the TTS pipeline and standardizes the procedure of data preprocessing, common modules sharing, model configuration, and the process of training and synthesis. The models supported here include Text FrontEnd, end-to-end Acoustic models and Vocoders:
Text FrontEnd
- Rule based Chinese frontend.
Acoustic Models
Vocoders
Voice Cloning
It's difficult to install some dependent libraries for this repo in Windows system, we recommend that youDO NOT use Windows system, please useLinux
.
Make sure the librarylibsndfile1
is installed, e.g., on Ubuntu.
sudo apt-get install libsndfile1
Seeinstall for more details. This repo requires PaddlePaddle2.1.2 or above.
git clone https://github.com/PaddlePaddle/Parakeetcd Parakeetpip install -e.
If some python dependent packages cannot be installed successfully, you can run the following script first.(replacepython3.6
with your own python version)
sudo apt install -y python3.6-dev
Seeinstall for more details.
Entries to the introduction, and the launch of training and synthsis for different example models:
- >>> Chinese Text Frontend
- >>> FastSpeech2/FastPitch
- >>> Montreal-Forced-Aligner
- >>> Parallel WaveGAN
- >>> SpeedySpeech
- >>> Tacotron2_AISHELL3
- >>> GE2E
- >>> WaveFlow
- >>> TransformerTTS
- >>> Tacotron2
Check ourwebsite for audio sampels.
- fastspeech2_nosil_baker_ckpt_0.4.zip
- fastspeech2_nosil_aishell3_ckpt_0.4.zip
- fastspeech2_nosil_ljspeech_ckpt_0.5.zip
Parakeet is provided under theApache-2.0 license.
About
PAddle PARAllel text-to-speech toolKIT (supporting Tacotron2, Transformer TTS, FastSpeech2/FastPitch, SpeedySpeech, WaveFlow and Parallel WaveGAN)