Scarfmonster/HiFiPLNPublic

NotificationsYou must be signed in to change notification settings
Fork3
Star37

Multispeaker Community Vocoder Model for DiffSinger

License

MIT license

37 stars 3 forks Branches Tags Activity

Star

Notifications

You must be signed in to change notification settings

Branches Tags

Folders and files

Name		Name	Last commit message	Last commit date
Latest commit History 130 Commits
alias		alias
configs		configs
dataset-utils		dataset-utils
model		model
pitch		pitch
shift		shift
utils		utils
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
data.py		data.py
export.py		export.py
onnx_profile.py		onnx_profile.py
preproc.py		preproc.py
progress.py		progress.py
pyproject.toml		pyproject.toml
render.py		render.py
requirements.txt		requirements.txt
train.py		train.py
vuv.py		vuv.py

Repository files navigation

HiFiPLN

Multispeaker Community Vocoder model forDiffSinger

This is the code used to train the "HiFiPLN" vocoder.

A trained model for use with OpenUtau is available for download on the officialrelease page.

Why HiFiPLN?

Because a lot of PLN was spent training this thing.

Training

Python

Python 3.10 or 3.11 is required.

Data preparation

Preperocessing and splitting the dataset into smaller files is done using a single script. Note that if the input files are shorter than--length seconds, they will be skipped. It is better to provide full unsegmented files to the script, but if your input files are already split into chunks, you can run with--length 0 to disable splitting.

python preproc.py --config PATH_TO_CONFIG -o"dataset/train" --length 1 PATH_TO_TRAIN_DATASET

You will also need to provide some validation audio files. Runpreproc.py with--length 0 to disable segmenting.

python preproc.py --config PATH_TO_CONFIG -o"dataset/valid" --length 0 PATH_TO_VALIDATION_DATASET

Train model

python train.py --config"configs/hifipln.yaml"

If you see an error saying "Total length of `Data Loader` across ranks is zero" then you do not have enough validation files.
You may want to editconfigs/hifipln.yaml and changetrain: batch_size: 12 to a value that better fits your available VRAM.

Resume

python train.py --config"configs/hifipln.yaml" --resume CKPT_PATH

You may set CKPT_PATH to a log directory (eg. logs/HiFiPLN), and it will find the last checkpoint of the last run.

Finetuning

Download a checkpoint fromhttps://utau.pl/hifipln/#checkpoints-for-finetuning
Save the checkpoint as ckpt/HiFiPLN.ckpt then run:

python train.py --config"configs/hifipln-finetune.yaml"

Finetuning shouldn't be run for too long, especially for small datasets. Just 2-3 epochs or ~20000 steps should be fine.

Exporting for use in OpenUtau

python export.py --config configs/hifipln.yaml --output out/hifipln --model CKPT_PATH

You may set CKPT_PATH to a log directory (eg. logs/HiFiPLN), and it will find the last checkpoint of the last run.

Credits

About

Multispeaker Community Vocoder Model for DiffSinger

utau.pl/hifipln/

Releases

2tags

Contributors2

Languages

Python100.0%

Movatterモバイル変換

Navigation Menu

Search code, repositories, users, issues, pull requests...

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

License

Uh oh!

Folders and files

Latest commit

History

Repository files navigation

HiFiPLN

Why HiFiPLN?

Training

Python

Data preparation

Train model

Resume

Finetuning

Exporting for use in OpenUtau

Credits

About

Topics

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Uh oh!

Contributors2

Uh oh!

Languages

Movatterモバイル変換

License

Scarfmonster/HiFiPLN

Folders and files

Latest commit

History

Repository files navigation

HiFiPLN

Why HiFiPLN?

Training

Python

Data preparation

Train model

Resume

Finetuning

Exporting for use in OpenUtau

Credits

About

Topics

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Uh oh!

Contributors2

Uh oh!

Languages