NotificationsYou must be signed in to change notification settings
Fork3
Star11

Sum-Product Networks (SPNs) for Robust Automatic Speaker Identification.

You must be signed in to change notification settings

Folders and files

Name		Name	Last commit message	Last commit date
Latest commit History 58 Commits
results		results
spn_asi		spn_asi
.gitignore		.gitignore
README.md		README.md
config.sh		config.sh
main.py		main.py
requirements.txt		requirements.txt
run.sh		run.sh
spk_model.jpg		spk_model.jpg

Repository files navigation

SPNs for robust ASI

Sum-product networks (SPNs) with Gaussian leaves are used as speaker models for automatic speaker identification (ASI)[1]. An example of an SPN with univariate Gaussian leaves is shown in Figure 1. Marginalisation and bounded marginalisation, as proposed byCooket al., are used to significantly increase the robustness of the SPN speaker models to noise. To identify the reliable spectral components for marginalisation, ana priori SNR estimator is used.


Figure 1: SPN speaker model with univariate Gaussian leaves.

Installation

git clone https://github.com/anicolson/SPN-ASI.git
cd SPN-ASI
virtualenv --system-site-packages -p python3 ~/venv/SPN-ASI
source ~/venv/SPN-Spk-Rec/bin/activate
pip install -r requirements.txt

Dataset

TIMIT corpus for clean speech:

The clean speech of the speakers included in the TIMIT corpus are used to train the ASI system. The path to the TIMIT corpus is set inconfig.sh. If there is an issue with SPHERE .wav files, please use the followingscript.

Noisy speech anda priori SNR estimates:

The noisy speech anda priori SNR estimates used for testing the ASI system can be obtained from:http://dx.doi.org/10.21227/nbec-q510. Please see theDeep Xi repository if you require outputs from ana priori SNR estimator. The paths to the noisy speech anda priori SNR estimates are set inconfig.sh.

How to use the ASI system

Training:

./run.sh TRAIN 1

Identification:

./run.sh IDENTIFICATION 1 MARG 1 BOUNDS 1

MARG is the flag for the marginalisation, andBOUNDS specifies whether bounds are to be used with marginalisation.

References

Please cite the following:

[1] Nicolson, A., & Paliwal, K. K. (2020). Sum-Product Networks for Robust Automatic Speaker Identification. Proc. Interspeech 2020.

About

Sum-Product Networks (SPNs) for Robust Automatic Speaker Identification.

Releases

No releases published

Packages

No packages published

Movatterモバイル変換

Navigation Menu

Search code, repositories, users, issues, pull requests...

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Folders and files

Latest commit

History

Repository files navigation

SPNs for robust ASI

Installation

Dataset

How to use the ASI system

References

About

Topics

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages

Uh oh!

Languages

Movatterモバイル変換

anicolson/SPN-ASI

Folders and files

Latest commit

History

Repository files navigation

SPNs for robust ASI

Installation

Dataset

How to use the ASI system

References

About

Topics

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages0

Uh oh!

Languages

Packages