Published November 4, 2023 | Version v1
Conference paper Open
Timbre Transfer Using Image-to-Image Denoising Diffusion Implicit Models
Description
Timbre transfer techniques aim at converting the sound of a musical piece generated by one instrument into the same one as if it was played by another instrument, while maintaining as much as possible the content in terms of musical characteristics such as melody and dynamics. Following their recent breakthroughs in deep learning-based generation, we apply Denoising Diffusion Models (DDMs) to perform timbre transfer. Specifically, we apply the recently proposed Denoising Diffusion Implicit Models (DDIMs) that enable to accelerate the sampling procedure. Inspired by the recent application of DDMs to image translation problems we formulate the timbre transfer task similarly, by first converting the audio tracks into log mel spectrograms and by conditioning the generation of the desired timbre spectrogram through the input timbre spectrogram. We perform both one-to-one and many-to-many timbre transfer, by converting audio waveforms containing only single instruments and multiple instruments, respectively.We compare the proposed technique with existing state-of-the-art methods both through listening tests and objective measures in order to demonstrate the effectiveness of the proposed model.
Files
000029.pdf
Files (1.1 MB)
Name | Size | Download all |
---|---|---|
md5:5489faf3b0aa0a290aae493918fe2a0d | 1.1 MB | PreviewDownload |
148
Views
162
Downloads
Show more details
All versions | This version | |
---|---|---|
Views Total views | 148 | 148 |
Downloads Total downloads | 162 | 162 |
Data volume Total data volume | 202.7 MB | 202.7 MB |
Versions
External resources
Indexed in
Communities
Details
- DOI
- DOI Badge
DOI
10.5281/zenodo.10265271
Markdown
[](https://doi.org/10.5281/zenodo.10265271)
reStructuredText
.. image:: https://zenodo.org/badge/DOI/10.5281/zenodo.10265271.svg :target: https://doi.org/10.5281/zenodo.10265271
HTML
<a href="https://doi.org/10.5281/zenodo.10265271"><img src="https://zenodo.org/badge/DOI/10.5281/zenodo.10265271.svg" alt="DOI"></a>
Image URL
https://zenodo.org/badge/DOI/10.5281/zenodo.10265271.svg
Target URL
https://doi.org/10.5281/zenodo.10265271
- Resource type
- Conference paper
- Publisher
- ISMIR
- Imprint
- Proceedings of the 24th International Society for Music Information Retrieval Conference, 257-263. Milan, Italy.
- Conference
- International Society for Music Information Retrieval Conference (ISMIR 2023), Milan, Italy, November 5-9, 2023
Rights
Creative Commons Attribution 4.0 International
The Creative Commons Attribution license allows re-distribution and re-use of a licensed work on the condition that the creator is appropriately credited.Read more
Citation
Export
Technical metadata
- Created
- December 5, 2023
- Modified
- July 10, 2024