Movatterモバイル変換


[0]ホーム

URL:


Skip to main
Published November 4, 2023 | Version v1
Conference paper Open

Timbre Transfer Using Image-to-Image Denoising Diffusion Implicit Models

Description

Timbre transfer techniques aim at converting the sound of a musical piece generated by one instrument into the same one as if it was played by another instrument, while maintaining as much as possible the content in terms of musical characteristics such as melody and dynamics. Following their recent breakthroughs in deep learning-based generation, we apply Denoising Diffusion Models (DDMs) to perform timbre transfer. Specifically, we apply the recently proposed Denoising Diffusion Implicit Models (DDIMs) that enable to accelerate the sampling procedure. Inspired by the recent application of DDMs to image translation problems we formulate the timbre transfer task similarly, by first converting the audio tracks into log mel spectrograms and by conditioning the generation of the desired timbre spectrogram through the input timbre spectrogram. We perform both one-to-one and many-to-many timbre transfer, by converting audio waveforms containing only single instruments and multiple instruments, respectively.We compare the proposed technique with existing state-of-the-art methods both through listening tests and objective measures in order to demonstrate the effectiveness of the proposed model.

Files

000029.pdf

Files (1.1 MB)

NameSize Download all
md5:5489faf3b0aa0a290aae493918fe2a0d
1.1 MBPreviewDownload
148
Views
162
Downloads

Versions

External resources

Indexed in

Communities

Details

DOI
10.5281/zenodo.10265271
DOI Badge

DOI

10.5281/zenodo.10265271

Markdown

[![DOI](https://zenodo.org/badge/DOI/10.5281/zenodo.10265271.svg)](https://doi.org/10.5281/zenodo.10265271)

reStructuredText

.. image:: https://zenodo.org/badge/DOI/10.5281/zenodo.10265271.svg  :target: https://doi.org/10.5281/zenodo.10265271

HTML

<a href="https://doi.org/10.5281/zenodo.10265271"><img src="https://zenodo.org/badge/DOI/10.5281/zenodo.10265271.svg" alt="DOI"></a>

Image URL

https://zenodo.org/badge/DOI/10.5281/zenodo.10265271.svg

Target URL

https://doi.org/10.5281/zenodo.10265271
Resource type
Conference paper
Publisher
ISMIR
Imprint
Proceedings of the 24th International Society for Music Information Retrieval Conference, 257-263. Milan, Italy.
Conference
International Society for Music Information Retrieval Conference (ISMIR 2023), Milan, Italy, November 5-9, 2023

Rights

  • The Creative Commons Attribution license allows re-distribution and re-use of a licensed work on the condition that the creator is appropriately credited.Read more

Citation

Export

Technical metadata

Created
December 5, 2023
Modified
July 10, 2024

This site uses cookies. Find out more onhow we use cookies


[8]ページ先頭

©2009-2025 Movatter.jp