CN119731649A

Movatterモバイル変換

Info

Publication number: CN119731649A
Application number: CN202280004472.7A
Authority: CN
Inventors: 吴洁
Original assignee: Beijing Xiaomi Mobile Software Co Ltd; Beijing Xiaomi Pinecone Electronic Co Ltd
Current assignee: Beijing Xiaomi Mobile Software Co Ltd; Beijing Xiaomi Pinecone Electronic Co Ltd
Priority date: 2022-06-20
Filing date: 2022-06-20
Publication date: 2025-03-28
Also published as: WO2023245389A1

Abstract

Translated fromChinese

本公开提出一种歌曲生成方法、装置、电子设备和存储介质，该方法包括：获取目标用户输入的语音音频和目标歌曲的唯一识别号，对语音音频进行梅尔谱特征提取，得到目标用户的真实梅尔谱特征，根据目标歌曲的唯一识别号获取与唯一识别号对应的歌曲模板，将目标用户的真实梅尔谱特征和歌曲模板输入至预设的歌曲生成模型中，得到歌曲生成模型输出的目标梅尔谱特征，根据目标梅尔谱特征生成目标歌曲，可以在歌曲生成过程中有效结合目标用户的真实梅尔谱特征和目标歌曲对应的歌曲模板，以有效降低对用户语音数据的数据量的依赖程度，从而在提升歌曲生成便捷性的同时，有效提升歌曲生成效果。

The present disclosure proposes a song generation method, device, electronic device and storage medium, the method comprising: obtaining voice audio input by a target user and a unique identification number of a target song, performing mel-spectrogram feature extraction on the voice audio to obtain the real mel-spectrogram feature of the target user, obtaining a song template corresponding to the unique identification number according to the unique identification number of the target song, inputting the real mel-spectrogram feature of the target user and the song template into a preset song generation model to obtain a target mel-spectrogram feature output by the song generation model, generating a target song according to the target mel-spectrogram feature, and effectively combining the real mel-spectrogram feature of the target user and the song template corresponding to the target song in the song generation process to effectively reduce the degree of dependence on the data volume of the user's voice data, thereby effectively improving the song generation effect while improving the convenience of song generation.