CN119252276B

Movatterモバイル変換

Info

Publication number: CN119252276B
Application number: CN202411764748.9A
Authority: CN
Inventors: 游捷; 蔡瑞泽; 蔡体健; 熊汉卿; 阙越; 谭林丰; 刘文涵
Original assignee: East China Jiaotong University
Current assignee: East China Jiaotong University
Priority date: 2024-12-04
Filing date: 2024-12-04
Publication date: 2025-03-18
Anticipated expiration: 2044-12-04
Also published as: CN119252276A

Abstract

Translated fromChinese

本申请涉及一种基于脉冲神经网络的未知音频事件识别算法，它包括如下步骤：构建音频数据集，并拆分为训练集、验证集和测试集；对音频数据集中的每段音频数据进行预处理，生成3D log‑mel频谱图；构建脉冲神经网络模型并进行分类训练；使用交叉熵损失和对比损失联合训所述脉冲神经网络模型；使用验证集中的已知类别的音频数据输入至脉冲神经网络和自编码器，获得区分已知类别和未知音频类别的阈值；使用训练好的脉冲神经网络模型对采集的音频数据进行识别。本发明能够在不依赖于预先标注的未知类别信息的情况下，有效地识别和区分未知的声音事件，提高系统的整体识别的准确率，并为后续的未知声音事件分析和处理提供支持。

The present application relates to an unknown audio event recognition algorithm based on a pulse neural network, which includes the following steps: constructing an audio data set and splitting it into a training set, a validation set, and a test set; preprocessing each audio data segment in the audio data set to generate a 3D log‑mel spectrogram; constructing a pulse neural network model and performing classification training; using cross entropy loss and contrast loss to jointly train the pulse neural network model; using audio data of known categories in the validation set to input into the pulse neural network and the autoencoder to obtain a threshold for distinguishing known categories from unknown audio categories; using the trained pulse neural network model to identify the collected audio data. The present invention can effectively identify and distinguish unknown sound events without relying on pre-labeled unknown category information, improve the overall recognition accuracy of the system, and provide support for subsequent unknown sound event analysis and processing.