CN115376496B

Movatterモバイル変換

Info

Publication number: CN115376496B
Application number: CN202210983356.6A
Authority: CN
Inventors: 伍汉诚; 魏韬; 王少军; 肖京
Original assignee: Ping An Technology Shenzhen Co Ltd
Current assignee: Ping An Technology Shenzhen Co Ltd
Priority date: 2022-08-16
Filing date: 2022-08-16
Publication date: 2025-05-06
Anticipated expiration: 2042-08-16
Also published as: CN115376496A

Abstract

Translated fromChinese

本申请实施例属于人工智能领域，涉及一种语音识别方法，包括根据业务场景匹配对应的业务解码图和静态解码图；获取待识别语音及客户热词表；通过业务解码图对待识别语音进行解码，得到初步解码结果；若客户热词表中包含客户热词时，根据客户热词表中的客户热词构建客户解码图，将客户解码图和静态解码图构建的融合解码图作为目标解码图；若客户热词表中未包含客户热词时，将静态解码图作为目标解码图；通过目标解码图对初步解码结果进行解码，得到目标解码结果。本申请还提供一种语音识别装置、计算机设备及存储介质。此外，本申请还涉及区块链技术，用户的业务解码图和静态解码图可存储于区块链中。本申请有效提升语音识别的准确性及效率。

The embodiment of the present application belongs to the field of artificial intelligence, and relates to a method for speech recognition, including matching corresponding business decoding graphs and static decoding graphs according to business scenarios; obtaining the speech to be recognized and the customer hot word list; decoding the speech to be recognized through the business decoding graph to obtain a preliminary decoding result; if the customer hot word list contains customer hot words, construct a customer decoding graph according to the customer hot words in the customer hot word list, and use the fusion decoding graph constructed by the customer decoding graph and the static decoding graph as the target decoding graph; if the customer hot word list does not contain customer hot words, use the static decoding graph as the target decoding graph; decode the preliminary decoding result through the target decoding graph to obtain the target decoding result. The present application also provides a speech recognition device, a computer device and a storage medium. In addition, the present application also relates to blockchain technology, and the user's business decoding graph and static decoding graph can be stored in the blockchain. The present application effectively improves the accuracy and efficiency of speech recognition.