Movatterモバイル変換


[0]ホーム

URL:


CN112669838A - Intelligent sound box audio playing method and device, electronic equipment and storage medium - Google Patents

Intelligent sound box audio playing method and device, electronic equipment and storage medium
Download PDF

Info

Publication number
CN112669838A
CN112669838ACN202011495657.1ACN202011495657ACN112669838ACN 112669838 ACN112669838 ACN 112669838ACN 202011495657 ACN202011495657 ACN 202011495657ACN 112669838 ACN112669838 ACN 112669838A
Authority
CN
China
Prior art keywords
playback
smart speaker
audio
audio stream
voice
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
CN202011495657.1A
Other languages
Chinese (zh)
Inventor
彭媛
操灿
方律
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Hefei Feier Intelligent Technology Co ltd
Original Assignee
Hefei Feier Intelligent Technology Co ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Hefei Feier Intelligent Technology Co ltdfiledCriticalHefei Feier Intelligent Technology Co ltd
Priority to CN202011495657.1ApriorityCriticalpatent/CN112669838A/en
Publication of CN112669838ApublicationCriticalpatent/CN112669838A/en
Pendinglegal-statusCriticalCurrent

Links

Images

Landscapes

Abstract

Translated fromChinese

本发明提供了一种智能音箱音频播放方法,包括A:智能音箱接收到唤醒语音,切换到唤醒状态,唤醒状态下,如果智能音箱处于播放状态,则降低智能音箱输出音量;B:智能音箱接收播放控制语音,基于播放控制语音提取播放关键词,生成播放请求;C:基于播放请求访问音频流服务器,依次从音频流服务器获取对应的音频流数据;D:智能音箱播放收到的音频流数据。本发明的优点在于:基于唤醒语音调控播放音量,主动降低播放音量以方便接收用户后续语音指令,解决了自身播放振动对接收控制指令的影响;通过统一端口访问不同服务器,防止因版权问题影响用户使用,提高满意度;只需要对关键词进行检测,从音频流服务器层面滤除错误数据。

Figure 202011495657

The present invention provides an audio playback method of a smart speaker, including A: the smart speaker receives a wake-up voice, switches to a wake-up state, and in the wake-up state, if the smart speaker is in a playback state, the output volume of the smart speaker is reduced; B: the smart speaker receives Play control voice, extract play keywords based on the play control voice, and generate a play request; C: Access the audio stream server based on the play request, and sequentially obtain the corresponding audio stream data from the audio stream server; D: The smart speaker plays the received audio stream data . The advantages of the present invention are: based on the wake-up voice to control the playback volume, actively reduce the playback volume to facilitate the reception of the user's subsequent voice commands, and solve the influence of self-play vibration on receiving control commands; access different servers through a unified port to prevent copyright issues from affecting users Use, improve satisfaction; only need to detect keywords, filter out erroneous data from the audio streaming server level.

Figure 202011495657

Description

Intelligent sound box audio playing method and device, electronic equipment and storage medium
Technical Field
The invention relates to the technical field of intelligent sound boxes, in particular to an intelligent sound box audio playing method and device, electronic equipment and a storage medium.
Background
The intelligent sound box is a novel intelligent electronic product, has a voice input function and a remote voice acquisition function, is fixed on a PCB (printed circuit board) inside the sound box by arranging MEMS microphones with small models according to a certain array mode, and enables the sound box to perform fine acquisition on voices from all angles and directions. However, in the actual use process, the speaker of the sound box makes a sound, so that the whole sound box and the PCB inside the sound box are in a vibration state, the invention patent application with publication number CN107134286A discloses a wireless audio playing method based on voice interaction, a music player and a storage medium, the intelligent music player receives the control voice of a user, the high-power sound box is connected through wireless communication for playing, the receiving and playing devices are separately arranged, the influence of the playing audio on the voice instruction recognition effect is reduced, but this method does not solve the problem of how to eliminate the influence of the vibration of the PCB panel on the received voice signal when the audio player is used to play audio, in this case, the low-power audio player can only be used as a controller, and cannot be used independently as a player, which limits the use of the device.
Disclosure of Invention
The invention aims to solve the technical problem of providing an audio playing method of an intelligent sound box, which can eliminate the influence of playing audio on the audio receiving effect.
The invention solves the technical problems through the following technical scheme: an audio playing method for an intelligent sound box comprises the following steps:
step A: the intelligent sound box receives the awakening voice and switches to an awakening state, and in the awakening state, if the intelligent sound box is in a playing state, the output volume of the intelligent sound box is reduced;
and B: the intelligent sound box receives the playing control voice, extracts playing keywords based on the playing control voice and generates a playing request;
and C: accessing an audio stream server based on the playing request, and sequentially acquiring corresponding audio stream data from the audio stream server;
step D: and the intelligent sound box plays the received audio stream data.
According to the method, the intelligent sound box is awakened based on the awakening voice, when the intelligent sound box plays audio, the playing volume can be actively reduced after the intelligent sound box is awakened so as to conveniently receive a subsequent voice instruction of a user, the recognition precision is improved, and the influence of self playing vibration on receiving a control instruction is solved, so that the intelligent sound box can effectively serve as a controller and a player, and the user experience is improved; the control command is analyzed in the intelligent loudspeaker box to obtain a playing request, the corresponding server is accessed, different audio stream servers can be accessed based on the audio information specified by the user, the problem that the single audio stream server cannot meet the user requirements due to copyright and other problems is solved, control is performed through a unified port, use is convenient, and user experience is better.
Preferably, the awakening voice content is an identification name of the intelligent sound box, and the awakening voice can switch the intelligent sound box from a dormant state to an awakening state; the user does not perform any operation or input any effective voice command within the preset time period, and the intelligent sound box enters a dormant state.
Preferably, in step B, after receiving the play control voice, the smart speaker obtains text information corresponding to the play control voice through parsing, obtains the play keyword according to the text information parsing, and obtains a corresponding audio stream server address according to the play keyword, where the play request includes the play keyword, the audio stream server address, and a play sequence.
Preferably, the playing sequence includes sequential playing, random playing, circular playing, and single-track playing; when the text information does not include the playing sequence, the playing is sequentially performed based on the total playing times, sequentially performed based on the preference of the user, or randomly performed.
Preferably, the method for analyzing the playing keywords is to analyze and match the text information by using a preset playing keyword library, wherein the playing keyword library comprises one or more combinations of audio stream file names, singers, song writers, categories, regions, years and genders.
Preferably, the smart speaker analyzes the relationship between the preset playing keyword and the audio stream server address to obtain an audio stream server address corresponding to the playing keyword, when the smart speaker plays the audio stream data, the smart speaker further caches the next audio stream data to be played from the audio stream server, and after the current audio stream data is played, the smart speaker plays the cached data.
Preferably, if the playing keyword cannot be analyzed from the text information, the playing keyword is randomly generated based on the historical playing data of the user, or the playing keywords are sequentially generated from high to low according to the playing times based on the historical playing data.
Preferably, before the smart sound box receives the play control voice, the smart sound box further comprises a step of communicating with a user terminal to perform networking, the user terminal is connected with the smart sound box through a Bluetooth, the user terminal selects to connect with a wifi network, an account is input by the user terminal or an account of the smart sound box device is used to log in an audio streaming server, and historical play data of the account is obtained, wherein the historical play data comprises audio streaming information corresponding to keywords in a play keyword library, and the address of the audio streaming server and the playing frequency of the audio streaming.
Preferably, if the smart sound box receives the operation control request voice after the step a, analyzing the operation control request to obtain a control keyword, and executing a corresponding command; the control keywords comprise pause, start, previous, next, volume increase and volume decrease.
Preferably, the smart sound box is a double-loudspeaker or multi-loudspeaker Bluetooth sound box.
The invention also provides an audio playing method of the intelligent sound box, which comprises the steps of
Step i: a user speaks a wake-up voice of the intelligent sound box;
step ii: the intelligent sound box receives the awakening voice, switches to an awakening state and sends prompt information, and in the awakening state, if the intelligent sound box is in a playing state, the output volume of the intelligent sound box is reduced;
step iii: after the intelligent sound box sends out the prompt message, the user speaks a play control voice which the intelligent sound box wants to execute;
step iv: the intelligent sound box receives the playing control voice, extracts playing keywords based on the playing control voice, analyzes an address of the audio streaming server based on the playing keywords and generates a playing request;
step v: the intelligent sound box accesses a corresponding audio streaming server based on the playing request;
step vi: the audio streaming server responds to the playing request and sequentially returns the searched audio streaming data to the intelligent sound box based on the playing sequence in the playing request;
step vii: and the intelligent sound box plays the received audio stream data.
The invention also provides an intelligent sound box audio playing device, which comprises
A wake-up module: used for receiving the awakening voice and switching to an awakening state, and in the awakening state, if the intelligent sound box is in a playing state, the output volume of the intelligent sound box is reduced
A play request generation module: the system is used for receiving the playing control voice and analyzing to generate a playing request;
a play request sending module: the audio streaming server is used for sending the playing request to the audio streaming server obtained by analysis;
an audio stream data receiving module: the system comprises a server, a server and a server, wherein the server is used for receiving audio stream data returned by the audio stream server;
a playing module: and playing the audio stream data through the intelligent sound box.
The invention also provides an electronic device comprising a memory and a processor, wherein the memory is used for storing one or more computer instructions, and the one or more computer instructions are executed by the processor to realize the playing method.
The invention also provides a readable storage medium, which stores computer instructions, and the computer instructions can realize the audio playing method when being executed by a processor.
The intelligent sound box audio playing method, the intelligent sound box audio playing device, the electronic equipment and the storage medium have the advantages that: the intelligent sound box is awakened based on the awakening voice, when the intelligent sound box plays audio, the play volume can be actively reduced after the intelligent sound box is awakened so as to conveniently receive a subsequent voice instruction of a user, the recognition precision is improved, and the influence of self-play vibration on receiving a control instruction is solved, so that the intelligent sound box can effectively serve as a controller and a player, and the user experience is improved; the control command is analyzed in the intelligent loudspeaker box to obtain a playing request, the corresponding server is accessed, different audio stream servers can be accessed based on the audio information specified by the user, the problem that the single audio stream server cannot meet the user requirements due to copyright and other problems is solved, control is performed through a unified port, use is convenient, and user experience is better. All playing data can be stored on the account of each audio streaming server and can be stored on the intelligent sound box in a unified mode, so that unified management and control over different music players can be achieved, great convenience is brought to users, user satisfaction is improved, historical data of the users on different audio streaming servers can be fused, user preference is better analyzed, user instructions can be understood more intelligently, and user satisfaction is improved. The playing keyword is extracted from the playing control voice to obtain the playing request, so that only the keyword needs to be detected, the voice recognition difficulty is reduced, the corresponding audio streaming server is accessed to obtain data, the error data is filtered from the audio streaming server, the user experience is improved, and the audio is played based on the data fed back by the audio streaming server.
Drawings
Fig. 1 is a flowchart of an audio playing method for a smart sound box according to an embodiment of the present invention.
Fig. 2 is a flowchart of generating a play request by an audio playing method for a smart sound box according to an embodiment of the present invention;
fig. 3 is a flowchart illustrating a method for playing an audio of an intelligent sound box according to an embodiment of the present invention;
fig. 4 is a flowchart of an audio streaming server corresponding to the audio playing method for an intelligent sound box according to an embodiment of the present invention;
fig. 5 is a flowchart illustrating a method for playing an audio of an intelligent sound box according to an embodiment of the present invention to analyze a control keyword;
fig. 6 is a flowchart illustrating a method for playing audio of a smart sound box according to an embodiment of the present invention;
fig. 7 is a block diagram of an audio playing apparatus of a smart speaker according to a third embodiment of the present invention;
fig. 8 is a block diagram of a playing request generating module of an audio playing device of an intelligent speaker according to a third embodiment of the present invention;
fig. 9 is a structural diagram of an audio playing device of an intelligent sound box according to a third embodiment of the present invention.
Detailed Description
To make the objects, technical solutions and advantages of the present invention more apparent, the technical solutions of the present invention are described below in detail and completely with reference to the accompanying drawings, and it is apparent that the described embodiments are some, but not all embodiments of the present invention. All other embodiments, which can be derived by a person skilled in the art from the embodiments given herein without making any creative effort, shall fall within the protection scope of the present invention.
Example one
As shown in fig. 1, the present embodiment provides an audio playing method for a smart sound box, including the following steps:
step A: the intelligent sound box receives the awakening voice and switches to an awakening state, and in the awakening state, if the intelligent sound box is in a playing state, the output volume of the intelligent sound box is reduced;
this embodiment awakens up intelligent audio amplifier based on awakening up pronunciation, when intelligent audio amplifier broadcast audio frequency, can initiatively reduce the broadcast volume after awakening up in order to conveniently receive the follow-up voice command of user, improves the discernment precision, has solved self broadcast vibration to the influence of receiving control command to make intelligent audio amplifier can effectually act as controller and player, promote user experience.
The content of the awakening voice is the identification name of the intelligent sound box and can be defined by a user, the awakening voice can switch the intelligent sound box from a dormant state to an awakening state, the intelligent sound box can receive the control voice of the user in the awakening state, when the intelligent sound box is in the awakening state, the user does not perform any operation or input any effective voice instruction in a preset time period, and the intelligent sound box automatically enters the dormant state; in order to distinguish from most of the names or nicknames, the content of the wake-up voice is generally a phrase of four or more words.
And B: the intelligent sound box receives the playing control voice, extracts playing keywords based on the playing control voice and generates a playing request;
referring to fig. 2, after receiving the play control voice, the smart speaker sends the play control voice to the smart speaker, the smart speaker obtains corresponding text information through parsing, obtains the play keyword according to the text information, the smart speaker obtains a corresponding audio streaming server address according to the play keyword, and the play request includes the play keyword, the audio streaming server address and a play sequence;
referring to fig. 3, the method for parsing the broadcast keyword is to parse and match the text information by using a preset broadcast keyword library, where the broadcast keyword library includes one or more combinations of audio stream file name, singer, song writer, genre, region, year, and gender.
For example, if the user says "i want to listen to the song of zhou jilun", the play keyword is "play-zhou jilun"; when the user says 'Guo Dege', the playing keyword is analyzed to be 'playing-Guo Dege'; the playing keywords can be further refined to reduce the search noise, for example, for "Zhou Jilun", the resolution is "playing-Zhou Jilun-popular music"; for "Guo Dege", it is analyzed as "Play-Guo Dege-Xiang"; therefore, key information can be extracted quickly, phrases which cannot be identified can be filtered automatically under the condition that a user expresses relatively complicated words or habitual slogan, and finally subsequent steps are executed only on the basis of analyzed playing keywords.
And the intelligent sound box analyzes the relation between the preset playing keyword and the audio streaming server address to obtain the audio streaming server address corresponding to the playing keyword.
For example, for the playing keywords "play-zhou jieren-popular music" and "play-guo german-phase", the smart speaker searches for "zhou jieren" using a single or multiple audio streaming servers such as QQ music or internet cloud music based on the classification result of the audio streams, and for the phase sound category, the smart speaker accesses himalaya or other audio streaming servers for searching; when the type of the audio streaming server is determined, information such as copyright can be considered according to professional categories of different audio streaming servers, for example, when the copyright of the zhou jieren music belongs to the QQ music, and when the keyword for analyzing the playing is "play-zhou jieren-popular music", the address of the audio streaming server determined by the smart speaker is the server address of the QQ music.
The playing sequence comprises sequential playing, random playing, circular playing and single-track playing; the intelligent sound box preferentially extracts a playing sequence based on the playing control voice of the user, and if the user says ' random playing of Zhou Jie Lun ' song ', the analyzed playing key word is ' playing-Zhou Jie Lun-QQ music-random playing '; and when the text information does not comprise the playing sequence, playing the text information in sequence based on the total playing times, playing the text information in sequence based on the preference of the user or randomly playing the text information, wherein the preference of the user is obtained by arranging all audio stream data listened to by different audio stream servers according to the descending order of the playing times of the user, and the more the playing times, the higher the preference of the user is.
If the playing keywords can not be analyzed from the text information, the playing keywords are randomly generated based on the historical playing data of the user, or the playing keywords are sequentially generated from high to low according to the historical playing data. For example, if the user says "i want to listen to a song", no specific song, author, or genre information is found, and the songs played by the user with the highest frequency are played in turn based on the user's historical preference.
This embodiment accomplishes the analysis of broadcast keyword in intelligent audio amplifier to further analysis confirms audio stream server address and broadcast order, can confirm corresponding music provider based on the music copyright is automatic from this, and search on the audio stream server that corresponds, thereby solve the problem that single server part audio stream data copyright lacks, can satisfy user's demand, and the simple operation, the user only need be through unified interface, promptly the intelligent audio amplifier control can, all broadcast data except can keeping on the account number of each audio stream server, can also unified save on intelligent audio amplifier, can realize carrying out unified management and control to different music players from this, the very big user that has facilitated, promote user satisfaction.
And C: accessing an audio stream server based on the playing request, and sequentially acquiring corresponding audio stream data from the audio stream server; the intelligent sound box accesses and searches the corresponding audio streaming server based on the analyzed playing request, and returns the search results to the intelligent sound box, and the intelligent sound box sequentially returns the playing sequence obtained based on the analysis to the intelligent sound box under the condition that a plurality of search results exist;
for the audio streaming server, referring to fig. 4, the working method is as follows: the audio streaming server responds to the playing request, and sends corresponding audio streaming data to the intelligent sound box, wherein the playing request at least comprises: audio streaming server address, playing keyword and playing sequence; after the audio streaming server is connected, searching corresponding audio streaming data on the audio streaming server according to the playing keywords; sequentially sending the searched audio stream data to the intelligent sound boxes based on the playing sequence contained in the playing request;
when the intelligent sound box plays the searched audio stream data, the intelligent sound box also caches the next audio stream data to be played from the audio stream server, and after the current audio stream data is played, the intelligent sound box plays the cached data; and if the intelligent sound box receives other playing control voices again before the current audio stream data is played, executing the steps based on the latest playing control voice.
Step D: through intelligent audio amplifier broadcast audio stream data, in order to improve broadcast tone quality, obtain abundanter high bass effect, intelligent audio amplifier chooses for use two loudspeaker or many loudspeaker bluetooth speaker.
Further, referring to fig. 5, if the smart speaker receives the play control voice in step B and then parses out a control keyword, corresponding control operations are executed, where the control keyword includes pause, start, previous, next, volume up, and volume down.
If the intelligent sound box is awakened when in a playing state, in the step B, the subsequent steps are executed according to the playing control voice, the audio stream server is accessed to obtain new audio stream data, the new audio stream data is played, the audio stream data cached in the previous task is discarded, and the audio stream data to be played is cached in the current task again.
Referring to fig. 6, the above playing method requires accessing an audio streaming server based on a network, where the smart speaker further includes a step of communicating with a user terminal to perform networking before receiving a playing control voice, where the user terminal is connected to the smart speaker through bluetooth, and selects to connect to a wifi network through the user terminal, and inputs an account number using the user terminal or logs in the audio streaming server using an account number of the smart speaker device, so as to obtain historical playing data of the account number, where the historical playing data includes audio streaming information corresponding to keywords in a playing keyword library, an address of the audio streaming server, and playing times of the audio streaming; therefore, historical data of users in different audio streaming servers can be centralized in the intelligent loudspeaker box, and unified control is convenient to carry out.
Example two
The audio playing method provided by the embodiment comprises the following steps:
an intelligent sound box audio playing method comprises
Step i: a user speaks a wake-up voice of the intelligent sound box;
step ii: the intelligent sound box receives the awakening voice, switches to an awakening state and sends prompt information, and in the awakening state, if the intelligent sound box is in a playing state, the output volume of the intelligent sound box is reduced;
step iii: after the intelligent sound box sends out the prompt message, the user speaks a play control voice which the intelligent sound box wants to execute;
step iv: the intelligent sound box receives the playing control voice, extracts playing keywords based on the playing control voice, analyzes an address of the audio streaming server based on the playing keywords and generates a playing request;
step v: the intelligent sound box accesses a corresponding audio streaming server based on the playing request;
step vi: the audio streaming server responds to the playing request and sequentially returns the searched audio streaming data to the intelligent sound box based on the playing sequence in the playing request;
step vii: and the intelligent sound box plays the received audio stream data.
EXAMPLE III
Referring to fig. 7, based on the above audio playing method, this embodiment further provides an audio playing apparatus for an intelligent speaker, including:
a wake-up module: the intelligent sound box is used for receiving the awakening voice and switching to an awakening state, and in the awakening state, if the intelligent sound box is in a playing state, the output volume of the intelligent sound box is reduced;
a play request generation module: the system is used for receiving the playing control voice and analyzing to generate a playing request;
a play request sending module: the audio streaming server is used for sending the playing request to the audio streaming server obtained by analysis;
an audio stream data receiving module: the system comprises a server, a server and a server, wherein the server is used for receiving audio stream data returned by the audio stream server;
a playing module: and playing the audio stream data through the intelligent sound box.
Referring to fig. 8, the play request generation module includes
A voice receiving and processing unit: the system comprises a voice processing module, a voice processing module and a voice processing module, wherein the voice processing module is used for receiving voice information in a user environment and performing noise reduction processing and echo cancellation processing on the received voice information; the echo cancellation aims to remove background sound played by the intelligent sound box, so that the identification capability of the control instruction is further improved, and the user experience is improved;
an offline speech recognition unit: the voice recognition device is used for performing offline voice recognition on the voice information processed by the voice receiving and processing unit, wherein the range of the offline voice recognition comprises recognition of playing control keywords, and the control keywords comprise pause, start, previous, next, volume plus and volume minus;
a voice transmission and analysis unit: the voice cloud interface is used for sending the processed voice information to the server through the voice cloud interface of the intelligent sound box, analyzing text information corresponding to the playing control voice through the server, and analyzing playing keywords, an audio streaming server address and an audio playing sequence according to the text information;
a play request generation unit: the method is used for fusing the playing keywords, the audio streaming server address and the audio playing sequence to generate a playing request.
The wake-up module may obtain the wake-up voice based on the voice receiving and processing unit of the play request generation module.
Referring to fig. 9, the audio playing device of the smart speaker further includes
A networking module: the system is used for connecting a wifi network;
a feedback module: the system is used for acquiring and feeding back historical data of a user in the intelligent loudspeaker box and each audio streaming server;
a buffer module: and the audio stream data is stored in the buffer area.
Example four
The present embodiments also provide an electronic device comprising a memory and a processor, the memory for storing one or more computer instructions executable by the processor for performing a method comprising:
step A: the intelligent sound box receives the awakening voice and switches to an awakening state, and in the awakening state, if the intelligent sound box is in a playing state, the output volume of the intelligent sound box is reduced;
and B: the intelligent sound box receives the playing control voice, extracts playing keywords based on the playing control voice and generates a playing request;
and C: accessing an audio stream server based on the playing request, and sequentially acquiring corresponding audio stream data from the audio stream server;
step D: and the intelligent sound box plays the received audio stream data.
EXAMPLE five
The present embodiments also provide a readable storage medium storing computer instructions that, when executed by a processor, are capable of performing the following method:
step A: the intelligent sound box receives the awakening voice and switches to an awakening state, and in the awakening state, if the intelligent sound box is in a playing state, the output volume of the intelligent sound box is reduced;
and B: the intelligent sound box receives the playing control voice, extracts playing keywords based on the playing control voice and generates a playing request;
and C: accessing an audio stream server based on the playing request, and sequentially acquiring corresponding audio stream data from the audio stream server;
step D: and the intelligent sound box plays the received audio stream data.
The above examples are only intended to illustrate the technical solution of the present invention, but not to limit it; although the present invention has been described in detail with reference to the foregoing embodiments, it will be understood by those of ordinary skill in the art that: the technical solutions described in the foregoing embodiments may still be modified, or some technical features may be equivalently replaced; and such modifications or substitutions do not depart from the spirit and scope of the corresponding technical solutions of the embodiments of the present invention.

Claims (14)

Translated fromChinese
1.一种智能音箱音频播放方法,其特征在于:包括以下步骤:1. a smart speaker audio playback method, characterized in that: comprising the following steps:步骤A:智能音箱接收到唤醒语音,切换到唤醒状态,唤醒状态下,如果智能音箱处于播放状态,则降低智能音箱输出音量;Step A: The smart speaker receives the wake-up voice and switches to the wake-up state. In the wake-up state, if the smart speaker is in the playback state, reduce the output volume of the smart speaker;步骤B:智能音箱接收播放控制语音,基于播放控制语音提取播放关键词,生成播放请求;Step B: the smart speaker receives the playback control voice, extracts playback keywords based on the playback control voice, and generates a playback request;步骤C:基于播放请求访问音频流服务器,依次从音频流服务器获取对应的音频流数据;Step C: access the audio stream server based on the playback request, and sequentially obtain corresponding audio stream data from the audio stream server;步骤D:智能音箱播放收到的音频流数据。Step D: The smart speaker plays the received audio stream data.2.根据权利要求1所述的一种智能音箱音频播放方法,其特征在于:所述唤醒语音内容为智能音箱的标识名,唤醒语音能够将智能音箱从休眠状态切换至唤醒状态;用户在预设时间段内未进行任何操作或输入任何有效语音指令,智能音箱进入休眠状态。2. The audio playback method of a smart speaker according to claim 1, wherein the content of the wake-up voice is the identification name of the smart speaker, and the wake-up voice can switch the smart speaker from the dormant state to the wake-up state; If no operation is performed or any valid voice command is input within the set time period, the smart speaker will enter the sleep state.3.根据权利要求1所述的一种智能音箱音频播放方法,其特征在于:步骤B中智能音箱接收到播放控制语音后,通过解析得到播放控制语音对应的文本信息,根据文本信息解析得到所述播放关键词,智能音箱根据播放关键词解析出对应的音频流服务器地址,所述播放请求包括播放关键词、音频流服务器地址和播放顺序。3. The audio playback method of a smart speaker according to claim 1, characterized in that: after the smart speaker receives the playback control voice in step B, the text information corresponding to the playback control voice is obtained by parsing, and the text information is parsed and obtained according to the text information. The playback keyword is described, and the smart speaker parses the corresponding audio stream server address according to the playback keyword, and the playback request includes the playback keyword, the audio stream server address, and the playback sequence.4.根据权利要求3所述的一种智能音箱音频播放方法,其特征在于:所述播放顺序包括顺序播放、随机播放、循环播放、单曲播放;在文本信息未包括播放顺序时,基于总播放次数顺序播放、基于用户喜好顺序播放或随机播放。4. The audio playback method of a smart speaker according to claim 3, wherein the playback order includes sequential playback, random playback, loop playback, and single playback; when the text information does not include the playback order, based on total Play in order by the number of plays, in order based on user preference, or in random order.5.根据权利要求3所述的一种智能音箱音频播放方法,其特征在于:解析播放关键词的方法为利用预设的播放关键词库对文本信息进行解析和匹配,所述播放关键词库包括音频流文件名、演唱者、曲作者、类别、地区、年代、性别中的一种或多种组合。5. The audio playback method of a smart speaker according to claim 3, wherein the method for analyzing playback keywords is to parse and match text information by using a preset playback keyword library, and the playback keyword library Include one or more combinations of audio stream file name, artist, composer, category, region, era, and gender.6.根据权利要求3所述的一种智能音箱音频播放方法,其特征在于:所述智能音箱基于预设的播放关键词与音频流服务器地址的关系解析得到播放关键词对应的音频流服务器地址,智能音箱播放音频流数据时,所述智能音箱还从音频流服务器缓存下一首待播放音频流数据,当前音频流数据播放完成后,智能音箱播放缓存数据。6. The audio playback method of a smart speaker according to claim 3, wherein the smart speaker obtains the audio stream server address corresponding to the playback keyword based on the relationship between the preset playback keyword and the audio stream server address. , when the smart speaker plays the audio stream data, the smart speaker also caches the next audio stream data to be played from the audio stream server. After the current audio stream data is played, the smart speaker plays the cached data.7.根据权利要求5所述的一种智能音箱音频播放方法,其特征在于:如果无法从文本信息中解析出播放关键词,则基于用户的历史播放数据随机生成播放关键词,或者基于历史播放数据根据播放次数从高到低依次生成播放关键词。7. a kind of smart speaker audio playback method according to claim 5, is characterized in that: if can't be parsed from the text information to play the key word, then randomly generate the play key word based on the user's historical play data, or based on the historical play key The data generates playback keywords in sequence from high to low according to the number of playbacks.8.根据权利要求1所述的一种智能音箱音频播放方法,其特征在于:智能音箱在接收播放控制语音之前,还包括与用户终端通信进行联网的步骤,用户终端通过蓝牙连接智能音箱,通过用户终端选择连接wifi网络,使用用户终端输入账号或者使用智能音箱设备账号登录音频流服务器,获取账号的历史播放数据,所述历史播放数据包括与播放关键词库中的关键词对应的音频流信息,以及音频流服务器地址和音频流播放次数。8. The audio playback method of a smart speaker according to claim 1, wherein the smart speaker further comprises a step of communicating with a user terminal for networking before receiving the playback control voice, the user terminal is connected to the smart speaker through Bluetooth, and the user terminal is connected to the smart speaker through Bluetooth. The user terminal chooses to connect to the wifi network, uses the user terminal to input an account or uses the smart speaker device account to log in to the audio streaming server, and obtains historical playback data of the account, where the historical playback data includes audio stream information corresponding to the keywords in the playback keyword database , and the address of the audio stream server and the number of times the audio stream is played.9.根据权利要求1所述的一种智能音箱音频播放方法,其特征在于:如果步骤B中的播放控制语音解析出控制关键词,则执行对应的控制操作,所述控制关键词包括暂停、开始、上一个、下一个、上一首、下一首、音量加、音量减。9. The audio playback method of a smart speaker according to claim 1, wherein: if the playback control voice in step B parses out control keywords, then execute corresponding control operations, and the control keywords include pause, Start, Previous, Next, Previous, Next, Volume Up, Volume Down.10.根据权利要求1所述的一种智能音箱音频播放方法,其特征在于:所述智能音箱为双喇叭或多喇叭蓝牙音箱。10 . The audio playback method of a smart speaker according to claim 1 , wherein the smart speaker is a dual-speaker or multi-speaker Bluetooth speaker. 11 .11.一种智能音箱音频播放方法,其特征在于:包括11. A smart speaker audio playback method, characterized in that: comprising:步骤i:用户说出智能音箱的唤醒语音;Step i: The user speaks the wake-up voice of the smart speaker;步骤ii:智能音箱接收到唤醒语音,切换到唤醒状态,并发出提示信息,唤醒状态下,如果智能音箱处于播放状态,则降低智能音箱输出音量;Step ii: The smart speaker receives the wake-up voice, switches to the wake-up state, and sends out a prompt message. In the wake-up state, if the smart speaker is in the playback state, reduce the output volume of the smart speaker;步骤iii:智能音箱发出提示信息后,用户说出希望智能音箱执行的播放控制语音;Step iii: After the smart speaker sends a prompt message, the user speaks the playback control voice that the smart speaker wants to perform;步骤iv:智能音箱接收播放控制语音,基于播放控制语音提取播放关键词,并基于播放关键词解析音频流服务器地址,生成播放请求;Step iv: the smart speaker receives the playback control voice, extracts playback keywords based on the playback control voice, and parses the audio stream server address based on the playback keywords to generate a playback request;步骤v:智能音箱基于播放请求访问对应的音频流服务器;Step v: The smart speaker accesses the corresponding audio streaming server based on the playback request;步骤vi:所述音频流服务器响应于播放请求,基于播放请求中的播放顺序,依次将查找到的音频流数据返回智能音箱;Step vi: in response to the playback request, the audio stream server returns the found audio stream data to the smart speaker in turn based on the playback order in the playback request;步骤vii:智能音箱播放收到的音频流数据。Step vii: The smart speaker plays the received audio stream data.12.一种智能音箱音频播放装置,其特征在于:包括12. A smart speaker audio playback device, characterized in that: comprising:唤醒模块:用于接收唤醒语音,切换到唤醒状态,唤醒状态下,如果智能音箱处于播放状态,则降低智能音箱输出音量Wake-up module: used to receive the wake-up voice, switch to the wake-up state, in the wake-up state, if the smart speaker is in the playback state, reduce the output volume of the smart speaker播放请求生成模块:用于接收播放控制语音,解析生成播放请求;Playing request generating module: used to receive playback control voice, parse and generate playback request;播放请求发送模块:用于将播放请求发送给解析得到的音频流服务器;Playing request sending module: used to send the playing request to the parsed audio stream server;音频流数据接收模块:用于接收音频流服务器返回的音频流数据;Audio stream data receiving module: used to receive the audio stream data returned by the audio stream server;播放模块:通过智能音箱播放音频流数据。Playback module: Play audio stream data through smart speakers.13.一种电子设备,包括存储器和处理器,所述存储器用于存储一条或多条计算机指令,其特征在于:所述一条或多条计算机指令被所述处理器执行以实现如权利要求1~10任一项所述的方法。13. An electronic device comprising a memory and a processor, wherein the memory is used to store one or more computer instructions, wherein the one or more computer instructions are executed by the processor to implement the method of claim 1 The method of any one of ~10.14.一种可读存储介质,存储有计算机指令,其特征在于:所述计算机指令被处理器执行时能够实现如权利要求1-10任一项所述的方法。14. A readable storage medium storing computer instructions, wherein when the computer instructions are executed by a processor, the method according to any one of claims 1-10 can be implemented.
CN202011495657.1A2020-12-172020-12-17Intelligent sound box audio playing method and device, electronic equipment and storage mediumPendingCN112669838A (en)

Priority Applications (1)

Application NumberPriority DateFiling DateTitle
CN202011495657.1ACN112669838A (en)2020-12-172020-12-17Intelligent sound box audio playing method and device, electronic equipment and storage medium

Applications Claiming Priority (1)

Application NumberPriority DateFiling DateTitle
CN202011495657.1ACN112669838A (en)2020-12-172020-12-17Intelligent sound box audio playing method and device, electronic equipment and storage medium

Publications (1)

Publication NumberPublication Date
CN112669838Atrue CN112669838A (en)2021-04-16

Family

ID=75404805

Family Applications (1)

Application NumberTitlePriority DateFiling Date
CN202011495657.1APendingCN112669838A (en)2020-12-172020-12-17Intelligent sound box audio playing method and device, electronic equipment and storage medium

Country Status (1)

CountryLink
CN (1)CN112669838A (en)

Cited By (6)

* Cited by examiner, † Cited by third party
Publication numberPriority datePublication dateAssigneeTitle
CN113207042A (en)*2021-04-302021-08-03海信视像科技股份有限公司Media asset playing method and display equipment
CN114049877A (en)*2021-11-042022-02-15北京奇天大胜网络科技有限公司Voice digital human-television information interaction method and system based on Internet of things
CN114297471A (en)*2021-11-222022-04-08北京声智科技有限公司Audio playing method and device
CN114613364A (en)*2022-03-282022-06-10东莞中之科技股份有限公司 Sound control method and system based on voice control
CN115454375A (en)*2022-09-222022-12-09星河智联汽车科技有限公司Volume adjusting method, device, equipment and vehicle
CN116684393A (en)*2023-06-012023-09-01中国联合网络通信集团有限公司Audio processing method, terminal, home gateway, computer device and storage medium

Citations (6)

* Cited by examiner, † Cited by third party
Publication numberPriority datePublication dateAssigneeTitle
US20090062949A1 (en)*2007-09-032009-03-05Heo U-BeomAudio data player and method of creating playback list thereof
CN108520742A (en)*2018-01-242018-09-11联发科技(新加坡)私人有限公司Improve method, speech recognition equipment and the playing device of phonetic recognization rate
CN110139127A (en)*2019-05-212019-08-16北京声智科技有限公司Audio file play method, server, intelligent sound box and play system
CN111083678A (en)*2018-10-222020-04-28深圳市冠旭电子股份有限公司 Bluetooth speaker playback control method, system and smart device
CN111654782A (en)*2020-06-052020-09-11百度在线网络技术(北京)有限公司Intelligent sound box and signal processing method
CN111949240A (en)*2019-05-162020-11-17阿里巴巴集团控股有限公司Interaction method, storage medium, service program, and device

Patent Citations (6)

* Cited by examiner, † Cited by third party
Publication numberPriority datePublication dateAssigneeTitle
US20090062949A1 (en)*2007-09-032009-03-05Heo U-BeomAudio data player and method of creating playback list thereof
CN108520742A (en)*2018-01-242018-09-11联发科技(新加坡)私人有限公司Improve method, speech recognition equipment and the playing device of phonetic recognization rate
CN111083678A (en)*2018-10-222020-04-28深圳市冠旭电子股份有限公司 Bluetooth speaker playback control method, system and smart device
CN111949240A (en)*2019-05-162020-11-17阿里巴巴集团控股有限公司Interaction method, storage medium, service program, and device
CN110139127A (en)*2019-05-212019-08-16北京声智科技有限公司Audio file play method, server, intelligent sound box and play system
CN111654782A (en)*2020-06-052020-09-11百度在线网络技术(北京)有限公司Intelligent sound box and signal processing method

Cited By (7)

* Cited by examiner, † Cited by third party
Publication numberPriority datePublication dateAssigneeTitle
CN113207042A (en)*2021-04-302021-08-03海信视像科技股份有限公司Media asset playing method and display equipment
CN114049877A (en)*2021-11-042022-02-15北京奇天大胜网络科技有限公司Voice digital human-television information interaction method and system based on Internet of things
CN114297471A (en)*2021-11-222022-04-08北京声智科技有限公司Audio playing method and device
CN114613364A (en)*2022-03-282022-06-10东莞中之科技股份有限公司 Sound control method and system based on voice control
CN114613364B (en)*2022-03-282022-11-01东莞中之科技股份有限公司Sound control method and system based on voice control
CN115454375A (en)*2022-09-222022-12-09星河智联汽车科技有限公司Volume adjusting method, device, equipment and vehicle
CN116684393A (en)*2023-06-012023-09-01中国联合网络通信集团有限公司Audio processing method, terminal, home gateway, computer device and storage medium

Similar Documents

PublicationPublication DateTitle
CN112669838A (en)Intelligent sound box audio playing method and device, electronic equipment and storage medium
US12301909B2 (en)Server and method for controlling server
US9190052B2 (en)Systems and methods for providing information discovery and retrieval
US20200151212A1 (en)Music recommending method, device, terminal, and storage medium
US11640832B2 (en)Emotion-based voice interaction method, storage medium and terminal device using pitch, fluctuation and tone
KR101309794B1 (en)Display apparatus, method for controlling the display apparatus and interactive system
CN108063969B (en)Display apparatus, method of controlling display apparatus, server, and method of controlling server
CN108882101B (en)Playing control method, device, equipment and storage medium of intelligent sound box
CN107221323A (en) Method for ordering songs by voice, terminal and storage medium
CN108492826B (en)Audio processing method and device, intelligent equipment and medium
US10693944B1 (en)Media-player initialization optimization
CN105278684B (en)A kind of intelligent playing method and device
US11114079B2 (en)Interactive music audition method, apparatus and terminal
CN115396709B (en)Display device, server and wake-up-free voice control method
KR102584324B1 (en)Method for providing of voice recognition service and apparatus thereof
CN112466304B (en) Offline voice interaction method, device, system, equipment and storage medium
CN112786031B (en)Man-machine conversation method and system
CN113196384B (en)Method and system for dynamically inserting supplemental audio content into an audio recording at a requested time
CN114822506A (en) A message broadcasting method, device, mobile terminal and storage medium
CN113948075A (en)Method, system, device and storage medium for man-machine conversation management
CN111506743A (en) A media resource storage method, media resource playback method and related equipment
US11367446B2 (en)Information dissemination system and method thereof
KR102091006B1 (en)Display apparatus and method for controlling the display apparatus
US12267286B1 (en)Sharing of content
KR101576683B1 (en)Method and apparatus for playing audio file comprising history storage

Legal Events

DateCodeTitleDescription
PB01Publication
PB01Publication
SE01Entry into force of request for substantive examination
SE01Entry into force of request for substantive examination
RJ01Rejection of invention patent application after publication
RJ01Rejection of invention patent application after publication

Application publication date:20210416


[8]ページ先頭

©2009-2025 Movatter.jp