JP2000259180A

Movatterモバイル変換

Info

Publication number: JP2000259180A
Application number: JP11059059A
Authority: JP
Inventors: Atsushi Noguchi; 淳野口
Original assignee: NEC Corp
Current assignee: NEC Corp
Priority date: 1999-03-05
Filing date: 1999-03-05
Publication date: 2000-09-22

Abstract

(57)【要約】【課題】連続音声文章入力装置において、文章入力の
ために発声された音声が音響的に類似の音声コマンドに
誤認識されるおそれを減らす。【解決手段】連続音声文章入力装置１０において、ユ
ーザーが音声による文章入力の際に使用した各音声コマ
ンドの使用頻度を音声コマンド履歴管理部８が記憶す
る。音声コマンド用辞書管理部９は、音声コマンド履歴
管理部８の記憶内容を監視し、所定回数だけ全ての音声
コマンドが使用されたときに、各音声コマンドが所定の
最低回数に達しているかどうか調べ、最低回数に達して
いない音声コマンドは音声コマンド用辞書４から削除す
る。(57) [Summary] [PROBLEMS] To reduce the possibility that a voice uttered for text input is erroneously recognized as an acoustically similar voice command in a continuous voice text input device. SOLUTION: In a continuous voice text input device 10, a voice command history management unit 8 stores the frequency of use of each voice command used when a user inputs text by voice. The voice command dictionary management unit 9 monitors the stored contents of the voice command history management unit 8 and checks whether each voice command has reached a predetermined minimum number of times when all voice commands have been used a predetermined number of times. The voice command that has not reached the minimum number is deleted from the voice command dictionary 4.

Description

Translated fromJapanese

【発明の詳細な説明】DETAILED DESCRIPTION OF THE INVENTION

【０００１】[0001]

【発明の属する技術分野】本発明は、連続音声文章入力
装置及び連続音声文章入力方法に関し、特に、音声コマ
ンド用認識対象語彙の管理方法の改良に関する。BACKGROUND OF THE INVENTION 1. Field of the Invention The present invention relates to a continuous speech text input device and a continuous speech text input method, and more particularly to an improvement in a method of managing a recognition target vocabulary for voice commands.

【０００２】[0002]

【従来の技術】近年、コンピュータその他の機器の操作
を音声で行う連続音声文章入力装置が研究されている。
これらの連続音声文章入力装置においては、装置の操作
性を向上させるために、キーボードでコマンドを入力し
たり、あるいは、マウスでコマンドを選択する代わり
に、音声でコマンドを入力する。例えば、『ここを削
除』と音声でコマンドを入力すると、直前に入力した音
声認識結果が削除されるようになっている。2. Description of the Related Art In recent years, continuous speech text input devices for operating computers and other devices by voice have been studied.
In these continuous voice text input devices, in order to improve the operability of the device, commands are input by voice instead of inputting commands with a keyboard or selecting commands with a mouse. For example, when a voice command "Delete here" is input, the immediately preceding voice recognition result is deleted.

【０００３】従来の連続音声文章入力装置の一例を図１
０に示す。図１０に示した従来の連続音声文章入力装置
１００は、ユーザーが音声入力を行う音声入力部１０１
と、入力された音声に対して認識処理を行う音声認識部
１０２と、認識処理の際に使用する連続音声文章入力用
の認識処理用パターンを記憶した連続音声文章入力用辞
書１０３と、音声コマンド用の認識処理用パターンを記
憶した音声コマンド用辞書１０４と、認識結果に基づい
て、入力された音声が連続音声文章入力用の音声か、あ
るいは、音声コマンド用の音声であるかを判断する認識
結果管理部１０５と、入力された音声が連続文章入力用
の音声であった場合に、認識結果を表示する認識結果表
示部１０６と、認識結果が音声コマンドであった場合
に、予め定義された各音声コマンドに対応するコマンド
を実行する音声コマンド実行部１０７と、から構成され
る。FIG. 1 shows an example of a conventional continuous speech text input device.
0 is shown. A conventional continuous speech sentence input device 100 shown in FIG.
A speech recognition unit 102 for performing recognition processing on input speech, a continuous speech text input dictionary 103 storing recognition processing patterns for continuous speech text input used in the recognition processing, and a voice command Command dictionary 104 that stores a recognition processing pattern for voice recognition, and recognition that determines whether the input voice is a voice for continuous voice sentence input or a voice for voice command based on the recognition result. A result management unit 105, a recognition result display unit 106 that displays a recognition result when the input voice is a voice for continuous sentence input, and a predefined result when the recognition result is a voice command. And a voice command execution unit 107 for executing a command corresponding to each voice command.

【０００４】[0004]

【発明が解決しようとする課題】しかしながら、上述の
従来の連続音声文章入力装置１００には次のような問題
があった。However, the conventional continuous speech sentence input device 100 has the following problems.

【０００５】第１の問題点は、入力された文章を認識す
る認識性能が劣化するという点である。The first problem is that the recognition performance for recognizing an input sentence is deteriorated.

【０００６】その理由は、文章入力のために発声された
音声が音響的に類似する音声コマンドと誤認識される可
能性があるためである。The reason is that there is a possibility that a voice uttered for text input may be erroneously recognized as an acoustically similar voice command.

【０００７】第２の問題点は、あらかじめ用意されてい
る音声コマンドと同一の文字列を入力しにくいという点
である。[0007] The second problem is that it is difficult to input the same character string as a voice command prepared in advance.

【０００８】その理由は、連続音声文章入力装置に音声
コマンドと同一の文字列を音声入力したときに、連続音
声文章入力装置は、文章入力のために発声されたもので
あるのか、あるいは、コマンド入力のために発声された
ものであるのかの区別を付けることができないためであ
る。The reason is that when the same character string as the voice command is input to the continuous voice text input device by voice, the continuous voice text input device is uttered for text input, This is because it cannot be distinguished whether the voice is uttered for input.

【０００９】例えば、『ここを削除』という音声コマン
ドが用意されている場合において、『ここを削除』とい
う一節を含む文章を連続音声で入力すると、連続音声文
章入力装置はその『ここを削除』という音声が文章の一
部であるのか、あるいは、音声コマンドであるのか区別
することができない。For example, when a voice command "Delete here" is prepared and a sentence including a passage "Delete here" is input in continuous voice, the continuous voice text input device uses the "Delete here" command. It cannot be distinguished whether the voice is a part of a sentence or a voice command.

【００１０】図１０に示した連続音声文章入力装置にも
多くの連続音声文章入力装置がこれまでに提案されてい
る。Many continuous speech text input devices have been proposed as the continuous speech text input device shown in FIG.

【００１１】例えば、特開平７−２１９５８４号公報
は、音声により入力されたコマンドに対応する処理を行
う連続音声文章入力装置であって、音声の誤認識による
誤処理を防止する連続音声文章入力装置を提案してい
る。For example, Japanese Unexamined Patent Publication No. 7-219584 discloses a continuous speech text input device for performing a process corresponding to a command input by voice, which prevents erroneous processing due to erroneous recognition of voice. Has been proposed.

【００１２】この連続音声文章入力装置においては、再
確認が必要な音声コマンドを予め定めておき、その音声
コマンドが入力された場合には、認識結果の再確認を行
うものである。In this continuous speech sentence input device, a voice command requiring reconfirmation is determined in advance, and when the voice command is input, the recognition result is reconfirmed.

【００１３】しかしながら、この連続音声文章入力装置
によれば、再確認を必要とする音声コマンドを予め定め
ておく必要があるが、再確認を必要とするか否かの判断
基準を適正に定めることは極めて困難であるという問題
点がある。However, according to this continuous speech text input device, it is necessary to determine in advance the voice command that requires reconfirmation. However, it is necessary to appropriately determine the criteria for determining whether reconfirmation is required. Is extremely difficult.

【００１４】また、特開平１０−２８２９８７号公報
は、複数の辞書を用意し、その中から一つの辞書を選択
して、その辞書を用いて音声認識を行う音声認識システ
ムを提案している。Japanese Patent Application Laid-Open No. 10-282987 proposes a speech recognition system in which a plurality of dictionaries are prepared, one of the dictionaries is selected, and speech recognition is performed using the dictionaries.

【００１５】しかしながら、この音声認識システムによ
れば、複数の辞書を作成しなければならず、そのための
メモリを多数用意する必要があり、システム全体の機構
の単純化を図ることができないという問題点を内包して
いる。However, according to this speech recognition system, a plurality of dictionaries must be created, a large number of memories must be prepared, and the mechanism of the entire system cannot be simplified. Is included.

【００１６】本発明は、以上のような従来の連続音声文
章入力装置における問題点に鑑みてなされたものであ
り、入力された文章を認識する認識性能の劣化を防止
し、かつ、予め用意されている音声コマンドと同一の文
字列が入力しやすく、さらに、装置全体の機構の単純化
を図ることができる連続音声文章入力装置及び連続音声
文章入力方法を提供することを目的とする。SUMMARY OF THE INVENTION The present invention has been made in view of the above-described problems in the conventional continuous speech text input device, and prevents the deterioration of the recognition performance for recognizing the input text and prepares the data in advance. It is an object of the present invention to provide a continuous-speech text input device and a continuous-speech text input method capable of easily inputting the same character string as a given voice command and further simplifying the mechanism of the entire device.

【００１７】[0017]

【課題を解決するための手段】この目的を達成するた
め、本発明は、請求項１に記載されているように、文章
を連続的に音声入力し、かつ、制御用音声コマンドを音
声入力するための音声入力手段と、音声入力手段に入力
された音声を認識する音声認識手段と、音声コマンドの
認識用辞書を記憶する音声コマンド辞書記憶手段と、音
声認識手段における認識結果に基づいて、入力された音
声が連続文章入力用の音声か、あるいは、音声コマンド
用の音声であるかを判断し、入力された音声が音声コマ
ンド用の音声であった場合には、各音声コマンドに対応
する制御動作を実行させる認識結果管理手段と、入力さ
れた音声が音声コマンド用の音声であった場合の音声コ
マンドの履歴を記憶する音声コマンド履歴管理手段と、
音声コマンド履歴管理手段の記憶内容に基づいて、各音
声コマンドを音声コマンド辞書記憶手段から削除すべき
か否かを判断し、必要ある場合には、その音声コマンド
を削除する音声コマンド辞書管理手段と、を備える連続
音声文章入力装置を提供する。In order to achieve the above object, according to the present invention, a sentence is continuously input by voice and a control voice command is input by voice. Voice input means for recognizing voice input to the voice input means, voice command dictionary storage means for storing a voice command recognition dictionary, and input based on the recognition result in the voice recognition means. It is determined whether the input voice is a voice for continuous sentence input or a voice for voice command. If the input voice is a voice for voice command, the control corresponding to each voice command is performed. A recognition result management unit for executing an operation, a voice command history management unit for storing a history of voice commands when the input voice is voice for voice commands,
Voice command dictionary management means for determining whether each voice command should be deleted from the voice command dictionary storage means based on the storage content of the voice command history management means, and deleting the voice command if necessary, A continuous speech sentence input device provided with:

【００１８】請求項１に係る連続音声文章入力装置にお
いては、音声コマンド履歴管理手段は、各音声コマンド
が音声認識手段によって認識結果として出力された回数
を記憶する。また、音声コマンド辞書管理手段は、音声
コマンド履歴管理手段に記憶された各音声コマンドの使
用頻度に従って、使用頻度が低い音声コマンドを認識対
象から除外する。In the continuous speech sentence input device according to the first aspect, the voice command history management means stores the number of times each voice command has been output as a recognition result by the voice recognition means. Further, the voice command dictionary management unit excludes voice commands having a low usage frequency from recognition targets according to the usage frequency of each voice command stored in the voice command history management unit.

【００１９】これによって、音声コマンド用辞書に含ま
れる音声コマンドをユーザーが良く使用するもののみに
絞り込むことが可能となる。従って、文章入力のための
入力音声が誤って音声コマンドに誤認識される可能性が
減り、入力された音声の認識性能を改善することができ
る。また、あらかじめ用意されている音声コマンドと同
一の文字列が入力しにくいというケースが発生する可能
性が削減されるという効果を得ることができる。This makes it possible to narrow down the voice commands included in the voice command dictionary to only those frequently used by the user. Therefore, the possibility that the input voice for the text input is erroneously recognized as the voice command is reduced, and the recognition performance of the input voice can be improved. Further, it is possible to obtain an effect that the possibility that a case where it is difficult to input the same character string as a voice command prepared in advance is less likely to occur is reduced.

【００２０】請求項２は、文章を連続的に音声入力し、
かつ、制御用音声コマンドを音声入力するための音声入
力手段と、音声コマンドの認識用辞書と、その音声コマ
ンドを認識しやすくするかどうかの音声コマンド毎の優
先度と、を記憶する音声コマンド辞書記憶手段と、音声
入力手段に入力された音声を認識し、かつ、音声コマン
ドを認識処理する場合には優先度に従って認識処理を行
う音声認識手段と、音声認識手段の認識結果に基づい
て、入力された音声が連続文章入力用の音声か、あるい
は、音声コマンド用の音声であるかを判断し、入力され
た音声が音声コマンド用の音声であった場合には、各音
声コマンドに対応する制御動作を実行させる認識結果管
理手段と、入力された音声が音声コマンド用の音声であ
った場合の音声コマンドの履歴を記憶する音声コマンド
履歴管理手段と、音声コマンド履歴管理手段の記憶内容
に基づいて、各音声コマンドの優先度を下げるか否かを
判断し、その判断結果を音声コマンド辞書記憶手段に送
る音声コマンド辞書管理手段と、を備える連続音声文章
入力装置を提供する。According to a second aspect of the present invention, a sentence is continuously input by voice,
A voice input means for voice inputting a control voice command; a voice command recognition dictionary; and a voice command dictionary for storing a priority for each voice command as to whether the voice command can be easily recognized. A storage unit, a voice recognition unit that recognizes voice input to the voice input unit, and performs a recognition process in accordance with a priority when performing a voice command recognition process, based on a recognition result of the voice recognition unit; It is determined whether the input voice is a voice for continuous sentence input or a voice for voice command. If the input voice is a voice for voice command, the control corresponding to each voice command is performed. A recognition result managing means for executing the operation; a voice command history managing means for storing a history of voice commands when the input voice is voice voice voice; Voice command dictionary management means for determining whether to lower the priority of each voice command based on the storage contents of the command history management means and sending the determination result to the voice command dictionary storage means Provide equipment.

【００２１】本請求項に係る連続音声文章入力装置にお
いては、音声コマンド辞書管理手段が、音声コマンド履
歴管理手段の記憶内容に基づいて、各音声コマンドの優
先度を下げるか否かを判断する。音声コマンド辞書管理
手段によってある音声コマンドの優先度が下げられる
と、音声認識手段がその音声コマンドを認識しにくくな
る。従って、文章入力のための入力音声が誤って音声コ
マンドに誤認識される可能性が減り、入力された音声の
認識性能を改善することができる。また、あらかじめ用
意されている音声コマンドと同一の文字列が入力しにく
いというケースが発生する可能性が削減されるという効
果を得ることができる。In the continuous voice sentence input device according to the present invention, the voice command dictionary management means determines whether to lower the priority of each voice command based on the storage contents of the voice command history management means. When the priority of a certain voice command is lowered by the voice command dictionary management unit, it becomes difficult for the voice recognition unit to recognize the voice command. Therefore, the possibility that the input voice for the text input is erroneously recognized as the voice command is reduced, and the recognition performance of the input voice can be improved. Further, it is possible to obtain an effect that the possibility that a case where it is difficult to input the same character string as a voice command prepared in advance is less likely to occur is reduced.

【００２２】音声コマンド履歴管理手段は、例えば、請
求項３に記載されているように、音声コマンドの使用頻
度を計算する音声コマンド使用頻度計算手段を備えるこ
とが好ましい。Preferably, the voice command history management means includes voice command usage frequency calculation means for calculating the usage frequency of voice commands, for example.

【００２３】また、音声コマンド履歴管理手段は、例え
ば、請求項４に記載されているように、各音声コマンド
毎に削除するか否かの判定基準となる最低使用頻度を記
憶しておく音声コマンド最低使用頻度記憶手段を備える
ことが好ましい。The voice command history management means may store, for example, a voice command that stores a minimum frequency of use as a criterion for determining whether or not to delete each voice command. It is preferable to provide a minimum use frequency storage unit.

【００２４】これによって、各音声コマンド毎に削除す
るか否かの判定基準を変えることができ、音声コマンド
の使用状況に応じた管理を行うことができる。Thus, the criterion for determining whether or not to delete each voice command can be changed, and management according to the usage status of the voice command can be performed.

【００２５】また、請求項５に記載されているように、
本連続音声文章入力装置は、音声コマンド辞書管理手段
が音声コマンド辞書記憶手段から削除した音声コマンド
を記憶しておく削除コマンド記憶手段と、削除コマンド
記憶手段の記憶内容を表示する削除コマンド表示手段
と、をさらに備えることが好ましい。Also, as described in claim 5,
The continuous voice sentence input device includes: a deletion command storage means for storing voice commands deleted from the voice command dictionary storage means by the voice command dictionary management means; a deletion command display means for displaying storage contents of the deletion command storage means. Is preferably further provided.

【００２６】これによって、ユーザーは既に削除した音
声コマンドを容易に知ることができ、現在、認識対象と
されている音声コマンドを容易に把握することができ
る。Thus, the user can easily know the voice command that has already been deleted, and can easily recognize the voice command that is currently recognized.

【００２７】また、この場合、請求項６に記載されてい
るように、削除コマンド記憶手段に記憶されている音声
コマンドを再登録する削除コマンド再登録手段をさらに
設けることが好ましい。In this case, it is preferable that a delete command re-registering means for re-registering the voice command stored in the delete command storing means be further provided.

【００２８】一旦は削除した音声コマンドであっても、
後に、その音声コマンドを再び認識対象とする必要が生
じる場合もある。このため、削除コマンド記憶手段に記
憶されている音声コマンドを再登録することができるよ
うにすることによって、音声コマンドを新たに登録する
場合と比較して、より容易に所望の音声コマンドを認識
対象に組み入れることができる。Even if the voice command is once deleted,
Later, the voice command may need to be recognized again. For this reason, by enabling the voice command stored in the deletion command storage means to be re-registered, the desired voice command can be more easily recognized as compared with the case of newly registering the voice command. Can be incorporated into

【００２９】音声コマンド履歴管理手段は、請求項７に
記載されているように、ユーザー毎の音声コマンド使用
履歴を記憶するユーザー別音声コマンド履歴管理手段を
備えることが好ましい。It is preferable that the voice command history management means includes a user-specific voice command history management means for storing a voice command usage history for each user.

【００３０】このように、音声コマンド履歴管理手段が
ユーザー毎に音声コマンド使用履歴を記憶することによ
り、認識対象から削除する音声コマンドをユーザー毎に
変えることができる。As described above, since the voice command history management means stores the voice command use history for each user, the voice command to be deleted from the recognition target can be changed for each user.

【００３１】請求項８に記載されているように、本連続
音声文章入力装置は、音声コマンド辞書管理手段から音
声コマンドが削除されるときに、その旨の表示を行う削
除表示手段をさらに備えることが好ましい。According to an eighth aspect of the present invention, when the voice command is deleted from the voice command dictionary management means, the continuous voice sentence input device further comprises a deletion display means for displaying the fact. Is preferred.

【００３２】これによって、ユーザーは音声コマンドを
認識対象から削除する前に、改めて削除するか否かの再
確認を行うことができ、削除対象ではない音声コマンド
を誤って削除することを防止することができる。Thus, before deleting a voice command from a recognition target, the user can reconfirm whether or not to delete the voice command, thereby preventing a voice command that is not a deletion target from being erroneously deleted. Can be.

【００３３】音声コマンド履歴管理手段は、請求項９に
記載されているように、一定時間経過後に、または、音
声コマンドが一定回数使用された後に、音声コマンドの
履歴の記憶を開始するものであることが好ましい。The voice command history management means starts storing the history of the voice command after a lapse of a predetermined time or after the voice command has been used a certain number of times. Is preferred.

【００３４】例えば、請求項１に係る連続音声文章入力
装置においては、使用開始時からの全音声コマンドの使
用回数に基づいて、特定の音声コマンドを認識対象から
削除するかどうかを判断している。これに対して、ユー
ザーが連続音声文章入力装置の使用に不慣れな期間と、
ある程度慣れた期間とでは、使用する音声コマンドが異
なる可能性もあるため、ユーザーが連続音声文章入力装
置の使用に不慣れな期間においては、全音声コマンドの
使用回数を計数しない方が好ましいこともある。このた
め、請求項９においては、一定時間経過後に、または、
音声コマンドが一定回数使用された後に、音声コマンド
履歴管理手段が音声コマンドの履歴の記憶を開始するも
のとしている。For example, in the continuous voice sentence input device according to the first aspect, it is determined whether or not a specific voice command is to be deleted from the recognition target based on the number of times all voice commands have been used since the start of use. . On the other hand, during periods when users are unfamiliar with using continuous speech text input devices,
Since a voice command to be used may be different from a period to which the user is accustomed to some extent, it may be preferable not to count the number of times of using all voice commands during a period when the user is unfamiliar with using the continuous voice sentence input device . For this reason, in claim 9, after a lapse of a predetermined time, or
After the voice command is used a certain number of times, the voice command history management means starts storing the history of the voice command.

【００３５】また、本発明は、請求項１０に記載されて
いるように、音声による文章入力の際に使用された各音
声コマンドを記憶する第一の過程と、所定回数だけ音声
コマンドが使用されたときに、各音声コマンドが予め定
められた最低回数に達しているか否かを判定する第二の
過程と、最低回数に達していない音声コマンドを認識の
対象から削除する第三の過程と、からなる連続音声文章
入力方法を提供する。According to a tenth aspect of the present invention, a first step of storing each voice command used at the time of inputting a sentence by voice, and the voice command is used a predetermined number of times. When, the second step of determining whether each voice command has reached a predetermined minimum number of times, and the third step of deleting the voice command that has not reached the minimum number of times from the target of recognition, And a method for inputting continuous voice sentences.

【００３６】本請求項に係る方法によれば、請求項１に
係る連続音声文章入力装置と同様の効果を得ることがで
きる。According to the method of the present invention, it is possible to obtain the same effects as those of the continuous speech sentence input device according to the first aspect.

【００３７】さらに、本発明は、請求項１１に記載され
ているように、音声による文章入力の際に使用された各
音声コマンドを記憶する第一の過程と、所定回数だけ音
声コマンドが使用されたときに、各音声コマンドが予め
定められた最低回数に達しているか否かを判定する第二
の過程と、最低回数に達していない音声コマンドについ
て、その音声コマンドを認識しやすくするかどうかの優
先度を下げる第三の過程と、からなる連続音声文章入力
方法を提供する。Further, according to the present invention, a first step of storing each voice command used at the time of inputting a sentence by voice, and the voice command is used a predetermined number of times. A second step of determining whether or not each voice command has reached a predetermined minimum number of times, and for voice commands not reaching the minimum number of times, whether or not to facilitate recognition of the voice command. And a third step of lowering the priority.

【００３８】本請求項に係る方法によれば、請求項２に
係る連続音声文章入力装置と同様の効果を得ることがで
きる。According to the method of the present invention, it is possible to obtain the same effects as those of the continuous speech sentence input device of the second aspect.

【００３９】請求項１２に記載されているように、本連
続音声文章入力方法は、認識対象から削除された音声コ
マンドを記憶する過程と、一旦認識対象から削除され、
記憶されている音声コマンドを表示する過程と、をさら
に備えることが好ましい。According to a twelfth aspect of the present invention, there is provided a continuous speech sentence input method, comprising the steps of: storing a voice command deleted from a recognition target;
Displaying the stored voice command.

【００４０】本請求項に係る方法によれば、請求項５に
係る連続音声文章入力装置と同様の効果を得ることがで
きる。According to the method of the present invention, it is possible to obtain the same effect as the continuous speech sentence input device of the fifth aspect.

【００４１】請求項１３に記載されているように、本連
続音声文章入力方法は、一旦認識対象から削除され、記
憶されている音声コマンドを再度認識対象とする過程を
さらに備えることが好ましい。As described in the thirteenth aspect, it is preferable that the continuous voice sentence input method further includes a step of once recognizing the stored voice command from the recognition target and re-recognizing the stored voice command.

【００４２】本請求項に係る方法によれば、請求項６に
係る連続音声文章入力装置と同様の効果を得ることがで
きる。According to the method of the present invention, it is possible to obtain the same effect as the continuous speech sentence input device of the sixth aspect.

【００４３】請求項１４に記載されているように、本連
続音声文章入力方法は、音声コマンドが認識対象から削
除されるときに、その旨の表示を行う過程をさらに備え
ることが好ましい。As described in the fourteenth aspect, it is preferable that the continuous voice sentence input method further includes a step of, when a voice command is deleted from a recognition target, displaying a message to that effect.

【００４４】本請求項に係る方法によれば、請求項８に
係る連続音声文章入力装置と同様の効果を得ることがで
きる。According to the method of the present invention, it is possible to obtain the same effect as the continuous speech sentence input device of the eighth aspect.

【００４５】請求項１５に記載されているように、第一
の過程は、一定時間経過後に、または、音声コマンドが
一定回数使用された後に、開始されるものであることが
好ましい。[0045] As described in claim 15, the first step is preferably started after a lapse of a predetermined time or after a voice command is used a predetermined number of times.

【００４６】本請求項に係る方法によれば、請求項９に
係る連続音声文章入力装置と同様の効果を得ることがで
きる。According to the method of the present invention, it is possible to obtain the same effects as the continuous speech sentence input device of the ninth aspect.

【００４７】[0047]

【発明の実施の形態】次に、本発明の実施の形態に係る
連続音声文章入力装置及び連続音声文章入力方法を説明
する。Next, a continuous speech text input device and a continuous speech text input method according to an embodiment of the present invention will be described.

【００４８】（第一の実施形態）図１は、本発明の第一
の実施形態に係る連続音声文章入力装置１０のブロック
図である。(First Embodiment) FIG. 1 is a block diagram of a continuous speech sentence input device 10 according to a first embodiment of the present invention.

【００４９】本実施形態に係る連続音声文章入力装置１
０は、ユーザーが文章を連続的に音声入力し、かつ、制
御用音声コマンドを音声入力するための音声入力部１１
と、音声入力部１１に入力された音声を認識する音声認
識部１２と、音声の認識処理の際に使用する連続音声文
章入力用の認識処理用パターンを記憶した連続音声文章
入力用辞書１３と、音声コマンドの認識用パターンを記
憶した音声コマンド用辞書１４と、音声認識部１２にお
ける音声の認識結果を管理し、音声認識部１２における
認識結果に基づいて、入力された音声が連続文章入力用
の音声か、あるいは、音声コマンド用の音声であるかを
判断する認識結果管理部１５と、入力された音声が連続
文章入力用の音声であった場合に、認識結果を表示する
認識結果表示部１６と、入力された音声が音声コマンド
用の音声であった場合には、各音声コマンドに対応する
制御動作を実行させる音声コマンド実行部１７と、入力
された音声が音声コマンド用の音声であった場合にその
音声コマンドの履歴を記憶する音声コマンド履歴管理部
１８と、音声コマンド用辞書１４の記憶内容を管理する
音声コマンド用辞書管理部１９と、からなる。The continuous speech sentence input device 1 according to the present embodiment
0 is a voice input unit 11 for a user to continuously input a sentence and input a control voice command.
A voice recognition unit 12 for recognizing voice input to the voice input unit 11, a continuous voice text input dictionary 13 storing recognition processing patterns for continuous voice text input used in voice recognition processing, A voice command dictionary 14 storing voice command recognition patterns and a voice recognition result in the voice recognition unit 12 are managed. Based on the recognition result in the voice recognition unit 12, the input voice is used for continuous sentence input. Or a recognition result management unit 15 for determining whether the voice is a voice command voice or a voice command voice, and a recognition result display unit for displaying a recognition result when the input voice is a voice for continuous text input. 16, when the input voice is a voice command voice, a voice command execution unit 17 for executing a control operation corresponding to each voice command; A voice command history management unit 18 for storing the history of the voice command when was the voice for command, a voice command dictionary management unit 19 for managing the contents stored in the voice command dictionary 14 consists.

【００５０】以上のような構成を有する本実施形態に係
る連続音声文章入力装置は以下のように作動する。The continuous-speech sentence input device according to the present embodiment having the above configuration operates as follows.

【００５１】音声入力部１１は、ユーザーが入力した音
声を取り込み、その音声のデータを音声認識部１２に送
る。The voice input unit 11 takes in the voice input by the user and sends the voice data to the voice recognition unit 12.

【００５２】音声認識部１２は、連続音声文章入力用辞
書１３及び音声コマンド用辞書１４の記憶内容に基づい
て、入力された音声に対して認識処理を行い、認識結果
を示す情報と、連続音声文章入力用辞書１３と音声コマ
ンド用辞書１４の何れを用いて認識した結果であるかの
情報とを認識結果管理部１５に出力する。The voice recognition unit 12 performs a recognition process on the input voice based on the contents stored in the continuous voice sentence input dictionary 13 and the voice command dictionary 14, and outputs information indicating the recognition result and continuous voice. It outputs to the recognition result management unit 15 information on which of the sentence input dictionary 13 and the voice command dictionary 14 has been used as the recognition result.

【００５３】連続音声文章入力用辞書１３は、音声認識
部１２において連続音声文章入力認識用に使用する音声
認識辞書を記憶している。The continuous speech sentence input dictionary 13 stores a speech recognition dictionary used by the speech recognition unit 12 for continuous speech sentence input recognition.

【００５４】音声コマンド用辞書１４は、音声認識部１
２において音声コマンド認識用に使用する音声認識辞書
を記憶している。The voice command dictionary 14 includes the voice recognition unit 1.
2 stores a voice recognition dictionary used for voice command recognition.

【００５５】認識結果管理部１５は、音声認識部１２か
ら送られてきた認識結果が連続音声文章入力用辞書１３
を用いて出された結果である場合には、その認識結果を
認識結果表示部１６に送り、音声認識部１２から送られ
てきた認識結果が音声コマンド用辞書１４を用いて出さ
れた結果である場合には、その認識結果を音声コマンド
実行部１７に送る。The recognition result management unit 15 stores the recognition result sent from the speech recognition unit 12 in the continuous speech sentence input dictionary 13.
If the recognition result is output using the voice command dictionary 14, the recognition result transmitted from the voice recognition unit 12 is transmitted to the recognition result display unit 16 if the recognition result is output using the voice command dictionary 14. If there is, the recognition result is sent to the voice command execution unit 17.

【００５６】認識結果表示部１６は認識結果管理部１５
から送られてきた認識結果を、例えば、スクリーン上に
表示し、ユーザーに告知する。The recognition result display unit 16 is a recognition result management unit 15
The recognition result sent from is displayed on the screen, for example, to notify the user.

【００５７】音声コマンド実行部１７は、認識結果管理
部１５から送られてきた認識結果に対して、あらかじめ
記憶していた対応する動作を実行する。例えば、音声コ
マンド実行部１７には、『ここを削除』という音声コマ
ンドに対して「現在、ユーザーに表示している画面上に
おいて直前に入力した音声認識結果を削除する」という
動作が記憶されているものとする。このような場合に
は、『ここを削除』という音声コマンドが認識されたと
いう認識結果が認識結果管理部１５から送られてきた場
合には、音声コマンド実行部１７は、その音声コマンド
に対応する動作として、「現在、ユーザーに表示してい
る画面上において直前に入力した音声認識結果を削除す
る」という動作を実行する。The voice command execution unit 17 executes a corresponding operation stored in advance on the recognition result sent from the recognition result management unit 15. For example, the voice command execution unit 17 stores an operation of “deleting the voice recognition result input immediately before on the screen currently displayed to the user” in response to the voice command “delete here”. Shall be In such a case, when a recognition result indicating that the voice command “Delete here” has been recognized is sent from the recognition result management unit 15, the voice command execution unit 17 responds to the voice command. As an operation, an operation of “deleting the speech recognition result input immediately before on the screen currently displayed to the user” is executed.

【００５８】音声コマンド実行部１７は、このようにし
て実行した音声コマンドの情報を音声コマンド履歴管理
部１８に送る。The voice command execution unit 17 sends information on the voice command executed in this way to the voice command history management unit 18.

【００５９】音声コマンド履歴管理部１８は、音声コマ
ンド実行部１７において実行された各音声コマンドの履
歴を記憶する。The voice command history management section 18 stores a history of each voice command executed in the voice command execution section 17.

【００６０】音声コマンド用辞書管理部１９は、音声コ
マンド履歴管理部１８の記憶内容に基づいて、各音声コ
マンドを削除すべきか否かを判断し、必要に応じて、音
声コマンド用辞書１４に記憶されている音声コマンドを
削除する。すなわち、後述するように、音声コマンド用
辞書管理部１９は、音声コマンド履歴管理部１８に記憶
された各音声コマンドの使用頻度に従って、使用頻度が
低い音声コマンドを音声コマンド用辞書１４から削除す
る。The voice command dictionary management unit 19 determines whether or not each voice command should be deleted based on the contents stored in the voice command history management unit 18, and stores it in the voice command dictionary 14 as necessary. Delete the voice command that is being performed. That is, as will be described later, the voice command dictionary management unit 19 deletes voice commands that are used less frequently from the voice command dictionary 14 in accordance with the usage frequency of each voice command stored in the voice command history management unit 18.

【００６１】次に、具体的なデータを使用して本実施形
態に係る連続音声文章入力装置１０を説明する。Next, the continuous speech sentence input device 10 according to the present embodiment will be described using specific data.

【００６２】本実施形態に係る連続音声文章入力装置１
０には、図２に示すように、複数個の音声コマンドとそ
れに対応する動作が登録されているものとする。例え
ば、「ここを削除」という音声コマンドが入力された場
合には、その音声コマンドに対応して、「直前に入力し
た音声認識結果を削除する」という動作が実行されるも
のとし、あるいは、「ここで改行」という音声コマンド
が入力された場合には、その音声コマンドに対応して、
「直前に入力した音声の直後に改行を行う」という動作
が実行されるものとする。The continuous speech sentence input device 1 according to the present embodiment
It is assumed that a plurality of voice commands and operations corresponding to the voice commands are registered in 0, as shown in FIG. For example, when a voice command of “Delete here” is input, an operation of “Delete the previously input voice recognition result” is performed in response to the voice command, or “ If a voice command of "Line feed here" is input, in response to the voice command,
It is assumed that an operation of “perform a line feed immediately after the voice input immediately before” is executed.

【００６３】本連続音声文章入力装置１０をユーザーが
ある程度利用し、連続音声による文章入力を行い、その
際に音声コマンドも使用したものとする。It is assumed that the user uses the continuous voice text input device 10 to some extent to input text by continuous voice, and also uses voice commands at that time.

【００６４】音声コマンド履歴管理部１８は、使用開始
時からの各音声コマンドが使用された回数及び全使用回
数をカウントする。例えば、合計３０回音声コマンドが
使用されたときに、各音声コマンドの使用回数が図３に
示す通りになっているものとする。The voice command history management unit 18 counts the number of times each voice command has been used since the start of use and the total number of times each voice command has been used. For example, when the voice command is used 30 times in total, it is assumed that the number of times each voice command is used is as shown in FIG.

【００６５】音声コマンド用辞書管理部１９は、音声コ
マンド履歴管理部１８の記憶内容を常時監視しており、
各音声コマンドが使用された回数の合計が音声コマンド
用辞書管理部１９においてあらかじめ定められた回数
（ここでは、３０回とする）だけ使用された時に、各音
声コマンドが予め定められた最低回数（ここでは、１回
とする）に達しているか否かを判定する。The voice command dictionary management unit 19 constantly monitors the contents stored in the voice command history management unit 18.
When the total number of times each voice command has been used is a predetermined number of times (here, 30 times) in the voice command dictionary management unit 19, each voice command has a predetermined minimum number of times (30 times). Here, it is determined whether or not it has reached once.

【００６６】図３に示した場合では、音声コマンド『シ
ャットダウン』が、最低回数に達していないため、音声
コマンド用辞書管理部１９は音声コマンド用辞書１４に
記憶されている音声コマンド『シャットダウン』を削除
する。In the case shown in FIG. 3, since the voice command “shutdown” has not reached the minimum number, the voice command dictionary management unit 19 executes the voice command “shutdown” stored in the voice command dictionary 14. delete.

【００６７】以上のように、本実施形態に係る連続音声
文章入力装置によれば、音声コマンド履歴管理部１８が
各音声コマンドの使用頻度を記憶し、音声コマンド辞書
管理部１９がその使用頻度に従って、使用頻度が低い音
声コマンドを音声コマンド用辞書１４から除外する。As described above, according to the continuous voice sentence input device according to the present embodiment, the voice command history management unit 18 stores the usage frequency of each voice command, and the voice command dictionary management unit 19 stores the usage frequency according to the usage frequency. , Voice commands that are used less frequently are excluded from the voice command dictionary 14.

【００６８】このため、音声コマンド用辞書１４に含ま
れる音声コマンドは使用頻度が高いもののみに絞り込ま
れる。従って、文章入力のための入力音声が誤って音声
コマンドに誤認識されるおそれを少なくすることができ
る。また、あらかじめ用意されている音声コマンドと同
一の文字列が入力しにくくなるという問題点も解消する
ことができる。For this reason, the voice commands included in the voice command dictionary 14 are narrowed down to only those frequently used. Therefore, it is possible to reduce the possibility that an input voice for inputting a sentence is erroneously recognized as a voice command. Further, it is possible to solve the problem that it becomes difficult to input the same character string as a voice command prepared in advance.

【００６９】（第二の実施形態）以下、本発明の第二の
実施形態に係る連続音声文章入力装置を説明する。(Second Embodiment) Hereinafter, a continuous speech sentence input device according to a second embodiment of the present invention will be described.

【００７０】本実施形態に係る連続音声文章入力装置の
構造は上記の第一の実施形態に係る連続音声文章入力装
置１０と同じである。ただし、各構成要素の機能が以下
のように異なっている。The structure of the continuous speech text input device according to the present embodiment is the same as that of the continuous speech text input device 10 according to the first embodiment. However, the function of each component is different as follows.

【００７１】本実施形態に係る連続音声文章入力装置に
おける音声コマンド用辞書１４には、各コマンド毎にそ
のコマンドを認識しやすくするか否かの度合いを示す優
先度が記憶されており、音声認識部１２が音声コマンド
を認識処理する場合には、この優先度に従って音声コマ
ンドを認識処理する。The voice command dictionary 14 in the continuous voice sentence input device according to the present embodiment stores, for each command, a priority indicating whether or not the command is easily recognized. When the unit 12 recognizes a voice command, it recognizes the voice command according to the priority.

【００７２】音声コマンド用辞書１４に記憶されている
優先度は、以下のように、必要に応じて、下げられる。The priority stored in the voice command dictionary 14 is lowered as necessary as described below.

【００７３】本実施形態に係る連続音声文章入力装置に
おいては、音声コマンド履歴管理部１８は、上記の第一
の実施形態の場合と同様に、各音声コマンドの使用頻度
を記憶する。In the continuous voice sentence input device according to this embodiment, the voice command history management unit 18 stores the frequency of use of each voice command, as in the case of the first embodiment.

【００７４】音声コマンド用辞書管理部１９は、音声コ
マンド履歴管理部１８の記憶内容を常時監視しており、
各音声コマンドが使用された回数の合計が音声コマンド
用辞書管理部１９においてあらかじめ定められた回数
（ここでは、３０回とする）だけ使用された時に、各音
声コマンドが予め定められた最低回数（ここでは、１回
とする）に達しているか否かを判定する。The voice command dictionary management unit 19 constantly monitors the contents stored in the voice command history management unit 18.
When the total number of times each voice command has been used is a predetermined number of times (here, 30 times) in the voice command dictionary management unit 19, each voice command has a predetermined minimum number of times (30 times). Here, it is determined whether or not it has reached once.

【００７５】例えば、図３に示した場合においては、音
声コマンド『シャットダウン』が、最低回数に達してい
ないため、音声コマンド用辞書管理部１９は音声コマン
ド用辞書１４に記憶されている音声コマンド『シャット
ダウン』の優先度を低くする。従って、音声認識部１２
は優先度に従って各音声コマンドを認識処理しているた
め、優先度が低くなった音声コマンド『シャットダウ
ン』を認識することができる度合いが低くなる。すなわ
ち、音声コマンド『シャットダウン』を認識しにくくな
る。For example, in the case shown in FIG. 3, since the voice command “shutdown” has not reached the minimum number, the voice command dictionary management unit 19 stores the voice command “shutdown” stored in the voice command dictionary 14. Shutdown ”priority. Therefore, the voice recognition unit 12
Recognizes each voice command in accordance with the priority, the degree to which the voice command “shutdown” having a lower priority can be recognized becomes lower. That is, it becomes difficult to recognize the voice command “shutdown”.

【００７６】以上のように、本実施形態によれば、使用
頻度が低い音声コマンドは認識され難くなる。従って、
第一の実施形態の場合と同様に、文章入力のための入力
音声が誤って音声コマンドと誤認識される可能性が減
り、認識性能を改善することができる。また、あらかじ
め用意されている音声コマンドと同一の文字列が入力し
にくくなるという問題点を解消することもできる。As described above, according to the present embodiment, it is difficult to recognize a voice command that is used less frequently. Therefore,
As in the case of the first embodiment, the possibility that an input voice for inputting a sentence is erroneously recognized as a voice command is reduced, and the recognition performance can be improved. It is also possible to solve the problem that it is difficult to input the same character string as a voice command prepared in advance.

【００７７】（第三の実施形態）図５は、本発明の第三
の実施形態に係る連続音声文章入力装置２０のブロック
図である。(Third Embodiment) FIG. 5 is a block diagram of a continuous speech sentence input device 20 according to a third embodiment of the present invention.

【００７８】本実施形態に係る連続音声文章入力装置２
０は、図１に示した第一の実施形態に係る連続音声文章
入力装置１０の構成要素に加えて、音声コマンド使用頻
度計算ユニット２１、音声コマンド最低使用頻度記憶ユ
ニット２２、ユーザー別音声コマンド履歴管理ユニット
２３及びタイマー２４を備えている。これらは何れも音
声コマンド履歴管理部１８に接続されている。The continuous speech sentence input device 2 according to the present embodiment
0 is a voice command usage frequency calculation unit 21, a voice command minimum usage frequency storage unit 22, a voice command history for each user, in addition to the components of the continuous voice sentence input device 10 according to the first embodiment shown in FIG. A management unit 23 and a timer 24 are provided. These are all connected to the voice command history management unit 18.

【００７９】前述の第一及び第二の実施形態において
は、音声コマンド履歴管理部１８が使用開始時からの各
音声コマンドの使用回数をカウントしていたが、本実施
形態のように、音声コマンド使用頻度計算ユニット２１
を別個に設け、この音声コマンド使用頻度計算ユニット
２１により、各音声コマンドの使用頻度を計算するよう
ようにすることができる。In the above-described first and second embodiments, the voice command history management unit 18 counts the number of times each voice command has been used since the start of use. Usage frequency calculation unit 21
Are separately provided, and the voice command usage frequency calculation unit 21 calculates the usage frequency of each voice command.

【００８０】音声コマンド使用頻度計算ユニット２１は
任意に交換することができるので、計算に必要な容量を
所望の値に設定することができる。Since the voice command usage frequency calculation unit 21 can be replaced arbitrarily, the capacity required for calculation can be set to a desired value.

【００８１】音声コマンド最低使用頻度記憶ユニット２
２は各音声コマンド毎に削除するか否かの判定基準とな
る最低使用頻度を記憶している。Voice command minimum use frequency storage unit 2
Reference numeral 2 stores a minimum usage frequency which is a criterion for determining whether or not to delete each voice command.

【００８２】第一及び第二の実施形態においては、各音
声コマンドを音声コマンド用辞書１４から削除するかど
うかの判定の際に、判定基準となる使用頻度を全ての音
声コマンドに対して均一な値を用いていたが、音声コマ
ンド最低使用頻度記憶ユニット２２を設けることによ
り、各音声コマンド毎に削除するか否かの判定基準を変
えることができ、音声コマンドの使用状況に応じた管理
を行うことができる。In the first and second embodiments, when determining whether or not each voice command is to be deleted from the voice command dictionary 14, the frequency of use as a criterion is made uniform for all voice commands. Although the value is used, the provision of the voice command minimum use frequency storage unit 22 can change the criterion of whether or not to delete each voice command, and perform management according to the usage status of the voice command. be able to.

【００８３】ユーザー別音声コマンド履歴管理ユニット
２３は、ユーザー毎の音声コマンド使用履歴を記憶して
いる。The voice command history management unit 23 for each user stores voice command usage history for each user.

【００８４】ユーザー別音声コマンド履歴管理ユニット
２３を設けることによって、ユーザー毎の音声コマンド
使用履歴がそれぞれ独立に記憶することができるため、
音声コマンド用辞書１４から削除する音声コマンドをユ
ーザー毎に変えることが可能になる。By providing the voice command history management unit 23 for each user, the voice command usage history for each user can be stored independently.
The voice command to be deleted from the voice command dictionary 14 can be changed for each user.

【００８５】タイマー２４は、本連続音声文章入力装置
の使用が開始されてからの時間を計測する。The timer 24 measures the time from the start of use of the continuous speech text input device.

【００８６】例えば、第一の実施形態に係る連続音声文
章入力装置においては、使用開始時からの全音声コマン
ドの使用回数が所定の回数に達した後、特定の音声コマ
ンドを音声コマンド用辞書１４から削除するかどうかを
判断している。これに対して、ユーザーが連続音声文章
入力装置の使用に不慣れな期間と、それ以後の連続音声
文章入力装置の使用に慣れた期間とでは、使用する音声
コマンドが異なる可能性が大きい。従って、ユーザーが
連続音声文章入力装置の使用に不慣れな期間において
は、全音声コマンドの使用回数を計数しない方が好まし
い。For example, in the continuous voice sentence input device according to the first embodiment, after the number of times all voice commands have been used since the start of use reaches a predetermined number, a specific voice command is input to the voice command dictionary 14. Has been determined whether to delete. On the other hand, there is a high possibility that the voice command to be used is different between a period during which the user is unfamiliar with using the continuous voice sentence input device and a period during which the user is familiar with using the continuous voice text input device. Therefore, it is preferable not to count the number of times all voice commands are used during a period when the user is unfamiliar with using the continuous voice sentence input device.

【００８７】このため、タイマー２４を用いて、本連続
音声文章入力装置の使用開始以後の時間を計測し、使用
開始から所定時間内においては、音声コマンドの使用回
数を計数せず、所定時間経過後に音声コマンドの使用回
数の計数を開始する。これによって、ユーザーの実際の
使用状況に応じて音声コマンドを削除するか否かの判定
を行うことができる。For this reason, the timer 24 is used to measure the time after the start of use of the continuous speech sentence input device, and within a predetermined time from the start of use, the number of times the voice command is used is not counted. Later, the counting of the number of times the voice command is used is started. This makes it possible to determine whether or not to delete the voice command according to the actual usage status of the user.

【００８８】なお、音声コマンドの使用回数の計数開始
は、使用開始時からの時間に代えて、使用開始時からの
音声コマンドの使用回数に依存させることもできる。す
なわち、例えば、使用開始時からの音声コマンドの使用
回数が所定回数（例えば、１００回）に達したときに、
それ以後の音声コマンドの使用回数を音声コマンド用辞
書管理部１９における判断の基準に用いることもでき
る。The start of counting the number of times the voice command is used can be made dependent on the number of times the voice command has been used since the start of use instead of the time from the start of use. That is, for example, when the number of times the voice command has been used since the start of use reaches a predetermined number (for example, 100),
The number of times the voice command is used thereafter can also be used as a criterion for determination in the voice command dictionary management unit 19.

【００８９】（第四の実施形態）図６は、本発明の第四
の実施形態に係る連続音声文章入力装置３０のブロック
図である。(Fourth Embodiment) FIG. 6 is a block diagram of a continuous speech sentence input device 30 according to a fourth embodiment of the present invention.

【００９０】本実施形態に係る連続音声文章入力装置３
０は、図１に示した第一の実施形態に係る連続音声文章
入力装置１０の構成要素に加えて、削除コマンド記憶ユ
ニット３１、削除コマンド表示ユニット３２、削除コマ
ンド再登録ユニット３３及び削除動作表示ユニット３４
をさらに備えている。The continuous speech sentence input device 3 according to the present embodiment
0 is a deletion command storage unit 31, a deletion command display unit 32, a deletion command re-registration unit 33, and a deletion operation display in addition to the components of the continuous speech sentence input device 10 according to the first embodiment shown in FIG. Unit 34
Is further provided.

【００９１】削除コマンド記憶ユニット３１は音声コマ
ンド辞書管理部１９が音声コマンド用辞書１４から削除
した音声コマンドを記憶する。削除コマンド表示ユニッ
ト３２は削除コマンド記憶ユニット３１の記憶内容、す
なわち、削除された音声コマンドのリストを表示する。The deletion command storage unit 31 stores the voice command deleted from the voice command dictionary 14 by the voice command dictionary management unit 19. The deletion command display unit 32 displays the storage contents of the deletion command storage unit 31, that is, a list of deleted voice commands.

【００９２】削除コマンド表示ユニット３２が既に音声
コマンド用辞書１４から削除された音声コマンドを表示
することによって、ユーザーは既に削除した音声コマン
ドと現存している音声コマンドとを容易に把握すること
ができる。Since the deleted command display unit 32 displays the voice command already deleted from the voice command dictionary 14, the user can easily grasp the already deleted voice command and the existing voice command. .

【００９３】削除コマンド再登録ユニット３３は、削除
コマンド記憶ユニット３１に記憶されている音声コマン
ドを音声コマンド用辞書１４に再登録する。The deletion command re-registration unit 33 re-registers the voice commands stored in the deletion command storage unit 31 in the voice command dictionary 14.

【００９４】一旦は削除した音声コマンドであっても、
後に、その音声コマンドを再び認識対象とする必要が生
じる場合もある。このため、削除コマンド記憶ユニット
３１に記憶されている音声コマンドを音声コマンド用辞
書１４に再登録することができるようにすることによっ
て、音声コマンドを新たに音声コマンド用辞書１４に登
録する場合と比較して、より容易に所望の音声コマンド
を認識対象に組み入れることができる。Even if the voice command is once deleted,
Later, the voice command may need to be recognized again. For this reason, by enabling the voice command stored in the deletion command storage unit 31 to be re-registered in the voice command dictionary 14, the voice command is compared with a case where the voice command is newly registered in the voice command dictionary 14. Thus, the desired voice command can be more easily incorporated into the recognition target.

【００９５】削除動作表示ユニット３４は、音声コマン
ド用辞書１４から音声コマンドが削除されるときに、そ
の旨の表示を行う。When a voice command is deleted from the voice command dictionary 14, the deletion operation display unit 34 displays a message to that effect.

【００９６】例えば、音声コマンド「シャットダウン」
を音声コマンド用辞書１４から削除する場合、削除動作
表示ユニット３４は図４に示すようなダイアログをスク
リーン上に表示する。For example, the voice command “Shutdown”
Is deleted from the voice command dictionary 14, the deletion operation display unit 34 displays a dialog as shown in FIG. 4 on the screen.

【００９７】これによって、ユーザーは音声コマンドを
音声コマンド用辞書１４から削除する前に、改めて削除
するか否かの再確認を行うことができ、必要な音声コマ
ンドを誤って削除することを防止することができる。Thus, before the user deletes the voice command from the voice command dictionary 14, the user can reconfirm whether or not the voice command should be deleted, thereby preventing a necessary voice command from being erroneously deleted. be able to.

【００９８】以下、本発明に係る連続音声文章入力方法
の実施形態を説明する。Hereinafter, an embodiment of the continuous speech sentence input method according to the present invention will be described.

【００９９】（第五の実施形態）図７は、本発明の第五
の実施形態に係る連続音声文章入力方法の各過程を示す
フローチャートである。(Fifth Embodiment) FIG. 7 is a flowchart showing the steps of a continuous speech sentence input method according to a fifth embodiment of the present invention.

【０１００】先ず、連続音声文章が音声コマンドととも
に入力される（ステップ１０）。First, a continuous voice sentence is input together with a voice command (step 10).

【０１０１】この際、使用された各音声コマンドが記憶
され、各音声コマンド毎に使用回数が記憶される（ステ
ップ２０）。At this time, each voice command used is stored, and the number of times of use is stored for each voice command (step 20).

【０１０２】次いで、全ての音声コマンドの使用回数が
所定回数Ａ１（例えば、３０回）に達したか否かが判定
される（ステップ３０）。Next, it is determined whether or not the number of use of all voice commands has reached a predetermined number A1 (for example, 30) (step 30).

【０１０３】全ての音声コマンドの使用回数が所定回数
Ａ１に達していない場合（ステップ３０のＮＯ）には、
各音声コマンド毎の使用回数の計数が継続して行われる
（ステップ２０）。If the number of uses of all voice commands has not reached the predetermined number A1 (NO in step 30),
The number of times of use for each voice command is counted continuously (step 20).

【０１０４】全ての音声コマンドの使用回数が所定回数
Ａ１に達した場合（ステップ３０のＹＥＳ）には、各音
声コマンドの使用回数が所定回数Ａ２（例えば、３回）
に達したか否かが判定される（ステップ４０）。When the number of times of use of all voice commands has reached the predetermined number of times A1 (YES in step 30), the number of times of use of each voice command is equal to the predetermined number of times A2 (for example, three times).
Is determined (step 40).

【０１０５】音声コマンドの使用回数が所定回数Ａ２に
達していない場合（ステップ４０のＮＯ）には、本プロ
セスはそのまま終了する。すなわち、その音声コマンド
は認識対象から削除されることなく、そのまま認識対象
として残される。If the number of times the voice command has been used has not reached the predetermined number A2 (NO in step 40), the present process ends. That is, the voice command is not deleted from the recognition target, but remains as the recognition target.

【０１０６】所定の音声コマンドの使用回数が所定回数
Ａ２に達した場合（ステップ４０のＹＥＳ）には、その
音声コマンドは認識対象から削除される（ステップ５
０）。以後、その音声コマンドが入力されても、その音
声コマンドに対応する制御動作は実行されない。If the number of times the predetermined voice command has been used has reached the predetermined number A2 (YES in step 40), the voice command is deleted from the recognition target (step 5).
0). Thereafter, even if the voice command is input, the control operation corresponding to the voice command is not executed.

【０１０７】以上のように、本実施形態に係る連続音声
文章入力方法によれば、使用頻度が低い音声コマンドが
認識対象から除外される。このため、使用頻度が高い音
声コマンドのみが認識対象として残るため、文章入力の
ための入力音声が誤って音声コマンドに誤認識されるお
それを少なくすることができる。また、あらかじめ用意
されている音声コマンドと同一の文字列が入力しにくく
なるという問題点も解消することができる。As described above, according to the continuous-speech sentence input method according to the present embodiment, a voice command with a low use frequency is excluded from recognition targets. For this reason, only the frequently used voice command remains as the recognition target, and the possibility that the input voice for inputting the text is erroneously recognized as the voice command can be reduced. Further, it is possible to solve the problem that it becomes difficult to input the same character string as a voice command prepared in advance.

【０１０８】（第六の実施形態）図８は、本発明の第六
の実施形態に係る連続音声文章入力方法の各過程を示す
フローチャートである。(Sixth Embodiment) FIG. 8 is a flowchart showing each step of a continuous speech sentence input method according to a sixth embodiment of the present invention.

【０１０９】本実施形態に係る連続音声文章入力方法に
おいては、前述の第五の実施形態におけるステップ５０
に代えてステップ６０が実施される。多のステップ１０
乃至４０は第五の実施形態の場合と同様である。In the continuous speech sentence input method according to the present embodiment, step 50 in the fifth embodiment is used.
Step 60 is performed instead of. Many steps 10
Steps 40 to 40 are the same as those in the fifth embodiment.

【０１１０】ステップ６０においては、所定の音声コマ
ンドの使用回数が所定回数Ａ２に達した場合（ステップ
４０のＹＥＳ）には、その音声コマンドの優先度が低く
される。この結果、優先度が低くなった音声コマンドは
認識される度合いが低くなる。すなわち、その音声コマ
ンドは認識されにくくなる。In step 60, when the number of times of use of the predetermined voice command has reached the predetermined number of times A2 (YES in step 40), the priority of the voice command is lowered. As a result, the voice command having the lower priority has a lower recognition degree. That is, the voice command becomes difficult to be recognized.

【０１１１】本実施形態によっても、第五の実施形態に
よる効果と同一の効果を得ることができる。According to the present embodiment, the same effect as that of the fifth embodiment can be obtained.

【０１１２】（第七の実施形態）図９は、本発明の第七
の実施形態に係る連続音声文章入力方法の各過程を示す
フローチャートである。(Seventh Embodiment) FIG. 9 is a flowchart showing the steps of a continuous speech sentence input method according to a seventh embodiment of the present invention.

【０１１３】本実施形態に係る連続音声文章入力方法
は、図７に示した第五の実施形態における各ステップ１
０乃至４０に加えて、ステップ７０乃至ステップ１００
を備えている。The continuous speech sentence input method according to the present embodiment corresponds to the step 1 in the fifth embodiment shown in FIG.
Steps 70 to 100 in addition to 0 to 40
It has.

【０１１４】ステップ１０乃至４０は第五の実施形態の
場合と同様に実施される。Steps 10 to 40 are performed in the same manner as in the fifth embodiment.

【０１１５】所定の音声コマンドの使用回数が所定回数
Ａ２に達した場合（ステップ４０のＹＥＳ）には、その
音声コマンドを認識対象から削除しても良いかどうかの
再確認を求める表示が行われる（ステップ７０）。例え
ば、図４に示したようなダイアログがスクリーン上に表
示される。When the number of times the predetermined voice command has been used reaches the predetermined number A2 (YES in step 40), a display requesting reconfirmation as to whether the voice command may be deleted from the recognition target is displayed. (Step 70). For example, a dialog as shown in FIG. 4 is displayed on the screen.

【０１１６】ユーザーが削除不可を選択した場合（ステ
ップ７０のＮＯ）には、本プロセスはそのまま終了す
る。すなわち、その音声コマンドは認識対象から削除さ
れることなく、そのまま認識対象として残される。If the user selects the deletion prohibition (NO in step 70), the present process ends. That is, the voice command is not deleted from the recognition target, but remains as the recognition target.

【０１１７】ユーザーが削除可を選択した場合（ステッ
プ７０のＹＥＳ）には、その音声コマンドは認識対象か
ら削除される（ステップ５０）。以後、その音声コマン
ドが入力されても、その音声コマンドに対応する制御動
作は実行されない。When the user selects the deletion permission (YES in step 70), the voice command is deleted from the recognition target (step 50). Thereafter, even if the voice command is input, the control operation corresponding to the voice command is not executed.

【０１１８】このように認識対象から除外された音声コ
マンドは所定の記憶ユニット内に記憶される（ステップ
８０）。The voice command excluded from the recognition target is stored in a predetermined storage unit (step 80).

【０１１９】記憶ユニット内に記憶された音声コマン
ド、すなわち、認識対象から削除された音声コマンドの
リストは、ユーザーの要求に応じて、あるいは、ユーザ
ーの要求の有無にかかわらず常にスクリーン上に表示さ
れる（ステップ９０）。The voice command stored in the storage unit, that is, the list of voice commands deleted from the recognition target, is always displayed on the screen in response to the user's request or regardless of the user's request. (Step 90).

【０１２０】ユーザーが、スクリーン上に表示された削
除済み音声コマンドの中から再度認識対象として登録を
希望するものがある場合（ステップ１００のＹＥＳ）に
は、リスト中の所望の音声コマンドを選択することによ
り、その音声コマンドは認識対象として再登録される
（ステップ１１０）。If the user wishes to register again as a recognition target from among the deleted voice commands displayed on the screen (YES in step 100), the user selects the desired voice command in the list. As a result, the voice command is re-registered as a recognition target (step 110).

【０１２１】[0121]

【発明の効果】以上のように、本発明は次のような効果
を奏する。As described above, the present invention has the following effects.

【０１２２】第１の効果は認識性能を改善することがで
きるという効果である。The first effect is that the recognition performance can be improved.

【０１２３】第２の効果は、あらかじめ用意されている
音声コマンドと同一の文字列が入力しにくいというケー
スが発生するおそれを小さくすることができるという効
果である。The second effect is that it is possible to reduce a possibility that a case where it is difficult to input the same character string as a voice command prepared in advance is reduced.

【０１２４】その理由は、使用頻度が低い音声コマンド
を認識対象から削除することにより、音声コマンド辞書
をユーザーの使用頻度が高いもののみに絞り込むことが
でき、文章入力のための入力音声が誤って音声コマンド
に誤認識される可能性が減るためである。The reason is that by deleting a voice command that is not frequently used from the recognition target, the voice command dictionary can be narrowed down to only those that are frequently used by the user, and the input voice for inputting a sentence may be erroneously input. This is because the possibility of being erroneously recognized as a voice command is reduced.

【図面の簡単な説明】[Brief description of the drawings]

【図１】本発明の第一の実施形態に係る連続音声文章入
力装置の構成を示すブロック図である。FIG. 1 is a block diagram showing a configuration of a continuous speech sentence input device according to a first embodiment of the present invention.

【図２】本発明の第一の実施形態において使用される登
録された音声コマンドの具体例の一つである。FIG. 2 is a specific example of a registered voice command used in the first embodiment of the present invention.

【図３】本発明の第一の実施形態における音声コマンド
使用回数の記憶内容の具体例の一つである。FIG. 3 is a specific example of a storage content of a voice command usage count in the first embodiment of the present invention.

【図４】本発明の第一の実施形態においてユーザーに表
示するダイアログの具体例の一つである。FIG. 4 is a specific example of a dialog displayed to a user in the first embodiment of the present invention.

【図５】本発明の第三の実施形態に係る連続音声文章入
力装置の構成を示すブロック図である。FIG. 5 is a block diagram showing a configuration of a continuous speech sentence input device according to a third embodiment of the present invention.

【図６】本発明の第四の実施形態に係る連続音声文章入
力装置の構成を示すブロック図である。FIG. 6 is a block diagram showing a configuration of a continuous speech sentence input device according to a fourth embodiment of the present invention.

【図７】本発明の第五の実施形態に係る連続音声文章入
力方法の各過程を示すフローチャートである。FIG. 7 is a flowchart showing each step of a continuous speech sentence input method according to a fifth embodiment of the present invention.

【図８】本発明の第六の実施形態に係る連続音声文章入
力方法の各過程を示すフローチャートである。FIG. 8 is a flowchart showing each step of a continuous speech sentence input method according to a sixth embodiment of the present invention.

【図９】本発明の第七の実施形態に係る連続音声文章入
力方法の各過程を示すフローチャートである。FIG. 9 is a flowchart illustrating each step of a continuous speech sentence input method according to a seventh embodiment of the present invention.

【図１０】従来の連続音声文章入力装置の構成を示すブ
ロック図である。FIG. 10 is a block diagram showing a configuration of a conventional continuous speech sentence input device.

【符号の説明】[Explanation of symbols]

１０第一の実施形態に係る連続音声文章入力装置１１音声入力部１２音声認識部１３連続音声文章入力用辞書１４音声コマンド用辞書１５認識結果管理部１６認識結果表示部１７音声コマンド実行部１８音声コマンド履歴管理部１９音声コマンド用辞書管理部２０第二の実施形態に係る連続音声文章入力装置２１音声コマンド使用頻度計算ユニット２２音声コマンド最低使用頻度記憶ユニット２３ユーザー別音声コマンド履歴管理ユニット２４タイマー３０第三の実施形態に係る連続音声文章入力装置３１削除コマンド記憶ユニット３２削除コマンド表示ユニット３３削除コマンド再登録ユニット３４削除動作表示ユニット Reference Signs List 10 Continuous speech text input device according to first embodiment 11 Voice input unit 12 Voice recognition unit 13 Continuous speech text input dictionary 14 Voice command dictionary 15 Recognition result management unit 16 Recognition result display unit 17 Voice command execution unit 18 Voice Command history management unit 19 Voice command dictionary management unit 20 Continuous voice sentence input device according to the second embodiment 21 Voice command usage frequency calculation unit 22 Voice command minimum usage frequency storage unit 23 Voice command history management unit for each user 24 Timer 30 Continuous voice sentence input device 31 according to the third embodiment 31 Delete command storage unit 32 Delete command display unit 33 Delete command re-registration unit 34 Delete operation display unit

Claims

Translated fromJapanese

【特許請求の範囲】[Claims]

【請求項１】文章を連続的に音声入力し、かつ、制御
用音声コマンドを音声入力するための音声入力手段と、前記音声入力手段に入力された音声を認識する音声認識
手段と、前記音声コマンドの認識用辞書を記憶する音声コマンド
辞書記憶手段と、前記音声認識手段における認識結果に基づいて、入力さ
れた音声が連続文章入力用の音声か、あるいは、前記音
声コマンド用の音声であるかを判断し、入力された音声
が前記音声コマンド用の音声であった場合には、各音声
コマンドに対応する制御動作を実行させる認識結果管理
手段と、入力された音声が前記音声コマンド用の音声であった場
合の音声コマンドの履歴を記憶する音声コマンド履歴管
理手段と、前記音声コマンド履歴管理手段の記憶内容に基づいて、
前記各音声コマンドを前記音声コマンド辞書記憶手段か
ら削除すべきか否かを判断し、必要ある場合には、その
音声コマンドを削除する音声コマンド辞書管理手段と、を備える連続音声文章入力装置。1. A voice input means for continuously inputting a text and a voice command for control, a voice recognition means for recognizing a voice input to the voice input means, and the voice A voice command dictionary storage unit for storing a command recognition dictionary; and, based on a recognition result of the voice recognition unit, whether the input voice is a voice for continuous sentence input or a voice for the voice command. If the input voice is the voice for the voice command, a recognition result management means for executing a control operation corresponding to each voice command, and the input voice is the voice for the voice command. Voice command history management means for storing the history of voice commands in the case of, based on the storage content of the voice command history management means,
A voice command dictionary management means for determining whether or not each voice command should be deleted from the voice command dictionary storage means, and deleting the voice command if necessary;

【請求項２】文章を連続的に音声入力し、かつ、制御
用音声コマンドを音声入力するための音声入力手段と、前記音声コマンドの認識用辞書と、その音声コマンドを
認識しやすくするかどうかの音声コマンド毎の優先度
と、を記憶する音声コマンド辞書記憶手段と、前記音声入力手段に入力された音声を認識し、かつ、前
記音声コマンドを認識処理する場合には前記優先度に従
って認識処理を行う音声認識手段と、前記音声認識手段の認識結果に基づいて、入力された音
声が連続文章入力用の音声か、あるいは、前記音声コマ
ンド用の音声であるかを判断し、入力された音声が前記
音声コマンド用の音声であった場合には、各音声コマン
ドに対応する制御動作を実行させる認識結果管理手段
と、入力された音声が前記音声コマンド用の音声であった場
合の音声コマンドの履歴を記憶する音声コマンド履歴管
理手段と、前記音声コマンド履歴管理手段の記憶内容に基づいて、
前記各音声コマンドの前記優先度を下げるか否かを判断
し、その判断結果を前記音声コマンド辞書記憶手段に送
る音声コマンド辞書管理手段と、を備える連続音声文章入力装置。2. A voice input means for continuously inputting a text and a voice input of a control voice command, a dictionary for recognizing the voice command, and whether or not the voice command can be easily recognized. Voice command dictionary storing means for storing the priority of each voice command, and recognizing the voice input to the voice input means, and performing recognition processing according to the priority when the voice command is recognized. Based on the recognition result of the voice recognition unit, determines whether the input voice is a voice for continuous sentence input, or a voice for the voice command, the input voice Is a voice for the voice command, a recognition result managing means for executing a control operation corresponding to each voice command, and an input voice is a voice for the voice command. A voice command history management means for storing a history of voice commands when were, based on the stored contents of said voice command history management unit,
A voice command dictionary management unit that determines whether to lower the priority of each voice command and sends a result of the determination to the voice command dictionary storage unit.

【請求項３】前記音声コマンド履歴管理手段は、その
記憶内容に基づいて、前記音声コマンドの使用頻度を計
算する音声コマンド使用頻度計算手段を備えることを特
徴とする請求項１又は２に記載の連続音声文章入力装
置。3. The voice command use frequency calculating means according to claim 1, wherein said voice command history management means includes voice command use frequency calculation means for calculating use frequency of said voice command based on the stored contents. Continuous voice text input device.

【請求項４】前記音声コマンド履歴管理手段は、各音
声コマンド毎に削除するか否かの判定基準となる最低使
用頻度を記憶しておく音声コマンド最低使用頻度記憶手
段を備えることを特徴とする請求項３に記載の連続音声
文章入力装置。4. The voice command history management means includes voice command minimum usage frequency storage means for storing a minimum usage frequency serving as a criterion for determining whether or not to delete each voice command. The continuous speech sentence input device according to claim 3.

【請求項５】前記音声コマンド辞書管理手段が前記音
声コマンド辞書記憶手段から削除した音声コマンドを記
憶しておく削除コマンド記憶手段と、前記削除コマンド記憶手段の記憶内容を表示する削除コ
マンド表示手段と、をさらに備えることを特徴とする請求項１、３及び４の
何れか一項に記載の連続音声文章入力装置。5. A deletion command storage means for storing voice commands deleted from the voice command dictionary storage means by the voice command dictionary management means, and a deletion command display means for displaying storage contents of the deletion command storage means. The continuous speech sentence input device according to any one of claims 1, 3 and 4, further comprising:

【請求項６】前記削除コマンド記憶手段に記憶されて
いる音声コマンドを再登録する削除コマンド再登録手段
をさらに備えることを特徴とする請求項５に記載の連続
音声文章入力装置。6. The continuous speech sentence input device according to claim 5, further comprising a deletion command re-registration unit for re-registering the voice command stored in the deletion command storage unit.

【請求項７】前記音声コマンド履歴管理手段が、ユー
ザー毎の音声コマンド使用履歴を記憶するユーザー別音
声コマンド履歴管理手段を備えたことを特徴とする請求
項１乃至６の何れか一項に記載の連続音声文章入力装
置。7. The voice command history management unit according to claim 1, wherein the voice command history management unit includes a user-specific voice command history management unit that stores a voice command usage history for each user. Continuous speech text input device.

【請求項８】前記音声コマンド辞書管理手段から音声
コマンドが削除されるときに、その旨の表示を行う削除
表示手段をさらに備えることを特徴とする請求項１及び
３乃至７の何れか一項に記載の連続音声文章入力装置。8. The apparatus according to claim 1, further comprising a deletion display unit for displaying when a voice command is deleted from the voice command dictionary management unit. 2. A continuous speech sentence input device according to claim 1.

【請求項９】前記音声コマンド履歴管理手段は、一定
時間経過後に、または、音声コマンドが一定回数使用さ
れた後に、前記音声コマンドの履歴の記憶を開始するも
のであることを特徴とする請求項１乃至８の何れか一項
に記載の連続音声文章入力装置。9. The voice command history management means starts storing the voice command history after a lapse of a predetermined time or after a voice command has been used a certain number of times. 9. The continuous speech sentence input device according to any one of 1 to 8.

【請求項１０】音声による文章入力の際に使用された
各音声コマンドを記憶する第一の過程と、所定回数だけ音声コマンドが使用されたときに、各音声
コマンドが予め定められた最低回数に達しているか否か
を判定する第二の過程と、前記最低回数に達していない音声コマンドを認識の対象
から削除する第三の過程と、からなる連続音声文章入力方法。10. A first step of storing each voice command used at the time of inputting a sentence by voice, and when the voice command is used a predetermined number of times, each voice command is reduced to a predetermined minimum number of times. A second step of determining whether or not the voice command has been reached, and a third step of deleting a voice command that has not reached the minimum number of times from recognition targets.

【請求項１１】音声による文章入力の際に使用された
各音声コマンドを記憶する第一の過程と、所定回数だけ音声コマンドが使用されたときに、各音声
コマンドが予め定められた最低回数に達しているか否か
を判定する第二の過程と、前記最低回数に達していない音声コマンドについて、そ
の音声コマンドを認識しやすくするかどうかの優先度を
下げる第三の過程と、からなる連続音声文章入力方法。11. A first step of storing each voice command used in inputting a sentence by voice, and when the voice command is used a predetermined number of times, each voice command is reduced to a predetermined minimum number of times. A second step of determining whether or not the voice command has been reached; and a third step of lowering the priority as to whether or not the voice command which has not reached the minimum number of times is easy to recognize the voice command. Sentence input method.

【請求項１２】認識対象から削除された音声コマンド
を記憶する過程と、一旦認識対象から削除され、記憶されている音声コマン
ドを表示する過程と、をさらに備えることを特徴とする請求項１０に記載の連
続音声文章入力方法。12. The apparatus according to claim 10, further comprising: a step of storing the voice command deleted from the recognition target; and a step of displaying the stored voice command once deleted from the recognition target. How to input the continuous voice sentence described.

【請求項１３】一旦認識対象から削除され、記憶され
ている音声コマンドを再度認識対象とする過程をさらに
備えることを特徴とする請求項１２に記載の連続音声文
章入力方法。13. The continuous speech sentence input method according to claim 12, further comprising the step of once again recognizing the voice command once deleted from the recognition target and stored.

【請求項１４】前記音声コマンドが認識対象から削除
されるときに、その旨の表示を行う過程をさらに備える
ことを特徴とする請求項１０、１２及び１３の何れか一
項に記載の連続音声文章入力方法。14. The continuous voice according to claim 10, further comprising, when said voice command is deleted from a recognition target, displaying a message to that effect. Sentence input method.

【請求項１５】前記第一の過程は、一定時間経過後
に、または、音声コマンドが一定回数使用された後に、
開始されるものであることを特徴とする請求項１０乃至
１４の何れか一項に記載の連続音声文章入力方法。15. The method according to claim 1, wherein the first step is performed after a certain period of time or after a voice command is used a certain number of times.
The method according to any one of claims 10 to 14, wherein the method is started.