JP2008124597A

Movatterモバイル変換

Info

Publication number: JP2008124597A
Application number: JP2006303546A
Authority: JP
Inventors: Ryuichi Nariyama; 隆一成山
Original assignee: Yamaha Corp
Current assignee: Yamaha Corp
Priority date: 2006-11-09
Filing date: 2006-11-09
Publication date: 2008-05-29

Abstract

PROBLEM TO BE SOLVED: To provide an audio teleconference system capable of dynamically changing speech communication partners. SOLUTION: A management server 101 is stored with information of respective audio teleconference devices and individual information of respective users, and authenticates the respective users. A relay server 102 is a relay to connect the respective audio teleconference devices. When authenticating a user of an audio teleconference device, the management server 101 sets the audio teleconference device so as to be connected to the relay server 102. The user can specify a desired destination to which a speech is to be transmitted. When the user specifies the transmission destination, the relay server 102 specifies the audio teleconference device at the transmission destination that the user specifies and transmits an utterance speech to only the specified audio teleconference device. COPYRIGHT: (C)2008,JPO&INPIT

Description

Translated fromJapanese

この発明は、音声会議システムに関し、特に会議中に動的に通話相手を切り換えることができる音声会議システムに関する。 The present invention relates to an audio conference system, and more particularly to an audio conference system that can dynamically switch a call partner during a conference.

近年、複数の地点で音声会議（通信会議）を行う音声会議システムが普及している。音声会議システムでは、各地点の会議参加者が発言すると、他の全ての地点に音声が送信される。この場合、他の地点の聴者は、声の特徴から誰が発言したかを判断しなければならない。多地点で音声会議を行う場合は、発言者の特定が難しくなる。そこで、発言者通知機能を備えた通信端末が提案されている（例えば特許文献１参照）。
特開２００６−１００９６８号公報In recent years, a voice conference system that performs a voice conference (communication conference) at a plurality of points has become widespread. In the audio conference system, when a conference participant at each location speaks, audio is transmitted to all other locations. In this case, listeners at other points must determine who spoke from the characteristics of the voice. When a voice conference is performed at multiple points, it is difficult to specify a speaker. Accordingly, a communication terminal having a speaker notification function has been proposed (see, for example, Patent Document 1).
Japanese Patent Laid-Open No. 2006-1000096

しかし、特許文献１に記載の装置では、上述のように発言内容が全ての地点に送信されてしまう。例えば複数の社内の者と、社外の者が会議をしている場合に、一時的に社内の者同士だけで相談等をしたい場合があるが、特許文献１に記載の装置では、動的に通話相手を切り換えることができなかった。 However, in the apparatus described in Patent Document 1, the content of a statement is transmitted to all points as described above. For example, in the case where a plurality of in-house persons and outside persons are in a meeting, there is a case where it is desired to temporarily consult only with in-house persons. The other party could not be switched.

この発明は、動的に通話相手を切り換えることができる音声会議システムを提供することを目的とする。 An object of the present invention is to provide an audio conference system capable of dynamically switching a call partner.

この発明の音声会議システムは、中継サーバと、前記中継サーバを介して接続される複数の音声会議装置と、からなる音声会議システムであって、前記音声会議装置は、音声会議装置のユーザである会議参加者を認証する認証部と、前記中継サーバと情報を送受信する送受信部と、会議参加者から、発言の送信先を指定する操作を受け付ける受付部と、を備え、前記中継サーバは、接続している音声会議装置、および各音声会議装置で認証された会議参加者を記載した管理テーブルと、各音声会議装置から受信した音声情報を、前記管理テーブルに記載されている他の音声会議装置に送信する転送部と、を備え、前記音声会議装置の送受信部は、前記受付部が受け付けた発言の送信先の指定情報を前記中継サーバに送信し、前記中継サーバの転送部は、前記指定情報を受信した場合、前記管理テーブルを参照して、前記指定情報に記載されている送信先の音声会議装置、および前記指定情報を受信した音声会議装置から受信した音声情報を、これらの音声会議装置にのみ送信する処理である送信先選択処理を行うことを特徴とする。 The audio conference system according to the present invention is an audio conference system including a relay server and a plurality of audio conference apparatuses connected via the relay server, and the audio conference apparatus is a user of the audio conference apparatus. An authentication unit that authenticates a conference participant, a transmission / reception unit that transmits / receives information to / from the relay server, and a reception unit that receives an operation for designating a transmission destination of a message from the conference participant, wherein the relay server is connected Management table describing the conference participants authenticated by each voice conference device, and the voice information received from each voice conference device, and other voice conference devices described in the management table A transmission unit for transmitting to the relay server, wherein the transmission / reception unit of the audio conference apparatus transmits the designation information of the transmission destination of the message accepted by the reception unit to the relay server, When the transmission unit receives the designation information, the transmission unit refers to the management table, and receives the voice information received from the voice conference apparatus of the transmission destination described in the designation information and the voice conference apparatus that has received the designation information. The transmission destination selection process, which is a process for transmitting only to these voice conference apparatuses, is performed.

この構成では、音声会議装置に会議参加者を認証する認証部を備える。中継サーバは、各音声会議装置と、認証された会議参加者を管理する管理テーブルを備える。会議参加者が発言の送信先を指定する操作を行うと、送信先を示す情報が中継サーバに送信され、以後、その会議参加者の音声は、指定された送信先にのみ送信される。また、指定された送信先の音声は、指定操作の有った音声会議装置にのみ送信される。 In this configuration, the audio conference device includes an authentication unit that authenticates the conference participant. The relay server includes a management table for managing each audio conference device and authenticated conference participants. When the conference participant performs an operation of designating the transmission destination of the message, information indicating the transmission destination is transmitted to the relay server, and thereafter, the audio of the conference participant is transmitted only to the designated transmission destination. In addition, the voice of the designated transmission destination is transmitted only to the voice conference apparatus having the designated operation.

この発明は、さらに、前記転送部は、前記送信先選択処理において、前記指定情報に記載されている送信先の音声会議装置、および前記指定情報を受信した音声会議装置から受信した音声情報を、これらの音声会議装置以外の音声会議装置に、音量を制限して送信することを特徴とする。 According to the present invention, the transfer unit further receives the audio information received from the audio conference device of the transmission destination described in the designation information and the audio conference device that received the designation information in the transmission destination selection process. The voice volume is limited and transmitted to voice conference apparatuses other than these voice conference apparatuses.

この構成では、中継サーバは、受信した音声情報を、指定された送信先以外の音声会議装置に、音量を制限してから送信する。 In this configuration, the relay server transmits the received audio information to the audio conference apparatus other than the designated transmission destination after limiting the volume.

この発明は、さらに、前記認証部は、生体認証により、会議参加者を認証することを特徴とする。 The present invention is further characterized in that the authentication unit authenticates a conference participant by biometric authentication.

この構成では、指紋、声紋、静脈等の生体情報を用いて会議参加者の認証を行う。 In this configuration, meeting participants are authenticated using biometric information such as fingerprints, voiceprints, and veins.

この発明は、さらに、前記認証部は、会議参加者の音声特徴量を記録した記録媒体を読み取る読取り部と、会議参加者の音声特徴量を抽出する音声分析部と、からなり、前記読取り部が前記記録媒体から読み取った音声特徴量と、前記音声分析部が抽出した音声特徴量と、を比較することにより、会議参加者を認証することを特徴とする。 In the present invention, the authentication unit further includes a reading unit that reads a recording medium in which a voice feature amount of the conference participant is recorded, and a voice analysis unit that extracts the voice feature amount of the conference participant, and the reading unit The conference participant is authenticated by comparing the voice feature value read from the recording medium with the voice feature value extracted by the voice analysis unit.

この構成では、会議参加者の音声特徴量を記憶した記録媒体（例えばＩＣカード）を事前に作成しておき、会議参加者がこのＩＣカードを音声会議装置にセットすると、音声特徴量を読み出す。音声会議装置は、会議参加者の発言から音声特徴量を抽出し、ＩＣカードから読み出した音声特徴量と比較することで会議参加者を認証する。 In this configuration, a recording medium (for example, an IC card) that stores the audio feature amount of the conference participant is created in advance, and when the conference participant sets the IC card in the audio conference device, the audio feature amount is read out. The audio conference apparatus extracts the audio feature amount from the speech of the conference participant and authenticates the conference participant by comparing with the audio feature amount read from the IC card.

この発明は、さらに、前記受付部は、音声認識により、送信先を指定する操作を受け付けることを特徴とする。 The present invention is further characterized in that the accepting unit accepts an operation for designating a transmission destination by voice recognition.

この構成では、音声認識により送信先を指定する操作を受け付ける。例えば、会議参加者が、他の会議参加者の個人名等を発言すると、この発言内容を音声認識し、送信先を示す情報として中継サーバに送信する。 In this configuration, an operation for designating a transmission destination by voice recognition is accepted. For example, when a conference participant speaks the personal name or the like of another conference participant, the content of the speech is recognized and transmitted to the relay server as information indicating the transmission destination.

この発明によれば、動的に通話相手を切り換えることができるため、例えば複数の社内の者と、社外の者が会議をしている場合に、一時的に社内の者同士だけで相談をしたい場合等に好適である。 According to the present invention, since the other party can be switched dynamically, for example, when a person in the company and a person outside the company are in a meeting, it is desired to temporarily consult only with the person in the company. It is suitable for cases.

図面を参照して、本発明の実施形態に係る音声会議システムについて説明する。図１は、この実施形態に係る音声会議システムのブロック図である。 With reference to the drawings, a voice conference system according to an embodiment of the present invention will be described. FIG. 1 is a block diagram of an audio conference system according to this embodiment.

音声会議システムは、ネットワーク１００を介して接続される管理サーバ１０１、中継サーバ１０２、音声会議装置１１１Ａ、音声会議装置１１１Ｂ、および音声会議装置１１１Ｃを備えている。 The voice conference system includes amanagement server 101, arelay server 102, avoice conference device 111A, avoice conference device 111B, and avoice conference device 111C connected via thenetwork 100.

管理サーバ１０１は、ネットワーク１００内に接続される音声会議装置、および各音声会議装置のユーザを管理する。管理サーバ１０１は、各音声会議装置の情報、および各ユーザの個人情報を記憶しており、各ユーザの認証を行う。中継サーバ１０２は、各音声会議装置を接続する中継機であり、各音声会議装置をＳＳＬ通信にて接続する。各音声会議装置は、この中継サーバ１０２を介して接続される。管理サーバ１０１は、音声会議装置のユーザを認証した場合、音声会議装置を中継サーバ１０２に接続するよう設定する。 Themanagement server 101 manages voice conference apparatuses connected to thenetwork 100 and users of the voice conference apparatuses. Themanagement server 101 stores information on each audio conference device and personal information of each user, and authenticates each user. Therelay server 102 is a relay machine that connects the audio conference apparatuses, and connects the audio conference apparatuses by SSL communication. Each audio conference apparatus is connected via therelay server 102. When the user of the audio conference apparatus is authenticated, themanagement server 101 is set to connect the audio conference apparatus to therelay server 102.

図２は、音声会議装置１１１Ａのブロック図である。なお、音声会議装置１１１Ａ、音声会議装置１１１Ｂ、および音声会議装置１１１Ｃは全て同じ構成、機能を有する。
音声会議装置１１１Ａは、マイク１１、アンプ１２、Ａ／Ｄコンバータ１３、通信部１４、制御部１５、表示部１６、カードリーダ／ライタ１７、操作部１８、Ｄ／Ａコンバータ１９、アンプ２０、およびスピーカ２１を備えている。FIG. 2 is a block diagram of theaudio conference apparatus 111A. Note that theaudio conference apparatus 111A, theaudio conference apparatus 111B, and theaudio conference apparatus 111C all have the same configuration and function.
Theaudio conference apparatus 111A includes a microphone 11, anamplifier 12, an A /D converter 13, a communication unit 14, acontrol unit 15, adisplay unit 16, a card reader /writer 17, anoperation unit 18, a D /A converter 19, anamplifier 20, and Aspeaker 21 is provided.

マイク１１は、音声会議装置１１１Ａの周囲の音声を収音する。
アンプ１２は、マイク１１で収音した音声信号を増幅する。
Ａ／Ｄコンバータ１３は、アンプ１２で増幅されたアナログ形式の音声信号をデジタル形式に変換する。The microphone 11 collects sound around theaudio conference apparatus 111A.
Theamplifier 12 amplifies the audio signal collected by the microphone 11.
The A /D converter 13 converts the analog audio signal amplified by theamplifier 12 into a digital format.

通信部１４は、ネットワークインタフェースを有し、音声会議装置１１１Ａをネットワーク１００を介して他の装置に接続する。本実施形態では、通信部１４は、ネットワーク１００、および中継サーバ１０２を介して他の音声会議装置１１１Ｂ、および１１１Ｃに接続し、音声情報を送受信する。通信部１４は、Ａ／Ｄコンバータ１３から入力される音声信号をネットワーク上の送受信に適した音声情報に変換して他の装置に送信する。また、他の装置から受信した音声情報をデジタル形式の音声信号に変換し、Ｄ／Ａコンバータ１９に出力する。 The communication unit 14 has a network interface and connects theaudio conference apparatus 111A to another apparatus via thenetwork 100. In the present embodiment, the communication unit 14 is connected to the otheraudio conference apparatuses 111B and 111C via thenetwork 100 and therelay server 102, and transmits and receives audio information. The communication unit 14 converts the audio signal input from the A /D converter 13 into audio information suitable for transmission / reception on the network, and transmits the audio information to another device. Also, audio information received from another device is converted into a digital audio signal and output to the D /A converter 19.

Ｄ／Ａコンバータ１９は、デジタル形式の音声信号をアナログ形式に変換する。
アンプ２０は、Ｄ／Ａコンバータ１９で変換されたアナログ音声信号を増幅する。The D /A converter 19 converts a digital audio signal into an analog format.
Theamplifier 20 amplifies the analog audio signal converted by the D /A converter 19.

スピーカ２１は、アンプ２０で増幅された音声信号を放音する。
以上のようにして音声会議装置１１１Ａは、自装置のユーザの発話音声を収音して他装置に送信し、他装置の発話音声を受信して放音する。Thespeaker 21 emits the audio signal amplified by theamplifier 20.
As described above, theaudio conference apparatus 111A collects and transmits the utterance voice of the user of the own apparatus to the other apparatus, and receives and emits the utterance voice of the other apparatus.

制御部１５は、音声会議装置１１１Ａを統括的に制御する。また、機能的に音声分析部１５１、および比較部１５２を実現する。
音声分析部１５１は、Ａ／Ｄコンバータ１３から入力された音声信号を分析し、音声特徴量（声紋）を抽出する。また、音声分析部１５１は、音声認識（発言内容の抽出）を行う。
比較部１５２は、音声分析部１５１が抽出した音声特徴量、およびカードリーダ／ライタ１７から後述の登録済み情報（音声特徴量と個人情報を含む）を入力し、これらを比較する。Thecontrol unit 15 comprehensively controls theaudio conference apparatus 111A. In addition, thevoice analysis unit 151 and thecomparison unit 152 are functionally realized.
Thevoice analysis unit 151 analyzes the voice signal input from the A /D converter 13 and extracts a voice feature amount (voice print). In addition, thevoice analysis unit 151 performs voice recognition (extraction of speech content).
Thecomparison unit 152 inputs the voice feature amount extracted by thevoice analysis unit 151 and registered information (including voice feature amount and personal information) described later from the card reader /writer 17 and compares them.

表示部１６は、ＬＣＤからなり、会議中に、接続されている音声会議装置の情報やユーザの個人情報等が表示される。 Thedisplay unit 16 is composed of an LCD, and displays information on the connected audio conference device, personal information of the user, and the like during the conference.

操作部１８は、ユーザが操作を行うためのユーザインタフェースであり、例えば音声会議装置の筐体に設置された複数の押し釦等からなる。 Theoperation unit 18 is a user interface for a user to perform an operation, and includes, for example, a plurality of push buttons and the like installed on the casing of the audio conference apparatus.

カードリーダ／ライタ１７は、ＩＣカード５０に情報を記録し、ＩＣカード５０に記録されている情報を読み出す。ＩＣカード５０には、ユーザの音声特徴量、個人情報が記録されている。ユーザは、予め管理サーバ１０１に音声特徴量、個人情報を登録し、ＩＣカード５０を発行しておく。 The card reader /writer 17 records information on theIC card 50 and reads information recorded on theIC card 50. TheIC card 50 stores the user's voice feature amount and personal information. The user registers the voice feature amount and personal information in themanagement server 101 in advance and issues anIC card 50.

図３は、登録、認証の例を示す図である。同図（Ａ）は、ユーザ登録、ＩＣカード５０の発行を示す図である。ユーザは、管理サーバ１０１に対し、音声特徴量と個人情報を送信する。個人情報は、管理サーバ１０１に接続されている登録用の端末（図示せず）等から入力する。音声特徴量は、登録用の端末に備えられたマイクから自身の音声を入力し、この登録用の端末（または管理サーバ１０１）が入力された音声から抽出する。管理サーバ１０１は、受信した個人情報を登録し、音声特徴量と対応付けて記録する。管理サーバ１０１は、ユーザＩＤを発行し、個人情報と音声特徴量を登録済み情報としてＩＣカード５０に記録する。 FIG. 3 is a diagram illustrating an example of registration and authentication. FIG. 2A is a diagram showing user registration andIC card 50 issuance. The user transmits a voice feature amount and personal information to themanagement server 101. The personal information is input from a registration terminal (not shown) connected to themanagement server 101. The voice feature amount is extracted from the voice inputted by the registration terminal (or the management server 101) by inputting its own voice from a microphone provided in the registration terminal. Themanagement server 101 registers the received personal information and records it in association with the audio feature amount. Themanagement server 101 issues a user ID and records the personal information and the voice feature amount on theIC card 50 as registered information.

同図（Ｂ）は、ユーザ認証の例を示す図である。ユーザは、ＩＣカード５０を音声会議装置１１１Ａのカードリーダ／ライタ１７にセットし、発話する。カードリーダ／ライタ１７は、ＩＣカード５０に記録されている登録済み情報を読み出し、これを比較部１５２に入力する。音声分析部１５１は、ユーザの発話による音声信号から音声特徴量を抽出し、比較部１５２に入力する。比較部１５２は、音声分析部１５１から入力された音声特徴量と、カードリーダ／ライタ１７から入力された登録済み情報に含まれている音声特徴量と、を比較する。比較部１５２は、比較した音声特徴量が一致する（または近似する）と判断した場合、登録済み情報に含まれている個人情報を管理サーバ１０１に送信する。この結果、管理サーバ１０１で認証が行われ、音声会議装置１１１Ａが中継サーバ１０２に接続される。音声会議装置１１１Ｂ、および音声会議装置１１１Ｃにおいても同様の動作が行われ、全ての音声会議装置が中継サーバ１０２を介して接続される。 FIG. 5B is a diagram showing an example of user authentication. The user places theIC card 50 in the card reader /writer 17 of theaudio conference apparatus 111A and speaks. The card reader /writer 17 reads the registered information recorded on theIC card 50 and inputs it to thecomparison unit 152. Thevoice analysis unit 151 extracts a voice feature amount from a voice signal generated by the user's utterance and inputs the voice feature amount to thecomparison unit 152. Thecomparison unit 152 compares the speech feature amount input from thespeech analysis unit 151 with the speech feature amount included in the registered information input from the card reader /writer 17. When thecomparison unit 152 determines that the compared audio feature amounts match (or approximate), thecomparison unit 152 transmits the personal information included in the registered information to themanagement server 101. As a result, authentication is performed by themanagement server 101, and theaudio conference apparatus 111A is connected to therelay server 102. The same operation is performed in theaudio conference apparatus 111B and theaudio conference apparatus 111C, and all the audio conference apparatuses are connected via therelay server 102.

なお、本発明において管理サーバ１０１は必須の構成要件ではない。比較部１５２は、比較した音声特徴量が一致する（または近似する）と判断した場合、登録済み情報に含まれている個人情報を中継サーバ１０２に送信し、これを受信した中継サーバが音声会議装置１１１Ａを接続するようにしてもよい。音声会議装置１１１Ｂ、および音声会議装置１１１Ｃにおいても同様の動作が行われ、全ての音声会議装置が中継サーバ１０２を介して接続される。 In the present invention, themanagement server 101 is not an essential component. When thecomparison unit 152 determines that the compared audio feature amounts match (or approximate), thecomparison unit 152 transmits the personal information included in the registered information to therelay server 102, and the relay server that has received the personal information includes the audio conference. Thedevice 111A may be connected. The same operation is performed in theaudio conference apparatus 111B and theaudio conference apparatus 111C, and all the audio conference apparatuses are connected via therelay server 102.

中継サーバ１０２は、会議中において、接続している音声会議装置、および各ユーザを管理する管理テーブルを有している。図４は、管理テーブルを示す図である。同図に示すように、中継サーバ１０２は、各音声会議装置の名称（ＩＰアドレス、固有の名称等）、ユーザＩＤ、個人名、部署名等を記録している。これらの情報は、管理サーバ１０１から受信する。管理サーバ１０１を用いない場合には、これらの情報を各音声会議装置から受信する。 Therelay server 102 has a management table for managing the connected audio conference device and each user during the conference. FIG. 4 is a diagram showing a management table. As shown in the figure, therelay server 102 records the name (IP address, unique name, etc.), user ID, personal name, department name, etc. of each audio conference device. These pieces of information are received from themanagement server 101. When themanagement server 101 is not used, these pieces of information are received from each audio conference device.

中継サーバ１０２は、音声会議装置１１１Ａ、音声会議装置１１１Ｂ、および音声会議装置１１１Ｃに対し、接続している全ての音声会議装置の情報（ＩＰアドレス、固有の名称等）を送信する。これにより、各音声会議装置の表示部１６において、接続されている音声会議装置の情報が表示される。 Therelay server 102 transmits information (IP address, unique name, etc.) of all connected audio conference apparatuses to theaudio conference apparatus 111A, theaudio conference apparatus 111B, and theaudio conference apparatus 111C. Thereby, the information of the connected audio conference apparatus is displayed on thedisplay unit 16 of each audio conference apparatus.

なお、中継サーバ１０２は、音声会議装置１１１Ａ、音声会議装置１１１Ｂ、および音声会議装置１１１Ｃに対し、それぞれの音声会議装置のユーザ個人情報（例えば氏名、所属部署名等）を送信してもよい。この場合、各音声会議装置において、接続されているユーザの個人情報が表示される。 Note that therelay server 102 may transmit user personal information (for example, name, department name, etc.) of each audio conference device to theaudio conference device 111A, theaudio conference device 111B, and theaudio conference device 111C. In this case, the personal information of the connected user is displayed in each audio conference device.

また、以下のようにして、各ユーザが発話したときのみ個人情報を表示するようにしてもよい。
ユーザが発話すると、音声会議装置１１１Ａの通信部１４が中継サーバ１０２に音声情報を送信する。また、同時に、音声分析部１５１は、ユーザの発話による音声信号から音声特徴量を抽出し、比較部１５２に入力する。比較部１５２は、音声分析部１５１から入力された音声特徴量と、カードリーダ／ライタ１７から入力された登録済み情報に含まれている音声特徴量を比較する。比較部１５２は、比較した音声特徴量が一致すると判断した場合、登録済み情報に含まれている個人情報を中継サーバ１０２に送信する。中継サーバ１０２は、受信した音声情報とともに個人情報を他の装置（音声会議装置１１１Ｂ、音声会議装置１１１Ｃ）に送信する。音声会議装置１１１Ｂ、および音声会議装置１１１Ｃの制御部１５は、中継サーバ１０２から音声情報と個人情報とを受信し、音声を放音するとともに、表示部１６に個人情報を表示する。Further, personal information may be displayed only when each user speaks as follows.
When the user speaks, the communication unit 14 of the audio conference apparatus 111 A transmits audio information to therelay server 102. At the same time, thevoice analysis unit 151 extracts a voice feature amount from a voice signal generated by the user's utterance and inputs the voice feature amount to thecomparison unit 152. Thecomparison unit 152 compares the speech feature amount input from thespeech analysis unit 151 with the speech feature amount included in the registered information input from the card reader /writer 17. When thecomparison unit 152 determines that the compared audio feature amounts match, thecomparison unit 152 transmits the personal information included in the registered information to therelay server 102. Therelay server 102 transmits the personal information together with the received audio information to other devices (theaudio conference device 111B and theaudio conference device 111C). Theaudio conference device 111B and thecontrol unit 15 of theaudio conference device 111C receive the audio information and the personal information from therelay server 102, emit the audio, and display the personal information on thedisplay unit 16.

なお、比較部１５２が、音声分析部１５１から入力された音声特徴量と、カードリーダ／ライタ１７から入力された登録済み情報に含まれている音声特徴量を比較した結果、音声特徴量が一致しないと判断した場合、制御部１５は、表示部１６に警告を表示し、中継サーバ１０２にエラー情報を送信し、中継サーバ１０２が音声情報を送信しないようにする。また、制御部１５が通信部１４に音声情報を送信しないように設定してもよい。 Note that thecomparison unit 152 compares the audio feature amount input from theaudio analysis unit 151 with the audio feature amount included in the registered information input from the card reader /writer 17, and as a result, the audio feature amount matches. When it is determined not to do so, thecontrol unit 15 displays a warning on thedisplay unit 16, transmits error information to therelay server 102, and prevents therelay server 102 from transmitting voice information. In addition, thecontrol unit 15 may be set not to transmit voice information to the communication unit 14.

次に、音声会議システムの送信先選択動作について説明する。この音声会議システムは、各音声会議装置１１１の接続を動的に切り換えることができるものである。
図５は、音声会議システムの送信先選択動作について示す図である。同図において、音声会議装置１１１Ａが設置されている地点ａにユーザ１Ａが存在し、音声会議装置１１１Ｂが設置されている地点ｂにユーザ１Ｂ、およびユーザ１Ｃが存在し、音声会議装置１１１Ｃが設置されている地点ｃにユーザ１Ｄが存在する。Next, the transmission destination selection operation of the audio conference system will be described. This voice conference system can dynamically switch the connection of each voice conference apparatus 111.
FIG. 5 is a diagram showing the destination selection operation of the audio conference system. In the figure, theuser 1A exists at the point a where theaudio conference device 111A is installed, theuser 1B and the user 1C exist at the point b where theaudio conference device 111B is installed, and theaudio conference device 111C is installed. Theuser 1D exists at the point c.

各音声会議装置１１１のユーザ１Ａ〜１Ｄは、自身の発話した音声の送信先を自由に選択することができる。すなわち、特定のユーザにのみ音声情報を発信し、それ以外のユーザには音声情報を発信しないように設定することができる。同図（Ａ）においては、ユーザ１Ｂ、およびユーザ１Ｃの発話した音声が音声会議装置１１１Ａ、および音声会議装置１１１Ｃに送信される例を示し、同図（Ｂ）においては、ユーザ１Ｂ、およびユーザ１Ｃの発話した音声が音声会議装置１１１Ａにのみ送信される例を示す。 Users 1 A to 1 D of each audio conference device 111 can freely select a transmission destination of the audio spoken by themselves. That is, it can be set so that voice information is transmitted only to a specific user and voice information is not transmitted to other users. FIG. 6A shows an example in which voices spoken by theuser 1B and the user 1C are transmitted to theaudio conference apparatus 111A and theaudio conference apparatus 111C. In FIG. The example which the audio | voice which 1C uttered is transmitted only to theaudio conference apparatus 111A is shown.

図３（Ｂ）に示したユーザ認証が行われると、中継サーバ１０２は、音声会議装置１１１Ａ、音声会議装置１１１Ｂ、および音声会議装置１１１Ｃを接続し、音声会議を開始する。各音声会議装置の表示部１６においては、接続されている音声会議装置の情報（例えばＩＰアドレス等）が表示される。また、接続されているユーザの個人情報（例えば氏名、所属部署名等）が表示される。 When the user authentication shown in FIG. 3B is performed, therelay server 102 connects theaudio conference device 111A, theaudio conference device 111B, and theaudio conference device 111C, and starts an audio conference. On thedisplay unit 16 of each audio conference device, information (for example, an IP address) of the connected audio conference device is displayed. In addition, personal information (for example, name, department name, etc.) of the connected user is displayed.

最初に各音声会議装置が接続された場合、それぞれの音声会議装置で収音した音声情報は、全ての音声会議装置に送信される。つまり、同図（Ａ）に示すように、音声会議装置１１１Ｂのユーザ１Ｂ、およびユーザ１Ｃの発話した音声は、音声会議装置１１１Ａ、および音声会議装置１１１Ｃに送信される。 When each audio conference device is first connected, the audio information collected by each audio conference device is transmitted to all audio conference devices. That is, as shown in FIG. 5A, the voices spoken by theuser 1B and the user 1C of theaudio conference apparatus 111B are transmitted to theaudio conference apparatus 111A and theaudio conference apparatus 111C.

ユーザ１Ｂ、およびユーザ１Ｃは、会議中に、音声会議装置１１１Ｂの表示部１６を見て、現在音声会議を行っている相手先の装置、ユーザを知ることができる。ここで、会議中にユーザ１Ｂ、またはユーザ１Ｃが一時的にユーザ１Ａとだけ音声会議を行いたい場合、操作部１８を用いて、音声会議装置１１１Ａにのみ音声を送信するように設定する。例えば音声会議装置１１１ＡのＩＰアドレス、固有名を指定すればよい。 During the meeting, theuser 1B and the user 1C can see thedisplay device 16 of theaudio conference apparatus 111B and know the apparatus and user of the other party that is currently conducting the audio conference. Here, when theuser 1B or the user 1C wants to temporarily hold the audio conference only with theuser 1A during the conference, theoperation unit 18 is used to set the audio to be transmitted only to theaudio conference apparatus 111A. For example, the IP address and unique name of theaudio conference apparatus 111A may be designated.

ユーザ１Ｂ、またはユーザ１Ｃが操作部１８を用いて、音声会議装置１１１Ａにのみ音声を送信するように設定すると、音声会議装置１１１Ｂは、中継サーバ１０２に対し、音声会議装置１１１Ａを指定した接続先指定情報を送信する。中継サーバ１０２は、接続先指定情報を受信した場合、以後、音声会議装置１１１Ｂから受信した音声情報を音声会議装置１１１Ａにのみ送信する。また、音声会議装置１１１Ａから受信した音声情報を音声会議装置１１１Ｂにのみ送信する。音声会議装置１１１Ｃから受信した音声情報は、音声会議装置１１１Ａ、および音声会議装置１１１Ｂに送信するが、送信せずに破棄してもよい。 When theuser 1B or the user 1C uses theoperation unit 18 to set to transmit audio only to theaudio conference apparatus 111A, theaudio conference apparatus 111B connects therelay server 102 with theaudio conference apparatus 111A specified. Send specified information. When therelay server 102 receives the connection destination designation information, therelay server 102 thereafter transmits the audio information received from theaudio conference apparatus 111B only to theaudio conference apparatus 111A. Also, the audio information received from theaudio conference apparatus 111A is transmitted only to theaudio conference apparatus 111B. The audio information received from theaudio conference apparatus 111C is transmitted to theaudio conference apparatus 111A and theaudio conference apparatus 111B, but may be discarded without being transmitted.

例えば、ユーザ１Ａ、ユーザ１Ｂ、およびユーザ１Ｃが同じ会社（Ａ社）の人間であり、ユーザ１Ｄのみが他社（Ｂ社）の人間であった場合、Ａ社の機密情報をユーザ１Ａ、ユーザ１Ｂ、およびユーザ１Ｃの間で一時的に会話したい状況が考えられる。本実施形態の送信先選択動作は、このような状況に好適である。 For example, when theuser 1A, theuser 1B, and the user 1C are people of the same company (Company A) and only theuser 1D is a person of another company (Company B), the confidential information of the Company A is used as theuser 1A and theuser 1B. , And a situation where the user 1C wants to have a conversation temporarily. The transmission destination selection operation of this embodiment is suitable for such a situation.

また、各ユーザは、音声会議装置ではなく音声会議を行っているユーザを指定することで音声情報の送信先を設定することもできる。ユーザ１Ｂ、またはユーザ１Ｃは、音声会議装置１１１Ｂの表示部１６を見て、ユーザ１Ａにのみ音声を送信するように設定する。例えばユーザ１Ａの氏名を指定すればよい。 Each user can also set the destination of voice information by designating a user who is conducting a voice conference instead of a voice conference device. Theuser 1B or the user 1C looks at thedisplay unit 16 of theaudio conference apparatus 111B and sets to transmit the audio only to theuser 1A. For example, the name of theuser 1A may be specified.

ユーザ１Ｂ、またはユーザ１Ｃが操作部１８を用いて、ユーザ１Ａにのみ音声を送信するように設定すると、音声会議装置１１１Ｂは、中継サーバ１０２に対し、ユーザ１Ａを指定した接続先指定情報を送信する。中継サーバ１０２は、図４に示した管理テーブルを参照し、ユーザ１Ａの音声会議装置、すなわち音声会議装置１１１Ａにのみ音声情報を送信するように設定する。以後、中継サーバ１０２は、音声会議装置１１１Ｂから受信した音声情報を音声会議装置１１１Ａにのみ送信する。また、音声会議装置１１１Ａから受信した音声情報を音声会議装置１１１Ｂにのみ送信する。音声会議装置１１１Ｃから受信した音声情報は、音声会議装置１１１Ａ、および音声会議装置１１１Ｂに送信してもよいし、送信せずに破棄してもよい。 When theuser 1B or the user 1C uses theoperation unit 18 to set to transmit audio only to theuser 1A, theaudio conference apparatus 111B transmits connection destination designation information designating theuser 1A to therelay server 102. To do. Therelay server 102 refers to the management table shown in FIG. 4 and sets the audio information to be transmitted only to the voice conference device of theuser 1A, that is, thevoice conference device 111A. Thereafter, therelay server 102 transmits the audio information received from theaudio conference apparatus 111B only to theaudio conference apparatus 111A. Also, the audio information received from theaudio conference apparatus 111A is transmitted only to theaudio conference apparatus 111B. The audio information received from theaudio conference apparatus 111C may be transmitted to theaudio conference apparatus 111A and theaudio conference apparatus 111B, or may be discarded without being transmitted.

また、上記においては、ユーザが操作部１８を用いて送信先を選択する例について説明したが、音声分析部１５１が発言内容を抽出することで、音声により送信先を指定することもできる。例えば、ユーザ１Ｂが「送信選択、１Ａさん」と発話すると、音声分析部１５１が音声認識を行い、中継サーバ１０２に対し、ユーザ１Ａを指定した接続先指定情報を送信する。同様に、例えば、ユーザ１Ｃが「送信選択、音声会議装置１１１Ａ」と発話すると、音声分析部１５１が音声認識を行い、中継サーバ１０２に対し、音声会議装置１１１Ａを指定した接続先指定情報を送信する。 In the above description, the example in which the user selects the transmission destination using theoperation unit 18 has been described. However, thevoice analysis unit 151 can also specify the transmission destination by voice by extracting the content of the utterance. For example, when theuser 1 B speaks “transmission selection, Mr. 1A”, thevoice analysis unit 151 performs voice recognition and transmits connection destination designation information designating theuser 1 A to therelay server 102. Similarly, for example, when the user 1C speaks “transmission selection,voice conference apparatus 111A”, thevoice analysis unit 151 performs voice recognition and transmits connection destination designation information designating thevoice conference apparatus 111A to therelay server 102. To do.

なお、中継サーバ１０２が音声情報を送信しないようにする例について説明したが、指定された音声会議装置以外には、音声をミュートして（音量レベルを下げて）送信するようにしてもよい。すなわち、図５（Ｂ）の例においては、中継サーバ１０２は、音声会議装置１１１Ｂから受信した音声情報を音声会議装置１１１Ａに送信するとともに、音声会議装置１１１Ｃにも送信する。ただし、音声会議装置１１１Ｃに送信する音声情報は、音量レベルを下げてから送信する。これにより、音声会議装置１１１Ｃのユーザ１Ｄは、発言内容を聞き取ることができず、音声情報が送信されていない場合と同様の状況となる。 Although an example in which therelay server 102 does not transmit audio information has been described, other than the designated audio conference apparatus, the audio may be muted (volume level lowered) and transmitted. That is, in the example of FIG. 5B, therelay server 102 transmits the audio information received from theaudio conference apparatus 111B to theaudio conference apparatus 111A and also transmits to theaudio conference apparatus 111C. However, the audio information transmitted to theaudio conference apparatus 111C is transmitted after the volume level is lowered. As a result, theuser 1D of theaudio conference apparatus 111C cannot hear the content of the statement, and the situation is the same as when the audio information is not transmitted.

また、図５（Ｂ）の状況において、ユーザ１Ｂ、またはユーザ１Ｃは、いつでもユーザ１Ｄとの会話を再開することができる。ユーザ１Ｂ、またはユーザ１Ｃは、操作部１８を用いて送信選択を解除するように設定する。ユーザ１Ｂ、またはユーザ１Ｃが操作部１８を用いて、送信選択を解除するように設定すると、音声会議装置１１１Ｂは、中継サーバ１０２に対し、解除情報を送信する。中継サーバ１０２は、解除情報を受信した場合、以後、音声会議装置１１１Ｂから受信した音声情報を音声会議装置１１１Ａ、および音声会議装置１１１Ｃに送信する。 In the situation of FIG. 5B, theuser 1B or the user 1C can resume the conversation with theuser 1D at any time. The user 1 B or the user 1 C is set to cancel the transmission selection using theoperation unit 18. When theuser 1B or the user 1C uses theoperation unit 18 to set to cancel the transmission selection, theaudio conference apparatus 111B transmits the cancellation information to therelay server 102. When therelay server 102 receives the release information, therelay server 102 transmits the audio information received from theaudio conference apparatus 111B to theaudio conference apparatus 111A and theaudio conference apparatus 111C.

また、ユーザ１Ｂ、およびユーザ１Ｃは、音声により送信選択を解除するように設定することもできる。例えば、ユーザ１Ｂが、「送信選択解除」と発話すると、音声分析部１５１が音声認識を行い、中継サーバ１０２に対し、解除情報を送信する。中継サーバ１０２は、解除情報を受信した場合、以後、音声会議装置１１１Ｂから受信した音声情報を音声会議装置１１１Ａ、および音声会議装置１１１Ｃに送信する。 Further, theuser 1B and the user 1C can be set to cancel the transmission selection by voice. For example, when theuser 1B speaks “transmission selection cancellation”, thevoice analysis unit 151 performs voice recognition and transmits the cancellation information to therelay server 102. When therelay server 102 receives the release information, therelay server 102 transmits the audio information received from theaudio conference apparatus 111B to theaudio conference apparatus 111A and theaudio conference apparatus 111C.

以上のようにして、本実施形態の音声会議システムは、動的に通話相手を切り換えることができる。各ユーザは、一時的に特定のユーザ間とだけ会話したい場合、会議を終了せずとも、容易に接続先を切り換えることができる。 As described above, the voice conference system according to the present embodiment can dynamically switch the call partner. Each user can easily switch the connection destination without ending the conference if he / she wants to talk only with specific users temporarily.

なお、本実施形態では、ＩＣカードに音声特徴量を記録し、ユーザからの発話音声と比較することにより、各ユーザを認証する例について説明したが、他の生体情報（指紋、静脈等）により認証を行ってもよい。 In this embodiment, an example in which each user is authenticated by recording a voice feature amount on an IC card and comparing it with a speech voice from the user has been described. However, by using other biological information (fingerprint, vein, etc.) Authentication may be performed.

また、音声特徴量や他の生体情報を記録する記録媒体は、ＩＣカードに限るものではない。例えば磁気カードを用いてもよいし、ＣＤ−ＲＯＭ等のメディアを用いてもよい。また、携帯型の記憶装置（所謂ＵＳＢメモリ等）を用いてもよい。この場合、カードリーダ／ライタ１７を、各記録媒体に対応した読取り部（記録媒体がＣＤ−ＲＯＭであればＣＤ−ＲＯＭドライブ）とすればよい。 Further, the recording medium for recording the voice feature amount and other biological information is not limited to the IC card. For example, a magnetic card or a medium such as a CD-ROM may be used. A portable storage device (a so-called USB memory or the like) may be used. In this case, the card reader /writer 17 may be a reading unit corresponding to each recording medium (a CD-ROM drive if the recording medium is a CD-ROM).

音声会議システムの構成を示すブロック図である。It is a block diagram which shows the structure of an audio conference system.音声会議装置の構成を示すブロック図である。It is a block diagram which shows the structure of an audio conference apparatus.登録、認証の例を示す図である。It is a figure which shows the example of registration and authentication.管理テーブルを示す図である。It is a figure which shows a management table.音声会議システムの送信先選択動作について示す図である。It is a figure shown about the transmission destination selection operation | movement of an audio conference system.

符号の説明Explanation of symbols

１１−マイク
１２−アンプ
１３−Ａ／Ｄコンバータ
１４−通信部
１５−制御部
１６−表示部
１７−カードリーダ／ライタ
１８−操作部
１９−Ｄ／Ａコンバータ
２０−アンプ
２１−スピーカ11-microphone 12-amplifier 13-A / D converter 14-communication unit 15-control unit 16-display unit 17-card reader / writer 18-operation unit 19-D / A converter 20-amplifier 21-speaker

Claims

Translated fromJapanese

前記転送部は、前記送信先選択処理において、前記指定情報に記載されている送信先の音声会議装置、および前記指定情報を受信した音声会議装置から受信した音声情報を、これらの音声会議装置以外の音声会議装置に、音量を制限して送信する請求項１に記載の音声会議システム。 In the transmission destination selection process, the transfer unit receives the audio information received from the audio conference device of the transmission destination described in the designation information and the audio conference device that received the designation information, other than these audio conference devices. The audio conference system according to claim 1, wherein the audio conferencing apparatus transmits the audio with limited volume.

前記認証部は、生体認証により、会議参加者を認証する請求項１、または請求項２に記載の音声会議システム。 The voice conference system according to claim 1, wherein the authentication unit authenticates a conference participant by biometric authentication.

前記認証部は、会議参加者の音声特徴量を記録した記録媒体を読み取る読取り部と、
会議参加者の音声特徴量を抽出する音声分析部と、からなり、
前記読取り部が前記記録媒体から読み取った音声特徴量と、前記音声分析部が抽出した音声特徴量と、を比較することにより、会議参加者を認証する請求項３に記載の音声会議システム。The authentication unit is a reading unit that reads a recording medium that records audio feature amounts of conference participants;
A voice analysis unit that extracts voice features of conference participants,
The audio conference system according to claim 3, wherein the conference participant is authenticated by comparing the audio feature amount read from the recording medium by the reading unit and the audio feature amount extracted by the audio analysis unit.

前記受付部は、音声認識により、送信先を指定する操作を受け付ける請求項１乃至請求項４のいずれかに記載の音声会議システム。 The voice conference system according to claim 1, wherein the reception unit receives an operation for designating a transmission destination by voice recognition.