CN101697581A

Movatterモバイル変換

Info

Publication number: CN101697581A
Application number: CN200910179642A
Authority: CN
Inventors: 詹五洲; 王东琦
Original assignee: Shenzhen Huawei Communication Technologies Co Ltd
Current assignee: Global Innovation Polymerization LLC; Tanous Co
Priority date: 2009-10-26
Filing date: 2009-10-26
Publication date: 2010-04-21
Anticipated expiration: 2029-10-26
Also published as: CN101697581B

Abstract

The invention discloses a method, a device and a system for supporting a simultaneous interpretation video conference. The method comprises the following steps of: receiving interpretation voice which is obtained by interpreting original voice of a meeting place where a conference terminal is arranged and sent by the conference terminal; according to the language type of the interpretation voice, performing the voice mixing treatment of the interpretation voice to obtain the interpretation mixing voice of all types of language after the voice mixing treatment; and sending the interpretation mixing voice of all types of languages to the conference terminal corresponding to the meeting place supporting the corresponding language type. The method, the device and the system have the advantages that: the interpretation mechanisms are supported to be set up in all conference terminals, independent interpretation conference terminals are unnecessarily set up, the technical scheme is easily realized, and the practicability is much high.

Description

Support method, the Apparatus and system of simultaneous interpretation video conference

Technical field

The present invention relates to communication technical field, particularly a kind of method, Apparatus and system of supporting simultaneous interpretation video conference.

Background technology

Video conference is a kind of multimedia communication means, can realize the image between two or more places, the interactive function of voice-and-data simultaneously, utilizes television equipment and communication network to hold a meeting.Video meeting system generally by video conference terminal, transmission network and multipoint control unit (Multipoint Control Unit, hereinafter to be referred as: MCU) etc. several parts are formed.

When using multilingual in the video meeting system, need be to meeting-place speech carrying out simultaneous interpretation.Prior art is provided with special translation meeting-place in conference system, be responsible for certain a pair of language is translated mutually, as Chinese and English intertranslation.All be provided with the language form in meeting-place separately for other common meeting-place, the meeting-place of certain language form sends and receives all is the voice of the language form that self is provided with.MCU can judge the language form of the voice after the translation of uploading in the translation meeting-place, when MCU carries out the audio mixing processing, carry out audio mixing according to language form, for example will be together from the voice mixing in all Chinese meeting-place, if what uploaded in the translation meeting-place in addition is Chinese, then also be in the same place with the voice mixing in Chinese meeting-place.After all types of voice being carried out the audio mixing processing, send to the meeting-place of corresponding language type.

By above-mentioned analysis as can be known, the scheme of existing technology needs the dedicated translation meeting-place that is independent of each meeting-place, and whole video-signal system is comparatively complicated, realizes inconvenient.

Summary of the invention

The invention provides a kind of method, Apparatus and system of supporting simultaneous interpretation video conference, need set up special-purpose translation conference terminal in order to solve existing video-signal system, system is comparatively complicated, implements inconvenient problem.

The embodiment of the invention provides a kind of method of supporting simultaneous interpretation video conference, comprising:

Receive the translated speech that conference terminal sends, described translated speech obtains after by described conference terminal the raw tone in self meeting-place, place being translated;

According to the language form of described translated speech described translated speech is carried out audio mixing and handle, obtain the translation audio mixing of each language form after audio mixing is handled;

The described translation audio mixing of each language form is sent to the conference terminal of the meeting-place correspondence of supporting the corresponding language type.

The embodiment of the invention also provides a kind of method of supporting simultaneous interpretation video conference, comprising:

The raw tone in meeting-place, conference terminal place translated obtain translated speech, and send described translated speech to MCU;

Receive the translation audio mixing that described MCU sends, described translation audio mixing carries out the audio mixing processing by described MCU according to the language form of the translated speech of each conference terminal transmission and obtains, and the language form of described translation audio mixing is identical with the language form that described meeting-place is supported.

The embodiment of the invention also provides a kind of multipoint control unit of supporting simultaneous interpretation video conference, comprising:

Receiver module is used to receive the translated speech that conference terminal sends, and described translated speech obtains after by described conference terminal the raw tone in self meeting-place, place being translated;

The audio mixing module is used for according to the language form of described translated speech described translated speech being carried out audio mixing and handles, and obtains the translation audio mixing of each language form after audio mixing is handled;

Sending module is used for the conference terminal that described translation audio mixing with each language form sends to the meeting-place correspondence of supporting the corresponding language type.

The embodiment of the invention also provides a kind of conference terminal of supporting simultaneous interpretation video conference, comprising:

Translation module is used for the raw tone in meeting-place, conference terminal place translated and obtains translated speech;

Sending module is used for sending described translated speech to MCU;

Receiver module, be used to receive described MCU and send the translation audio mixing, described translation audio mixing carries out the audio mixing processing by described MCU according to the language form of the translated speech of each conference terminal transmission and obtains, and the language form of described translation audio mixing is identical with the language form that described meeting-place is supported.

The embodiment of the invention also provides a kind of system that supports simultaneous interpretation video conference, comprise MCU and at least one conference terminal, described conference terminal is used for the raw tone in meeting-place, conference terminal place translated and obtains translated speech, sends described translated speech to MCU; And receive the translation audio mixing that described MCU sends;

Described MCU, be used to receive the described translated speech that conference terminal sends, according to the language form of described translated speech described translated speech being carried out audio mixing handles, obtain the translation audio mixing of each language form after audio mixing is handled, the described translation audio mixing of each language form is sent to the conference terminal of the meeting-place correspondence of supporting the corresponding language type.

Method, the Apparatus and system of the support simultaneous interpretation video conference that the embodiment of the invention provides, each conference terminal can be translated back output to the raw tone in meeting-place according to the interpreter language type of self, by MCU the translated speech after the meeting terminal translation is carried out after audio mixing handles, send to other conference terminal again according to language form, independently translate conference terminal thereby make whole video-signal system need not to set up, scheme is easy to realize having very high practicality.

Description of drawings

The method flow diagram of the support simultaneous interpretation video conference that Fig. 1 provides for first embodiment of the invention;

The method flow diagram of the support simultaneous interpretation video conference that Fig. 2 provides for second embodiment of the invention;

The method flow diagram of the support simultaneous interpretation video conference that Fig. 3 provides for third embodiment of the invention;

The method flow diagram of the support simultaneous interpretation video conference that Fig. 4 provides for fourth embodiment of the invention;

The video conference application scenarios schematic diagram that Fig. 5 provides for fifth embodiment of the invention;

The video conference application scenarios schematic diagram that Fig. 6 provides for sixth embodiment of the invention;

The structural representation of the MCU that Fig. 7 provides for seventh embodiment of the invention;

The structural representation of the conference terminal that Fig. 8 provides for eighth embodiment of the invention;

The system configuration schematic diagram of the support simultaneous interpretation video conference that Fig. 9 provides for ninth embodiment of the invention.

Embodiment

For the purpose, technical scheme and the advantage that make the embodiment of the invention clearer, below in conjunction with the accompanying drawing in the embodiment of the invention, technical scheme in the embodiment of the invention is clearly and completely described, obviously, described embodiment is the present invention's part embodiment, rather than whole embodiment.Based on the embodiment among the present invention, those of ordinary skills belong to the scope of protection of the invention not making all other embodiment that obtained under the creative work prerequisite.

In the system of the support simultaneous interpretation video conference that the embodiment of the invention provides, comprise the conference terminal at least one meeting-place, and the MCU that is connected with each conference terminal.The equipment of conference terminal mainly comprises video input-output apparatus, audio input/output device, Video Codec, audio codec, information communication device and multiplexed/signal separated time equipment etc.Video meeting system couples together conference terminal and MCU through communication network.MCU is the control core of system, and conference terminal all will be connected to MCU by standard interface, realizes the mixing and the exchange of image and voice.Main taking into account system is to processing of audio data in the embodiment of the invention, and at up direction, conference terminal utilizes microphone to pick up the voice data in meeting-place, after the encoded packing, is sent to other conference terminal by transmission network.At down direction, conference terminal receives the voice data of other conference terminals by transmission network, obtains the original sound data after decoding, and plays in the meeting-place by loud speaker.

In embodiments of the present invention, the conference terminal in each meeting-place can be provided with body translation, and the voice that receive are translated.Conference terminal can set in advance the language form of translated speech, according to the language form of the interpreter language that is provided with the raw tone in self meeting-place, place is translated.The language form of the translated speech that conference terminal is provided with can be a kind of for what fix, also can be for multiple.

The method flow diagram of the support simultaneous interpretation video conference that Fig. 1 provides for first embodiment of the invention, as shown in Figure 1, present embodiment supports the executive agent of the method for simultaneous interpretation video conference to can be MCU, method comprises:

Step 11, receive the translated speech that conference terminal sends, this translated speech obtains after by this conference terminal the raw tone in self meeting-place, place being translated.

In the practical application, pick up the raw tone of meeting-place speech by the conference terminal in meeting-place, obtain translated speech after translating, can carry out the transmission of translated speech by designated lane, this passage is the translated speech passage.Conference terminal sends to MCU by the translated speech passage with translated speech, and MCU receives the translated speech that conference terminal sends by the translated speech passage.

Step 12, according to the language form of this translated speech this translated speech is carried out audio mixing and handle, obtain the translation audio mixing of each language form after audio mixing is handled.

Because each conference terminal is translated the raw tone of self meeting-place, place speech, therefore, MCU need not to translate again, and the translated speech of each conference terminal is carried out sending to each conference terminal again and getting final product after audio mixing handles.

The audio mixing that MCU sends to certain conference terminal does not comprise the voice in the meeting-place at this conference terminal self place.The audio mixing processing policy that is MCU should make arbitrary meeting-place can't hear the sound of self, only hears the sound in other meeting-place.In addition, when the meeting-place of speech has when a plurality of, can set a plurality of meeting-place of participating in audio mixing is the bigger meeting-place of volume.

For example realize 3 side meeting-place audio mixings, the meeting-place of then participating in audio mixing is 3 side meeting-place of volume maximum, the audio mixing processing policy of 3 side meeting-place audio mixings is as follows: when having only a meeting-place to make a speech, the sound of oneself is can't hear in the meeting-place of then making a speech, and the sound in speech meeting-place all can be heard in other meeting-place; When make a speech simultaneously in two meeting-place, the both sides that then make a speech all can hear the other side's sound, can't hear the sound of oneself, and other meeting-place all can be heard the sound in two meeting-place of speech simultaneously; Current have 3 meeting-place or surpass meeting-place more than 3 when making a speech simultaneously, then audio mixing is participated in 3 side meeting-place of volume maximum, as T1, T2 and T3 is 3 side meeting-place of volume maximum in the active conference, then the sound in other two sides meeting-place can be heard in any one meeting-place among T1, T2 and the T3, and all the other meeting-place outside T1, T2 and the T3 then can be heard the audio mixing in T1, T2 and T3 meeting-place simultaneously.

Respectively describe in detail below among the embodiment, the audio mixing that can adopt the audio mixing processing policy identical with present embodiment to carry out raw tone or translated speech is handled, and repeats no more.

Step 13, this translation audio mixing of each language form sent to the conference terminal of the meeting-place correspondence of supporting the corresponding language type.

MCU just can send to this translation audio mixing the conference terminal of the meeting-place correspondence of supporting the corresponding language type according to the language form of each meeting-place support after having finished the audio mixing processing.As Chinese audio mixing is sent to Chinese meeting-place, English audio mixing is sent to English meeting-place, make each meeting field energy receive the translated speech of the language form of self supporting thus.

By MCU the translated speech after the meeting terminal translation is carried out will translate the conference terminal that audio mixing sends to the meeting-place of support corresponding language type according to language form again after audio mixing handles in the present embodiment.Whole video-signal system need not to set up independently translates conference terminal, be supported in each conference terminal and set up body translation, the language form of the translated speech of each conference terminal output all sets in advance, be convenient to MCU and voice carried out the audio mixing processing by the language type, scheme is easy to realize having very high practicality.

The method flow diagram of the support simultaneous interpretation video conference that Fig. 2 provides for second embodiment of the invention.The difference of first embodiment that present embodiment is corresponding with Fig. 1 is that present embodiment describes from the angle of handling raw tone, and as shown in Figure 2, present embodiment comprises:

The raw tone in this meeting-place, conference terminal place thatstep 21, reception conference terminal send.

Can carry out the transmission of raw tone by designated lane, this passage is the raw tone passage, and MCU receives the raw tone in this meeting-place, conference terminal place of conference terminal transmission by the raw tone passage.

Step 22, this raw tone is carried out audio mixing handle, obtain the original audio mixing after audio mixing is handled.

MCU can carry out audio mixing to raw tone according to the audio mixing processing policy to be handled, and each raw tone is carried out after audio mixing handles, and the original audio mixing of passing to certain meeting-place does not comprise the raw tone in this meeting-place, includes only the raw tone in other meeting-place.For example, pass to the original audio mixing in meeting-place 1, do not comprise the raw tone in meeting-place 1.

Optionally, can the raw tone that this meeting-place need be translated be carried out sending to this meeting-place after audio mixing is handled according to the specific requirement in certain meeting-place.For example, the body translation in meeting-place 2 is responsible for English is become Chinese with translator of French, and then MCU can only carry out the audio mixing processing with original English voice and French voice, and the original audio mixing that will obtain sends to the conference terminal in meeting-place 2.

Step 23, send this original audio mixing to this conference terminal.

MCU is after having carried out the audio mixing processing, just can original audio mixing be sent to each conference terminal by the raw tone passage, after each conference terminal receives original audio mixing, just can translate original audio mixing according to the language form that self meeting-place, place is supported, in the meeting-place, play then, perhaps, when requiring to play raw tone, do not translate and play-over raw tone in the meeting-place.Just realized the intercommunication between the different language type meeting-place thus.

MCU carries out the raw tone of each conference terminal to send to each conference terminal again after audio mixing is handled in the present embodiment, the language form of being supported according to self meeting-place, place by each conference terminal is handled the audio mixing of raw tone, makes the meeting-place of different language type can carry out the intercommunication of language.

The first embodiment of the present invention is described from the angle of handling translated speech, and second embodiment is described from the angle of handling raw tone.According to actual needs, both can be carried out combination, i.e. the present invention can set up bilingual sound passage and carry out the transmission of voice, and the translated speech passage is exclusively used in the transmission translated speech, and the raw tone passage is exclusively used in the transmission raw tone.Adopt the method for first embodiment and second embodiment to realize the respectively processing of system to raw tone and translated speech by bilingual sound passage.

The method flow diagram of the support simultaneous interpretation video conference that Fig. 3 provides for third embodiment of the invention.As shown in Figure 3, present embodiment supports the executive agent of the method for simultaneous interpretation video conference to can be conference terminal, and method comprises:

Step 31, the raw tone in meeting-place, conference terminal place translated obtain translated speech, and send this translated speech to MCU.

Conference terminal sends to MCU by the translated speech passage with translated speech after the raw tone in self meeting-place, place is translated, and by MCU translated speech is carried out audio mixing and handles.

Step 32, receive the translation audio mixing that this MCU sends, this translation audio mixing carries out audio mixing processing according to the language form of translated speech to the translated speech of each conference terminal transmission by this MCU and obtains, and language form of this translation audio mixing is identical with the language form that this meeting-place is supported.

Conference terminal receives the translation audio mixing that MCU sends by the translated speech passage.

Each conference terminal all is provided with the language form of translated speech in the present embodiment, and each conference terminal all has interpretative function, can translate back output to the raw tone in meeting-place.By MCU the translated speech after the meeting terminal translation is carried out sending to each conference terminal according to language form again after audio mixing handles.Make whole video-signal system need not to set up and independently translate conference terminal, and be convenient to MCU and by the language type voice carried out audio mixing and handle that scheme is easy to realize having very high practicality.

The method flow diagram of the support simultaneous interpretation video conference that Fig. 4 provides for fourth embodiment of the invention.The difference of the 3rd embodiment that present embodiment is corresponding with Fig. 3 is that present embodiment describes from the angle of handling raw tone, and as shown in Figure 4, present embodiment comprises:

Step 41, send the raw tone in meeting-place, conference terminal place to this MCU.

Conference terminal carries out audio mixing by MCU to raw tone and handles by the raw tone of raw tone passage to MCU transmission self meeting-place, place.

Step 42, receive the original audio mixing that this MCU sends, the raw tone that this original audio mixing is sent each conference terminal by this MCU is carried out audio mixing and is handled and obtain.

Conference terminal receives the original audio mixing that MCU sends by the raw tone passage, afterwards, just can this original audio mixing be translated, and play in the meeting-place according to the sound-type of self meeting-place, place support; When perhaps requiring to listen to raw tone, play-over this raw tone in the meeting-place.

Each conference terminal is transferred to MCU with raw tone in the present embodiment, by MCU the raw tone of each conference terminal is carried out sending to each conference terminal again after audio mixing is handled, the language form that each conference terminal is supported according to self meeting-place, place is handled the audio mixing of raw tone, makes the meeting-place of different language type can carry out the intercommunication of language.

The video conference application scenarios schematic diagram that Fig. 5 provides for fifth embodiment of the invention.Present embodiment is elaborated in conjunction with the technical scheme of concrete application scenarios to the method embodiment of support simultaneous interpretation video conference.

In the application scenarios as shown in Figure 5, video conference is carried out in Chinese meeting-place 1, Chinese meeting-place 2, English meeting-place 1 and French meeting-place 1, each meeting-place receives translated speech according to the language form of each self-supporting, the language form of supporting as Chinese meeting-place is Chinese, the translated speech that then Chinese meeting-place receives is a Chinese speech, other meeting-place are similar in this, repeat no more.The multilingual speech may be used in a meeting-place, the raw tone in meeting-place also should be the multilingual type mutually, in the present embodiment, the sound-type after the translation of the conference terminal in each meeting-place is for what fix, and promptly conference terminal is translated into the fixedly voice of language form with the raw tone in this meeting-place.Chinese is all translated into the voice in this meeting-place in Chinese meeting-place 1, and English is all translated into the voice in this meeting-place in Chinese meeting-place 2, and English is all translated into the voice in this meeting-place in English meeting-place 1, and French is all translated into the voice in this meeting-place in French meeting-place 1.

In the present embodiment, when meeting was carried out, the method for work of the conference terminal in MCU and each meeting-place was as follows:

(1) conference terminal in each meeting-place obtains the raw tone in self meeting-place, place, and raw tone is sent to MCU by the raw tone passage.Simultaneously the raw tone in self meeting-place, place is translated, obtained translated speech, and translated speech is sent to MCU by the translated speech passage.As, the conference terminal in Chinese meeting-place 1 sends to MCU by the translated speech passage after the voiced translation in this meeting-place is become Chinese.After the conference terminal in Chinese meeting-place 2 becomes English with the voiced translation in this meeting-place, send to MCU by the translated speech passage.

(2) MCU carries out the audio mixing processing with the raw tone of each conference terminal transmission, obtains original audio mixing, and original audio mixing is sent to each conference terminal.Simultaneously, the translated speech that MCU sends each conference terminal is carried out the audio mixing processing, obtains translating audio mixing, and according to the language form that each meeting-place, conference terminal place is supported, will translate audio mixing and send to each conference terminal by the translated speech passage.As MCU Chinese audio mixing is sent to Chinese meeting-place 1 and Chinese meeting-place 2, English audio mixing is sent to English meeting-place 1, the French audio mixing is sent to French meeting-place 1.

(3) each conference terminal receives the original audio mixing that MCU sends.Simultaneously, each conference terminal receives the translation audio mixing that MCU sends by the translated speech passage.

Because it is identical with the language form of this meeting-place support that MCU sends to the translation audio mixing of conference terminal in certain meeting-place, the translation audio mixing that receives as Chinese meeting-place is Chinese audio mixing.Therefore, each conference terminal can directly be play this translation audio mixing in the meeting-place, place.

For receiving original audio mixing, then can handle according to concrete demand, if any the meeting-place need translate, and then play, and the raw tone of not translating is wished to hear in the meeting-place that has, and then can not translate and play-over original audio mixing.

Each conference terminal all is provided with the language form of translated speech in the present embodiment, and each conference terminal all has interpretative function, can translate back output to the raw tone in meeting-place.By MCU the translated speech after the meeting terminal translation is carried out after audio mixing handles, send to other conference terminal again according to language form.Scheme is easy to realize having very high practicality.

The video conference application scenarios schematic diagram that Fig. 6 provides for sixth embodiment of the invention.Present embodiment is elaborated in conjunction with the technical scheme of concrete application scenarios to the method embodiment of support simultaneous interpretation video conference.

Difference at present embodiment and the 5th embodiment is that the language form of each meeting-place translation is a kind of for what fix among the 5th embodiment, and the language form of each meeting-place translation can be for multiple in the present embodiment.Make the translated resources in arbitrary meeting-place to share thus, can finish by same translator as Chinese and English intertranslation and Great Britain and France's literary composition intertranslation.Body translation as Chinese meeting-place 1 can carry out the translation that Chinese arrives English to English translation and French, and the translated speech that upload in then Chinese meeting-place 1 may be a Chinese or English.In the present embodiment, can increase the sound-type that the module of an identifiable language type uploads the meeting-place in MCU discerns.

In the present embodiment, the language form that is uploaded to the voice of MCU after the conference terminal translation can have multiple, the language form of the voice that 1 translation of Chinese meeting-place is uploaded is Chinese or English, the language form of the voice that 2 translations of Chinese meeting-place are uploaded is Chinese, the language form of the voice that 1 translation of English meeting-place is uploaded is English or French, and the language form of the voice that 1 translation of French meeting-place is uploaded is Chinese or French.

The processing method of the raw tone that MCU uploads conference terminal in the present embodiment is similar to the 5th embodiment, repeats no more herein.

In the present embodiment,, then discern the language form of interpreter language earlier, carry out audio mixing according to the language form of the translated speech of discerning again and handle by MCU for the translated speech that each conference terminal is uploaded.As at a time, MCU identifies the translated speech of uploading in Chinese meeting-place 1 and is Chinese, then this translated speech and other translator of Chinese voice is together carried out the audio mixing processing; At another constantly, MCU identifies the translated speech of uploading in Chinese meeting-place 1 and is English, then this translated speech and other translator of English voice is together carried out the audio mixing processing.The method of the translated speech after the identification being carried out the audio mixing processing is similar to the method among the 5th embodiment, repeats no more herein.

Wherein, the method of the language form of identification translated speech can for: in the packet of the translated speech that conference terminal is uploaded, add the language form sign, after MCU receives the packet of voice, discern the language form of this translated speech according to the sign of VoP.Perhaps, MCU adopts the language identification engine to discern the language form of the translated speech of this conference terminal transmission.

When the translated speech that present embodiment is uploaded when conference terminal has the multilingual type, earlier the language form of the translated speech uploaded is discerned, again the translated speech after the identification is carried out audio mixing and handle.Make the translated resources in meeting-place to share thus.Whole video-signal system need not to set up independently translates conference terminal, is supported in each conference terminal and sets up body translation, and scheme is easy to realize having very high practicality.

The structural representation of the MCU that Fig. 7 provides for seventh embodiment of the invention.As shown in Figure 7, present embodiment MCU comprises:receiver module 71,audio mixing module 72 and sendingmodule 73.

Receiver module 71 is used to receive the translated speech that conference terminal sends, and this translated speech obtains after by this conference terminal the raw tone in self meeting-place, place being translated.

Audio mixing module 72 is used for according to the language form of this translated speech this translated speech being carried out audio mixing to be handled, and obtains the translation audio mixing of each language form after audio mixing is handled.

Sendingmodule 73 is used for this translation audio mixing of each language form is sent to the conference terminal of the meeting-place correspondence of supporting the corresponding language type.

Receive the raw tone in other meeting-place and carry out respective handling according to real needs for the ease of each meeting-place.Thisreceiver module 71 also is used to receive the raw tone in this conference terminal self meeting-place, place that conference terminal sends.

Thisaudio mixing module 72 is used for that also this raw tone is carried out audio mixing to be handled, and obtains the original audio mixing of each language form after audio mixing is handled.

This sendingmodule 73 also is used for sending this original audio mixing to this conference terminal.

Wherein,receiver module 71 also is used for receiving the raw tone that conference terminal sends by the raw tone passage, receives the translated speech that conference terminal sends by the translated speech passage.Accordingly, sendingmodule 73 also is used for sending raw tone by the raw tone passage to conference terminal, sends translated speech by the translated speech passage to conference terminal.In the present embodiment, the raw tone passage is different passages with the translated speech passage.

The raw tone that the conference terminal that MCU receives sends can have multiple, and as being Chinese or English, at this moment, thisreceiver module 71 also is used for receiving by the raw tone passage of this conference terminal the raw tone of at least a language form.

The language form of the translated speech that conference terminal is provided with can be a kind of for what fix, also can be for multiple.Therefore the language form of the translated speech that receives of MCU can be for fixing a kind of, also can be for multiple.At this moment, thisreceiver module 71 also is used for receiving by the translated speech passage of this conference terminal the translated speech of at least a language form.

When the language form of the translated speech that sends when certain conference terminal is multiple, MCU also needs the language form of translated speech is judged, at this moment, MCU also comprises:identification module 74, when being used for translated speech that a meeting terminal in office sends and comprising at least two kinds of language forms, discern the language form of the translated speech that arbitrary conference terminal sends.

The function of MCU in the present embodiment, and MCU can not repeat them here referring to the record of the corresponding embodiment of Fig. 1～Fig. 6 with mutual mechanism and effect between the conference terminal.

Raw tone that MCU sends each conference terminal in the present embodiment and translated speech are carried out after audio mixing handles, by the raw tone passage original audio mixing is sent to each conference terminal, will translate audio mixing according to language form by the interpreter language passage and send to each conference terminal.Make video-signal system need not to set up and independently translate conference terminal, be supported in each conference terminal and set up body translation, make that video-signal system is easy to realize having very high practicality.

The structural representation of the conference terminal that Fig. 8 provides for eighth embodiment of the invention.As shown in Figure 8, the present embodiment conference terminal comprises: translation module 81, sending module 82 and receiver module 83.

Translation module 81 is used for the raw tone in meeting-place, conference terminal place translated and obtains translated speech.Conference terminal can set in advance the language form of translated speech, and the raw tone in meeting-place is translated back output, and the language form of the translated speech that conference terminal is provided with can be a kind of for what fix, also can be for multiple.

Sending module 82 is used for sending this translated speech to multipoint control unit MCU.After translated speech was sent to MCU, MCU carries out audio mixing to translated speech to be handled, and the translation audio mixing that audio mixing is handled sends to conference terminal.

Receiver module 83 is used to receive this MCU and sends the translation audio mixing, this translation audio mixing carries out audio mixing processing according to the language form of translated speech to the translated speech of each conference terminal transmission by this MCU and obtains, and language form of this translation audio mixing is identical with the language form that this meeting-place is supported.

For the raw tone in meeting-place, conference terminal directly sends to MCU, and this sending module 82 also is used for sending to this MCU the raw tone in meeting-place, conference terminal place.

After MCU receives original audio mixing, after carrying out the audio mixing processing, audio mixing is handled the original audio mixing that generates send to each conference terminal, the receiver module 83 of conference terminal also is used to receive this MCU and sends original audio mixing, and the raw tone that this original audio mixing is sent according to each conference terminal by described MCU is carried out the audio mixing processing and obtained.

Wherein, sending module 82 also is used for sending raw tone by the raw tone passage, sends translated speech by the translated speech passage.Accordingly, receiver module 83 also is used for receiving original audio mixing by the raw tone passage, receives the translation audio mixing by the translated speech passage.In the present embodiment, the raw tone passage is different passages with the translated speech passage.

The conference terminal of present embodiment can be set up body translation, and the language form of the translated speech of conference terminal output all sets in advance, and is convenient to MCU and by the language type voice is carried out the audio mixing processing, makes that video-signal system is easy to realize.

The system configuration schematic diagram of the support simultaneous interpretation video conference that Fig. 9 provides for ninth embodiment of the invention.As shown in Figure 9, the system of present embodiment comprisesconference terminal 91 and MCU92.

Thisconference terminal 91 is used for the raw tone in meeting-place, conference terminal place translated and obtains translated speech, sends this translated speech to MCU92; And receive the translation audio mixing that this MCU92 sends.

This MCU92 is used to receive this translated speech thatconference terminal 91 sends, according to the language form of this translated speech this translated speech being carried out audio mixing handles, obtain the translation audio mixing of each language form after audio mixing is handled, this translation audio mixing of each language form is sent to theconference terminal 91 of the meeting-place correspondence of supporting the corresponding language type.

For making each conference terminal can receive the raw tone of other conference terminals, this MCU also is used to receive the original language in this meeting-place, conference terminal place thatconference terminal 91 sends, this raw tone is carried out audio mixing to be handled, obtain the original audio mixing of each language form after audio mixing is handled, send this original audio mixing to thisconference terminal 91.

Conference terminal 91 also is used for the raw tone to this MCU transmission self meeting-place, place, and receives this original audio mixing that this MCU92 sends.

Optionally, be connected with raw tone passage and translated speech passage betweenconference terminal 91 and the MCU92, by raw tone channel transfer raw tone, by translated speech channel transfer translated speech.The quantity ofconference terminal 91 is disposed according to actual needs, does not limit.The function of MCU92 and structure can be referring to the records of the corresponding embodiment of Fig. 7, the function ofconference terminal 91 and structure can be referring to the records of the corresponding embodiment of Fig. 8,conference terminal 91 can not repeat them here referring to the record of the corresponding embodiment of Fig. 1～Fig. 6 with mutual mechanism and effect between the MCU92.

One of ordinary skill in the art will appreciate that: accompanying drawing is the schematic diagram of an embodiment, and module in the accompanying drawing or flow process might not be that enforcement the present invention is necessary.

One of ordinary skill in the art will appreciate that: the module in the device among the embodiment can be described according to embodiment and be distributed in the device of embodiment, also can carry out respective change and be arranged in the one or more devices that are different from present embodiment.The module of the foregoing description can be merged into a module, also can further split into a plurality of submodules.

The invention described above embodiment sequence number is not represented the quality of embodiment just to description.

One of ordinary skill in the art will appreciate that: all or part of step that realizes said method embodiment can be finished by the relevant hardware of program command, aforesaid program can be stored in the computer read/write memory medium, this program is carried out the step that comprises said method embodiment when carrying out; And aforesaid storage medium comprises: various media that can be program code stored such as ROM, RAM, magnetic disc or CD.

It should be noted that at last: above embodiment only in order to technical scheme of the present invention to be described, is not intended to limit; Although with reference to previous embodiment the present invention is had been described in detail, those of ordinary skill in the art is to be understood that: it still can be made amendment to the technical scheme that previous embodiment is put down in writing, and perhaps part technical characterictic wherein is equal to replacement; And these modifications or replacement do not make the essence of appropriate technical solution break away from the spirit and scope of embodiment of the invention technical scheme.

Claims

1. a method of supporting simultaneous interpretation video conference is characterized in that, comprising:

2. the method for support simultaneous interpretation video conference according to claim 1 is characterized in that, also comprises:

Receive the raw tone in the meeting-place, described conference terminal place of conference terminal transmission;

Described raw tone is carried out audio mixing handle, obtain the original audio mixing after audio mixing is handled;

Send described original audio mixing to described conference terminal.

3. the method for support simultaneous interpretation video conference according to claim 2, it is characterized in that, receive the translated speech that described conference terminal sends, comprising: the described translated speech that receives at least a language form by the translated speech passage of described conference terminal;

Receive the raw tone in the meeting-place, described conference terminal place of conference terminal transmission, comprising: the raw tone passage by described conference terminal receives described raw tone;

Described translated speech passage is different passages with described raw tone passage.

4. the method for support simultaneous interpretation video conference according to claim 3, it is characterized in that, described translated speech is carried out before audio mixing handles, also comprise: when the translated speech of arbitrary described conference terminal transmission comprises at least two kinds of language forms, discern the language form of the translated speech of arbitrary described conference terminal transmission.

5. the method for support simultaneous interpretation video conference according to claim 2, it is characterized in that, the described translation audio mixing of each language form is sent to the conference terminal of the meeting-place correspondence of supporting the corresponding language type, and comprising: the translated speech passage by described conference terminal sends described translation audio mixing;

Send described original audio mixing to described conference terminal, comprising: the raw tone passage by described conference terminal sends described original audio mixing;

6. a method of supporting simultaneous interpretation video conference is characterized in that, comprising:

The raw tone in meeting-place, conference terminal place translated obtain translated speech, and send described translated speech to multipoint control unit MCU;

Receive the translation audio mixing that described MCU sends, described translation audio mixing carries out audio mixing processing according to the language form of translated speech to the translated speech of each conference terminal transmission by described MCU and obtains, and the language form of described translation audio mixing is identical with the language form that described meeting-place is supported.

7. the method for support simultaneous interpretation video conference according to claim 6 is characterized in that, also comprises:

Send the raw tone in meeting-place, conference terminal place to described MCU;

Receive the original audio mixing that described MCU sends, the raw tone that described original audio mixing is sent each conference terminal by described MCU is carried out the audio mixing processing and is obtained.

8. the method for support simultaneous interpretation video conference according to claim 7 is characterized in that, sends described translated speech to MCU, comprising: the translated speech passage by described conference terminal sends described translated speech;

To the raw tone in described MCU transmission meeting-place, conference terminal place, comprising: the raw tone passage by described conference terminal sends described raw tone;

9. the method for support simultaneous interpretation video conference according to claim 7 is characterized in that, receives the translation audio mixing that described MCU sends, and comprising: the translated speech passage by described conference terminal receives described translation audio mixing;

Receive the original audio mixing that described MCU sends, comprising: the raw tone passage by described conference terminal receives described original audio mixing;

10. a multipoint control unit of supporting simultaneous interpretation video conference is characterized in that, comprising:

11. the multipoint control unit of support simultaneous interpretation video conference according to claim 10 is characterized in that,

Described receiver module also is used to receive the raw tone in the meeting-place, described conference terminal place that conference terminal sends;

Described audio mixing module is used for that also described raw tone is carried out audio mixing and handles, and obtains the original audio mixing of each language form after audio mixing is handled;

Described sending module also is used for sending described original audio mixing to described conference terminal.

12. the multipoint control unit of support simultaneous interpretation video conference according to claim 11 is characterized in that,

Described receiver module also is used for receiving by the translated speech passage of described conference terminal the described translated speech of at least a language form; Described receiver module also is used for receiving described raw tone by the raw tone passage of described conference terminal;

13. the multipoint control unit of support simultaneous interpretation video conference according to claim 12 is characterized in that, also comprises:

Identification module is used for discerning the language form of the translated speech of arbitrary described conference terminal transmission when the translated speech that arbitrary described conference terminal sends comprises at least two kinds of language forms.

14. the multipoint control unit of support simultaneous interpretation video conference according to claim 11 is characterized in that,

Described sending module also is used for sending described translation audio mixing by the translated speech passage of described conference terminal; Described sending module also is used for sending described original audio mixing by the raw tone passage of described conference terminal; Described translated speech passage is different passages with described raw tone passage.

15. a conference terminal of supporting simultaneous interpretation video conference is characterized in that, comprising:

Sending module is used for sending described translated speech to multipoint control unit MCU;

Receiver module, be used to receive described MCU and send the translation audio mixing, described translation audio mixing carries out audio mixing processing according to the language form of translated speech to the translated speech of each conference terminal transmission by described MCU and obtains, and the language form of described translation audio mixing is identical with the language form that described meeting-place is supported.

16. the conference terminal of support simultaneous interpretation video conference according to claim 15 is characterized in that,

Described sending module also is used for the raw tone to described MCU transmission meeting-place, conference terminal place;

Described receiver module also is used to receive the original audio mixing that described MCU sends, and the raw tone that described original audio mixing is sent according to each conference terminal by described MCU is carried out the audio mixing processing and obtained.

17. the conference terminal of support simultaneous interpretation video conference according to claim 16 is characterized in that,

Described sending module also is used for sending described translated speech by the translated speech passage of described conference terminal; Described sending module also is used for sending described raw tone by the raw tone passage of described conference terminal; Described translated speech passage is different passages with described raw tone passage.

18. the conference terminal of support simultaneous interpretation video conference according to claim 16 is characterized in that,

Described receiver module also is used for receiving described translation audio mixing by the translated speech passage of described conference terminal; Described receiver module also is used for receiving described original audio mixing by the raw tone passage of described conference terminal; Described translated speech passage is different passages with described raw tone passage.

19. a system that supports simultaneous interpretation video conference comprises multipoint control unit MCU and at least one conference terminal, it is characterized in that,

Described conference terminal is used for the raw tone in meeting-place, conference terminal place translated and obtains translated speech, sends described translated speech to MCU; And receive the translation audio mixing that described MCU sends;

20. the system of support simultaneous interpretation video conference according to claim 19 is characterized in that,

Described MCU also is used to receive the original language in the meeting-place, described conference terminal place that conference terminal sends, and described raw tone is carried out audio mixing handle, and obtains the original audio mixing of each language form after audio mixing is handled, and sends described original audio mixing to described conference terminal;

Described conference terminal also is used for the raw tone to described MCU transmission self meeting-place, place, and receives the described original audio mixing that described MCU sends.