Movatterモバイル変換


[0]ホーム

URL:


CN101697581A - Method, device and system for supporting simultaneous interpretation video conference - Google Patents

Method, device and system for supporting simultaneous interpretation video conference
Download PDF

Info

Publication number
CN101697581A
CN101697581ACN200910179642ACN200910179642ACN101697581ACN 101697581 ACN101697581 ACN 101697581ACN 200910179642 ACN200910179642 ACN 200910179642ACN 200910179642 ACN200910179642 ACN 200910179642ACN 101697581 ACN101697581 ACN 101697581A
Authority
CN
China
Prior art keywords
conference terminal
audio mixing
place
translated speech
meeting
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Granted
Application number
CN200910179642A
Other languages
Chinese (zh)
Other versions
CN101697581B (en
Inventor
詹五洲
王东琦
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Global Innovation Polymerization LLC
Tanous Co
Original Assignee
Shenzhen Huawei Communication Technologies Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Shenzhen Huawei Communication Technologies Co LtdfiledCriticalShenzhen Huawei Communication Technologies Co Ltd
Priority to CN2009101796421ApriorityCriticalpatent/CN101697581B/en
Publication of CN101697581ApublicationCriticalpatent/CN101697581A/en
Application grantedgrantedCritical
Publication of CN101697581BpublicationCriticalpatent/CN101697581B/en
Expired - Fee Relatedlegal-statusCriticalCurrent
Anticipated expirationlegal-statusCritical

Links

Images

Landscapes

Abstract

The invention discloses a method, a device and a system for supporting a simultaneous interpretation video conference. The method comprises the following steps of: receiving interpretation voice which is obtained by interpreting original voice of a meeting place where a conference terminal is arranged and sent by the conference terminal; according to the language type of the interpretation voice, performing the voice mixing treatment of the interpretation voice to obtain the interpretation mixing voice of all types of language after the voice mixing treatment; and sending the interpretation mixing voice of all types of languages to the conference terminal corresponding to the meeting place supporting the corresponding language type. The method, the device and the system have the advantages that: the interpretation mechanisms are supported to be set up in all conference terminals, independent interpretation conference terminals are unnecessarily set up, the technical scheme is easily realized, and the practicability is much high.

Description

Support method, the Apparatus and system of simultaneous interpretation video conference
Technical field
The present invention relates to communication technical field, particularly a kind of method, Apparatus and system of supporting simultaneous interpretation video conference.
Background technology
Video conference is a kind of multimedia communication means, can realize the image between two or more places, the interactive function of voice-and-data simultaneously, utilizes television equipment and communication network to hold a meeting.Video meeting system generally by video conference terminal, transmission network and multipoint control unit (Multipoint Control Unit, hereinafter to be referred as: MCU) etc. several parts are formed.
When using multilingual in the video meeting system, need be to meeting-place speech carrying out simultaneous interpretation.Prior art is provided with special translation meeting-place in conference system, be responsible for certain a pair of language is translated mutually, as Chinese and English intertranslation.All be provided with the language form in meeting-place separately for other common meeting-place, the meeting-place of certain language form sends and receives all is the voice of the language form that self is provided with.MCU can judge the language form of the voice after the translation of uploading in the translation meeting-place, when MCU carries out the audio mixing processing, carry out audio mixing according to language form, for example will be together from the voice mixing in all Chinese meeting-place, if what uploaded in the translation meeting-place in addition is Chinese, then also be in the same place with the voice mixing in Chinese meeting-place.After all types of voice being carried out the audio mixing processing, send to the meeting-place of corresponding language type.
By above-mentioned analysis as can be known, the scheme of existing technology needs the dedicated translation meeting-place that is independent of each meeting-place, and whole video-signal system is comparatively complicated, realizes inconvenient.
Summary of the invention
The invention provides a kind of method, Apparatus and system of supporting simultaneous interpretation video conference, need set up special-purpose translation conference terminal in order to solve existing video-signal system, system is comparatively complicated, implements inconvenient problem.
The embodiment of the invention provides a kind of method of supporting simultaneous interpretation video conference, comprising:
Receive the translated speech that conference terminal sends, described translated speech obtains after by described conference terminal the raw tone in self meeting-place, place being translated;
According to the language form of described translated speech described translated speech is carried out audio mixing and handle, obtain the translation audio mixing of each language form after audio mixing is handled;
The described translation audio mixing of each language form is sent to the conference terminal of the meeting-place correspondence of supporting the corresponding language type.
The embodiment of the invention also provides a kind of method of supporting simultaneous interpretation video conference, comprising:
The raw tone in meeting-place, conference terminal place translated obtain translated speech, and send described translated speech to MCU;
Receive the translation audio mixing that described MCU sends, described translation audio mixing carries out the audio mixing processing by described MCU according to the language form of the translated speech of each conference terminal transmission and obtains, and the language form of described translation audio mixing is identical with the language form that described meeting-place is supported.
The embodiment of the invention also provides a kind of multipoint control unit of supporting simultaneous interpretation video conference, comprising:
Receiver module is used to receive the translated speech that conference terminal sends, and described translated speech obtains after by described conference terminal the raw tone in self meeting-place, place being translated;
The audio mixing module is used for according to the language form of described translated speech described translated speech being carried out audio mixing and handles, and obtains the translation audio mixing of each language form after audio mixing is handled;
Sending module is used for the conference terminal that described translation audio mixing with each language form sends to the meeting-place correspondence of supporting the corresponding language type.
The embodiment of the invention also provides a kind of conference terminal of supporting simultaneous interpretation video conference, comprising:
Translation module is used for the raw tone in meeting-place, conference terminal place translated and obtains translated speech;
Sending module is used for sending described translated speech to MCU;
Receiver module, be used to receive described MCU and send the translation audio mixing, described translation audio mixing carries out the audio mixing processing by described MCU according to the language form of the translated speech of each conference terminal transmission and obtains, and the language form of described translation audio mixing is identical with the language form that described meeting-place is supported.
The embodiment of the invention also provides a kind of system that supports simultaneous interpretation video conference, comprise MCU and at least one conference terminal, described conference terminal is used for the raw tone in meeting-place, conference terminal place translated and obtains translated speech, sends described translated speech to MCU; And receive the translation audio mixing that described MCU sends;
Described MCU, be used to receive the described translated speech that conference terminal sends, according to the language form of described translated speech described translated speech being carried out audio mixing handles, obtain the translation audio mixing of each language form after audio mixing is handled, the described translation audio mixing of each language form is sent to the conference terminal of the meeting-place correspondence of supporting the corresponding language type.
Method, the Apparatus and system of the support simultaneous interpretation video conference that the embodiment of the invention provides, each conference terminal can be translated back output to the raw tone in meeting-place according to the interpreter language type of self, by MCU the translated speech after the meeting terminal translation is carried out after audio mixing handles, send to other conference terminal again according to language form, independently translate conference terminal thereby make whole video-signal system need not to set up, scheme is easy to realize having very high practicality.
Description of drawings
The method flow diagram of the support simultaneous interpretation video conference that Fig. 1 provides for first embodiment of the invention;
The method flow diagram of the support simultaneous interpretation video conference that Fig. 2 provides for second embodiment of the invention;
The method flow diagram of the support simultaneous interpretation video conference that Fig. 3 provides for third embodiment of the invention;
The method flow diagram of the support simultaneous interpretation video conference that Fig. 4 provides for fourth embodiment of the invention;
The video conference application scenarios schematic diagram that Fig. 5 provides for fifth embodiment of the invention;
The video conference application scenarios schematic diagram that Fig. 6 provides for sixth embodiment of the invention;
The structural representation of the MCU that Fig. 7 provides for seventh embodiment of the invention;
The structural representation of the conference terminal that Fig. 8 provides for eighth embodiment of the invention;
The system configuration schematic diagram of the support simultaneous interpretation video conference that Fig. 9 provides for ninth embodiment of the invention.
Embodiment
For the purpose, technical scheme and the advantage that make the embodiment of the invention clearer, below in conjunction with the accompanying drawing in the embodiment of the invention, technical scheme in the embodiment of the invention is clearly and completely described, obviously, described embodiment is the present invention's part embodiment, rather than whole embodiment.Based on the embodiment among the present invention, those of ordinary skills belong to the scope of protection of the invention not making all other embodiment that obtained under the creative work prerequisite.
In the system of the support simultaneous interpretation video conference that the embodiment of the invention provides, comprise the conference terminal at least one meeting-place, and the MCU that is connected with each conference terminal.The equipment of conference terminal mainly comprises video input-output apparatus, audio input/output device, Video Codec, audio codec, information communication device and multiplexed/signal separated time equipment etc.Video meeting system couples together conference terminal and MCU through communication network.MCU is the control core of system, and conference terminal all will be connected to MCU by standard interface, realizes the mixing and the exchange of image and voice.Main taking into account system is to processing of audio data in the embodiment of the invention, and at up direction, conference terminal utilizes microphone to pick up the voice data in meeting-place, after the encoded packing, is sent to other conference terminal by transmission network.At down direction, conference terminal receives the voice data of other conference terminals by transmission network, obtains the original sound data after decoding, and plays in the meeting-place by loud speaker.
In embodiments of the present invention, the conference terminal in each meeting-place can be provided with body translation, and the voice that receive are translated.Conference terminal can set in advance the language form of translated speech, according to the language form of the interpreter language that is provided with the raw tone in self meeting-place, place is translated.The language form of the translated speech that conference terminal is provided with can be a kind of for what fix, also can be for multiple.
The method flow diagram of the support simultaneous interpretation video conference that Fig. 1 provides for first embodiment of the invention, as shown in Figure 1, present embodiment supports the executive agent of the method for simultaneous interpretation video conference to can be MCU, method comprises:
Step 11, receive the translated speech that conference terminal sends, this translated speech obtains after by this conference terminal the raw tone in self meeting-place, place being translated.
In the practical application, pick up the raw tone of meeting-place speech by the conference terminal in meeting-place, obtain translated speech after translating, can carry out the transmission of translated speech by designated lane, this passage is the translated speech passage.Conference terminal sends to MCU by the translated speech passage with translated speech, and MCU receives the translated speech that conference terminal sends by the translated speech passage.
Step 12, according to the language form of this translated speech this translated speech is carried out audio mixing and handle, obtain the translation audio mixing of each language form after audio mixing is handled.
Because each conference terminal is translated the raw tone of self meeting-place, place speech, therefore, MCU need not to translate again, and the translated speech of each conference terminal is carried out sending to each conference terminal again and getting final product after audio mixing handles.
The audio mixing that MCU sends to certain conference terminal does not comprise the voice in the meeting-place at this conference terminal self place.The audio mixing processing policy that is MCU should make arbitrary meeting-place can't hear the sound of self, only hears the sound in other meeting-place.In addition, when the meeting-place of speech has when a plurality of, can set a plurality of meeting-place of participating in audio mixing is the bigger meeting-place of volume.
For example realize 3 side meeting-place audio mixings, the meeting-place of then participating in audio mixing is 3 side meeting-place of volume maximum, the audio mixing processing policy of 3 side meeting-place audio mixings is as follows: when having only a meeting-place to make a speech, the sound of oneself is can't hear in the meeting-place of then making a speech, and the sound in speech meeting-place all can be heard in other meeting-place; When make a speech simultaneously in two meeting-place, the both sides that then make a speech all can hear the other side's sound, can't hear the sound of oneself, and other meeting-place all can be heard the sound in two meeting-place of speech simultaneously; Current have 3 meeting-place or surpass meeting-place more than 3 when making a speech simultaneously, then audio mixing is participated in 3 side meeting-place of volume maximum, as T1, T2 and T3 is 3 side meeting-place of volume maximum in the active conference, then the sound in other two sides meeting-place can be heard in any one meeting-place among T1, T2 and the T3, and all the other meeting-place outside T1, T2 and the T3 then can be heard the audio mixing in T1, T2 and T3 meeting-place simultaneously.
Respectively describe in detail below among the embodiment, the audio mixing that can adopt the audio mixing processing policy identical with present embodiment to carry out raw tone or translated speech is handled, and repeats no more.
Step 13, this translation audio mixing of each language form sent to the conference terminal of the meeting-place correspondence of supporting the corresponding language type.
MCU just can send to this translation audio mixing the conference terminal of the meeting-place correspondence of supporting the corresponding language type according to the language form of each meeting-place support after having finished the audio mixing processing.As Chinese audio mixing is sent to Chinese meeting-place, English audio mixing is sent to English meeting-place, make each meeting field energy receive the translated speech of the language form of self supporting thus.
By MCU the translated speech after the meeting terminal translation is carried out will translate the conference terminal that audio mixing sends to the meeting-place of support corresponding language type according to language form again after audio mixing handles in the present embodiment.Whole video-signal system need not to set up independently translates conference terminal, be supported in each conference terminal and set up body translation, the language form of the translated speech of each conference terminal output all sets in advance, be convenient to MCU and voice carried out the audio mixing processing by the language type, scheme is easy to realize having very high practicality.
The method flow diagram of the support simultaneous interpretation video conference that Fig. 2 provides for second embodiment of the invention.The difference of first embodiment that present embodiment is corresponding with Fig. 1 is that present embodiment describes from the angle of handling raw tone, and as shown in Figure 2, present embodiment comprises:
The raw tone in this meeting-place, conference terminal place thatstep 21, reception conference terminal send.
Can carry out the transmission of raw tone by designated lane, this passage is the raw tone passage, and MCU receives the raw tone in this meeting-place, conference terminal place of conference terminal transmission by the raw tone passage.
Step 22, this raw tone is carried out audio mixing handle, obtain the original audio mixing after audio mixing is handled.
MCU can carry out audio mixing to raw tone according to the audio mixing processing policy to be handled, and each raw tone is carried out after audio mixing handles, and the original audio mixing of passing to certain meeting-place does not comprise the raw tone in this meeting-place, includes only the raw tone in other meeting-place.For example, pass to the original audio mixing in meeting-place 1, do not comprise the raw tone in meeting-place 1.
Optionally, can the raw tone that this meeting-place need be translated be carried out sending to this meeting-place after audio mixing is handled according to the specific requirement in certain meeting-place.For example, the body translation in meeting-place 2 is responsible for English is become Chinese with translator of French, and then MCU can only carry out the audio mixing processing with original English voice and French voice, and the original audio mixing that will obtain sends to the conference terminal in meeting-place 2.
Step 23, send this original audio mixing to this conference terminal.
MCU is after having carried out the audio mixing processing, just can original audio mixing be sent to each conference terminal by the raw tone passage, after each conference terminal receives original audio mixing, just can translate original audio mixing according to the language form that self meeting-place, place is supported, in the meeting-place, play then, perhaps, when requiring to play raw tone, do not translate and play-over raw tone in the meeting-place.Just realized the intercommunication between the different language type meeting-place thus.
MCU carries out the raw tone of each conference terminal to send to each conference terminal again after audio mixing is handled in the present embodiment, the language form of being supported according to self meeting-place, place by each conference terminal is handled the audio mixing of raw tone, makes the meeting-place of different language type can carry out the intercommunication of language.
The first embodiment of the present invention is described from the angle of handling translated speech, and second embodiment is described from the angle of handling raw tone.According to actual needs, both can be carried out combination, i.e. the present invention can set up bilingual sound passage and carry out the transmission of voice, and the translated speech passage is exclusively used in the transmission translated speech, and the raw tone passage is exclusively used in the transmission raw tone.Adopt the method for first embodiment and second embodiment to realize the respectively processing of system to raw tone and translated speech by bilingual sound passage.
The method flow diagram of the support simultaneous interpretation video conference that Fig. 3 provides for third embodiment of the invention.As shown in Figure 3, present embodiment supports the executive agent of the method for simultaneous interpretation video conference to can be conference terminal, and method comprises:
Step 31, the raw tone in meeting-place, conference terminal place translated obtain translated speech, and send this translated speech to MCU.
Conference terminal sends to MCU by the translated speech passage with translated speech after the raw tone in self meeting-place, place is translated, and by MCU translated speech is carried out audio mixing and handles.
Step 32, receive the translation audio mixing that this MCU sends, this translation audio mixing carries out audio mixing processing according to the language form of translated speech to the translated speech of each conference terminal transmission by this MCU and obtains, and language form of this translation audio mixing is identical with the language form that this meeting-place is supported.
Conference terminal receives the translation audio mixing that MCU sends by the translated speech passage.
Each conference terminal all is provided with the language form of translated speech in the present embodiment, and each conference terminal all has interpretative function, can translate back output to the raw tone in meeting-place.By MCU the translated speech after the meeting terminal translation is carried out sending to each conference terminal according to language form again after audio mixing handles.Make whole video-signal system need not to set up and independently translate conference terminal, and be convenient to MCU and by the language type voice carried out audio mixing and handle that scheme is easy to realize having very high practicality.
The method flow diagram of the support simultaneous interpretation video conference that Fig. 4 provides for fourth embodiment of the invention.The difference of the 3rd embodiment that present embodiment is corresponding with Fig. 3 is that present embodiment describes from the angle of handling raw tone, and as shown in Figure 4, present embodiment comprises:
Step 41, send the raw tone in meeting-place, conference terminal place to this MCU.
Conference terminal carries out audio mixing by MCU to raw tone and handles by the raw tone of raw tone passage to MCU transmission self meeting-place, place.
Step 42, receive the original audio mixing that this MCU sends, the raw tone that this original audio mixing is sent each conference terminal by this MCU is carried out audio mixing and is handled and obtain.
Conference terminal receives the original audio mixing that MCU sends by the raw tone passage, afterwards, just can this original audio mixing be translated, and play in the meeting-place according to the sound-type of self meeting-place, place support; When perhaps requiring to listen to raw tone, play-over this raw tone in the meeting-place.
Each conference terminal is transferred to MCU with raw tone in the present embodiment, by MCU the raw tone of each conference terminal is carried out sending to each conference terminal again after audio mixing is handled, the language form that each conference terminal is supported according to self meeting-place, place is handled the audio mixing of raw tone, makes the meeting-place of different language type can carry out the intercommunication of language.
The video conference application scenarios schematic diagram that Fig. 5 provides for fifth embodiment of the invention.Present embodiment is elaborated in conjunction with the technical scheme of concrete application scenarios to the method embodiment of support simultaneous interpretation video conference.
In the application scenarios as shown in Figure 5, video conference is carried out in Chinese meeting-place 1, Chinese meeting-place 2, English meeting-place 1 and French meeting-place 1, each meeting-place receives translated speech according to the language form of each self-supporting, the language form of supporting as Chinese meeting-place is Chinese, the translated speech that then Chinese meeting-place receives is a Chinese speech, other meeting-place are similar in this, repeat no more.The multilingual speech may be used in a meeting-place, the raw tone in meeting-place also should be the multilingual type mutually, in the present embodiment, the sound-type after the translation of the conference terminal in each meeting-place is for what fix, and promptly conference terminal is translated into the fixedly voice of language form with the raw tone in this meeting-place.Chinese is all translated into the voice in this meeting-place in Chinese meeting-place 1, and English is all translated into the voice in this meeting-place in Chinese meeting-place 2, and English is all translated into the voice in this meeting-place in English meeting-place 1, and French is all translated into the voice in this meeting-place in French meeting-place 1.
In the present embodiment, when meeting was carried out, the method for work of the conference terminal in MCU and each meeting-place was as follows:
(1) conference terminal in each meeting-place obtains the raw tone in self meeting-place, place, and raw tone is sent to MCU by the raw tone passage.Simultaneously the raw tone in self meeting-place, place is translated, obtained translated speech, and translated speech is sent to MCU by the translated speech passage.As, the conference terminal in Chinese meeting-place 1 sends to MCU by the translated speech passage after the voiced translation in this meeting-place is become Chinese.After the conference terminal in Chinese meeting-place 2 becomes English with the voiced translation in this meeting-place, send to MCU by the translated speech passage.
(2) MCU carries out the audio mixing processing with the raw tone of each conference terminal transmission, obtains original audio mixing, and original audio mixing is sent to each conference terminal.Simultaneously, the translated speech that MCU sends each conference terminal is carried out the audio mixing processing, obtains translating audio mixing, and according to the language form that each meeting-place, conference terminal place is supported, will translate audio mixing and send to each conference terminal by the translated speech passage.As MCU Chinese audio mixing is sent to Chinese meeting-place 1 and Chinese meeting-place 2, English audio mixing is sent to English meeting-place 1, the French audio mixing is sent to French meeting-place 1.
(3) each conference terminal receives the original audio mixing that MCU sends.Simultaneously, each conference terminal receives the translation audio mixing that MCU sends by the translated speech passage.
Because it is identical with the language form of this meeting-place support that MCU sends to the translation audio mixing of conference terminal in certain meeting-place, the translation audio mixing that receives as Chinese meeting-place is Chinese audio mixing.Therefore, each conference terminal can directly be play this translation audio mixing in the meeting-place, place.
For receiving original audio mixing, then can handle according to concrete demand, if any the meeting-place need translate, and then play, and the raw tone of not translating is wished to hear in the meeting-place that has, and then can not translate and play-over original audio mixing.
Each conference terminal all is provided with the language form of translated speech in the present embodiment, and each conference terminal all has interpretative function, can translate back output to the raw tone in meeting-place.By MCU the translated speech after the meeting terminal translation is carried out after audio mixing handles, send to other conference terminal again according to language form.Scheme is easy to realize having very high practicality.
The video conference application scenarios schematic diagram that Fig. 6 provides for sixth embodiment of the invention.Present embodiment is elaborated in conjunction with the technical scheme of concrete application scenarios to the method embodiment of support simultaneous interpretation video conference.
Difference at present embodiment and the 5th embodiment is that the language form of each meeting-place translation is a kind of for what fix among the 5th embodiment, and the language form of each meeting-place translation can be for multiple in the present embodiment.Make the translated resources in arbitrary meeting-place to share thus, can finish by same translator as Chinese and English intertranslation and Great Britain and France's literary composition intertranslation.Body translation as Chinese meeting-place 1 can carry out the translation that Chinese arrives English to English translation and French, and the translated speech that upload in then Chinese meeting-place 1 may be a Chinese or English.In the present embodiment, can increase the sound-type that the module of an identifiable language type uploads the meeting-place in MCU discerns.
In the present embodiment, the language form that is uploaded to the voice of MCU after the conference terminal translation can have multiple, the language form of the voice that 1 translation of Chinese meeting-place is uploaded is Chinese or English, the language form of the voice that 2 translations of Chinese meeting-place are uploaded is Chinese, the language form of the voice that 1 translation of English meeting-place is uploaded is English or French, and the language form of the voice that 1 translation of French meeting-place is uploaded is Chinese or French.
The processing method of the raw tone that MCU uploads conference terminal in the present embodiment is similar to the 5th embodiment, repeats no more herein.
In the present embodiment,, then discern the language form of interpreter language earlier, carry out audio mixing according to the language form of the translated speech of discerning again and handle by MCU for the translated speech that each conference terminal is uploaded.As at a time, MCU identifies the translated speech of uploading in Chinese meeting-place 1 and is Chinese, then this translated speech and other translator of Chinese voice is together carried out the audio mixing processing; At another constantly, MCU identifies the translated speech of uploading in Chinese meeting-place 1 and is English, then this translated speech and other translator of English voice is together carried out the audio mixing processing.The method of the translated speech after the identification being carried out the audio mixing processing is similar to the method among the 5th embodiment, repeats no more herein.
Wherein, the method of the language form of identification translated speech can for: in the packet of the translated speech that conference terminal is uploaded, add the language form sign, after MCU receives the packet of voice, discern the language form of this translated speech according to the sign of VoP.Perhaps, MCU adopts the language identification engine to discern the language form of the translated speech of this conference terminal transmission.
When the translated speech that present embodiment is uploaded when conference terminal has the multilingual type, earlier the language form of the translated speech uploaded is discerned, again the translated speech after the identification is carried out audio mixing and handle.Make the translated resources in meeting-place to share thus.Whole video-signal system need not to set up independently translates conference terminal, is supported in each conference terminal and sets up body translation, and scheme is easy to realize having very high practicality.
The structural representation of the MCU that Fig. 7 provides for seventh embodiment of the invention.As shown in Figure 7, present embodiment MCU comprises:receiver module 71,audio mixing module 72 and sendingmodule 73.
Receiver module 71 is used to receive the translated speech that conference terminal sends, and this translated speech obtains after by this conference terminal the raw tone in self meeting-place, place being translated.
Audio mixing module 72 is used for according to the language form of this translated speech this translated speech being carried out audio mixing to be handled, and obtains the translation audio mixing of each language form after audio mixing is handled.
Sendingmodule 73 is used for this translation audio mixing of each language form is sent to the conference terminal of the meeting-place correspondence of supporting the corresponding language type.
Receive the raw tone in other meeting-place and carry out respective handling according to real needs for the ease of each meeting-place.Thisreceiver module 71 also is used to receive the raw tone in this conference terminal self meeting-place, place that conference terminal sends.
Thisaudio mixing module 72 is used for that also this raw tone is carried out audio mixing to be handled, and obtains the original audio mixing of each language form after audio mixing is handled.
This sendingmodule 73 also is used for sending this original audio mixing to this conference terminal.
Wherein,receiver module 71 also is used for receiving the raw tone that conference terminal sends by the raw tone passage, receives the translated speech that conference terminal sends by the translated speech passage.Accordingly, sendingmodule 73 also is used for sending raw tone by the raw tone passage to conference terminal, sends translated speech by the translated speech passage to conference terminal.In the present embodiment, the raw tone passage is different passages with the translated speech passage.
The raw tone that the conference terminal that MCU receives sends can have multiple, and as being Chinese or English, at this moment, thisreceiver module 71 also is used for receiving by the raw tone passage of this conference terminal the raw tone of at least a language form.
The language form of the translated speech that conference terminal is provided with can be a kind of for what fix, also can be for multiple.Therefore the language form of the translated speech that receives of MCU can be for fixing a kind of, also can be for multiple.At this moment, thisreceiver module 71 also is used for receiving by the translated speech passage of this conference terminal the translated speech of at least a language form.
When the language form of the translated speech that sends when certain conference terminal is multiple, MCU also needs the language form of translated speech is judged, at this moment, MCU also comprises:identification module 74, when being used for translated speech that a meeting terminal in office sends and comprising at least two kinds of language forms, discern the language form of the translated speech that arbitrary conference terminal sends.
The function of MCU in the present embodiment, and MCU can not repeat them here referring to the record of the corresponding embodiment of Fig. 1~Fig. 6 with mutual mechanism and effect between the conference terminal.
Raw tone that MCU sends each conference terminal in the present embodiment and translated speech are carried out after audio mixing handles, by the raw tone passage original audio mixing is sent to each conference terminal, will translate audio mixing according to language form by the interpreter language passage and send to each conference terminal.Make video-signal system need not to set up and independently translate conference terminal, be supported in each conference terminal and set up body translation, make that video-signal system is easy to realize having very high practicality.
The structural representation of the conference terminal that Fig. 8 provides for eighth embodiment of the invention.As shown in Figure 8, the present embodiment conference terminal comprises: translation module 81, sending module 82 and receiver module 83.
Translation module 81 is used for the raw tone in meeting-place, conference terminal place translated and obtains translated speech.Conference terminal can set in advance the language form of translated speech, and the raw tone in meeting-place is translated back output, and the language form of the translated speech that conference terminal is provided with can be a kind of for what fix, also can be for multiple.
Sending module 82 is used for sending this translated speech to multipoint control unit MCU.After translated speech was sent to MCU, MCU carries out audio mixing to translated speech to be handled, and the translation audio mixing that audio mixing is handled sends to conference terminal.
Receiver module 83 is used to receive this MCU and sends the translation audio mixing, this translation audio mixing carries out audio mixing processing according to the language form of translated speech to the translated speech of each conference terminal transmission by this MCU and obtains, and language form of this translation audio mixing is identical with the language form that this meeting-place is supported.
For the raw tone in meeting-place, conference terminal directly sends to MCU, and this sending module 82 also is used for sending to this MCU the raw tone in meeting-place, conference terminal place.
After MCU receives original audio mixing, after carrying out the audio mixing processing, audio mixing is handled the original audio mixing that generates send to each conference terminal, the receiver module 83 of conference terminal also is used to receive this MCU and sends original audio mixing, and the raw tone that this original audio mixing is sent according to each conference terminal by described MCU is carried out the audio mixing processing and obtained.
Wherein, sending module 82 also is used for sending raw tone by the raw tone passage, sends translated speech by the translated speech passage.Accordingly, receiver module 83 also is used for receiving original audio mixing by the raw tone passage, receives the translation audio mixing by the translated speech passage.In the present embodiment, the raw tone passage is different passages with the translated speech passage.
The function of MCU in the present embodiment, and MCU can not repeat them here referring to the record of the corresponding embodiment of Fig. 1~Fig. 6 with mutual mechanism and effect between the conference terminal.
The conference terminal of present embodiment can be set up body translation, and the language form of the translated speech of conference terminal output all sets in advance, and is convenient to MCU and by the language type voice is carried out the audio mixing processing, makes that video-signal system is easy to realize.
The system configuration schematic diagram of the support simultaneous interpretation video conference that Fig. 9 provides for ninth embodiment of the invention.As shown in Figure 9, the system of present embodiment comprisesconference terminal 91 and MCU92.
Thisconference terminal 91 is used for the raw tone in meeting-place, conference terminal place translated and obtains translated speech, sends this translated speech to MCU92; And receive the translation audio mixing that this MCU92 sends.
This MCU92 is used to receive this translated speech thatconference terminal 91 sends, according to the language form of this translated speech this translated speech being carried out audio mixing handles, obtain the translation audio mixing of each language form after audio mixing is handled, this translation audio mixing of each language form is sent to theconference terminal 91 of the meeting-place correspondence of supporting the corresponding language type.
For making each conference terminal can receive the raw tone of other conference terminals, this MCU also is used to receive the original language in this meeting-place, conference terminal place thatconference terminal 91 sends, this raw tone is carried out audio mixing to be handled, obtain the original audio mixing of each language form after audio mixing is handled, send this original audio mixing to thisconference terminal 91.
Conference terminal 91 also is used for the raw tone to this MCU transmission self meeting-place, place, and receives this original audio mixing that this MCU92 sends.
Optionally, be connected with raw tone passage and translated speech passage betweenconference terminal 91 and the MCU92, by raw tone channel transfer raw tone, by translated speech channel transfer translated speech.The quantity ofconference terminal 91 is disposed according to actual needs, does not limit.The function of MCU92 and structure can be referring to the records of the corresponding embodiment of Fig. 7, the function ofconference terminal 91 and structure can be referring to the records of the corresponding embodiment of Fig. 8,conference terminal 91 can not repeat them here referring to the record of the corresponding embodiment of Fig. 1~Fig. 6 with mutual mechanism and effect between the MCU92.
Method, the Apparatus and system of the support simultaneous interpretation video conference that the embodiment of the invention provides, each conference terminal can be translated back output to the raw tone in meeting-place according to the interpreter language type of self, by MCU the translated speech after the meeting terminal translation is carried out after audio mixing handles, send to other conference terminal again according to language form, independently translate conference terminal thereby make whole video-signal system need not to set up, scheme is easy to realize having very high practicality.
One of ordinary skill in the art will appreciate that: accompanying drawing is the schematic diagram of an embodiment, and module in the accompanying drawing or flow process might not be that enforcement the present invention is necessary.
One of ordinary skill in the art will appreciate that: the module in the device among the embodiment can be described according to embodiment and be distributed in the device of embodiment, also can carry out respective change and be arranged in the one or more devices that are different from present embodiment.The module of the foregoing description can be merged into a module, also can further split into a plurality of submodules.
The invention described above embodiment sequence number is not represented the quality of embodiment just to description.
One of ordinary skill in the art will appreciate that: all or part of step that realizes said method embodiment can be finished by the relevant hardware of program command, aforesaid program can be stored in the computer read/write memory medium, this program is carried out the step that comprises said method embodiment when carrying out; And aforesaid storage medium comprises: various media that can be program code stored such as ROM, RAM, magnetic disc or CD.
It should be noted that at last: above embodiment only in order to technical scheme of the present invention to be described, is not intended to limit; Although with reference to previous embodiment the present invention is had been described in detail, those of ordinary skill in the art is to be understood that: it still can be made amendment to the technical scheme that previous embodiment is put down in writing, and perhaps part technical characterictic wherein is equal to replacement; And these modifications or replacement do not make the essence of appropriate technical solution break away from the spirit and scope of embodiment of the invention technical scheme.

Claims (20)

CN2009101796421A2009-10-262009-10-26Method, device and system for supporting simultaneous interpretation video conferenceExpired - Fee RelatedCN101697581B (en)

Priority Applications (1)

Application NumberPriority DateFiling DateTitle
CN2009101796421ACN101697581B (en)2009-10-262009-10-26Method, device and system for supporting simultaneous interpretation video conference

Applications Claiming Priority (1)

Application NumberPriority DateFiling DateTitle
CN2009101796421ACN101697581B (en)2009-10-262009-10-26Method, device and system for supporting simultaneous interpretation video conference

Publications (2)

Publication NumberPublication Date
CN101697581Atrue CN101697581A (en)2010-04-21
CN101697581B CN101697581B (en)2012-11-21

Family

ID=42142648

Family Applications (1)

Application NumberTitlePriority DateFiling Date
CN2009101796421AExpired - Fee RelatedCN101697581B (en)2009-10-262009-10-26Method, device and system for supporting simultaneous interpretation video conference

Country Status (1)

CountryLink
CN (1)CN101697581B (en)

Cited By (18)

* Cited by examiner, † Cited by third party
Publication numberPriority datePublication dateAssigneeTitle
CN103200223A (en)*2013-02-212013-07-10中国对外翻译出版有限公司Method for achieving long-distance interpretation service
CN103218761A (en)*2011-08-242013-07-24卡西欧计算机株式会社Information processing device, information processing method, and computer readable storage medium
CN103716171A (en)*2013-12-312014-04-09广东公信数字设备有限公司Method, host computer and terminals for transmitting audio data
CN104780335A (en)*2015-03-262015-07-15中兴通讯股份有限公司Method and device for WebRTC P2P (web real-time communication peer-to-peer) audio and video call
CN105930322A (en)*2016-07-142016-09-07无锡科技职业学院Long-distance and manuscript-free simultaneous interpreting device system capable of realizing high-efficiency conversion
CN106294328A (en)*2016-07-262017-01-04四川传意荟能翻译有限公司A kind of online interpretation intelligent service system and method
CN108009161A (en)*2017-12-272018-05-08王全志Information output method, device
CN108090052A (en)*2018-01-052018-05-29深圳市沃特沃德股份有限公司Voice translation method and device
CN108615527A (en)*2018-05-102018-10-02腾讯科技(深圳)有限公司 Data processing method, device and storage medium based on simultaneous interpretation
CN108712271A (en)*2018-04-022018-10-26深圳市沃特沃德股份有限公司Interpretation method and translating equipment
CN109451574A (en)*2018-09-292019-03-08与德科技有限公司A kind of method, apparatus, terminal and the system of data transmission
CN109688367A (en)*2018-12-312019-04-26深圳爱为移动科技有限公司The method and system of the multilingual real-time video group chat in multiple terminals
CN109688363A (en)*2018-12-312019-04-26深圳爱为移动科技有限公司The method and system of private chat in the multilingual real-time video group in multiple terminals
WO2019191877A1 (en)*2018-04-022019-10-10深圳市沃特沃德股份有限公司Translation method, device and translation apparatus
WO2020047719A1 (en)*2018-09-032020-03-12深圳市欢太科技有限公司Shorthand method and device, terminal, and storage medium
CN111447397A (en)*2020-03-272020-07-24深圳市贸人科技有限公司Translation method and translation device based on video conference
CN111639503A (en)*2020-05-222020-09-08腾讯科技(深圳)有限公司Conference data processing method and device, storage medium and equipment
CN111868732A (en)*2020-06-192020-10-30深圳市台电实业有限公司Portable remote simultaneous interpretation translation platform

Cited By (24)

* Cited by examiner, † Cited by third party
Publication numberPriority datePublication dateAssigneeTitle
CN103218761A (en)*2011-08-242013-07-24卡西欧计算机株式会社Information processing device, information processing method, and computer readable storage medium
CN103200223B (en)*2013-02-212016-08-03中国对外翻译出版有限公司The method realizing long-distance oral interpreting service
CN103200223A (en)*2013-02-212013-07-10中国对外翻译出版有限公司Method for achieving long-distance interpretation service
CN103716171A (en)*2013-12-312014-04-09广东公信数字设备有限公司Method, host computer and terminals for transmitting audio data
CN103716171B (en)*2013-12-312017-04-05广东公信智能会议股份有限公司A kind of audio data transmission method and main frame, terminal
CN104780335A (en)*2015-03-262015-07-15中兴通讯股份有限公司Method and device for WebRTC P2P (web real-time communication peer-to-peer) audio and video call
CN105930322B (en)*2016-07-142018-11-20无锡科技职业学院A kind of conversion of long distance high efficiency is without original text synchronous translation apparatus system
CN105930322A (en)*2016-07-142016-09-07无锡科技职业学院Long-distance and manuscript-free simultaneous interpreting device system capable of realizing high-efficiency conversion
CN106294328A (en)*2016-07-262017-01-04四川传意荟能翻译有限公司A kind of online interpretation intelligent service system and method
CN108009161A (en)*2017-12-272018-05-08王全志Information output method, device
CN108090052A (en)*2018-01-052018-05-29深圳市沃特沃德股份有限公司Voice translation method and device
CN108712271A (en)*2018-04-022018-10-26深圳市沃特沃德股份有限公司Interpretation method and translating equipment
WO2019191877A1 (en)*2018-04-022019-10-10深圳市沃特沃德股份有限公司Translation method, device and translation apparatus
CN108615527B (en)*2018-05-102021-10-15腾讯科技(北京)有限公司 Data processing method, device and storage medium based on simultaneous interpretation
CN108615527A (en)*2018-05-102018-10-02腾讯科技(深圳)有限公司 Data processing method, device and storage medium based on simultaneous interpretation
US12087290B2 (en)2018-05-102024-09-10Tencent Technology (Shenzhen) Company LimitedData processing method based on simultaneous interpretation, computer device, and storage medium
WO2020047719A1 (en)*2018-09-032020-03-12深圳市欢太科技有限公司Shorthand method and device, terminal, and storage medium
CN109451574A (en)*2018-09-292019-03-08与德科技有限公司A kind of method, apparatus, terminal and the system of data transmission
CN109688363A (en)*2018-12-312019-04-26深圳爱为移动科技有限公司The method and system of private chat in the multilingual real-time video group in multiple terminals
CN109688367A (en)*2018-12-312019-04-26深圳爱为移动科技有限公司The method and system of the multilingual real-time video group chat in multiple terminals
CN111447397A (en)*2020-03-272020-07-24深圳市贸人科技有限公司Translation method and translation device based on video conference
CN111639503A (en)*2020-05-222020-09-08腾讯科技(深圳)有限公司Conference data processing method and device, storage medium and equipment
CN111868732A (en)*2020-06-192020-10-30深圳市台电实业有限公司Portable remote simultaneous interpretation translation platform
CN111868732B (en)*2020-06-192023-07-07深圳市台电实业有限公司Portable remote simultaneous interpretation translation table

Also Published As

Publication numberPublication date
CN101697581B (en)2012-11-21

Similar Documents

PublicationPublication DateTitle
CN101697581B (en)Method, device and system for supporting simultaneous interpretation video conference
CN1937664B (en)System and method for realizing multi-language conference
CN112543297B (en)Video conference live broadcast method, device and system
CN101370114B (en)Video and audio processing method, multi-point control unit and video conference system
CN102460487B (en)The system and method for mixing course teaching
CN101702762B (en)Multipoint control unit for realizing multi-language conference and conference terminal
CN106816055A (en)A kind of low-power consumption live teaching broadcast recording and broadcasting system for interacting and method
US20140118471A1 (en)Video Conferencing Method and Device Thereof
CN111601069B (en)Intelligent conference system
CN105959613A (en)Digital conference equipment and system
US20110224969A1 (en)Method, a Media Server, Computer Program and Computer Program Product For Combining a Speech Related to a Voice Over IP Voice Communication Session Between User Equipments, in Combination With Web Based Applications
US11089164B2 (en)Teleconference recording management system
CN103096128A (en)Method capable of achieving video interaction, server, terminal and system
CN107079069A (en) Interpreter table for conference systems
NollThe evolution of media
CN111355918A (en)Intelligent remote video conference system
CN211509180U (en)Multifunctional audio and video processing equipment
CN104427295A (en)Method for processing video in video conference and terminal
CN110570701A (en)two-way voice interactive teaching system and method for synchronous video
CN102664900B (en)Media business supplying method and device, media business display packing and device
EP2351022A1 (en)Method, a media server, computer program and computer program product for combining a speech related to a voice over ip voice communication session between user equipments, in combination with web based applications
CN111696552B (en)Translation method, translation device and earphone
CN101179695A (en)Method for implementing recorded broadcast of session, session television system and terminal
CN201418130Y (en)Improved structure of video conference control instrument
CN114679550B (en)Universal recording device and method thereof

Legal Events

DateCodeTitleDescription
C06Publication
PB01Publication
C10Entry into substantive examination
SE01Entry into force of request for substantive examination
C14Grant of patent or utility model
GR01Patent grant
TR01Transfer of patent right
TR01Transfer of patent right

Effective date of registration:20180223

Address after:California, USA

Patentee after:Global innovation polymerization LLC

Address before:California, USA

Patentee before:Tanous Co.

Effective date of registration:20180223

Address after:California, USA

Patentee after:Tanous Co.

Address before:518129 Longgang District, Guangdong, Bantian HUAWEI base B District, building 2, building No.

Patentee before:HUAWEI DEVICE Co.,Ltd.

CF01Termination of patent right due to non-payment of annual fee
CF01Termination of patent right due to non-payment of annual fee

Granted publication date:20121121

Termination date:20211026


[8]ページ先頭

©2009-2025 Movatter.jp