CN109120947A

Movatterモバイル変換

Info

Publication number: CN109120947A
Application number: CN201811031975.5A
Authority: CN
Inventors: 潘璠
Original assignee: Beijing Youku Technology Co Ltd
Current assignee: Alibaba China Co Ltd
Priority date: 2018-09-05
Filing date: 2018-09-05
Publication date: 2019-01-01

Abstract

The application embodiment discloses the voice private chat method and client of a kind of direct broadcasting room, wherein the described method includes: initiating the private chat request for being directed toward target user to voice server；The private chat voice messaging of the initiator is acquired, and the private chat voice messaging of acquisition is uploaded to the voice server, so that the voice server provides the private chat voice messaging to the target user by private chat channel；The private chat voice messaging provided by the target user that the voice server is sent by the private chat channel is provided, and receives the group chat voice messaging for the other users for being in same live streaming group with the initiator that the voice server is sent；After private chat voice messaging that the target user provides and the group chat voice messaging are synthesized one voice flow, the voice flow of synthesis is played.Technical solution provided by the present application can be improved the convenience linked up with other users.

Description

A kind of the voice private chat method and client of direct broadcasting room

Technical field

This application involves Internet technical field, in particular to the voice private chat method and client of a kind of direct broadcasting room.

Background technique

With the rise of net cast, a large amount of net cast platform is emerged.In net cast platform, it can divideBetween multiple net casts, usually presided over by main broadcaster between these net casts.Live content can be pushed to direct broadcast service by main broadcasterDevice, then the user in net cast can download from direct broadcast server and watch the live content between the net cast.

Currently, user is in watching video live broadcast, if it is desired to individually linked up with other users, usually can with thinkThe target user to be linked up jumps to an idle direct broadcasting room together, can be linked up by voice in the direct broadcasting room.Further, it is also possible to by way of text chat, by sending private chat text to target user, to realize with single user'sIt links up.

However, on the one hand will affect the net cast that user is currently viewed by way of jumping to idle direct broadcasting roomOn the other hand content will increase the operation complexity of user, in addition, equally will increase the behaviour of user in the way of communication textMake complexity.Therefore, when currently individually being linked up in direct broadcasting room with other users, operation can be faced and not enough easily askedTopic.

Summary of the invention

The purpose of the application embodiment is to provide the voice private chat method and client of a kind of direct broadcasting room, can be improved withThe convenience that other users are linked up.

To achieve the above object, the application embodiment provides a kind of voice private chat method of direct broadcasting room, the method packetIt includes: the private chat request for being directed toward target user is initiated to voice server, so that the voice server is requested in the private chatInitiator and the target user between establish private chat channel；The private chat voice messaging of the initiator is acquired, and will acquisitionThe private chat voice messaging be uploaded to the voice server so that the voice server by the private chat channel toThe target user provides the private chat voice messaging；Receive the voice server by the private chat channel send by instituteThe private chat voice messaging of target user's offer is stated, and receive that the voice server sends is in the initiator with alwaysBroadcast the group chat voice messaging of the other users of group；The private chat voice messaging that the target user is provided and group chat voice letterAfter breath synthesizes one voice flow, the voice flow of synthesis is played.

To achieve the above object, the application embodiment also provides a kind of client, and the client includes: private chat requestUnit is initiated, the private chat request for initiating to be directed toward target user to voice server, so that the voice server is in instituteIt states and establishes private chat channel between the initiator and the target user of private chat request；Private chat voice collecting unit, for acquiringThe private chat voice messaging of initiator is stated, and the private chat voice messaging of acquisition is uploaded to the voice server, so thatThe voice server provides the private chat voice messaging to the target user by the private chat channel；Voice messaging receivesUnit, the private chat voice provided by the target user sent for receiving the voice server by the private chat channelInformation, and receive the group chat voice for the other users that same live streaming group is in the initiator that the voice server is sentInformation；Voice flow synthesis unit, private chat voice messaging and the group chat voice messaging for providing the target user closeAfter one voice flow, the voice flow of synthesis is played.

To achieve the above object, the application embodiment also provides a kind of client, the client include processor andMemory, the memory is for storing computer program, when the computer program is executed by the processor, realizes above-mentionedMethod.

Detailed description of the invention

It, below will be to embodiment in order to illustrate more clearly of the application embodiment or technical solution in the prior artOr attached drawing needed to be used in the description of the prior art is briefly described, it should be apparent that, the accompanying drawings in the following description is onlyIt is some embodiments as described in this application, for those of ordinary skill in the art, in not making the creative labor propertyUnder the premise of, it is also possible to obtain other drawings based on these drawings.

Fig. 1 is the live broadcast system schematic diagram that voice connects wheat in the application embodiment；

Fig. 2 is the voice private chat method and step figure of direct broadcasting room in the application embodiment；

Fig. 3 is the schematic diagram of chat interface in the application embodiment；

Fig. 4 is the functional block diagram of client in the application embodiment；

Fig. 5 is the structural schematic diagram of client in the application embodiment.

Specific embodiment

In order to make those skilled in the art better understand the technical solutions in the application, below in conjunction with the application realityThe attached drawing in mode is applied, the technical solution in the application embodiment is clearly and completely described, it is clear that describedEmbodiment is only a part of embodiment of the application, rather than whole embodiments.Based on the embodiment party in the applicationFormula, every other embodiment obtained by those of ordinary skill in the art without making creative efforts, is all answeredWhen the range for belonging to the application protection.

The application provides a kind of voice private chat method of direct broadcasting room, and this method can be applied in system as shown in Figure 1.Referring to Fig. 1, video living transmission system may include voice server, direct broadcast server and client.Wherein, the clientIt can be the terminal device that user uses, in the terminal device, can have net cast software, and the terminal is setThe standby microphone that can above have the voice messaging for including user.In addition, the client can also refer to that the terminal is setThe net cast software of standby middle operation.The net cast software can call the microphone on the terminal device, to includeThe voice messaging of user.The voice server can be used for receiving the voice messaging for the user that each client uploads, and canThese voice messagings according to preset stream media protocol, are converted to voice flow.The direct broadcast server, then can receive masterThe live content that the terminal device broadcast is sent, and the live content can be converted to live streaming audio/video flow.

Referring to Fig. 2, the voice private chat method of direct broadcasting room provided by the present application may comprise steps of.

S1: the private chat request for being directed toward target user is initiated to voice server, so that the voice server is describedPrivate chat channel is established between the initiator and the target user of private chat request.

In the present embodiment, it when user watches live content in direct broadcasting room, can be used with the part in the direct broadcasting roomSame live streaming group is added in family.Different user in same live streaming group can carry out mutual communication by way of voice.ToolBody, user can open the function that voice in group connects wheat.In the case that voice connects the unlatching of wheat function in organizing, the Mike of userWind can acquire the voice messaging of user in real time.The voice messaging of acquisition can be uploaded to voice service by the client of userDevice.In voice server, voice messaging can be converted to the voice flow of user according to preset stream media protocol.This is pre-If stream media protocol for example can be HLS (HTTP Live Streaming, HTTP live stream) agreement.Certainly, this is presetStream media protocol can also be in the light of actual conditions modified.For example, the preset stream media protocol can also be WebRTC(Web Real-Time Communication, page real time communication) agreement.Subsequent, the user that unlatching voice connects wheat function canTo listen to the voice messaging of other users in same live streaming group.At this point, the client of user can initiate number to voice serverAccording to acquisition request.The user identifier of the user can be carried in the data acquisition request.In this way, voice server is receiving thisAfter data acquisition request, user identifier wherein included can be identified.By the user identifier, voice server can be determinedLive streaming group locating for the user identifier, then can by the live streaming group except the user identifier characterization voice flow in addition to otherThe voice flow of user is supplied to the client of the user.On the one hand it can enable the user to hear other in same live streaming groupOn the other hand the Instant audio messages of user also avoid the user from the voice messaging of uppick itself.In this way, user is from voiceThe voice messaging of other users can be used as group chat voice messaging in the same live streaming group obtained at server.The group chat voiceInformation can be obtained by the user in same live streaming group.

In the present embodiment, user carries out group in the live content of viewing direct broadcasting room and with the user of same live streaming groupWhen merely, it can choose with some user in live streaming group or individually linked up with the other users outside live streaming group.At this point, withThe client at family can initiate the private chat request for being directed toward target user to voice server, may include request in private chat requestThe user identifier of initiator and the user identifier of the target user.In this way, voice server is receiving private chat requestAfterwards, the user identifier of both sides can be therefrom extracted, and based on the user identifier extracted, between initiator and target userEstablish private chat channel.The private chat channel can be the channel for being used for transmission private chat voice messaging, the letter transmitted in the private chat channelBreath is independent of each other with the information in group chat channel.

S3: acquiring the private chat voice messaging of the initiator, and the private chat voice messaging of acquisition is uploaded to describedVoice server, so that the voice server provides the private chat voice to the target user by the private chat channelInformation.

In the present embodiment, after establishing private chat channel, in the client of two parties, private chat can occurThe prompt information of state.As shown in figure 3, may include the control of group chat and private chat in the operation interface of user, when private chat frequencyAfter road is established, private chat control can be activated.User can choose the private chat control of the activation in early operation interface.So whenWhen private chat control is selected, the voice messaging of user's typing can be used as private chat voice messaging, rather than group chat voice messaging.Certainly, in practical applications, private chat control can mutually be bound with target user.In this way, as shown in figure 3, when user need with it is moreWhen a user carries out private chat simultaneously, can occur multiple private chat controls, and each private chat control in the interface of the userIn can show the mark (Zhang San, Li Si) of other side, consequently facilitating user distinguishes the object of private chat.Correspondingly, when itIn one or more private chat control it is selected when, the private chat voice messaging of user will be sent to these selected private chat controlsAt the corresponding target user of part.

In the present embodiment, when the initiator of private chat request is under private chat state, the microphone of initiator can be withThe private chat voice messaging of initiator is acquired in the way of in step S1, and the private chat voice messaging of acquisition is uploaded to instituteState voice server.In the private chat voice messaging, the user identifier of the target user can be carried, so as to inform languageWhich user sound server, current private chat voice messaging should be sent to.

In the present embodiment, after voice server receives the private chat voice messaging, the mesh wherein carried can be identifiedThe user identifier of user is marked, so as to the private chat channel for establishing the private chat voice messaging before, is sent to target useFamily.Similarly, target user can reply private chat voice messaging to it after the private chat voice messaging of uppick initiator.ThisSample, voice server can also receive the private chat voice messaging of target user's offer.

In one embodiment, the client of user, can be to private after acquiring the private chat voice messaging of userMerely voice messaging carries out some optimization processings, so that the private chat voice messaging for being uploaded to voice server has higher sound quality.Firstly, client can all remove the sound in private chat voice messaging in addition to voice, so as to reduce environmental noise pairThe influence of voice.Specifically, client can identify the audio frequency characteristics in the private chat voice messaging.The audio frequency characteristics can wrapThe audio frequency characteristics for characterizing voice are included, can also include the audio frequency characteristics for characterizing environmental noise.Typically, voice is pastIt is past to have fixed frequency separation.For example, male sound may be typically located between 64~523Hz, female's sound is usually located at 160~Between 1200Hz.So, this corresponding relationship of voice and fixed frequency separation, can be used as standard voice feature.

In the present embodiment, it when the audio frequency characteristics for including in the private chat voice messaging of identification acquisition, can will be inThe private chat voice messaging of time-domain is converted to frequency domain, and the voice messaging in frequency domain can be and carry out according to frequencyDistribution, and each Frequency point can correspond to certain signal strength.At this point it is possible to be identified from the voice messaging of frequency domainSignal strength reaches the corresponding target frequency of information of specified intensity threshold value.The specified intensity threshold value can be set to human ear canThe intensity of sound obviously heard.In this way, the voice messaging of frequency domain can according to the specified intensity threshold value, be divided into it is multiple fromScattered voice segments, the intensity of voice messaging reaches the specified intensity threshold value in these voice segments.Voice in these voice segmentsInformation can have respective target frequency.These target frequencies can be as the audio for including in the private chat voice messagingFeature.It is then possible to calculate the frequency difference between target frequency frequency corresponding with standard voice feature.Specifically,The center frequency value of the frequency separation of male voice and female voice can be determined respectively.Then, it when calculating frequency difference, can first determineCurrent target frequency and which center frequency value are closer, it is then possible to calculate current target frequency and immediate frequencyFrequency difference between rate central value.The frequency difference can be as between current audio frequency characteristics and standard voice featureDifference value.

In the present embodiment, if the difference value is more than or equal to specified threshold, then it represents that current audio frequency characteristicsDiffer larger with standard voice feature, current audio frequency characteristics are likely to be environmental noise.Therefore, in this case, may be usedTo remove the corresponding information of the audio frequency characteristics from the private chat voice messaging, to filter in the private chat voice messagingComponent environment noise.Wherein, above-mentioned difference value can refer to the absolute value being calculated.The specified threshold can be according to realityBorder situation flexible setting.

In one embodiment, it is contemplated that after handling in a manner mentioned above private chat voice messaging, due toEnvironmental noise is eliminated, then there may be the mute of big section between voice adjacent in private chat voice messaging.FromFor the auditory effect of human ear, that people can be allowed to generate is uncomfortable for big section mute, while people can also be allowed to generate the illusion of communication disruption.MirrorIn this, can big section it is mute in be properly added the lower noise signal of some intensity, to eliminate above-mentioned problem.SpecificallyGround can identify target language segment in the private chat voice messaging, and the intensity value of any information is equal in the target language segmentLower than specified intensity threshold value.Wherein, it is lower than the specified intensity threshold value, shows for the angle of human ear, in the target language segmentVoice messaging can not be gone out by ear recognition, therefore, the target language segment be mute section.At this point it is possible to identify this mute sectionLasting duration show the target voice if the duration of the target language segment is more than or equal to specified duration threshold valueThe duration of Duan Chixu is too long, at this point it is possible to add specified noise signal in the target language segment.The specified noise signalCan be sound of the wind, sound of sea wave etc. will not allow human ear to generate uncomfortable white noise (White Noise).

In one embodiment, private chat voice messaging is carried out to handle it according to above-mentioned removal environmental noise the step ofAfterwards, it is more likely that the part signal in the initial position of normal voice and/or final position can be removed, so as to cause normal languageImperfect or normal voice the starting and/or termination of sound are excessively lofty.In consideration of it, can by the way of signal fitting,It is suitably the starting of voice and final position addition a part fitting information, to solve the problem above-mentioned.It specifically, can be withInitial position and the final position of voice are identified in the private chat voice messaging.Typically, there is language in voice messagingThe place of sound, the waveform that raising and lowering can all occur in the intensity of information can by the identification to information strength in voice messagingTo identify initial position and the final position of voice.At this point it is possible to according to the information waveform for the initial position identified and endThe information waveform that stop bit is set generates corresponding voice fitting information.Voice fitting information and the information of corresponding position splice itAfterwards, continuous waveform can be formed.In this way, adding the voice to match respectively at the initial position and the final positionIt is fitted information, the starting of voice and termination can be enabled more smooth, lofty feeling will not be generated.

In one embodiment, in the private chat voice messaging of the microphone acquisition of user, there may be echo signal,In order to enhance the audio experience of user, the echo signal in the private chat voice messaging can be identified, and from the private chat voiceThe echo signal is removed in information.Specifically, convergence algorithm can be carried out to input signal by sef-adapting filter, madeIt obtains and matches by the shock response that sef-adapting filter obtains with true echo path, so that it is corresponding to obtain echo pathThe estimated value of echo signal.It is then possible to the private chat voice messaging be subtracted to the estimated value of the echo signal, thus from the privateMerely echo signal is removed in voice messaging.

In one embodiment, user might have other people at one's side and speaking in typing private chat voice messaging, fromAnd lead to the sound in the voice messaging of typing there are others.In order to avoid other people sound causes to do to the sound of userIt disturbs, client is after collecting the private chat voice messaging of user, other people language that can will include in the private chat voice messagingMessage breath removal.Specifically, present embodiment can remove other people voice messaging by the method for Application on Voiceprint Recognition.The useIt family can be in advance in the client by a certain number of voice messagings of typing, so that client saves the vocal print of the userFeature.In this way, after client collects the private chat voice messaging of user, can identify the private chat language between net castThe vocal print feature for including in message breath, and the vocal print feature that will identify that is compared with the vocal print feature of the user.If the vocal print feature identified and the vocal print feature of the user are inconsistent, the vocal print feature that can be will identify thatCorresponding information is removed from the private chat voice messaging.Above-mentioned vocal print feature can be and utilize special Application on Voiceprint Recognition groupThe sound wave spectrum that part obtains after analyzing voice messaging.The generation of human language be Body Languages maincenter and vocal organs itBetween a complicated physiology physical process, tongue that people uses in speech, tooth, larynx, lung, nasal cavity is in terms of size and formEveryone is widely different, so the sound wave spectrum of different people is all variant, so that the vocal print feature between different userIt can also be different.Therefore, it is possible to remove the voice messaging of other users by vocal print feature.

S5: the private chat language provided by the target user that the voice server is sent by the private chat channel is receivedMessage breath, and receive the group chat language for the other users that same live streaming group is in the initiator that the voice server is sentMessage breath.

In the present embodiment, voice server again may be by the private chat channel, the private that target user is providedMerely voice messaging is sent to the initiator of private chat request.In addition, voice server can also be by group chat channel, it will be with the hairThe group chat voice messaging for playing other users of the side in same live streaming group is sent to the initiator together.In this way, in initiatorPrivate chat voice messaging and group chat voice messaging can be locally provided simultaneously with.

S7: private chat voice messaging that the target user provides and the group chat voice messaging are synthesized into one voice flowAfterwards, the voice flow of synthesis is played.

In the present embodiment, due to initiator's local reception to two kinds of voice messagings, in order to listen to simultaneously this twoKind of voice messaging needs private chat voice messaging that the target user provides and the group chat voice messaging synthesizing one languageSound stream, and the voice flow after synthesis is played by loudspeaker.In this way, both believing comprising group chat voice in voice flow in post synthesisBreath, and include private chat voice messaging, in addition, in practical applications, the live content of direct broadcasting room can also be added in voice flowVoice messaging so that user when carrying out private chat, will not miss other useful informations in direct broadcasting room.

Referring to Fig. 4, the application also provides a kind of client, the client includes:

Private chat request initiating cell, the private chat request for initiating to be directed toward target user to voice server, so that instituteIt states voice server and establishes private chat channel between the initiator and the target user that the private chat is requested；

Private chat voice collecting unit, for acquiring the private chat voice messaging of the initiator, and by the private chat of acquisitionVoice messaging is uploaded to the voice server, so that the voice server is used by the private chat channel to the targetFamily provides the private chat voice messaging；

Voice messaging receiving unit, for receive the voice server by the private chat channel send by the meshThe private chat voice messaging that user provides is marked, and receive that the voice server sends is in same live streaming group with the initiatorOther users group chat voice messaging；

Voice flow synthesis unit, private chat voice messaging and the group chat voice messaging for providing the target userAfter synthesizing one voice flow, the voice flow of synthesis is played.

In one embodiment, the client further include:

Difference value determination unit, for identification audio frequency characteristics in the private chat voice messaging, and the determining audio spyDifference value between sign and standard voice feature；

Voice messaging removal unit, if being more than or equal to specified threshold for the difference value, by the audio frequency characteristicsCorresponding information is removed from the private chat voice messaging.

In one embodiment, the client further include:

Vocal print feature recognition unit, the vocal print feature for including in the private chat voice messaging for identification, and will identify thatThe vocal print feature be compared with the vocal print feature of the initiator；

Voiceprint removal unit, if the vocal print feature for identifying and the vocal print feature of the initiator are differentIt causes, the corresponding information of the vocal print feature that will identify that is removed from the private chat voice messaging.

In one embodiment, the voice flow synthesis unit includes:

Group chat speech volume adjustment module, the volume of the private chat voice messaging for identification, and according to the institute identifiedVolume is stated, the volume of the group chat voice messaging is adjusted；

Track merging module, for merging into the group chat voice messaging after the private chat voice messaging and adjusting volumeOne track, and the information after track is merged is as the voice flow after synthesis.

Referring to Fig. 5, the application also provides a kind of client, the client includes memory and processor, described to depositReservoir when the computer program is executed by the processor, realizes the language of above-mentioned direct broadcasting room for storing computer programSound private chat method.

In the present embodiment, the memory may include the physical unit for storing information, usually by informationIt is stored again with the media using the methods of electricity, magnetic or optics after digitlization.Memory described in present embodiment again may be usedTo include: to store the device of information, such as RAM, ROM in the way of electric energy；The device of information is stored in the way of magnetic energy, it is such as hardDisk, floppy disk, tape, core memory, magnetic bubble memory, USB flash disk；Using the device of optical mode storage information, such as CD or DVD.Certainly, there are also memories of other modes, such as quantum memory, graphene memory etc..

In the present embodiment, the processor can be implemented in any suitable manner.For example, the processor can be withTake such as microprocessor or processor and storage can by (micro-) processor execute computer readable program code (such asSoftware or firmware) computer-readable medium, logic gate, switch, specific integrated circuit (Application SpecificIntegrated Circuit, ASIC), programmable logic controller (PLC) and the form etc. for being embedded in microcontroller.

The concrete function that the device that this specification embodiment provides, memory and processor are realized, can be with this theoryAforementioned embodiments in bright book contrast explanation, and can reach the technical effect of aforementioned embodiments, just no longer superfluous hereIt states.

In the 1990s, the improvement of a technology can be distinguished clearly be on hardware improvement (for example,Improvement to circuit structures such as diode, transistor, switches) or software on improvement (improvement for method flow).SoAnd with the development of technology, the improvement of current many method flows can be considered as directly improving for hardware circuit.Designer nearly all obtains corresponding hardware circuit by the way that improved method flow to be programmed into hardware circuit.CauseThis, it cannot be said that the improvement of a method flow cannot be realized with hardware entities module.For example, programmable logic device(Programmable Logic Device, PLD) (such as field programmable gate array (Field Programmable GateArray, FPGA)) it is exactly such a integrated circuit, logic function determines device programming by user.By designerVoluntarily programming comes a digital display circuit " integrated " on a piece of PLD, designs and makes without asking chip makerDedicated IC chip.Moreover, nowadays, substitution manually makes IC chip, this programming is also used instead mostly " is patrolledVolume compiler (logic compiler) " software realizes that software compiler used is similar when it writes with program development,And the source code before compiling also write by handy specific programming language, this is referred to as hardware description language(Hardware Description Language, HDL), and HDL is also not only a kind of, but there are many kind, such as ABEL(Advanced Boolean Expression Language)、AHDL(Altera Hardware DescriptionLanguage)、Confluence、CUPL(Cornell University Programming Language)、HDCal、JHDL(Java Hardware Description Language)、Lava、Lola、MyHDL、PALASM、RHDL(RubyHardware Description Language) etc., VHDL (Very-High-Speed is most generally used at presentIntegrated Circuit Hardware Description Language) and Verilog.Those skilled in the art also answerThis understands, it is only necessary to method flow slightly programming in logic and is programmed into integrated circuit with above-mentioned several hardware description languages,The hardware circuit for realizing the logical method process can be readily available.

It is also known in the art that other than realizing server in a manner of pure computer readable program code, it is completeEntirely can by by method and step carry out programming in logic come so that server with logic gate, switch, specific integrated circuit, programmableLogic controller realizes identical function with the form for being embedded in microcontroller etc..Therefore this server is considered one kindHardware component, and the structure that the unit for realizing various functions for including in it can also be considered as in hardware component.OrEven, can will be considered as realizing the unit of various functions either the software module of implementation method can be Hardware Subdivision againStructure in part.

As seen through the above description of the embodiments, those skilled in the art can be understood that the application canIt realizes by means of software and necessary general hardware platform.Based on this understanding, the technical solution essence of the applicationOn in other words the part that contributes to existing technology can be embodied in the form of software products, the computer software productIt can store in storage medium, such as ROM/RAM, magnetic disk, CD, including some instructions are used so that a computer equipment(can be personal computer, server or the network equipment etc.) executes each embodiment of the application or embodimentMethod described in certain parts.

Each embodiment in this specification is described in a progressive manner, same and similar between each embodimentPart may refer to each other, what each embodiment stressed is the difference with other embodiments.In particular, needleFor the embodiment of client, the introduction control for being referred to the embodiment of preceding method is explained.

The application can describe in the general context of computer-executable instructions executed by a computer, such as programModule.Generally, program module includes routines performing specific tasks or implementing specific abstract data types, programs, objects, groupPart, data structure etc..The application can also be practiced in a distributed computing environment, in these distributed computing environments, byTask is executed by the connected remote processing devices of communication network.In a distributed computing environment, program module can be withIn the local and remote computer storage media including storage equipment.

Although depicting the application by embodiment, it will be appreciated by the skilled addressee that there are many deformations by the applicationWith variation without departing from spirit herein, it is desirable to which the attached claims include these deformations and change without departing from the applicationSpirit.

Claims

1. a kind of voice private chat method of direct broadcasting room, which is characterized in that the described method includes:

The private chat request for being directed toward target user is initiated to voice server, so that the voice server is requested in the private chatInitiator and the target user between establish private chat channel；

The private chat voice messaging of the initiator is acquired, and the private chat voice messaging of acquisition is uploaded to the voice serviceDevice, so that the voice server provides the private chat voice messaging to the target user by the private chat channel；

The private chat voice messaging provided by the target user that the voice server is sent by the private chat channel is provided,And receive the group chat voice messaging for the other users that same live streaming group is in the initiator that the voice server is sent；

After private chat voice messaging that the target user provides and the group chat voice messaging are synthesized one voice flow, playThe voice flow of synthesis.

2. the method according to claim 1, wherein after the private chat voice messaging for acquiring the initiator,The method also includes:

It identifies the audio frequency characteristics in the private chat voice messaging, and determines the difference between the audio frequency characteristics and standard voice featureDifferent value；

If the difference value is more than or equal to specified threshold, the corresponding information of the audio frequency characteristics is believed from the private chat voiceIt is removed in breath.

3. according to the method described in claim 2, it is characterized in that, identify the audio frequency characteristics in the private chat voice messaging, andDetermine that the difference value between the audio frequency characteristics and standard voice feature includes:

The private chat voice messaging in time-domain is converted to frequency domain, and identifies letter from the voice messaging of frequency domainNumber intensity reaches the corresponding target frequency of information of specified intensity threshold value, and the target frequency that will identify that is as the privateThe audio frequency characteristics for merely including in voice messaging；

The frequency difference between the target frequency and standard voice frequency is calculated, and using the frequency difference as the audioDifference value between feature and standard voice feature.

4. according to the method described in claim 2, it is characterized in that, by the corresponding information of the audio frequency characteristics from the private chatAfter being removed in voice messaging, the method also includes:

Target language segment is identified in the private chat voice messaging, the intensity value of any information is below in the target language segmentSpecified intensity threshold value；

If the duration of the target language segment is more than or equal to specified duration threshold value, added in the target language segment specifiedNoise signal.

5. according to the method described in claim 2, it is characterized in that, by the corresponding information of the audio frequency characteristics from the private chatAfter being removed in voice messaging, the method also includes:

Identify initial position and the final position of voice in the private chat voice messaging, and in the initial position and describedThe voice fitting information to match is added at final position respectively.

6. the method according to claim 1, wherein after the private chat voice messaging for acquiring the initiator,The method also includes:

It identifies the echo signal in the private chat voice messaging, and removes the echo signal from the private chat voice messagingIt removes.

7. the method according to claim 1, wherein after the private chat voice messaging for acquiring the initiator,The method also includes:

The vocal print feature that identifies the vocal print feature for including in the private chat voice messaging, and will identify that and the initiatorVocal print feature be compared；

If the vocal print feature identified and the vocal print feature of the initiator are inconsistent, the vocal print feature that will identify thatCorresponding information is removed from the private chat voice messaging.

8. the method according to claim 1, wherein the private chat voice messaging that the target user is provided and instituteIt states group chat voice messaging and synthesizes one voice flow and include:

It identifies the volume of the private chat voice messaging, and according to the volume identified, adjusts the group chat voice messagingVolume；

Group chat voice messaging after the private chat voice messaging and adjusting volume is merged into a track, and track is mergedInformation afterwards is as the voice flow after synthesis.

9. according to the method described in claim 8, it is characterized in that, adjusting the group chat language according to the volume identifiedMessage breath volume include:

If the volume of the private chat voice messaging identified is more than or equal to specified volume threshold, the group chat voice is believedThe volume adjustment of breath is to the first volume；

When the volume of the group chat voice messaging is in first volume, if the sound of the private chat voice messaging identifiedAmount is less than the specified volume threshold, by the volume adjustment of the group chat voice messaging to the second volume；Wherein, first soundAmount is less than second volume.

10. a kind of client, which is characterized in that the client includes:

Private chat request initiating cell, the private chat request for initiating to be directed toward target user to voice server, so that institute's predicateSound server establishes private chat channel between the initiator that the private chat is requested and the target user；

Private chat voice collecting unit, for acquiring the private chat voice messaging of the initiator, and by the private chat voice of acquisitionInformation is uploaded to the voice server, so that the voice server is mentioned by the private chat channel to the target userFor the private chat voice messaging；

Voice messaging receiving unit is used by what the private chat channel was sent by the target for receiving the voice serverThe private chat voice messaging that family provides, and receive that the voice server sends with the initiator be in same live streaming group itsThe group chat voice messaging of his user；

Voice flow synthesis unit, private chat voice messaging and group chat voice messaging synthesis for providing the target userAfter one voice flow, the voice flow of synthesis is played.

11. client according to claim 10, which is characterized in that the client further include:

Difference value determination unit, audio frequency characteristics in the private chat voice messaging for identification, and determine the audio frequency characteristics withDifference value between standard voice feature；

Voice messaging removal unit, it is if being more than or equal to specified threshold for the difference value, the audio frequency characteristics are correspondingInformation removed from the private chat voice messaging.

12. client according to claim 10, which is characterized in that the client further include:

Vocal print feature recognition unit, the vocal print feature for including in the private chat voice messaging for identification, and the institute that will identify thatVocal print feature is stated to be compared with the vocal print feature of the initiator；

Voiceprint removal unit, if the vocal print feature for identifying and the vocal print feature of the initiator are inconsistent,The corresponding information of the vocal print feature that will identify that is removed from the private chat voice messaging.

13. client according to claim 10, which is characterized in that the voice flow synthesis unit includes:

Group chat speech volume adjustment module, the volume of the private chat voice messaging for identification, and according to the sound identifiedAmount, adjusts the volume of the group chat voice messaging；

Track merging module, for the group chat voice messaging after the private chat voice messaging and adjusting volume to be merged into oneTrack, and the information after track is merged is as the voice flow after synthesis.

14. a kind of client, which is characterized in that the client includes processor and memory, and the memory is for storingComputer program when the computer program is executed by the processor, is realized such as any claim in claim 1 to 9The method.