CN107613400B

Movatterモバイル変換

Info

Publication number: CN107613400B
Application number: CN201710862107.0A
Authority: CN
Inventors: 蔡西颖
Original assignee: Beijing QIYI Century Science and Technology Co Ltd
Current assignee: Beijing QIYI Century Science and Technology Co Ltd
Priority date: 2017-09-21
Filing date: 2017-09-21
Publication date: 2021-03-26
Anticipated expiration: 2037-09-21
Also published as: CN107613400A

Abstract

The invention provides a method and a device for realizing a voice barrage, wherein the method comprises the following steps: receiving voice barrage information sent by a client, wherein the voice barrage information comprises a video identifier and voice content corresponding to a video playing interface of the client, and a video playing time point when a client user triggers voice barrage operation; determining a video identifier of a playing video and a target client matched with the video identifier; and when the video playing progress of the video played by the target client reaches the video playing time point, controlling the target client to output the voice content, wherein the playing frequency band of the voice content is different from the playing frequency band of the video. The invention not only realizes the simultaneous output of the voice bullet screen and the video sound, but also can not interfere the sound of the currently played video, thereby realizing the normal playing of the voice bullet screen.

Description

Method and device for realizing voice barrage

Technical Field

The invention relates to the technical field of video playing, in particular to a method and a device for realizing a voice barrage.

Background

With the rapid development of the internet technology, internet videos are more and more popular with users, and almost all video platforms provide a bullet screen interaction function, that is, a user can watch videos or play games and send text bullet screens and/or expression bullet screens and/or voice bullet screens, and the internet video platforms can display the text bullet screens and/or expression bullet screens on a video playing interface so as to improve the participation sense of the user. And for the output of the voice barrage, the watching of the user to the video is prevented from being influenced by the mutual interference between the voice of the user and the played video, and the played video can be paused when the voice barrage is played.

Therefore, the implementation scheme of the voice bullet screen in the prior art can only play the voice bullet screen or video sound independently, and cannot realize the output playing of the voice bullet screen under the condition of not interfering the currently played video.

Disclosure of Invention

The invention provides a method and a device for realizing a voice barrage, which are used for solving the problem that the realization scheme of the voice barrage in the prior art cannot realize the output playing of the voice barrage under the condition of not interfering the currently played video.

In order to solve the above problem, according to an aspect of the present invention, the present invention discloses a method for implementing a voice barrage, including:

receiving voice barrage information sent by a client, wherein the voice barrage information comprises a video identifier and voice content corresponding to a video playing interface of the client, and a video playing time point when a client user triggers voice barrage operation;

determining a video identifier of a playing video and a target client matched with the video identifier;

and when the video playing progress of the video played by the target client reaches the video playing time point, controlling the target client to output the voice content, wherein the playing frequency band of the voice content is different from the playing frequency band of the video.

Optionally, if the number of the voice barrage information is multiple and multiple pieces of voice barrage information correspond to the same video identifier, when the video playing progress of the video played by the target client reaches the video playing time point, before the target client is controlled to output the voice content, the method further includes:

acquiring the voice length of each piece of voice content in each piece of received voice bullet screen information;

calculating a video playing time range corresponding to each piece of voice bullet screen information according to the video playing time point and the voice length of each piece of voice bullet screen information;

scoring is respectively carried out on a plurality of pieces of voice bullet screen information overlapped in the video playing time range;

determining the highest grade as target voice bullet screen information;

when the video playing progress of the video played by the target client reaches the video playing time point, controlling the target client to output the voice content, including:

and when the video playing progress of the video played by the target client reaches the video playing time point in the target voice bullet screen information, controlling the target client to output the voice content in the target voice bullet screen information.

Optionally, the scoring the multiple pieces of voice barrage information overlapping in the video playing time range includes:

acquiring preset parameter information of each piece of voice bullet screen information;

determining the weight corresponding to each parameter information in the preset parameter information according to a comparison table of the preset parameter information and the weight;

summing the weight of each parameter information in the preset parameter information to obtain the score of each voice bullet screen information;

the preset parameter information comprises at least one of the following parameters:

historical comments issued by a client user;

a user level of a client user;

the content relevancy of the voice content and the playing video;

the client user is a water force.

Optionally, when the preset parameter information includes the historical comment issued by the client user, the determining, according to the comparison table between the preset parameter information and the weight, the weight corresponding to each parameter information in the preset parameter information includes:

judging whether malicious comments exist in the historical comments issued by the client user;

if the malicious comments exist, determining the weight corresponding to the malicious comments according to a comparison table of preset parameter information and the weight;

and if the malicious comments do not exist, setting the weight of the historical comment issued by the client user to be zero.

Optionally, when the preset parameter information includes a content correlation degree between a voice content and a playing video, before the preset parameter information of each piece of voice bullet screen information is acquired, the method further includes:

performing semantic recognition on the voice content;

judging whether the recognized semantics are matched with preset junk characters;

if the recognized semantics are not matched with the preset junk characters, calculating the matching degree between the recognized semantics and a preset video label corresponding to the playing video;

determining the matching degree obtained by calculation as the content correlation degree of the voice content and the played video;

and if the recognized semantics are matched with the preset junk characters, recording the content correlation degree of the voice content and the played video as zero.

Optionally, after determining that the video identifier of the playing video matches the target client of the video identifier, the method further includes:

and displaying a voice bullet screen icon at the video playing time point of the video playing interface of the target client.

According to another aspect of the present invention, the present invention also discloses a device for implementing a voice barrage, comprising:

the receiving module is used for receiving voice barrage information sent by a client, wherein the voice barrage information comprises a video identifier and voice content corresponding to a video playing interface of the client, and a video playing time point when a client user triggers a voice barrage operation;

the first determining module is used for determining a video identifier of a playing video and a target client matched with the video identifier;

and the control module is used for controlling the target client to output the voice content when the video playing progress of the video played by the target client reaches the video playing time point, wherein the playing frequency band of the voice content is different from that of the video.

Optionally, the apparatus further comprises:

the acquisition module is used for acquiring the voice length of each piece of voice content in each piece of received voice bullet screen information if the number of the voice bullet screen information is multiple and the multiple pieces of voice bullet screen information correspond to the same video identifier;

the first calculation module is used for calculating a video playing time range corresponding to each piece of voice bullet screen information according to the video playing time point and the voice length of each piece of voice bullet screen information;

the scoring module is used for scoring the voice barrage information overlapped in the video playing time range;

the second determining module is used for determining the target voice barrage information with the highest score;

the control module includes:

and the control submodule is used for controlling the target client to output the voice content in the target voice bullet screen information when the video playing progress of the video played by the target client reaches the video playing time point in the target voice bullet screen information.

Optionally, the scoring module comprises:

the acquisition submodule is used for acquiring preset parameter information of each piece of voice bullet screen information;

the determining submodule is used for determining the weight corresponding to each parameter information in the preset parameter information according to a comparison table of the preset parameter information and the weight;

the calculation submodule is used for summing the weight of each piece of parameter information in the preset parameter information to obtain the score of each piece of voice bullet screen information;

historical comments issued by a client user;

a user level of a client user;

the content relevancy of the voice content and the playing video;

the client user is a water force.

Optionally, the determining sub-module includes:

the judging unit is used for judging whether malicious comments exist in the historical comments issued by the client user or not when the preset parameter information comprises the historical comments issued by the client user;

the first determining unit is used for determining the weight corresponding to the malicious comment according to a comparison table of preset parameter information and the weight if the malicious comment exists;

and the setting unit is used for setting the weight of the historical comment issued by the client user to be zero if no malicious comment exists.

Optionally, the apparatus further comprises:

the recognition module is used for performing semantic recognition on the voice content when the preset parameter information comprises the content correlation degree of the voice content and the played video;

the judging module is used for judging whether the recognized semantics are matched with the preset junk characters;

the second calculation module is used for calculating the matching degree between the identified semantics and a preset video tag corresponding to the playing video if the identified semantics are not matched with the preset spam characters;

the third determining module is used for determining the matching degree obtained by calculation as the content correlation degree of the voice content and the played video;

and the setting module is used for recording the content relevancy of the voice content and the played video as zero if the recognized semantics are matched with the preset junk characters.

Optionally, the apparatus further comprises:

and the display module is used for displaying the voice barrage icon on the video playing time point of the video playing interface of the target client.

Compared with the prior art, the invention has the following advantages:

according to the method and the device, the voice bullet screen information sent by the client is received, the voice bullet screen information comprises the video playing time point, the voice content and the video identification of the bullet screen, the target client for playing the video is determined through the video identification, and when the video playing progress of the target client reaches the video playing time point, the target client is controlled to output the voice content in a playing channel different from the playing channel for playing the video, so that the voice bullet screen and the video sound are output simultaneously, the sound of the currently played video cannot be interfered, and the normal playing of the voice bullet screen is realized.

In addition, the voice barrage is scored, so that the voice barrage with the highest score is selected to be played when a plurality of voice barrages with overlapped time ranges exist.

Drawings

FIG. 1 is a flowchart illustrating steps of an embodiment of a method for implementing a voice bullet screen according to the present invention;

FIG. 2 is a schematic view of a video playback interface of the present invention;

fig. 3 is a block diagram of an embodiment of a device for implementing a voice bullet screen according to the present invention.

Detailed Description

In order to make the aforementioned objects, features and advantages of the present invention comprehensible, embodiments accompanied with figures are described in further detail below.

Referring to fig. 1, a flowchart illustrating steps of an embodiment of a method for implementing a voice bullet screen according to the present invention is shown, which may specifically include the following steps:

step 101, receiving voice barrage information sent by a client, wherein the voice barrage information comprises a video identifier and voice content corresponding to a video playing interface of the client, and a video playing time point when a client user triggers a voice barrage operation;

when the user uses the client to play the network video, the voice barrage can be sent to the played network video, namely, the user wants to say the voice is sent out according to the currently played content/picture in a voice mode, so that other users watching the network video can also hear the voice spoken by the user.

Specifically, the user may open the voice barrage switch on the video playing interface shown in fig. 2 in advance at the client side, for example, so that the video playing interface can support sending and receiving of the voice barrage; then, click the voice bullet screen button 11 of the video playing interface, thereby sending out the voice bullet screen. As shown in fig. 2, when the user clicks the voice bullet screen button 11, the video playing time point is 5 minutes and 10 seconds at the time point in the video playing progress bar, and then the user enters the voice content, and after the entry is completed, the voice entry can be ended by clicking an entry completion button (not shown). After the voice input is finished, the client side may send the voice content input by the user, the video playing time point (for example, 5 minutes and 20 seconds) when the user clicks the voice barrage button, and the video identifier of the playing video to the video server side as the voice barrage information sent by the user.

Therefore, the video server can receive the voice barrage information sent by the client.

It should be noted that, in the above example, the triggering of the voice bullet screen operation by the user is implemented in a manner that the user clicks the voice bullet screen button, in practical applications, the manner that the user triggers the voice bullet screen operation is not limited to the manner of clicking the button, and may also be preset sliding gestures, double-clicking a certain button, and the like, which is not specifically limited by the present invention.

Step 102, determining a video identifier of a playing video and a target client matched with the video identifier;

after receiving the voice barrage information sent by a certain client, the video server extracts the video identifier in the voice barrage information, so as to search for a target client that is playing a video corresponding to the video identifier among all clients that play the video, where the number of the target clients may be one or more, and the method also includes the client that sends the voice barrage information instep 101.

Thus, it can be determined to which clients the voice bullet needs to be sent.

Step 103, when the video playing progress of the video played by the target client reaches the video playing time point, controlling the target client to output the voice content, wherein the playing frequency band of the voice content is different from the playing frequency band of the video.

Therefore, the video server according to the embodiment of the present invention may extract the voice content and the video playing time point (i.e., 5 minutes and 20 seconds) in the voice barrage information, and when the video playing progress of the video (corresponding to the video identifier) played by the target client reaches, for example, 5 minutes and 20 seconds, control the target client to output the voice content in the voice barrage information, thereby completing the output of the voice barrage.

In order to avoid the mutual interference between the played voice content and the played video, so that a user cannot hear the voice content or the sound of the played video, in the embodiment of the invention, when the voice content is played, the playing frequency band of the voice content is controlled, so that the playing frequency band is different from the playing channel of the played video, and the mutual interference between the two sounds is avoided.

In addition, in the specific implementation ofstep 103, an optional manner is that the video server detects the video playing progress of the playing video of each target client in real time, and controls the voice content to be output at the target client until the detected video playing progress reaches the video playing time point, and this processing manner can reduce the processing pressure of the target client; in another optional manner, the voice content and the video playing time point of the video corresponding to the video identifier may be associated with each other, so that the video playing progress of each target client does not need to be detected in real time, and as long as the video playing progress of the target client reaches the video playing time point, the video server according to the embodiment of the present invention may control the target client to output the voice content associated with the video playing time point, thereby reducing the processing pressure of the video server.

In addition, the client of the embodiment of the present invention may be any terminal device, including a mobile terminal (e.g., a mobile phone, a tablet computer) and a non-mobile terminal (e.g., a desktop computer).

By means of the technical scheme of the embodiment of the invention, the voice barrage information sent by the client is received, the voice barrage information comprises the video playing time point, the voice content and the video identification of the barrage, the target client for playing the video is determined through the video identification, and when the video playing progress of the target client reaches the video playing time point, the target client is controlled to output the voice content in a playing channel different from the playing video, so that the simultaneous output of the voice barrage and the video sound is realized, the sound of the currently played video is not interfered, and the normal playing of the voice barrage is realized.

For the same network video, there may be multiple clients sending voice barrages, or the same client may send multiple voice barrages for the same network video, so that there is a situation that there are multiple voice barrages instep 101.

Therefore, in an embodiment, if the number of the voice barrage information is multiple and the multiple voice barrage information corresponds to the same video identifier (that is, the video server receives multiple voice barrage information for the same video), beforestep 103, the method according to the embodiment of the present invention further includes:

the video server side can acquire the voice length of the voice content in each piece of received voice bullet screen information. For example, the video server receives 4 voice barrages for video 1, namely a voice barrage 1, a voice barrage 2, a voice barrage 3 and a voice barrage 4, and the corresponding voice lengths are 5 seconds, 10 seconds, 15 seconds and 15 seconds respectively.

each piece of voice bullet screen information further comprises a video playing time point when the user triggers the voice bullet screen operation, namely the voice bullet screen triggered at which time point. For example, the video playing time point of the voice bullet screen 1 is 5 minutes and 15 seconds, the video playing time point of the voice bullet screen 2 is 5 minutes and 10 seconds, the video playing time point of the voice bullet screen 3 is 5 minutes and 20 seconds, the video playing time point of the voice bullet screen 4 is 5 minutes and 25 seconds, and the voice lengths of the four voice bullet screens are 5 seconds, 10 seconds, 15 seconds and 15 seconds respectively.

Therefore, the video playing time range (i.e. the video playing time range covered by the voice content) corresponding to the voice bullet screen 1 can be calculated to be 5minutes 15 seconds to 5 minutes 20 seconds according to the video playing time point and the voice length of each piece of voice bullet screen information; the video playing time range corresponding to the voice bullet screen 2 is 5 minutes 10 seconds to 5 minutes 20 seconds; the video playing time range corresponding to the voice bullet screen 3 is 5 minutes 20 seconds to 5 minutes 35 seconds; the video playing time range corresponding to the voice barrage 4 is 5 minutes 25 seconds to 5 minutes 40 seconds.

In one embodiment, when a plurality of pieces of voice barrage information overlapping with each other in a video playing time range are respectively evaluated, the evaluation can be realized through the following sub-steps:

s21, acquiring preset parameter information of each piece of voice bullet screen information;

s22, determining the weight corresponding to each parameter information in the preset parameter information according to a comparison table of the preset parameter information and the weight;

s23, summing the weight of each parameter information in the preset parameter information to obtain the score of each piece of voice bullet screen information;

wherein the preset parameter information includes at least one of the following parameters:

historical comments issued by a client user;

a user level of a client user;

the content relevancy of the voice content and the playing video;

the client user is a water force.

In one embodiment, when the preset parameter information includes the history comment issued by the client user, then in executing S22, the following method may be implemented:

the method comprises the steps of presetting a malicious comment information table, and judging whether the historical comments issued by the client users have malicious comments or not by searching the preset malicious comment information table.

the preset parameter information and weight comparison table is provided with a corresponding weight for each parameter information, for example, the weight corresponding to the malicious comment is-10.

Therefore, the embodiment of the invention can grade the voice barrage of the user according to the historical comments of the user.

In addition, in another embodiment, when the preset parameter information includes the user level of the client user, in executing S22, the weight corresponding to the user level may be looked up in the comparison table of the preset parameter information and the weight. For example, the weight set for VIP users in the lookup table is 10, the weight set for VVIP users is 20, the weight set for star users is 15, and so on.

For example, if the level of the client user who sends the voice bullet screen 1 is VVIP, the weight 20 may be obtained.

In addition, in another embodiment, when the preset parameter information includes a content correlation degree between a voice content and a playing video, the obtaining of the preset parameter information of each piece of voice bullet screen information includes:

performing semantic recognition on the voice content;

the preset garbage characters can be nonsensical characters, such as 'haha', 'la', and the like.

If the recognized semantic meaning is matched with the preset junk text (for example, if the matching degree reaches a preset matching degree threshold value, for example, 80%, it indicates that most contents in the voice content are invalid contents), the content relevance between the voice content and the played video is recorded as zero, that is, the value of the parameter information is zero.

If the identified semantics are not matched with the preset junk characters (for example, the matching degree is smaller than a preset matching degree threshold value, for example, 80%), calculating the matching degree between the identified semantics and a preset video tag corresponding to the played video;

that is to say, in the embodiment of the present invention, video tags, such as a video type tag (including but not limited to a speech, a scenario, a fun, and the like), a director name tag, a role name tag, a tag describing a video scenario, and the like, may be set in advance for each video provided by the video server. Then for the voice content that is not spam, a degree of match between the identified semantics and the video tags corresponding to the video is calculated. The degree of match may indicate a degree of content correlation between the content spoken by the user and the video.

therefore, when the preset parameter information includes the content relevance between the voice content and the played video, the content relevance between the voice content and the played video can be obtained when the preset parameter information of each piece of voice bullet screen information is obtained.

Therefore, the embodiment of the invention can combine the content correlation degree between the voice content of the voice bullet screen and the playing video when scoring the voice bullet screen, and set higher weight for the content correlation degree.

In addition, in another embodiment, when the preset parameter information includes that the client user is a navy, in step S22, the weight corresponding to the navy user may be looked up in the comparison table of the preset parameter information and the weight. For example, the weight set for the naval user in the look-up table is-15.

The scheme of determining whether the client user is a water army according to the voice barrage information can be implemented in any one of the prior art, which is not specifically limited by the present invention.

In summary, the present invention can perform addition processing on the weights respectively corresponding to each preset parameter information to obtain the score of a certain voice barrage.

Optionally, in an embodiment, before scoring the plurality of pieces of voice barrage information with overlapping time ranges in the video playing time range, the method according to an embodiment of the present invention may further include:

and eliminating noise and noise of the voice content in the voice bullet screen information.

Therefore, the scoring accuracy can be improved by processing the voice content in the voice bullet screen to eliminate noise and murmur.

Determining the highest grade as target voice bullet screen information;

Correspondingly, whenstep 103 is executed, when the video playing progress of the video played by the target client reaches the video playing time point in the target voice barrage information, the target client may be controlled to output the voice content in the target voice barrage information.

Specifically, when the video playing progress of the video (corresponding to the video identifier) played by the target client reaches 5 minutes and 10 seconds, controlling the target client to output the voice content of the voice barrage 2; when the video playing progress of the video (corresponding to the video identifier) played by the target client reaches 5 minutes and 25 seconds, controlling the target client to output the voice content of the voice barrage 4.

Here, the scheme of the embodiment of the present invention is only schematically illustrated by taking 4 voice bullet screens as an example, and in practical application, the scheme is not limited to the above 4 voice bullet screens.

Optionally, in an embodiment, afterstep 102, the method according to an embodiment of the present invention further includes: and displaying a voice bullet screen icon at the video playing time point of the video playing interface of the target client.

Specifically, as shown in fig. 2, for the voice barrage 2 with the highest score, the playing time is 5 minutes and 10 seconds, in the embodiment of the present invention, avoice barrage icon 12 may be displayed at 5 minutes and 10 seconds of the video playing progress bar of the video playing interface shown in fig. 2 to remind the user that there is a voice barrage, and in addition,

voice barrage icons

13, 14, and 15 at other time points are also shown in the video playing progress bar in fig. 2.

Optionally, in another embodiment, beforestep 101, the method according to an embodiment of the present invention may further include:

and if the fact that the client user triggers the voice bullet screen operation is detected, reducing the video volume corresponding to the client video playing interface.

Therefore, when the user is about to input the voice content of the voice bullet screen, the volume of the currently played video is reduced, and the voice content collected by the video server side can be clearer.

In another embodiment, the method of the embodiment of the present invention may further limit the duration of the voice content input by each client user, for example, the maximum length is 20 seconds.

In addition, in another embodiment, the client side may first perform the pseudo-writing on the collected voice content locally on the client side, and then upload the voice content to the video server together with the video identifier and the video playing time point. Therefore, whenstep 103 is executed, the voice content of the local pseudo-writing can be directly played, and the playing delay of the voice barrage is reduced.

In addition, in another embodiment, a user may set whether to open the voice barrage function on the client side, so that the method of the embodiment of the present invention controls the target clients that open the voice barrage function to play the voice content in the target voice barrage information only for the target clients.

If the user sets the function of closing the voice barrage at the client side, the method of the embodiment of the present invention may further convert the voice content to be played (the voice content may not be scored, and may be the voice content corresponding to all the received voice barrage information) into characters, so as to control the target clients closing the voice barrage function to output the characters.

In the above embodiment, the method according to the embodiment of the present invention defaults to playing the voice content in the voice bullet screen information with the highest score for the voice bullet screen information with overlapping time ranges.

In other embodiments, the user may freely set the desired voice barrage according to needs, for example, the user may select to output the voice barrage input by the user a on the video playing interface, or output the voice barrage with the lowest score in the voice barrage information with overlapping time ranges, and so on.

In summary, the method of the embodiment of the present invention enables the video playing and the voice barrage to coexist in a friendly manner, so that the user of the video player can express the video watching experience in a more efficient manner without interrupting or disturbing the video playing, thereby improving the user experience; in addition, various users can send voice barracks, so that communication between special users (including high-quality observation users, star users, householders and the like) and common users can be enriched.

It should be noted that, for simplicity of description, the method embodiments are described as a series of acts or combination of acts, but those skilled in the art will recognize that the present invention is not limited by the illustrated order of acts, as some steps may occur in other orders or concurrently in accordance with the embodiments of the present invention. Further, those skilled in the art will appreciate that the embodiments described in the specification are presently preferred and that no particular act is required to implement the invention.

Corresponding to the method provided by the embodiment of the present invention, referring to fig. 3, a block diagram of a structure of an embodiment of a device for implementing a voice barrage according to the present invention is shown, and the device specifically includes the following modules:

the receiving module 31 is configured to receive voice barrage information sent by a client, where the voice barrage information includes a video identifier and a voice content corresponding to a video playing interface of the client, and a video playing time point when a client user triggers a voice barrage operation;

a first determining module 32, configured to determine a video identifier of a playing video and a target client that matches the video identifier;

a control module 33, configured to control the target client to output the voice content when the video playing progress of the video played by the target client reaches the video playing time point, where a playing frequency band of the voice content is different from a playing frequency band of the video.

Optionally, the apparatus further comprises:

optionally, the control module 33 includes:

Optionally, the scoring module comprises:

optionally, the preset parameter information includes at least one of the following parameters:

historical comments issued by a client user;

a user level of a client user;

the content relevancy of the voice content and the playing video;

the client user is a water force.

Optionally, the determining sub-module includes:

Optionally, the apparatus further comprises:

For the device embodiment, since it is basically similar to the method embodiment, the description is simple, and for the relevant points, refer to the partial description of the method embodiment.

The embodiments in the present specification are described in a progressive manner, each embodiment focuses on differences from other embodiments, and the same and similar parts among the embodiments are referred to each other.

As will be appreciated by one skilled in the art, embodiments of the present invention may be provided as a method, apparatus, or computer program product. Accordingly, embodiments of the present invention may take the form of an entirely hardware embodiment, an entirely software embodiment or an embodiment combining software and hardware aspects. Furthermore, embodiments of the present invention may take the form of a computer program product embodied on one or more computer-usable storage media (including, but not limited to, disk storage, CD-ROM, optical storage, and the like) having computer-usable program code embodied therein.

Embodiments of the present invention are described with reference to flowchart illustrations and/or block diagrams of methods, terminal devices (systems), and computer program products according to embodiments of the invention. It will be understood that each flow and/or block of the flow diagrams and/or block diagrams, and combinations of flows and/or blocks in the flow diagrams and/or block diagrams, can be implemented by computer program instructions. These computer program instructions may be provided to a processor of a general purpose computer, special purpose computer, embedded processor, or other programmable data processing terminal to produce a machine, such that the instructions, which execute via the processor of the computer or other programmable data processing terminal, create means for implementing the functions specified in the flowchart flow or flows and/or block diagram block or blocks.

These computer program instructions may also be stored in a computer-readable memory that can direct a computer or other programmable data processing terminal to function in a particular manner, such that the instructions stored in the computer-readable memory produce an article of manufacture including instruction means which implement the function specified in the flowchart flow or flows and/or block diagram block or blocks.

These computer program instructions may also be loaded onto a computer or other programmable data processing terminal to cause a series of operational steps to be performed on the computer or other programmable terminal to produce a computer implemented process such that the instructions which execute on the computer or other programmable terminal provide steps for implementing the functions specified in the flowchart flow or flows and/or block diagram block or blocks.

While preferred embodiments of the present invention have been described, additional variations and modifications of these embodiments may occur to those skilled in the art once they learn of the basic inventive concepts. Therefore, it is intended that the appended claims be interpreted as including preferred embodiments and all such alterations and modifications as fall within the scope of the embodiments of the invention.

Finally, it should also be noted that, herein, relational terms such as first and second, and the like may be used solely to distinguish one entity or action from another entity or action without necessarily requiring or implying any actual such relationship or order between such entities or actions. Also, the terms "comprises," "comprising," or any other variation thereof, are intended to cover a non-exclusive inclusion, such that a process, method, article, or terminal that comprises a list of elements does not include only those elements but may include other elements not expressly listed or inherent to such process, method, article, or terminal. Without further limitation, an element defined by the phrase "comprising an … …" does not exclude the presence of other like elements in a process, method, article, or terminal that comprises the element.

The method for implementing the voice barrage and the device for implementing the voice barrage provided by the invention are described in detail, a specific example is applied in the text to explain the principle and the implementation mode of the invention, and the description of the above embodiment is only used for helping to understand the method and the core idea of the invention; meanwhile, for a person skilled in the art, according to the idea of the present invention, there may be variations in the specific embodiments and the application scope, and in summary, the content of the present specification should not be construed as a limitation to the present invention.

Claims

1. A method for realizing a voice barrage is characterized by comprising the following steps:

when the video playing progress of the video played by the target client reaches the video playing time point, if the target client starts a voice barrage function, controlling the target client to output the voice content, wherein the playing frequency band of the voice content is different from the playing frequency band of the video; if the target client side closes the voice barrage function, converting the voice content into characters, and controlling the target client side to output the characters;

if the number of the voice barrage information is multiple and multiple pieces of voice barrage information correspond to the same video identifier, when the video playing progress of the video played by the target client reaches the video playing time point, if the target client starts the voice barrage function, before controlling the target client to output the voice content, the method further includes:

scoring is respectively carried out on a plurality of pieces of voice bullet screen information overlapped in the video playing time range; the overlapping of the video playing time ranges means that two or more time points are overlapped;

determining the highest grade as target voice bullet screen information;

when the video playing progress of the video played by the target client reaches the video playing time point, if the target client starts a voice barrage function, controlling the target client to output the voice content, including:

and when the video playing progress of the video played by the target client reaches the video playing time point in the target voice barrage information, if the target client starts the voice barrage function, controlling the target client to output the voice content in the target voice barrage information.

2. The method according to claim 1, wherein the scoring of the pieces of voice barrage information with overlapping video playing time ranges comprises:

historical comments issued by a client user;

a user level of a client user;

the content relevancy of the voice content and the playing video;

the client user is a water force.

3. The method according to claim 2, wherein when the preset parameter information includes historical comments issued by the client user, the determining the weight corresponding to each parameter information in the preset parameter information according to the comparison table of the preset parameter information and the weights includes:

4. The method according to claim 2, wherein when the preset parameter information includes a content correlation between the voice content and the playing video, before acquiring the preset parameter information of each piece of voice barrage information, the method further includes:

performing semantic recognition on the voice content;

5. The method of claim 1, wherein after determining that the video identifier of the playing video matches the target client of the video identifier, the method further comprises:

6. The utility model provides a realization device of pronunciation barrage which characterized in that includes:

the control module is used for controlling the target client to output the voice content if the target client starts a voice barrage function when the video playing progress of the video played by the target client reaches the video playing time point, wherein the playing frequency band of the voice content is different from that of the video; if the target client side closes the voice barrage function, converting the voice content into characters, and controlling the target client side to output the characters;

the scoring module is used for scoring the voice barrage information overlapped in the video playing time range; the overlapping of the video playing time ranges means that two or more time points are overlapped;

the control module includes:

and the control submodule is used for controlling the target client to output the voice content in the target voice bullet screen information if the target client starts the voice bullet screen function when the video playing progress of the video played by the target client reaches the video playing time point in the target voice bullet screen information.

7. The apparatus of claim 6, wherein the scoring module comprises:

historical comments issued by a client user;

a user level of a client user;

the content relevancy of the voice content and the playing video;

the client user is a water force.

8. The apparatus of claim 7, wherein the determination submodule comprises:

9. The apparatus of claim 7, further comprising:

10. The apparatus of claim 6, further comprising: