Movatterモバイル変換


[0]ホーム

URL:


CN107613400B - Method and device for realizing voice barrage - Google Patents

Method and device for realizing voice barrage
Download PDF

Info

Publication number
CN107613400B
CN107613400BCN201710862107.0ACN201710862107ACN107613400BCN 107613400 BCN107613400 BCN 107613400BCN 201710862107 ACN201710862107 ACN 201710862107ACN 107613400 BCN107613400 BCN 107613400B
Authority
CN
China
Prior art keywords
voice
video
content
client
bullet screen
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Active
Application number
CN201710862107.0A
Other languages
Chinese (zh)
Other versions
CN107613400A (en
Inventor
蔡西颖
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Beijing QIYI Century Science and Technology Co Ltd
Original Assignee
Beijing QIYI Century Science and Technology Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Beijing QIYI Century Science and Technology Co LtdfiledCriticalBeijing QIYI Century Science and Technology Co Ltd
Priority to CN201710862107.0ApriorityCriticalpatent/CN107613400B/en
Publication of CN107613400ApublicationCriticalpatent/CN107613400A/en
Application grantedgrantedCritical
Publication of CN107613400BpublicationCriticalpatent/CN107613400B/en
Activelegal-statusCriticalCurrent
Anticipated expirationlegal-statusCritical

Links

Images

Landscapes

Abstract

The invention provides a method and a device for realizing a voice barrage, wherein the method comprises the following steps: receiving voice barrage information sent by a client, wherein the voice barrage information comprises a video identifier and voice content corresponding to a video playing interface of the client, and a video playing time point when a client user triggers voice barrage operation; determining a video identifier of a playing video and a target client matched with the video identifier; and when the video playing progress of the video played by the target client reaches the video playing time point, controlling the target client to output the voice content, wherein the playing frequency band of the voice content is different from the playing frequency band of the video. The invention not only realizes the simultaneous output of the voice bullet screen and the video sound, but also can not interfere the sound of the currently played video, thereby realizing the normal playing of the voice bullet screen.

Description

Method and device for realizing voice barrage
Technical Field
The invention relates to the technical field of video playing, in particular to a method and a device for realizing a voice barrage.
Background
With the rapid development of the internet technology, internet videos are more and more popular with users, and almost all video platforms provide a bullet screen interaction function, that is, a user can watch videos or play games and send text bullet screens and/or expression bullet screens and/or voice bullet screens, and the internet video platforms can display the text bullet screens and/or expression bullet screens on a video playing interface so as to improve the participation sense of the user. And for the output of the voice barrage, the watching of the user to the video is prevented from being influenced by the mutual interference between the voice of the user and the played video, and the played video can be paused when the voice barrage is played.
Therefore, the implementation scheme of the voice bullet screen in the prior art can only play the voice bullet screen or video sound independently, and cannot realize the output playing of the voice bullet screen under the condition of not interfering the currently played video.
Disclosure of Invention
The invention provides a method and a device for realizing a voice barrage, which are used for solving the problem that the realization scheme of the voice barrage in the prior art cannot realize the output playing of the voice barrage under the condition of not interfering the currently played video.
In order to solve the above problem, according to an aspect of the present invention, the present invention discloses a method for implementing a voice barrage, including:
receiving voice barrage information sent by a client, wherein the voice barrage information comprises a video identifier and voice content corresponding to a video playing interface of the client, and a video playing time point when a client user triggers voice barrage operation;
determining a video identifier of a playing video and a target client matched with the video identifier;
and when the video playing progress of the video played by the target client reaches the video playing time point, controlling the target client to output the voice content, wherein the playing frequency band of the voice content is different from the playing frequency band of the video.
Optionally, if the number of the voice barrage information is multiple and multiple pieces of voice barrage information correspond to the same video identifier, when the video playing progress of the video played by the target client reaches the video playing time point, before the target client is controlled to output the voice content, the method further includes:
acquiring the voice length of each piece of voice content in each piece of received voice bullet screen information;
calculating a video playing time range corresponding to each piece of voice bullet screen information according to the video playing time point and the voice length of each piece of voice bullet screen information;
scoring is respectively carried out on a plurality of pieces of voice bullet screen information overlapped in the video playing time range;
determining the highest grade as target voice bullet screen information;
when the video playing progress of the video played by the target client reaches the video playing time point, controlling the target client to output the voice content, including:
and when the video playing progress of the video played by the target client reaches the video playing time point in the target voice bullet screen information, controlling the target client to output the voice content in the target voice bullet screen information.
Optionally, the scoring the multiple pieces of voice barrage information overlapping in the video playing time range includes:
acquiring preset parameter information of each piece of voice bullet screen information;
determining the weight corresponding to each parameter information in the preset parameter information according to a comparison table of the preset parameter information and the weight;
summing the weight of each parameter information in the preset parameter information to obtain the score of each voice bullet screen information;
the preset parameter information comprises at least one of the following parameters:
historical comments issued by a client user;
a user level of a client user;
the content relevancy of the voice content and the playing video;
the client user is a water force.
Optionally, when the preset parameter information includes the historical comment issued by the client user, the determining, according to the comparison table between the preset parameter information and the weight, the weight corresponding to each parameter information in the preset parameter information includes:
judging whether malicious comments exist in the historical comments issued by the client user;
if the malicious comments exist, determining the weight corresponding to the malicious comments according to a comparison table of preset parameter information and the weight;
and if the malicious comments do not exist, setting the weight of the historical comment issued by the client user to be zero.
Optionally, when the preset parameter information includes a content correlation degree between a voice content and a playing video, before the preset parameter information of each piece of voice bullet screen information is acquired, the method further includes:
performing semantic recognition on the voice content;
judging whether the recognized semantics are matched with preset junk characters;
if the recognized semantics are not matched with the preset junk characters, calculating the matching degree between the recognized semantics and a preset video label corresponding to the playing video;
determining the matching degree obtained by calculation as the content correlation degree of the voice content and the played video;
and if the recognized semantics are matched with the preset junk characters, recording the content correlation degree of the voice content and the played video as zero.
Optionally, after determining that the video identifier of the playing video matches the target client of the video identifier, the method further includes:
and displaying a voice bullet screen icon at the video playing time point of the video playing interface of the target client.
According to another aspect of the present invention, the present invention also discloses a device for implementing a voice barrage, comprising:
the receiving module is used for receiving voice barrage information sent by a client, wherein the voice barrage information comprises a video identifier and voice content corresponding to a video playing interface of the client, and a video playing time point when a client user triggers a voice barrage operation;
the first determining module is used for determining a video identifier of a playing video and a target client matched with the video identifier;
and the control module is used for controlling the target client to output the voice content when the video playing progress of the video played by the target client reaches the video playing time point, wherein the playing frequency band of the voice content is different from that of the video.
Optionally, the apparatus further comprises:
the acquisition module is used for acquiring the voice length of each piece of voice content in each piece of received voice bullet screen information if the number of the voice bullet screen information is multiple and the multiple pieces of voice bullet screen information correspond to the same video identifier;
the first calculation module is used for calculating a video playing time range corresponding to each piece of voice bullet screen information according to the video playing time point and the voice length of each piece of voice bullet screen information;
the scoring module is used for scoring the voice barrage information overlapped in the video playing time range;
the second determining module is used for determining the target voice barrage information with the highest score;
the control module includes:
and the control submodule is used for controlling the target client to output the voice content in the target voice bullet screen information when the video playing progress of the video played by the target client reaches the video playing time point in the target voice bullet screen information.
Optionally, the scoring module comprises:
the acquisition submodule is used for acquiring preset parameter information of each piece of voice bullet screen information;
the determining submodule is used for determining the weight corresponding to each parameter information in the preset parameter information according to a comparison table of the preset parameter information and the weight;
the calculation submodule is used for summing the weight of each piece of parameter information in the preset parameter information to obtain the score of each piece of voice bullet screen information;
the preset parameter information comprises at least one of the following parameters:
historical comments issued by a client user;
a user level of a client user;
the content relevancy of the voice content and the playing video;
the client user is a water force.
Optionally, the determining sub-module includes:
the judging unit is used for judging whether malicious comments exist in the historical comments issued by the client user or not when the preset parameter information comprises the historical comments issued by the client user;
the first determining unit is used for determining the weight corresponding to the malicious comment according to a comparison table of preset parameter information and the weight if the malicious comment exists;
and the setting unit is used for setting the weight of the historical comment issued by the client user to be zero if no malicious comment exists.
Optionally, the apparatus further comprises:
the recognition module is used for performing semantic recognition on the voice content when the preset parameter information comprises the content correlation degree of the voice content and the played video;
the judging module is used for judging whether the recognized semantics are matched with the preset junk characters;
the second calculation module is used for calculating the matching degree between the identified semantics and a preset video tag corresponding to the playing video if the identified semantics are not matched with the preset spam characters;
the third determining module is used for determining the matching degree obtained by calculation as the content correlation degree of the voice content and the played video;
and the setting module is used for recording the content relevancy of the voice content and the played video as zero if the recognized semantics are matched with the preset junk characters.
Optionally, the apparatus further comprises:
and the display module is used for displaying the voice barrage icon on the video playing time point of the video playing interface of the target client.
Compared with the prior art, the invention has the following advantages:
according to the method and the device, the voice bullet screen information sent by the client is received, the voice bullet screen information comprises the video playing time point, the voice content and the video identification of the bullet screen, the target client for playing the video is determined through the video identification, and when the video playing progress of the target client reaches the video playing time point, the target client is controlled to output the voice content in a playing channel different from the playing channel for playing the video, so that the voice bullet screen and the video sound are output simultaneously, the sound of the currently played video cannot be interfered, and the normal playing of the voice bullet screen is realized.
In addition, the voice barrage is scored, so that the voice barrage with the highest score is selected to be played when a plurality of voice barrages with overlapped time ranges exist.
Drawings
FIG. 1 is a flowchart illustrating steps of an embodiment of a method for implementing a voice bullet screen according to the present invention;
FIG. 2 is a schematic view of a video playback interface of the present invention;
fig. 3 is a block diagram of an embodiment of a device for implementing a voice bullet screen according to the present invention.
Detailed Description
In order to make the aforementioned objects, features and advantages of the present invention comprehensible, embodiments accompanied with figures are described in further detail below.
Referring to fig. 1, a flowchart illustrating steps of an embodiment of a method for implementing a voice bullet screen according to the present invention is shown, which may specifically include the following steps:
step 101, receiving voice barrage information sent by a client, wherein the voice barrage information comprises a video identifier and voice content corresponding to a video playing interface of the client, and a video playing time point when a client user triggers a voice barrage operation;
when the user uses the client to play the network video, the voice barrage can be sent to the played network video, namely, the user wants to say the voice is sent out according to the currently played content/picture in a voice mode, so that other users watching the network video can also hear the voice spoken by the user.
Specifically, the user may open the voice barrage switch on the video playing interface shown in fig. 2 in advance at the client side, for example, so that the video playing interface can support sending and receiving of the voice barrage; then, click the voice bullet screen button 11 of the video playing interface, thereby sending out the voice bullet screen. As shown in fig. 2, when the user clicks the voice bullet screen button 11, the video playing time point is 5 minutes and 10 seconds at the time point in the video playing progress bar, and then the user enters the voice content, and after the entry is completed, the voice entry can be ended by clicking an entry completion button (not shown). After the voice input is finished, the client side may send the voice content input by the user, the video playing time point (for example, 5 minutes and 20 seconds) when the user clicks the voice barrage button, and the video identifier of the playing video to the video server side as the voice barrage information sent by the user.
Therefore, the video server can receive the voice barrage information sent by the client.
It should be noted that, in the above example, the triggering of the voice bullet screen operation by the user is implemented in a manner that the user clicks the voice bullet screen button, in practical applications, the manner that the user triggers the voice bullet screen operation is not limited to the manner of clicking the button, and may also be preset sliding gestures, double-clicking a certain button, and the like, which is not specifically limited by the present invention.
Step 102, determining a video identifier of a playing video and a target client matched with the video identifier;
after receiving the voice barrage information sent by a certain client, the video server extracts the video identifier in the voice barrage information, so as to search for a target client that is playing a video corresponding to the video identifier among all clients that play the video, where the number of the target clients may be one or more, and the method also includes the client that sends the voice barrage information instep 101.
Thus, it can be determined to which clients the voice bullet needs to be sent.
Step 103, when the video playing progress of the video played by the target client reaches the video playing time point, controlling the target client to output the voice content, wherein the playing frequency band of the voice content is different from the playing frequency band of the video.
Therefore, the video server according to the embodiment of the present invention may extract the voice content and the video playing time point (i.e., 5 minutes and 20 seconds) in the voice barrage information, and when the video playing progress of the video (corresponding to the video identifier) played by the target client reaches, for example, 5 minutes and 20 seconds, control the target client to output the voice content in the voice barrage information, thereby completing the output of the voice barrage.
In order to avoid the mutual interference between the played voice content and the played video, so that a user cannot hear the voice content or the sound of the played video, in the embodiment of the invention, when the voice content is played, the playing frequency band of the voice content is controlled, so that the playing frequency band is different from the playing channel of the played video, and the mutual interference between the two sounds is avoided.
In addition, in the specific implementation ofstep 103, an optional manner is that the video server detects the video playing progress of the playing video of each target client in real time, and controls the voice content to be output at the target client until the detected video playing progress reaches the video playing time point, and this processing manner can reduce the processing pressure of the target client; in another optional manner, the voice content and the video playing time point of the video corresponding to the video identifier may be associated with each other, so that the video playing progress of each target client does not need to be detected in real time, and as long as the video playing progress of the target client reaches the video playing time point, the video server according to the embodiment of the present invention may control the target client to output the voice content associated with the video playing time point, thereby reducing the processing pressure of the video server.
In addition, the client of the embodiment of the present invention may be any terminal device, including a mobile terminal (e.g., a mobile phone, a tablet computer) and a non-mobile terminal (e.g., a desktop computer).
By means of the technical scheme of the embodiment of the invention, the voice barrage information sent by the client is received, the voice barrage information comprises the video playing time point, the voice content and the video identification of the barrage, the target client for playing the video is determined through the video identification, and when the video playing progress of the target client reaches the video playing time point, the target client is controlled to output the voice content in a playing channel different from the playing video, so that the simultaneous output of the voice barrage and the video sound is realized, the sound of the currently played video is not interfered, and the normal playing of the voice barrage is realized.
For the same network video, there may be multiple clients sending voice barrages, or the same client may send multiple voice barrages for the same network video, so that there is a situation that there are multiple voice barrages instep 101.
Therefore, in an embodiment, if the number of the voice barrage information is multiple and the multiple voice barrage information corresponds to the same video identifier (that is, the video server receives multiple voice barrage information for the same video), beforestep 103, the method according to the embodiment of the present invention further includes:
acquiring the voice length of each piece of voice content in each piece of received voice bullet screen information;
the video server side can acquire the voice length of the voice content in each piece of received voice bullet screen information. For example, the video server receives 4 voice barrages for video 1, namely a voice barrage 1, a voice barrage 2, a voice barrage 3 and a voice barrage 4, and the corresponding voice lengths are 5 seconds, 10 seconds, 15 seconds and 15 seconds respectively.
Calculating a video playing time range corresponding to each piece of voice bullet screen information according to the video playing time point and the voice length of each piece of voice bullet screen information;
each piece of voice bullet screen information further comprises a video playing time point when the user triggers the voice bullet screen operation, namely the voice bullet screen triggered at which time point. For example, the video playing time point of the voice bullet screen 1 is 5 minutes and 15 seconds, the video playing time point of the voice bullet screen 2 is 5 minutes and 10 seconds, the video playing time point of the voice bullet screen 3 is 5 minutes and 20 seconds, the video playing time point of the voice bullet screen 4 is 5 minutes and 25 seconds, and the voice lengths of the four voice bullet screens are 5 seconds, 10 seconds, 15 seconds and 15 seconds respectively.
Therefore, the video playing time range (i.e. the video playing time range covered by the voice content) corresponding to the voice bullet screen 1 can be calculated to be 5minutes 15 seconds to 5 minutes 20 seconds according to the video playing time point and the voice length of each piece of voice bullet screen information; the video playing time range corresponding to the voice bullet screen 2 is 5 minutes 10 seconds to 5 minutes 20 seconds; the video playing time range corresponding to the voice bullet screen 3 is 5 minutes 20 seconds to 5 minutes 35 seconds; the video playing time range corresponding to the voice barrage 4 is 5 minutes 25 seconds to 5 minutes 40 seconds.
Scoring is respectively carried out on a plurality of pieces of voice bullet screen information overlapped in the video playing time range;
the overlapping of the video playing time ranges means that two or more time points are overlapped, that is, the time intervals are overlapped, but not overlapped at one time point (for example, only one time point of 5 minutes 20 seconds is overlapped between the voice bullet screen 1 and the voice bullet screen 3, or only one time point of 5 minutes 20 seconds is overlapped between the voice bullet screen 2 and the voice bullet screen 3), so that two groups of voice bullet screens exist, one group is the voice bullet screen 1 and the voice bullet screen 2 (the overlapped time range is 5minutes 15 seconds to 5 minutes 20 seconds), and the other group is the voice bullet screen 3 and the voice bullet screen 4 (the overlapped time range is 5 minutes 25 seconds to 5 minutes 35 seconds). Therefore, it is necessary to score the voice bullet screen 1 and the voice bullet screen 2, and to score the voice bullet screen 3 and the voice bullet screen 4.
In one embodiment, when a plurality of pieces of voice barrage information overlapping with each other in a video playing time range are respectively evaluated, the evaluation can be realized through the following sub-steps:
s21, acquiring preset parameter information of each piece of voice bullet screen information;
s22, determining the weight corresponding to each parameter information in the preset parameter information according to a comparison table of the preset parameter information and the weight;
s23, summing the weight of each parameter information in the preset parameter information to obtain the score of each piece of voice bullet screen information;
wherein the preset parameter information includes at least one of the following parameters:
historical comments issued by a client user;
a user level of a client user;
the content relevancy of the voice content and the playing video;
the client user is a water force.
In one embodiment, when the preset parameter information includes the history comment issued by the client user, then in executing S22, the following method may be implemented:
judging whether malicious comments exist in the historical comments issued by the client user;
the method comprises the steps of presetting a malicious comment information table, and judging whether the historical comments issued by the client users have malicious comments or not by searching the preset malicious comment information table.
If the malicious comments exist, determining the weight corresponding to the malicious comments according to a comparison table of preset parameter information and the weight;
the preset parameter information and weight comparison table is provided with a corresponding weight for each parameter information, for example, the weight corresponding to the malicious comment is-10.
And if the malicious comments do not exist, setting the weight of the historical comment issued by the client user to be zero.
Therefore, the embodiment of the invention can grade the voice barrage of the user according to the historical comments of the user.
In addition, in another embodiment, when the preset parameter information includes the user level of the client user, in executing S22, the weight corresponding to the user level may be looked up in the comparison table of the preset parameter information and the weight. For example, the weight set for VIP users in the lookup table is 10, the weight set for VVIP users is 20, the weight set for star users is 15, and so on.
For example, if the level of the client user who sends the voice bullet screen 1 is VVIP, the weight 20 may be obtained.
In addition, in another embodiment, when the preset parameter information includes a content correlation degree between a voice content and a playing video, the obtaining of the preset parameter information of each piece of voice bullet screen information includes:
performing semantic recognition on the voice content;
judging whether the recognized semantics are matched with preset junk characters;
the preset garbage characters can be nonsensical characters, such as 'haha', 'la', and the like.
If the recognized semantic meaning is matched with the preset junk text (for example, if the matching degree reaches a preset matching degree threshold value, for example, 80%, it indicates that most contents in the voice content are invalid contents), the content relevance between the voice content and the played video is recorded as zero, that is, the value of the parameter information is zero.
If the identified semantics are not matched with the preset junk characters (for example, the matching degree is smaller than a preset matching degree threshold value, for example, 80%), calculating the matching degree between the identified semantics and a preset video tag corresponding to the played video;
that is to say, in the embodiment of the present invention, video tags, such as a video type tag (including but not limited to a speech, a scenario, a fun, and the like), a director name tag, a role name tag, a tag describing a video scenario, and the like, may be set in advance for each video provided by the video server. Then for the voice content that is not spam, a degree of match between the identified semantics and the video tags corresponding to the video is calculated. The degree of match may indicate a degree of content correlation between the content spoken by the user and the video.
Determining the matching degree obtained by calculation as the content correlation degree of the voice content and the played video;
therefore, when the preset parameter information includes the content relevance between the voice content and the played video, the content relevance between the voice content and the played video can be obtained when the preset parameter information of each piece of voice bullet screen information is obtained.
Similarly, in step S22, the weights corresponding to different content relevance degrees are searched in the comparison table of the preset parameter information and the weights, so as to determine the weight corresponding to the content relevance degree of the voice content and the playing video.
Therefore, the embodiment of the invention can combine the content correlation degree between the voice content of the voice bullet screen and the playing video when scoring the voice bullet screen, and set higher weight for the content correlation degree.
In addition, in another embodiment, when the preset parameter information includes that the client user is a navy, in step S22, the weight corresponding to the navy user may be looked up in the comparison table of the preset parameter information and the weight. For example, the weight set for the naval user in the look-up table is-15.
The scheme of determining whether the client user is a water army according to the voice barrage information can be implemented in any one of the prior art, which is not specifically limited by the present invention.
In summary, the present invention can perform addition processing on the weights respectively corresponding to each preset parameter information to obtain the score of a certain voice barrage.
Optionally, in an embodiment, before scoring the plurality of pieces of voice barrage information with overlapping time ranges in the video playing time range, the method according to an embodiment of the present invention may further include:
and eliminating noise and noise of the voice content in the voice bullet screen information.
Therefore, the scoring accuracy can be improved by processing the voice content in the voice bullet screen to eliminate noise and murmur.
Determining the highest grade as target voice bullet screen information;
the target voice bullet screen information with the highest score can be determined for each group of voice bullet screens with overlapped recombination time ranges. For example, the score of the voice bullet screen 1 is 80, and the score of the voice bullet screen 2 is 90; voice bullet screen 3 was rated 50 and voice bullet screen 4 was rated 80. Therefore, the highest-grade target voice bullet screen of the first group of voice bullet screens is the voice bullet screen 2; the highest scoring target voice bullet screen of the second group of voice bullet screens is the voice bullet screen 4.
Correspondingly, whenstep 103 is executed, when the video playing progress of the video played by the target client reaches the video playing time point in the target voice barrage information, the target client may be controlled to output the voice content in the target voice barrage information.
Specifically, when the video playing progress of the video (corresponding to the video identifier) played by the target client reaches 5 minutes and 10 seconds, controlling the target client to output the voice content of the voice barrage 2; when the video playing progress of the video (corresponding to the video identifier) played by the target client reaches 5 minutes and 25 seconds, controlling the target client to output the voice content of the voice barrage 4.
Here, the scheme of the embodiment of the present invention is only schematically illustrated by taking 4 voice bullet screens as an example, and in practical application, the scheme is not limited to the above 4 voice bullet screens.
Optionally, in an embodiment, afterstep 102, the method according to an embodiment of the present invention further includes: and displaying a voice bullet screen icon at the video playing time point of the video playing interface of the target client.
Specifically, as shown in fig. 2, for the voice barrage 2 with the highest score, the playing time is 5 minutes and 10 seconds, in the embodiment of the present invention, avoice barrage icon 12 may be displayed at 5 minutes and 10 seconds of the video playing progress bar of the video playing interface shown in fig. 2 to remind the user that there is a voice barrage, and in addition,voice barrage icons 13, 14, and 15 at other time points are also shown in the video playing progress bar in fig. 2.
Optionally, in another embodiment, beforestep 101, the method according to an embodiment of the present invention may further include:
and if the fact that the client user triggers the voice bullet screen operation is detected, reducing the video volume corresponding to the client video playing interface.
Therefore, when the user is about to input the voice content of the voice bullet screen, the volume of the currently played video is reduced, and the voice content collected by the video server side can be clearer.
In another embodiment, the method of the embodiment of the present invention may further limit the duration of the voice content input by each client user, for example, the maximum length is 20 seconds.
In addition, in another embodiment, the client side may first perform the pseudo-writing on the collected voice content locally on the client side, and then upload the voice content to the video server together with the video identifier and the video playing time point. Therefore, whenstep 103 is executed, the voice content of the local pseudo-writing can be directly played, and the playing delay of the voice barrage is reduced.
In addition, in another embodiment, a user may set whether to open the voice barrage function on the client side, so that the method of the embodiment of the present invention controls the target clients that open the voice barrage function to play the voice content in the target voice barrage information only for the target clients.
If the user sets the function of closing the voice barrage at the client side, the method of the embodiment of the present invention may further convert the voice content to be played (the voice content may not be scored, and may be the voice content corresponding to all the received voice barrage information) into characters, so as to control the target clients closing the voice barrage function to output the characters.
In the above embodiment, the method according to the embodiment of the present invention defaults to playing the voice content in the voice bullet screen information with the highest score for the voice bullet screen information with overlapping time ranges.
In other embodiments, the user may freely set the desired voice barrage according to needs, for example, the user may select to output the voice barrage input by the user a on the video playing interface, or output the voice barrage with the lowest score in the voice barrage information with overlapping time ranges, and so on.
In summary, the method of the embodiment of the present invention enables the video playing and the voice barrage to coexist in a friendly manner, so that the user of the video player can express the video watching experience in a more efficient manner without interrupting or disturbing the video playing, thereby improving the user experience; in addition, various users can send voice barracks, so that communication between special users (including high-quality observation users, star users, householders and the like) and common users can be enriched.
It should be noted that, for simplicity of description, the method embodiments are described as a series of acts or combination of acts, but those skilled in the art will recognize that the present invention is not limited by the illustrated order of acts, as some steps may occur in other orders or concurrently in accordance with the embodiments of the present invention. Further, those skilled in the art will appreciate that the embodiments described in the specification are presently preferred and that no particular act is required to implement the invention.
Corresponding to the method provided by the embodiment of the present invention, referring to fig. 3, a block diagram of a structure of an embodiment of a device for implementing a voice barrage according to the present invention is shown, and the device specifically includes the following modules:
the receiving module 31 is configured to receive voice barrage information sent by a client, where the voice barrage information includes a video identifier and a voice content corresponding to a video playing interface of the client, and a video playing time point when a client user triggers a voice barrage operation;
a first determining module 32, configured to determine a video identifier of a playing video and a target client that matches the video identifier;
a control module 33, configured to control the target client to output the voice content when the video playing progress of the video played by the target client reaches the video playing time point, where a playing frequency band of the voice content is different from a playing frequency band of the video.
Optionally, the apparatus further comprises:
the acquisition module is used for acquiring the voice length of each piece of voice content in each piece of received voice bullet screen information if the number of the voice bullet screen information is multiple and the multiple pieces of voice bullet screen information correspond to the same video identifier;
the first calculation module is used for calculating a video playing time range corresponding to each piece of voice bullet screen information according to the video playing time point and the voice length of each piece of voice bullet screen information;
the scoring module is used for scoring the voice barrage information overlapped in the video playing time range;
the second determining module is used for determining the target voice barrage information with the highest score;
optionally, the control module 33 includes:
and the control submodule is used for controlling the target client to output the voice content in the target voice bullet screen information when the video playing progress of the video played by the target client reaches the video playing time point in the target voice bullet screen information.
Optionally, the scoring module comprises:
the acquisition submodule is used for acquiring preset parameter information of each piece of voice bullet screen information;
the determining submodule is used for determining the weight corresponding to each parameter information in the preset parameter information according to a comparison table of the preset parameter information and the weight;
the calculation submodule is used for summing the weight of each piece of parameter information in the preset parameter information to obtain the score of each piece of voice bullet screen information;
optionally, the preset parameter information includes at least one of the following parameters:
historical comments issued by a client user;
a user level of a client user;
the content relevancy of the voice content and the playing video;
the client user is a water force.
Optionally, the determining sub-module includes:
the judging unit is used for judging whether malicious comments exist in the historical comments issued by the client user or not when the preset parameter information comprises the historical comments issued by the client user;
the first determining unit is used for determining the weight corresponding to the malicious comment according to a comparison table of preset parameter information and the weight if the malicious comment exists;
and the setting unit is used for setting the weight of the historical comment issued by the client user to be zero if no malicious comment exists.
Optionally, the apparatus further comprises:
the recognition module is used for performing semantic recognition on the voice content when the preset parameter information comprises the content correlation degree of the voice content and the played video;
the judging module is used for judging whether the recognized semantics are matched with the preset junk characters;
the second calculation module is used for calculating the matching degree between the identified semantics and a preset video tag corresponding to the playing video if the identified semantics are not matched with the preset spam characters;
the third determining module is used for determining the matching degree obtained by calculation as the content correlation degree of the voice content and the played video;
and the setting module is used for recording the content relevancy of the voice content and the played video as zero if the recognized semantics are matched with the preset junk characters.
Optionally, the apparatus further comprises:
and the display module is used for displaying the voice barrage icon on the video playing time point of the video playing interface of the target client.
For the device embodiment, since it is basically similar to the method embodiment, the description is simple, and for the relevant points, refer to the partial description of the method embodiment.
The embodiments in the present specification are described in a progressive manner, each embodiment focuses on differences from other embodiments, and the same and similar parts among the embodiments are referred to each other.
As will be appreciated by one skilled in the art, embodiments of the present invention may be provided as a method, apparatus, or computer program product. Accordingly, embodiments of the present invention may take the form of an entirely hardware embodiment, an entirely software embodiment or an embodiment combining software and hardware aspects. Furthermore, embodiments of the present invention may take the form of a computer program product embodied on one or more computer-usable storage media (including, but not limited to, disk storage, CD-ROM, optical storage, and the like) having computer-usable program code embodied therein.
Embodiments of the present invention are described with reference to flowchart illustrations and/or block diagrams of methods, terminal devices (systems), and computer program products according to embodiments of the invention. It will be understood that each flow and/or block of the flow diagrams and/or block diagrams, and combinations of flows and/or blocks in the flow diagrams and/or block diagrams, can be implemented by computer program instructions. These computer program instructions may be provided to a processor of a general purpose computer, special purpose computer, embedded processor, or other programmable data processing terminal to produce a machine, such that the instructions, which execute via the processor of the computer or other programmable data processing terminal, create means for implementing the functions specified in the flowchart flow or flows and/or block diagram block or blocks.
These computer program instructions may also be stored in a computer-readable memory that can direct a computer or other programmable data processing terminal to function in a particular manner, such that the instructions stored in the computer-readable memory produce an article of manufacture including instruction means which implement the function specified in the flowchart flow or flows and/or block diagram block or blocks.
These computer program instructions may also be loaded onto a computer or other programmable data processing terminal to cause a series of operational steps to be performed on the computer or other programmable terminal to produce a computer implemented process such that the instructions which execute on the computer or other programmable terminal provide steps for implementing the functions specified in the flowchart flow or flows and/or block diagram block or blocks.
While preferred embodiments of the present invention have been described, additional variations and modifications of these embodiments may occur to those skilled in the art once they learn of the basic inventive concepts. Therefore, it is intended that the appended claims be interpreted as including preferred embodiments and all such alterations and modifications as fall within the scope of the embodiments of the invention.
Finally, it should also be noted that, herein, relational terms such as first and second, and the like may be used solely to distinguish one entity or action from another entity or action without necessarily requiring or implying any actual such relationship or order between such entities or actions. Also, the terms "comprises," "comprising," or any other variation thereof, are intended to cover a non-exclusive inclusion, such that a process, method, article, or terminal that comprises a list of elements does not include only those elements but may include other elements not expressly listed or inherent to such process, method, article, or terminal. Without further limitation, an element defined by the phrase "comprising an … …" does not exclude the presence of other like elements in a process, method, article, or terminal that comprises the element.
The method for implementing the voice barrage and the device for implementing the voice barrage provided by the invention are described in detail, a specific example is applied in the text to explain the principle and the implementation mode of the invention, and the description of the above embodiment is only used for helping to understand the method and the core idea of the invention; meanwhile, for a person skilled in the art, according to the idea of the present invention, there may be variations in the specific embodiments and the application scope, and in summary, the content of the present specification should not be construed as a limitation to the present invention.

Claims (10)

CN201710862107.0A2017-09-212017-09-21Method and device for realizing voice barrageActiveCN107613400B (en)

Priority Applications (1)

Application NumberPriority DateFiling DateTitle
CN201710862107.0ACN107613400B (en)2017-09-212017-09-21Method and device for realizing voice barrage

Applications Claiming Priority (1)

Application NumberPriority DateFiling DateTitle
CN201710862107.0ACN107613400B (en)2017-09-212017-09-21Method and device for realizing voice barrage

Publications (2)

Publication NumberPublication Date
CN107613400A CN107613400A (en)2018-01-19
CN107613400Btrue CN107613400B (en)2021-03-26

Family

ID=61061622

Family Applications (1)

Application NumberTitlePriority DateFiling Date
CN201710862107.0AActiveCN107613400B (en)2017-09-212017-09-21Method and device for realizing voice barrage

Country Status (1)

CountryLink
CN (1)CN107613400B (en)

Families Citing this family (18)

* Cited by examiner, † Cited by third party
Publication numberPriority datePublication dateAssigneeTitle
CN108632658B (en)*2018-03-142021-03-16维沃移动通信有限公司Bullet screen display method and terminal
CN110620718A (en)*2018-06-192019-12-27中移(杭州)信息技术有限公司Comment method and comment device for audio file
CN109121014A (en)*2018-10-102019-01-01四川长虹电器股份有限公司Barrage dissemination method and system
CN110149549B (en)*2019-02-262022-09-13腾讯科技(深圳)有限公司Information display method and device
CN110267113B (en)*2019-06-142021-10-15北京字节跳动网络技术有限公司Video file processing method, system, medium, and electronic device
CN113132785A (en)*2019-12-302021-07-16阿里巴巴集团控股有限公司Multimedia data method, device, electronic equipment and computer storage medium
CN111432284B (en)*2020-04-302022-09-16成都新潮传媒集团有限公司Bullet screen interaction method of multimedia terminal and multimedia terminal
CN111586469B (en)*2020-05-122021-10-26腾讯科技(深圳)有限公司Bullet screen display method and device and electronic equipment
CN113938723B (en)*2020-06-292023-07-21大众问问(北京)信息科技有限公司Bullet screen playing method, bullet screen playing device and bullet screen playing equipment
CN114125543B (en)*2020-08-282023-03-21上海哔哩哔哩科技有限公司Bullet screen processing method, computing equipment and bullet screen processing system
CN114390361A (en)*2020-10-202022-04-22上海哔哩哔哩科技有限公司Audio bullet screen processing method and device
CN113573155A (en)*2021-07-222021-10-29深圳创维-Rgb电子有限公司 Voice barrage implementation method, device, intelligent device and readable storage medium
CN113691838A (en)*2021-08-242021-11-23北京快乐茄信息技术有限公司Audio bullet screen processing method and device, electronic equipment and storage medium
CN114296604B (en)*2021-12-142024-07-16咪咕音乐有限公司Bullet screen information display method, device, equipment and medium
CN114598922A (en)*2022-03-072022-06-07深圳创维-Rgb电子有限公司Voice message interaction method, device, equipment and storage medium
CN115665496B (en)*2022-12-052023-03-10深圳市华曦达科技股份有限公司Online video voice interaction method, system and storage medium
CN115942061A (en)*2022-12-232023-04-07百度时代网络技术(北京)有限公司Voice interaction information processing method and device, electronic equipment and storage medium
CN116546260A (en)*2023-05-182023-08-04维沃移动通信有限公司Audio bullet screen processing method and device, electronic equipment and readable storage medium

Citations (9)

* Cited by examiner, † Cited by third party
Publication numberPriority datePublication dateAssigneeTitle
CN103389971A (en)*2013-07-042013-11-13北京卓易讯畅科技有限公司Method and equipment for determining high-quality grade of comment content corresponding to application
CN103873935A (en)*2012-12-172014-06-18联想(北京)有限公司Data processing method and device
CN104125483A (en)*2014-07-072014-10-29乐视网信息技术(北京)股份有限公司Audio comment information generating method and device and audio comment playing method and device
CN104335558A (en)*2012-05-272015-02-04高通股份有限公司System and methods for managing concurrent audio messages
CN104756473A (en)*2012-09-142015-07-01谷歌公司Handling concurrent speech
CN105681820A (en)*2016-01-082016-06-15天脉聚源(北京)科技有限公司Video barrage recording method and device
CN105828100A (en)*2016-03-212016-08-03乐视网信息技术(北京)股份有限公司Audio and video files simultaneous playing method, device and system
CN105847939A (en)*2016-05-122016-08-10乐视控股(北京)有限公司Bullet screen play method, bullet screen play device and bullet screen play system
CN105872822A (en)*2015-12-152016-08-17乐视网信息技术(北京)股份有限公司Video playing method and video playing system

Patent Citations (9)

* Cited by examiner, † Cited by third party
Publication numberPriority datePublication dateAssigneeTitle
CN104335558A (en)*2012-05-272015-02-04高通股份有限公司System and methods for managing concurrent audio messages
CN104756473A (en)*2012-09-142015-07-01谷歌公司Handling concurrent speech
CN103873935A (en)*2012-12-172014-06-18联想(北京)有限公司Data processing method and device
CN103389971A (en)*2013-07-042013-11-13北京卓易讯畅科技有限公司Method and equipment for determining high-quality grade of comment content corresponding to application
CN104125483A (en)*2014-07-072014-10-29乐视网信息技术(北京)股份有限公司Audio comment information generating method and device and audio comment playing method and device
CN105872822A (en)*2015-12-152016-08-17乐视网信息技术(北京)股份有限公司Video playing method and video playing system
CN105681820A (en)*2016-01-082016-06-15天脉聚源(北京)科技有限公司Video barrage recording method and device
CN105828100A (en)*2016-03-212016-08-03乐视网信息技术(北京)股份有限公司Audio and video files simultaneous playing method, device and system
CN105847939A (en)*2016-05-122016-08-10乐视控股(北京)有限公司Bullet screen play method, bullet screen play device and bullet screen play system

Also Published As

Publication numberPublication date
CN107613400A (en)2018-01-19

Similar Documents

PublicationPublication DateTitle
CN107613400B (en)Method and device for realizing voice barrage
CN105657535B (en)A kind of audio identification methods and device
US9111294B2 (en)Keyword determinations from voice data
CN106570100B (en)Information search method and device
US12278859B2 (en)Creating a cinematic storytelling experience using network-addressable devices
JP6668245B2 (en) Methods, systems, and media for generating search results based on contextual information
US20170169018A1 (en)Method and Electronic Device for Recommending Media Data
US20150082330A1 (en)Real-time channel program recommendation on a display device
US20190197315A1 (en)Automatic story generation for live media
US20190200050A1 (en)Automatic personalized story generation for visual media
US10313713B2 (en)Methods, systems, and media for identifying and presenting users with multi-lingual media content items
CN110769312B (en)Method and device for recommending information in live broadcast application
US20120296458A1 (en)Background Audio Listening for Content Recognition
WO2019047850A1 (en)Identifier displaying method and device, request responding method and device
US20160247522A1 (en)Method and system for providing access to auxiliary information
JP2014146264A (en)Restaurant evaluation system and restaurant evaluation program
CN105045882B (en)A kind of hot word processing method and processing device
WO2019101099A1 (en)Video program identification method and device, terminal, system, and storage medium
CN111046268B (en) Information recommendation method, device and electronic device
CN112988956B (en)Method and device for automatically generating dialogue, and method and device for detecting information recommendation effect
CN105357588A (en)Data display method and terminal
US11164215B1 (en)Context-based voice-related advertisement offers
CN113946712B (en) Segment recommendation method, segment recommendation device, electronic device and readable storage medium
CN113722586A (en)Searching method and device and electronic equipment
CN112702638A (en)Information processing method, device and system and control method of video playing equipment

Legal Events

DateCodeTitleDescription
PB01Publication
PB01Publication
SE01Entry into force of request for substantive examination
SE01Entry into force of request for substantive examination
GR01Patent grant
GR01Patent grant

[8]ページ先頭

©2009-2025 Movatter.jp