Detailed Description
The technical solutions in the embodiments of the present invention will be clearly and completely described below with reference to the drawings in the embodiments of the present invention, and it is obvious that the described embodiments are only a part of the embodiments of the present invention, and not all of the embodiments. All other embodiments, which can be derived by a person skilled in the art from the embodiments given herein without making any creative effort, shall fall within the protection scope of the present invention.
At present, the identification methods of advertisements are not many, so that the targeted delivery of the advertisements cannot be realized, and the value conversion efficiency of the advertisements and the use experience of watching users are influenced. In a limited method, the identification speed of the advertisement is slow, the processing data is large, the advertisement cannot be intelligently monitored, and the defects exist.
Sound masking (audio masking) refers to a phenomenon in which the threshold of one sound rises due to the presence of another sound. When the pure tone is masked by white noise, the decibel value of the pure tone threshold rise is mainly determined by the power of a narrow-band noise centered on the frequency of the pure tone.
The acoustic masking is mainly determined by the power of a narrow-band noise centered around the pure tone frequency. The frequency width of this narrow band varies with the frequency of the pure tone, and is called the critical band. On the auditory frequency scale, the width of one critical band is equal to 1 bark. The concept of critical bands was proposed by the american scientist h. These frequency bands are associated with auditory filters. A strong low frequency pure tone may mask a high frequency pure tone in frequency. In addition to instantaneous masking in time, there is also a sound that can be masked behind it, called forward masking; a sound may also produce a masking of sounds in front of it, called backward masking. The masking of a complex pure tone is complex and, in relation to its frequency combination and phase relationship, sometimes produces a "time window" effect, i.e. the masking produced during a certain time period is small. Masking may occur in the center if one ear inputs a signal and the other ear inputs noise.
An acoustic signal, if present with a noise, becomes weak or completely inaudible, i.e. the hearing threshold of the signal is increased. This is the masking effect and the noise masks the signal. The magnitude of the masking effect depends on the relationship between noise and signal in frequency. Generally, the closer the frequency of the signal and noise is, the greater the masking, and the higher frequency signal is often masked by the low frequency noise than by the high frequency noise. Through the masking experiment of noise with variable bandwidth on pure tone signals, the masking effect of the noise with the 1000 Hz center frequency on the 1000 Hz pure tone signals is increased when the bandwidth is increased. But after increasing the bandwidth to 100 hz, the further increase does not contribute to the change in masking. That is, the masking effect of the noise is limited to the band, and the other sounds have no effect. This 100 hz band is the critical band and it widens as the frequency increases.
Because of the time-domain masking effect of human ears, people can hear the smiling sound in a quiet environment, but often cannot perceive relatively low sounds in a noisy environment, so that when an audio signal is noisy, the intensity of a masking sound can be considered to be increased, and when the audio signal is quiet, the intensity of the masking sound is appropriately reduced.
For the analysis of the masking sound, it can be realized by a masking sound decoder. A masking sound decoder, simply a spread spectrum receiver implemented in software, is based on the correlation of a pseudo-random signal.
For example, to decode the masking sound, a decoder may be provided that detects the masking sound that has been embedded or encoded by the masking sound embedder. For example, the decoder may enable audio access to the playback device presentation through a physical (e.g., electrical) connection to the playback device's speakers and/or an audio line output (if available).
Fig. 1 is a schematic diagram illustrating a scene architecture of an advertisement recognition system in an embodiment of the present invention.
Theplaying device 102 may be a terminal such as a digital television, a digital television box, a smart phone, a tablet computer, a computer, etc., and certainly, not shown in fig. 1 of the present invention, the playing device may also be a large-scalemedia playing device 102, such as a public media advertisement delivering device, a subway video player, etc.
In fig. 1, the identification terminal is disposed in theplayback device 102, or is interpreted as the identification terminal being theplayback device 102 itself, for example, theplayback device 102 is a smart phone, a tablet computer, a computer, or the like, and the identification module is embedded in, for example, an android system, a windows system, an apple system, or the like through software, plug-in, or the like.
Not shown in fig. 1, the identification terminal may also be independent from theplayback device 102, and is physically or signal connected to theplayback device 102 to identify the audio played by theplayback device 102.
The recognition terminal extracts an audio signal from the played audio and video advertisement, recognizes a masking sound in the audio signal, and transmits the recognized masking sound to theserver 100.
Specifically, as shown in fig. 2, theidentification terminal 120 in this embodiment includes:
theidentification module 121 is configured to extract an audio signal from the played audio/video advertisement, and identify a masking sound in the audio signal;
thetransmission module 122 transmits the identified masking sound to theserver 100, so that theserver 100 determines the playing condition of the advertisement according to the identified masking sound.
In some embodiments, theidentification terminal 120 may extract the played audio signal terminal masking sound through a masking sound decoding technique, and transmit the masking sound to theserver 100 through a network.
Theidentification terminal 120 may decode in real time and transmit to theserver 100 in real time, or theidentification terminal 120 may decode after playing for a certain time, which is not limited in the present invention.
Themedia platform 101 is capable of delivering audiovisual advertisements, audiovisual programs including, but not limited to, broadcast television stations, network video service providers, and the like.
Themedia platform 101 delivers the audio-video advertisement embedded with the masking sound to theplaying device 102.
Specifically, themedia platform 101 in fig. 3 includes:
theacquisition module 111 acquires the audio and video advertisement embedded with the masking sound;
and thedelivery module 112 delivers the audio and video advertisements to enable theidentification terminal 120 to extract an audio signal from the played audio and video advertisements, identify masking sounds in the audio signal and transmit the masking sounds to theserver 100, so that theserver 100 determines the playing condition of the advertisements according to the identified masking sounds.
Themedia platform 101 may be uniformly delivered for all theplaying devices 102, for example, audio and video advertisements a are delivered to all theplaying devices 102 in the network through a network channel in a certain time period, and audio and video advertisements B are delivered in another time period, so that as long as the playingdevices 102 are locked to the network channel, the advertisement data may be received, and thus real-time playing or recording may be performed.
Themedia platform 101 may also perform targeted playing for different playing devices, for example, playing a first type of advertisement for a smart television, and playing a second type of advertisement for a mobile playing terminal, because the emphasis points of users are different when using a smart phone or a smart television, the delivery is more targeted, and of course, the present invention is not limited thereto.
In some alternative embodiments, the advertisement embedded with the masking sound may be provided through theserver 100, and the interaction between theserver 100 and the playing device, the interaction between themedia platform 101 and theserver 100, and the interaction between themedia platform 101 and the playing device are implemented through a transmission network, which may be a broadcast television network, an internet network, a cable transmission, etc., and the present invention is not limited thereto.
In this embodiment, theserver 100 provides the audio-visual advertisement embedded with the masking sound to cause themedia platform 101 to deliver the audio-visual advertisement.
Specifically, theserver 100 in fig. 4 includes: the receivingmodule 101 is configured to receive a masking sound, determine a playing condition of an advertisement according to the masking sound, wherein the masking sound is obtained by extracting an audio signal from a played audio/video advertisement through theidentification terminal 120 and identifying the audio signal; and
and theadvertisement providing module 102 provides the audio and video advertisement embedded with the masking sound so as to enable themedia platform 101 to deliver the audio and video advertisement.
Since the masking sound corresponds to the audio signal, after theidentification terminal 120 transmits the identified masking sound to theserver 100, theserver 100 can determine the length (i.e., the playing time) of the audio/video advertisement played by thedigital playing device 102 according to the correspondence, determine the playing time according to the transmission time, and determine the transmitted device terminal according to the transmission path, thereby monitoring the advertisement, and thus, a user preference database (binding the preference degree of the advertisement with the device, evaluating the preference degree of the advertisement according to the playing time, etc.) can be established, thereby establishing a complete advertisement monitoring intelligent system, collecting advertisement information from the identification of the advertisement, without additional steps or links, and saving manpower and material resources.
The data extraction error rate is also an important technical index in a masking sound scheme, because on one hand, the interference from a physical space exists, on the other hand, signals transmitted in a channel are attenuated and distorted, and in addition, artificial data transformation and attack are added, the data extraction error rate is increased. In order to reduce the error rate of data extraction, the feature codes can be set for comparison and correction.
In the preferred embodiment, theserver 100 further comprises a featurecode providing module 104 for providing a feature code containing the masking sound, so that theidentification terminal 120 receives the feature code and identifies the masking sound accordingly.
The feature code may be used as verification or auxiliary identification, and the feature code may be analyzed masking sound data, so that the playing device compares the analyzed masking sound data with the feature code after analysis, and if the comparison is consistent or the approximation degree reaches a certain threshold (e.g., 99%) and the ratio of the analyzed masking sound data is greater than a certain ratio, for example, the ratio of the part reaching 99% approximation to the analyzed masking sound part is 80%, the masking sound is considered as a masking sound matching the feature code.
Further, theserver 100 further includes: thepath identification module 103 determines theidentification terminal 120 for identifying the masking sound according to the path of the transmission of the masking sound.
Since theidentification terminal 120 establishes a connection with theplayback device 102, when theserver 100 determines theidentification terminal 120, it is equivalent to determining theplayback device 102.
Therefore, the playing condition of each advertisement by thespecific playing device 102 can be known, and the advertisement can be monitored point to point conveniently, so that the subsequent advertisement pushing can be more targeted.
In other embodiments, the audio-visual advertisement embedded with the masking sound may be provided by an advertisement production terminal.
Specifically, the advertisement production terminal includes:
the advertisement providing module is used for providing audio and video advertisements embedded with masking sounds so as to enable the media platform to put the audio and video advertisements, wherein the identification terminal extracts audio signals from the played audio and video advertisements, identifies the masking sounds in the audio signals and transmits the masking sounds to the server, so that the server determines the playing conditions of the advertisements according to the identified masking sounds.
It should be understood that the number ofdigital playback devices 102,media platforms 101,servers 100 in fig. 1 is merely illustrative. There may be any number ofdigital playback devices 102,media platforms 101, andservers 100, as desired for the implementation.
According to the advertisement identification system provided by the invention, the masking sound corresponding to the played advertisement is identified in the identification module by embedding the masking sound into the advertisement, and the identified masking sound is transmitted to the server, so that the server analyzes the playing condition of the advertisement according to the masking sound, such as the advertisement playing time length, the advertisement type, the advertisement playing time period and the like according to the length, the interval and the like of the identified masking sound, the efficiency and the direct current of advertisement identification are improved, manpower and material resources are saved, the monitoring of the advertisement can be realized, and a complete advertisement monitoring system is established. The invention is not only suitable for televisions, public media equipment and the like, but also suitable for mobile video terminals such as mobile phones and the like, and has wide application.
FIG. 5 is a flow chart illustrating an advertisement recognition system according to an embodiment of the present invention. The specific process is shown in fig. 5, and includes:
and S11, providing the audio-video advertisement embedded with the masking sound.
Firstly, the audio and video advertisement embedded with the masking sound is made, for example, the audio and video advertisement data can be operated in a linear domain, and the masking sound embedding can be performed on the compressed audio and video advertisement data, but the invention is not limited to this.
The manufactured audio and video advertisements embedded with the masking sounds are stored in the internal storage of the server, and the server calls the audio and video advertisements in the internal storage and transmits the audio and video advertisements to the media platform through the network.
The audio and video advertisement can also be stored in a memory outside the server, and the server transmits a calling instruction to the memory so as to call the audio and video advertisement and transmit the audio and video advertisement to the media platform according to the network address indicated by the instruction.
The masking sound of the manufactured audio-video advertising terminal embedded with the masking sound is kept consistent on a time axis, namely, at each moment, the masking sound has a part corresponding to the audio signal at the moment.
Alternatively, in the interval preset time period, the masking sound has at least a portion corresponding to the audio signal at least one time in the preset time period.
Thus, it is possible to recognize information such as the length of time of playback by the length of time of the recognized masking sound.
And S12, delivering the audio and video advertisement.
The media platform can be uniformly delivered for all the playing devices, for example, audio and video advertisements a are delivered to all the playing devices in the network through a network channel in a certain time period, and audio and video advertisements B are delivered in another time period, so that the advertisement data can be received as long as the playing devices are locked to the network channel, and then real-time playing or recording is performed.
The media platform may also perform targeted playing for different playing devices, for example, playing a first type of advertisement for a smart television, and playing a second type of advertisement for a mobile playing device, because the emphasis points of users are different when using a smart phone or a smart television, the delivery is more targeted, and certainly, the present invention is not limited thereto.
And S13, extracting the audio signal from the played audio-video advertisement.
The masking sound in the audio signal is identified S14.
The identification module can extract the played audio signal terminal masking sound through a masking sound decoding technology, and simultaneously transmits the masking sound to the server through the network.
The identification module may decode in real time and transmit to the server in real time, or the identification module may decode after playing for a certain time, which is not limited in the present invention.
The identified masking sound is transmitted to a server S15.
And S16, determining the playing condition of the advertisement according to the identified masking sound.
Because the masking sound corresponds to the audio signal, after the recognition terminal transmits the recognized masking sound to the server, the server can confirm the length (namely playing time length) of the audio and video advertisement played by the playing equipment according to the correspondence, confirm the playing time according to the transmission time, confirm the recognition terminal according to the transmission path, and further confirm the playing equipment according to the corresponding relation between the recognition terminal and the playing equipment, so that the advertisement can be monitored, and a user preference database (the preference degree of the advertisement is bound with the equipment, the preference degree of the advertisement is evaluated according to the playing time length and the like) can be established, so that a complete intelligent advertisement monitoring system is established, the advertisement information is collected from the recognition of the advertisement, no additional step or link is needed, and manpower and material resources are saved.
In this embodiment, the advertisement embedded with the masking sound is provided by the advertisement producing terminal, and in other embodiments, as described above, the advertisement embedded with the masking sound may be provided by the server.
The data extraction error rate is also an important technical index in a masking sound scheme, because on one hand, the interference from a physical space exists, on the other hand, signals transmitted in a channel are attenuated and distorted, and in addition, artificial data transformation and attack are added, the data extraction error rate is increased. In order to reduce the error rate of data extraction, the feature codes can be set for comparison and correction.
According to the advertisement identification method provided by the invention, the masking sound corresponding to the played advertisement is identified at the identification terminal by embedding the masking sound into the advertisement, and the identified masking sound is transmitted to the server, so that the server analyzes the playing condition of the advertisement according to the masking sound, such as the advertisement playing time length, the advertisement type, the advertisement playing time period and the like according to the length, the interval and the like of the identified masking sound, the efficiency and the direct current of advertisement identification are improved, manpower and material resources are saved, the monitoring of the advertisement can be realized, and a complete advertisement monitoring system is established. The invention is not only suitable for televisions, public media equipment and the like, but also suitable for mobile video terminals such as mobile phones and the like, and has wide application.
Therefore, in a preferred embodiment, a feature code may be set, and in conjunction with fig. 6, the advertisement identification method includes:
and S21, providing the audio-video advertisement embedded with the masking sound.
And S22, delivering the audio and video advertisement.
And S23, providing a feature code containing the masking sound.
And S24, extracting the audio signal from the played audio-video advertisement.
And S25, identifying the masking sound in the audio signal according to the feature code.
The identified masking sound is transmitted to a server S26.
And S27, determining the advertisement delivery condition according to the identified masking sound.
The feature code may be provided by a server or other terminals, but the invention is not limited thereto.
The feature code may be used as verification or auxiliary identification, and the feature code may be analyzed masking sound data, so that the identification terminal compares the analyzed masking sound data with the feature code after analysis, and if the comparison is consistent or the approximation degree reaches a certain threshold (e.g., 99%) and the ratio of the analyzed masking sound data is greater than a certain ratio, for example, the ratio of the part reaching 99% approximation to the analyzed masking sound part is 80%, the masking sound is considered as a masking sound matching the feature code.
Therefore, the feature code can be compared with the audio and video advertisement, so that the calculated amount of finding the masking sound from the audio signal is reduced on one hand, and on the other hand, the analyzed masking sound is more accurate through feature comparison.
Further, in the embodiments of fig. 5 and 6, the server may further determine an identification terminal for identifying the masking sound according to the path of the transmission of the masking sound.
Since the identification terminal establishes a connection with the playback device, when the server determines the identification terminal, it is equivalent to determining the playback device.
Therefore, the playing condition of each advertisement by specific playing equipment can be known, the advertisement can be conveniently monitored point to point, and the subsequent advertisement pushing can be more targeted.
Reference is now made to fig. 7, which shows a schematic structural diagram of a computer device suitable for implementing the terminal device or the server of the embodiments of the present application.
As shown in fig. 7, the computer device 700 includes a Central Processing Unit (CPU)701, which can perform various appropriate works and processes according to a program stored in a Read Only Memory (ROM)702 or a program loaded from astorage section 708 into a Random Access Memory (RAM)) 703. In the RAM703, various programs and data necessary for the operation of the system 700 are also stored. The CPU701, the ROM702, and the RAM703 are connected to each other via abus 704. An input/output (I/O)interface 705 is also connected tobus 704.
The following components are connected to the I/O interface 705: aninput portion 706 including a keyboard, a mouse, and the like; anoutput section 707 including a display such as a Cathode Ray Tube (CRT), a Liquid Crystal Display (LCD), and the like, and a speaker; astorage section 708 including a hard disk and the like; and acommunication section 709 including a network interface card such as a LAN card, a modem, or the like. Thecommunication section 709 performs communication processing via a network such as the internet. Adrive 710 is also connected to the I/O interface 706 as needed. Aremovable medium 711 such as a magnetic disk, an optical disk, a magneto-optical disk, a semiconductor memory, or the like is mounted on thedrive 710 as necessary, so that a computer program read out therefrom is mounted as necessary in thestorage section 708.
In particular, according to an embodiment of the present invention, the processes described above with reference to the flowcharts may be implemented as computer software programs. For example, embodiments of the invention include a computer program product comprising a computer program tangibly embodied on a machine-readable medium, the computer program comprising program code for performing the method illustrated in the flow chart. In such an embodiment, the computer program can be downloaded and installed from a network through thecommunication section 709, and/or installed from theremovable medium 711.
As will be appreciated by one skilled in the art, embodiments of the present invention may be provided as a method, system, or computer program product. Accordingly, the present invention may take the form of an entirely hardware embodiment, an entirely software embodiment or an embodiment combining software and hardware aspects. Furthermore, the present invention may take the form of a computer program product embodied on one or more computer-usable storage media (including, but not limited to, disk storage, CD-ROM, optical storage, and the like) having computer-usable program code embodied therein.
The present invention is described with reference to flowchart illustrations and/or block diagrams of methods, apparatus (systems), and computer program products according to embodiments of the invention. It will be understood that each flow and/or block of the flow diagrams and/or block diagrams, and combinations of flows and/or blocks in the flow diagrams and/or block diagrams, can be implemented by computer program instructions. These computer program instructions may be provided to a processor of a general purpose computer, special purpose computer, embedded processor, or other programmable data processing apparatus to produce a machine, such that the instructions, which execute via the processor of the computer or other programmable data processing apparatus, create means for implementing the functions specified in the flowchart flow or flows and/or block diagram block or blocks.
These computer program instructions may also be stored in a computer-readable memory that can direct a computer or other programmable data processing apparatus to function in a particular manner, such that the instructions stored in the computer-readable memory produce an article of manufacture including instruction means which implement the function specified in the flowchart flow or flows and/or block diagram block or blocks.
These computer program instructions may also be loaded onto a computer or other programmable data processing apparatus to cause a series of operational steps to be performed on the computer or other programmable apparatus to produce a computer implemented process such that the instructions which execute on the computer or other programmable apparatus provide steps for implementing the functions specified in the flowchart flow or flows and/or block diagram block or blocks.
The principle and the implementation mode of the invention are explained by applying specific embodiments in the invention, and the description of the embodiments is only used for helping to understand the method and the core idea of the invention; meanwhile, for a person skilled in the art, according to the idea of the present invention, there may be variations in the specific embodiments and the application scope, and in summary, the content of the present specification should not be construed as a limitation to the present invention.