Movatterモバイル変換


[0]ホーム

URL:


CN101673262B - Method for searching audio content - Google Patents

Method for searching audio content
Download PDF

Info

Publication number
CN101673262B
CN101673262BCN200810042853ACN200810042853ACN101673262BCN 101673262 BCN101673262 BCN 101673262BCN 200810042853 ACN200810042853 ACN 200810042853ACN 200810042853 ACN200810042853 ACN 200810042853ACN 101673262 BCN101673262 BCN 101673262B
Authority
CN
China
Prior art keywords
audio
fingerprint
frequency fingerprint
index
search
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Expired - Fee Related
Application number
CN200810042853A
Other languages
Chinese (zh)
Other versions
CN101673262A (en
Inventor
连惠城
程建章
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Alibaba China Co Ltd
Original Assignee
Chuanxian Network Technology Shanghai Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Chuanxian Network Technology Shanghai Co LtdfiledCriticalChuanxian Network Technology Shanghai Co Ltd
Priority to CN200810042853ApriorityCriticalpatent/CN101673262B/en
Publication of CN101673262ApublicationCriticalpatent/CN101673262A/en
Application grantedgrantedCritical
Publication of CN101673262BpublicationCriticalpatent/CN101673262B/en
Expired - Fee Relatedlegal-statusCriticalCurrent
Anticipated expirationlegal-statusCritical

Links

Images

Landscapes

Abstract

The invention discloses a method for searching an audio content, which comprises:1) an audio fingerprint extraction step, which is to extract audio fingerprints of a plurality of audio files; 2) an audio fingerprint segmentation step, which is to segment the audio fingerprints extracted by the step 1); 3) an index generation step, which is to generate an audio fingerprint index according to the result of the segmentation in the step 2); and 4) a searching step, which is to search a matched audio file by using the audio fingerprint index. In the method, a segmentation technique in a text search engine is used to perform segmentation processing of the audio fingerprint files first, then an index technique in the field of text search is used to perform the index processing of the audio fingerprints, and finally, after the index processing is completed, the search engine can search an audio segment input by a user. Thus, the method facilitates the search of the user and improves search efficiency.

Description

The searching method of audio content
Technical field
The present invention relates to the searching method of audio content.
Background technology
Along with Internet development, search engine becomes people's one of necessary tool of surfing the Net.Traditional search engine all is based on text search (Text Search), and being called is text search engine.Its principle is: search engine server is collected a large amount of webpages; And according to the text in the existing Rule Extraction webpage and do participle (Word Segmentation) and handle, common segmenting method, for example: based on the segmenting method of string matching, based on the segmenting method of understanding with based on the segmenting method of statistics; Text search engine utilizes the text dictionary to index and shows to be used for quick search.The user is input to server with text when searching for, server is searched for according to concordance list, then return results after the text is carried out word segmentation processing fast.
At present, search engine all is based on text, searches for even the search engine of some search pictures or audio frequency also is text messages such as title, explanation, introduction, label through picture or audio program.Search engine does not also have directly to search for through the signal content of audio frequency.
Audio-frequency fingerprint (audio fingerprinting) just is being suggested a long time ago; For example, Jaap Haitsma and TonKalke have delivered " a kind of audio fingerprint system of high reliability " (A Highly Robust AudioFingerprinting System) on music searching in 2002 makes progress international conference (Proceedings of International Conference on MusicInformation Retrieval).This system passes through method for processing signals; With the sound signal of (for example 11.6ms) at set intervals in the audio file; Be converted into the fingerprint (fingerprint) of one 32 bit (bit) size, an audio file just can be converted into a file fingerprint by this method.System just can carry out fast audio-frequency fingerprint and retrieve behind table that all audio-frequency fingerprint files are indexed.
Under audio-frequency fingerprint number of files less (for example 10,000 s') situation, can all file fingerprints be deposited in the calculator memory, carry out index after, can retrieve fast easily.Above-mentioned " a kind of audio fingerprint system of high reliability " promptly provided the detailed step of this method.Yet under actual conditions, the number of audio file will be considerably beyond 10,000 number.For example, the number of audio files that occurs on the internet at present surpasses 10,000,000 numbers, and quantity is in continuous growth.Therefore adopt this method to be difficult to make practical search engine.
Summary of the invention
In order to solve the problems of the technologies described above, the present invention provides a kind of searching method of audio content, and it is audio-frequency fingerprint search engine (audio fingerprint search engine) that this search engine is called.
The present invention adopts following technical scheme:
A kind of searching method of audio content comprises:
1) audio-frequency fingerprint extraction step extracts the audio-frequency fingerprint of a plurality of audio files;
2) audio-frequency fingerprint participle step, the audio-frequency fingerprint that step 1) is extracted carries out participle;
3) index generates step, according to step 2) word segmentation result generate the audio-frequency fingerprint index;
4) search step, the audio file that utilizes this audio-frequency fingerprint indexed search to mate.
Wherein, said step 4) specifically may further comprise the steps:
According to audio file or the audio file fragment that the needs of input are retrieved, extract its audio-frequency fingerprint, this audio-frequency fingerprint is carried out participle, according to the audio file of word segmentation result search matched in said audio-frequency fingerprint index.
Wherein, further comprising the steps of before the said step 4) after the said step 3):
Storing step, store audio fingerprints, said audio-frequency fingerprint index and corresponding audio files thereof.
The present invention is through adopting the participle technique in the text search engine; On the audio-frequency fingerprint file, carry out word segmentation processing; Adopt the index technology in the text search field that audio-frequency fingerprint is carried out index process then; After index process was accomplished, search engine can be searched for the audio fragment of user's input.Not only make things convenient for user's search, and improved the efficient of search.
Further specify the present invention below in conjunction with accompanying drawing and embodiment.
Description of drawings
Fig. 1 is the searching method embodiment schematic flow sheet of audio content of the present invention.
Embodiment
As shown in Figure 1, a kind of searching method of audio content comprises:
1) audio-frequency fingerprint extraction step extracts the audio-frequency fingerprint of a plurality of audio files;
2) audio-frequency fingerprint participle step, the audio-frequency fingerprint that step 1) is extracted carries out participle;
3) index generates step, according to step 2) word segmentation result generate the audio-frequency fingerprint index;
4) search step, the audio file that utilizes this audio-frequency fingerprint indexed search to mate.
Wherein, said step 4) specifically may further comprise the steps:
According to audio file or the audio file fragment that the needs of input are retrieved, extract its audio-frequency fingerprint, this audio-frequency fingerprint is carried out participle, according to the audio file of word segmentation result search matched in said audio-frequency fingerprint index.
Wherein, further comprising the steps of before the said step 4) after the said step 3):
Storing step, store audio fingerprints, said audio-frequency fingerprint index and corresponding audio files thereof.
Participle mode in the foregoing description can adopt multiple mode to realize, below enumerates several kinds of modes and explains respectively.
Mode one
Employing is carried out word segmentation processing based on the Statistic for Chinese segmenting method to audio-frequency fingerprint.At first with the file fingerprint of 15000 audio files by the method generation fixed width of above-mentioned Jaap Haitsma and Ton Kalke, its width can be 32 bits or 16 bits, and each file fingerprint that obtains on average is made up of the fingerprint of about 10000 fixed width.The data of each 32 bit or 16 bits are counted as a word in the Chinese.It is 15000 pieces " articles " that all 15000 file fingerprints that comprise " word " are taken as, and these " articles " then carry out participle as the language material of Chinese word segmentation.In statistic processes, the frequency of the combination of each " word " of adjacent co-occurrence in the audio frequency language material is added up.The combination that the co-occurrence frequency is high is considered to a speech, is called " fingerprint speech ".For example; The combination of the fingerprint of 7 continuous scale-of-two " 00000000000000000000000000000000 " that frequency is higher; With the combination of the fingerprint of 5 continuous scale-of-two " 11111111111111111111111111111111 " be the higher fingerprint combination of frequency by statistics, they are used as " fingerprint speech ".
Mode two
Adopting the fingerprint width is the audio-frequency fingerprint method for distilling of 16 bits.Specifically be to be that the fingerprint of 32 bits carries out the fingerprint that interval sampling obtains 16 bits with width in the mode one.Adopt identical with mode one word segmentation processing of carrying out audio-frequency fingerprint based on the Statistic for Chinese segmenting method then.

Claims (3)

CN200810042853A2008-09-122008-09-12Method for searching audio contentExpired - Fee RelatedCN101673262B (en)

Priority Applications (1)

Application NumberPriority DateFiling DateTitle
CN200810042853ACN101673262B (en)2008-09-122008-09-12Method for searching audio content

Applications Claiming Priority (1)

Application NumberPriority DateFiling DateTitle
CN200810042853ACN101673262B (en)2008-09-122008-09-12Method for searching audio content

Publications (2)

Publication NumberPublication Date
CN101673262A CN101673262A (en)2010-03-17
CN101673262Btrue CN101673262B (en)2012-10-10

Family

ID=42020491

Family Applications (1)

Application NumberTitlePriority DateFiling Date
CN200810042853AExpired - Fee RelatedCN101673262B (en)2008-09-122008-09-12Method for searching audio content

Country Status (1)

CountryLink
CN (1)CN101673262B (en)

Families Citing this family (6)

* Cited by examiner, † Cited by third party
Publication numberPriority datePublication dateAssigneeTitle
CN103180847B (en)*2011-10-192016-03-02华为技术有限公司Music query method and apparatus
US8949872B2 (en)*2011-12-202015-02-03Yahoo! Inc.Audio fingerprint for content identification
CN103294696B (en)*2012-02-272018-01-19上海果壳电子有限公司Audio-video frequency content search method and system
CN102663112A (en)*2012-04-182012-09-12上海大学Music retrieval system based on mobile embedded device
CN103995890A (en)*2014-05-302014-08-20杭州智屏软件有限公司Method for updating and searching for data of real-time audio fingerprint search library
CN105825850B (en)*2016-04-292021-08-24腾讯科技(深圳)有限公司Audio processing method and device

Citations (4)

* Cited by examiner, † Cited by third party
Publication numberPriority datePublication dateAssigneeTitle
CN1592906A (en)*2000-07-312005-03-09沙扎姆娱乐有限公司 System and method for recognizing sound and music signals under strong noise and distortion
CN1655500A (en)*2004-02-112005-08-17微软公司Desynchronized fingerprinting method and system for digital multimedia data
CN1708758A (en)*2002-11-012005-12-14皇家飞利浦电子股份有限公司 Improved audio data fingerprint search
CN101014953A (en)*2003-09-232007-08-08音乐Ip公司Audio fingerprinting system and method

Patent Citations (4)

* Cited by examiner, † Cited by third party
Publication numberPriority datePublication dateAssigneeTitle
CN1592906A (en)*2000-07-312005-03-09沙扎姆娱乐有限公司 System and method for recognizing sound and music signals under strong noise and distortion
CN1708758A (en)*2002-11-012005-12-14皇家飞利浦电子股份有限公司 Improved audio data fingerprint search
CN101014953A (en)*2003-09-232007-08-08音乐Ip公司Audio fingerprinting system and method
CN1655500A (en)*2004-02-112005-08-17微软公司Desynchronized fingerprinting method and system for digital multimedia data

Also Published As

Publication numberPublication date
CN101673262A (en)2010-03-17

Similar Documents

PublicationPublication DateTitle
CN101673266B (en)Method for searching audio and video contents
CN100405371C (en)Method and system for abstracting new word
CN101673262B (en)Method for searching audio content
CN102053991B (en)Method and system for multi-language document retrieval
CN101727500A (en)Text classification method of Chinese web page based on steam clustering
CN102411578A (en)Multimedia playing system and method
CN102682000A (en)Text clustering method, question-answering system applying same and search engine applying same
CN102789464A (en)Natural language processing method, device and system based on semanteme recognition
CN101673263B (en)Method for searching video content
CN108491512A (en)The method of abstracting and device of headline
CN102542061A (en)Intelligent product classification method
CN108399265A (en)Real-time hot news providing method based on search and device
CN102339294A (en)Searching method and system for preprocessing keywords
CN102375863A (en)Method and device for keyword extraction in geographic information field
CN105574004B (en)A kind of removing duplicate webpages method and apparatus
CN109101491B (en)Author information extraction method and device, computer device and computer readable storage medium
WO2015062377A1 (en)Device and method for detecting similar text, and application
CN101673267B (en)Method for searching audio and video content
CN103853771B (en)A kind of method for pushing and system of search result
CN105608137A (en)Method and device for extracting identity label
WO2015024429A1 (en)Method and device for acquiring movie and television subject from webpage
EP1965312A3 (en)Information processing apparatus and method, program, and storage medium
CN109740147A (en)A kind of big quantity personnel resume duplicate removal Match Analysis
CN107943937B (en)Debtor asset monitoring method and system based on judicial public information analysis
CN102929862B (en)New word acquiring method and system

Legal Events

DateCodeTitleDescription
C06Publication
PB01Publication
C10Entry into substantive examination
SE01Entry into force of request for substantive examination
C14Grant of patent or utility model
GR01Patent grant
ASSSuccession or assignment of patent right

Owner name:TRANSMISSION LINE NETWORK TECHNOLOGY (SHANGHAI) CO

Free format text:FORMER OWNER: WEIXU NETWORK TECHNOLOGY (SHANGHAI) CO., LTD.

Effective date:20140409

C41Transfer of patent application or patent right or utility model
CORChange of bibliographic data

Free format text:CORRECT: ADDRESS; FROM: 200003 HUANGPU, SHANGHAI TO: 200241 MINHANG, SHANGHAI

TR01Transfer of patent right

Effective date of registration:20140409

Address after:200241 Shanghai City, Dongchuan Road, No. 555, floor floor, room f, F, F, F, F, No. 02, Minhang District

Patentee after:WEIXU NETWORK TECHNOLOGY (SHANGHAI) CO., LTD.

Address before:200003 gate 1305, 6 South Suzhou Road, Shanghai

Patentee before:Weixu Network Technology (Shanghai) Co., Ltd.

TR01Transfer of patent right
TR01Transfer of patent right

Effective date of registration:20171227

Address after:100080 Beijing Haidian District city Haidian street A Sinosteel International Plaza No. 8 block 5 layer A, C

Patentee after:Youku network technology (Beijing) Co., Ltd.

Address before:200241 Shanghai City, Dongchuan Road, No. 555, floor floor, room f, F, F, F, F, No. 02, Minhang District

Patentee before:WEIXU NETWORK TECHNOLOGY (SHANGHAI) CO., LTD.

TR01Transfer of patent right
TR01Transfer of patent right

Effective date of registration:20200710

Address after:310052 room 508, floor 5, building 4, No. 699, Wangshang Road, Changhe street, Binjiang District, Hangzhou City, Zhejiang Province

Patentee after:Alibaba (China) Co.,Ltd.

Address before:100080 Beijing Haidian District city Haidian street A Sinosteel International Plaza No. 8 block 5 layer A, C

Patentee before:Youku network technology (Beijing) Co.,Ltd.

CF01Termination of patent right due to non-payment of annual fee
CF01Termination of patent right due to non-payment of annual fee

Granted publication date:20121010

Termination date:20200912


[8]ページ先頭

©2009-2025 Movatter.jp