Movatterモバイル変換


[0]ホーム

URL:


CN101582080A - Web image clustering method based on image and text relevant mining - Google Patents

Web image clustering method based on image and text relevant mining
Download PDF

Info

Publication number
CN101582080A
CN101582080ACNA2009101000718ACN200910100071ACN101582080ACN 101582080 ACN101582080 ACN 101582080ACN A2009101000718 ACNA2009101000718 ACN A2009101000718ACN 200910100071 ACN200910100071 ACN 200910100071ACN 101582080 ACN101582080 ACN 101582080A
Authority
CN
China
Prior art keywords
image
word
text
correlation
vocabulary
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Granted
Application number
CNA2009101000718A
Other languages
Chinese (zh)
Other versions
CN101582080B (en
Inventor
庄越挺
吴飞
韩亚洪
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Zhejiang University ZJU
Original Assignee
Zhejiang University ZJU
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Zhejiang University ZJUfiledCriticalZhejiang University ZJU
Priority to CN2009101000718ApriorityCriticalpatent/CN101582080B/en
Publication of CN101582080ApublicationCriticalpatent/CN101582080A/en
Application grantedgrantedCritical
Publication of CN101582080BpublicationCriticalpatent/CN101582080B/en
Expired - Fee Relatedlegal-statusCriticalCurrent
Anticipated expirationlegal-statusCritical

Links

Images

Landscapes

Abstract

Translated fromChinese

本发明公开了一种基于图像和文本相关性挖掘的Web图像聚类方法。包括如下步骤:(1)根据查询提取Google图片搜索结果中的图像及其伴随文本;(2)提取伴随文本中名词构成词汇表;(3)计算词汇表中单词的可见度,并将其与TF-IDF方法集成以计算单词和图像相关性关联;(4)计算词汇表中任意两个单词间的主题相关度;(5)利用复杂图对相关性关联建模;(6)应用复杂图聚类算法对图像进行聚类。本发明将单词可见度与TF-IDF方法结合定义单词和图像的相关性关联,突破了TF-IDF方法作为一种文本处理技术不能直接度量单词和图像之间相关性的限制,通过复杂图对单词和图像以及单词和单词相关性关联建模提出了一种Web图像聚类框架,使得图像检索结果根据主题进行归类,方便用户进行检索。The invention discloses a Web image clustering method based on image and text correlation mining. It includes the following steps: (1) extract images and their accompanying texts in Google image search results according to the query; (2) extract nouns in the accompanying texts to form a vocabulary; (3) calculate the visibility of words in the vocabulary and compare them with TF -IDF method integration to calculate word and image correlation; (4) calculate topic correlation between any two words in the vocabulary; (5) use complex graph to model correlation; (6) apply complex graph aggregation Class algorithm to cluster images. The present invention combines the word visibility with the TF-IDF method to define the correlation between words and images, breaking through the limitation that the TF-IDF method as a text processing technology cannot directly measure the correlation between words and images. A web image clustering framework is proposed based on correlation modeling with images and words and words, so that the image retrieval results can be classified according to topics, which is convenient for users to retrieve.

Description

A kind of Web image clustering method based on image and text relevant mining
Technical field
The present invention relates to multimedia retrieval, relate in particular to a kind of Web image clustering method based on image and text relevant mining.
Background technology
On Web, use the keyword search image to remain retrieval method effectively commonly used, as the picture searching of commercial search engine Google and AltaVista.In the Web image retrieval, the key word that the user submits to is the vision polysemant often, and this class word comprises a plurality of different vision implications.For example " computer mouse ", " mouse animal " and a plurality of themes such as " Mickey mouse " can be represented in word " mouse ".Therefore, with these vision polysemant query image, the image searching result that is returned can comprise a plurality of themes, and the image blend of different themes together.This just need provide a kind of last handling process of retrieving to come the image of expressing different themes is sorted out.Recently, Many researchers has proposed the Web image clustering method and has solved this problem.Because have " semantic wide gap " between level image feature and the high-level semantic, these clustering methods have often utilized the multi-modal information such as vision, text and link that comprised by the cluster image collection simultaneously.The multi-modal information that belongs to the different characteristic space is to be mutually related, and excavating with utilizing these correlativitys related is an emphasis problem of recent machine learning research with the study of carrying out multi-modal information fusion, and representative work has various visual angles study and transfer learning.The former utilizes the various features space representation of same data to learn simultaneously, and latter's research and training data and test data have different distributions or belong to the problem concerning study in different characteristic space.It is related that the present invention excavates the correlativity of two kinds of modal informations of text and image, by graph model its incidence relation carried out modeling, and utilize the figure clustering algorithm that the Web image is carried out cluster.
The Web image follows text to coexist as among the html page with it usually, follows text and some text labels to describe the semantic content of image.In Web image retrieval and mark field, a lot of research and utilizations the correlativity association between image and the text.But, follow in the text various words that image, semantic is described the difference of contributing.For a plurality of words in the text, the word that has can find suitable image to vividly describe the implication of this word, for example " chairs "; The word that has is more abstract, then is difficult to find a suitable images to vividly describe the implication of this word, for example " statistics ".From the angle of thinking in images, this species diversity has reflected and has had different semantic associations between word and the image, reflects that also word has " visibility " attribute.So-called visibility is the probability that certain word can be visually perceived.As a kind of text-processing technology, TF-IDF can not directly measure the correlativity between word and the image, and tradition is weighed by the TF-IDF method and followed in the text word that the importance of image has been ignored the visual signature that image itself has to a certain extent.Therefore, the present invention proposes a kind of word visibility model, and with this model and TF-IDF method in conjunction with define a kind of new word and image correlation related.
On the other hand, for the Web image collection that comprises a plurality of themes, it follows implicit subject information in the text to reflect topic relativity between image indirectly.For this topic relativity is introduced the Web image clustering, the present invention utilizes implicit Di Li Cray to distribute and learns to obtain being distributed in the implicit theme probability on each word, by the degree of subject relativity function calculation word and the word topic relativity of definition.Latent dirichlet allocation model, be Latent Dirichlet Allocation, it is a kind of unsupervised learning model that can extract the implicit theme of text that proposes in recent years, as a kind of generating probability model, implicit Di Li Cray distribution is modeled in the set of a discrete data, as text data set.In the text representation field, it is typical case's representative of topic model that implicit Di Li Cray distributes, and can carry out modeling to the subject information that text data comprises.
Therefore, the present invention is by excavating image and following the correlativity association between the text to obtain two kinds of incidence relations: word is related with image correlation and word is related with the word topic relativity, and this cross correlation can carry out modeling with graph model.Traditional graph model can only carry out modeling to the link of the isomorphism between single type node and node.Bigraph (bipartite graph) can carry out modeling to two types of nodes, but this graph model only comprises the isomery link between the dissimilar nodes.Because two kinds of incidence relations that the present invention relates to had both comprised the isomery link between word and the image two class different node, comprise the isomorphism link between word and the word node of the same type again, therefore propose these two kinds of incidence relations to be carried out modeling, and use complicated figure clustering algorithm image is carried out cluster with more generally complicated graph model.
Summary of the invention
The objective of the invention is to make the same subject image gather into a class, retrieve, propose a kind of Web image clustering method based on image and text relevant mining to make things convenient for the user for the Web image searching result is carried out cluster.
Web image clustering method based on image and text relevant mining comprises the steps:
(1) extracts the image in the result for retrieval of Google picture searching and follow text according to user inquiring, extract the noun of following in the text and constitute vocabulary;
(2) to following text to carry out text-processing and extracting text feature;
(3) visibility of each word in the calculating vocabulary;
(4) visibility of word is integrated related to calculate word and image correlation with the TF-IDF method;
(5) according to topic model to following text collection analysis, extract implicit theme probability distribution to calculate the degree of subject relativity between any two words in the vocabulary;
(6) utilize complicated graph model and word related with image correlation is related with the word topic relativity to word and carry out modeling;
(7) use complicated figure clustering algorithm image is carried out cluster.
Image in the described result for retrieval that extracts the Google picture searching according to user inquiring and follow text, extracting the noun of following in the text, to constitute the step of vocabulary as follows:
(1) writes image in the result for retrieval that the reptile program downloads the Google picture searching, composing images set IMG={Image1..., ImageNd, N whereindIt is the total number of images among the set IMG;
(2) each image place webpage among the download images set IMG utilizes page analysis program that each webpage is resolved, and behind removal HTML mark and the punctuation mark, the content of text on the reservation page is as the text of following of image;
(3) text of following to each image carries out part-of-speech tagging, removes non-noun word, keeps the noun in the text, constitutes and follows text collection D={d1..., dNd, N whereindBe that set is followed the text sum among the D;
(4) sequential scanning follows among the text collection D each to follow text diIn all words, i=1 wherein ..., Nd, each various words keeps one, forms the vocabulary VOL={w that word list is representedi..., wNw, N whereinwIt is the total words among the vocabulary VOL.
Described to following text to carry out text-processing and to extract the step of text feature as follows:
(1) to each the word w among the vocabulary VOLi, i=1 wherein ..., Nw, NwBe total words in the vocabulary, sequential scanning follows among the text collection D each to follow text dj, add up each word wiAt each document djThe middle frequency n that occursIj, j=1 wherein ..., Nd, NdBe to follow the text sum, and comprise word w among the statistics set DiFollow text number n um (wi);
(2) calculate each word w according to formula (1)iFollow text d at eachjIn word frequency freq (wi, dj), i=1 wherein ..., Nw, NwBe total words in the vocabulary, j=1 ..., Nd, NdBe to follow the text sum among the set D;
freq(wi,dj)=nij/Σk=1Nwnkj.---(1)
(3) to each the word w among the vocabulary VOLi, calculate its contrary document word frequency idf (w according to formula (2)i);
idf(wi)=log(Nd/num(wi)). (2)
(4), will gather that each follows text d among the D according to vector space modeljBe expressed as NwDimensional vector, i are tieed up the word w in the corresponding vocabularyi, its value is tfidf (wi), computing formula is as follows:
tfidf(wi)=freq(wi,dj)×idf(wi). (3)。
The method of the visibility of each word is in the described calculating vocabulary: each word w among the vocabulary VOLiVisual scale value vis (wi) calculate by formula (4);
vis(wi)=((C1+10-9)/(C2+10-9))-IDFGoogle(wi).---(4)
Wherein, C1Be with word wiSubmit to the result for retrieval sum that the Google picture searching returns, C as inquiry2Be with word wiSubmit to the result for retrieval sum that the Google text search returns as inquiry; Exponential factor IDFGoogle(wi) computing formula as follows:
IDFGoogle(wi)=log(|DGoogle|/C2). (5)
Wherein, DGoogleBe all Web page set of Google index, | DGoogle| expression set DGoogleIn page sum.
Described visibility with word is integrated with the TF-IDF method to calculate the related method of word and image correlation to be: word wiWith image I magejThe related r (w of correlativityi, Imagei) calculate by formula (6), j=1 wherein ..., Nd, NdBe to follow the text sum;
r(wi,Imagej)=tfidf(wi)×vis(wi). (6)。
Described according to topic model to following text collection analysis, it is as follows with the step of calculating the degree of subject relativity between any two words in the vocabulary to extract implicit theme probability distribution:
(1) with vocabulary VOL, follow text collection D and the implicit number of topics k of set among the D as the input that the implicit Di Li Cray of topic model distributes, export each implicit theme zjProbability distribution P (zj) and zjAt each word wiOn probability distribution P (wi| zj), j=1 wherein ..., k;
(2) any two word w among the set VOLsAnd wtBetween degree of subject relativity Topic_r (ws, wt) by the defined degree of subject relativity function calculation of formula (7), wherein σ is a normaliztion constant,
Topic_r(ws,wt)=maxjP(z=j|ws)P(z=j|wt)
=maxjp(ws|z=j)P(z=j)P(ws)·p(wt|z=j)P(z=j)P(wt)---(7).
=maxjp(ws|z=j)p(wt|z=j)P(z=j)σ.
And word with word topic relativity related method of carrying out modeling related with image correlation is the complicated graph model of described utilization to word: complicated graph model comprises image node and two kinds of dissimilar nodes of word node, the link of isomery link between word and image and the isomorphism between word and word is as the limit between node, and word and image links weight are by the related r (w with image correlation of the defined word of formula (6)i, Imagei) calculate, word and word link weight are the word and the word degree of subject relativity function T opic_r (w of formula (7) definitions, wt) calculate, complicated graph model is expressed as set of matrices as shown in Equation (8);
{S∈R+Nw×Nw,A∈R+Nd×Nd}. (8)
Wherein, symmetric matrixS∈R+Nw×NwExpression word and word correlation matrix, NwBe total words in the vocabulary, R+Be the arithmetic number set, matrix element SIj(the expression of i ≠ j) word wiAnd wjBetween degree of subject relativity, SIj=Topic_r (wi, wj) matrixA∈R+Nw×NdExpression word and image correlation matrix, NdBe total number of images, matrix element AIjExpression word wiWith j image I magejBetween the correlativity association, AIj=tfidf (wi) vis (wi).
The complicated figure clustering algorithm of described application can be expressed as the defined optimization problem of formula (9) the method that image carries out cluster;
minC(1),C(2),D,B||S-C(1)D(C(1))T||2+||A-C(1)B(C(2))T||2s.t.C(1)∈{0,1}Nw×k1,C(2)∈{0,1}Nd×k2,C(1)1=1,C(2)1=1.---(9)
Wherein, each component ofvector 1 all is 1, k1And k2The cluster number of representing word and image respectively, generic oriental matrix C(1)And C(2)Be the output of complicated figure clustering algorithm, matrix element CPq(2)Represent p image I magepBelong to the q class, the complicated figure clustering algorithm that the defined optimization problem of formula (9) is found the solution is shown in algorithm 1:
Algorithm 1. complicated figure G1Clustering algorithm CGC.
Input: matrix S and A;
Output: generic oriental matrix C(1)And C(2), k1And k2It is respectively the cluster number of word and image;
2-5 is up to convergence forstep 1. iteration step;
Step 2. is calculated D=((C(1))TC(1))-1(C(1))TSC(1)(C(1))TC(1))-1
Step 3. is calculated B=((C(1))TC(1))-1(C(1))TAC(2)(C(2))TC(2))-1
Step 4. is D fixedly, B and C(2), upgrade C line by line(1), make to minimize L that L is calculated as follows:
L=||S-C(1)D(C(1))T||2+||A-C(1)B(C(2))T||2
Step 5. is D fixedly, B and C(1), upgrade C line by line(2), make to minimize L that L is calculated as follows:
L=||S-C(1)D(C(1))T||2+||A-C(1)B(C(2))T||2.
Generic oriental matrix C according toalgorithm 1 output(2)The method of image collection IMG being carried out cluster is, if matrix elementCpq(2)=1Then p image I magepBe classified as the q class, p=1 wherein ..., Nd, NdTotal number of images among the expression set IMG, q=1 ..., k2, k2The cluster number of image among the expression IMG.
The useful effect that the present invention has is: the present invention is related in conjunction with the correlativity of definition word and image with traditional TF-IDF method with word visibility model, has broken through TF-IDF method a kind of text-processing technology of writing a composition and can not directly measure the restriction of correlativity between word and the image; And word related with image correlation is related with the word topic relativity to word carries out modeling and proposed a kind of new Web image clustering framework by complicated figure, improved Web image clustering precision, make image searching result sort out, make things convenient for the user to retrieve according to theme.
Description of drawings
Fig. 1 is based on the committed step workflow diagram of the Web image clustering method of image and text relevant mining, wherein (a) is according to the parts of images that extracts from Goole picture search return results of inquiry " bass " and follows text accordingly, (b) be complicated graph model example, solid line represents word related with image correlation, dotted line represents word related with the word topic relativity, (c) be the cluster result of output, treatment step (1) is to after following text to carry out text-processing and extracting text feature, it is related with the correlativity between image to excavate text, the word that obtains and image and word and word are closed the complicated figure of couplings for two kinds carry out modeling, treatment step (2) is to utilize complicated figure clustering algorithm that complicated figure shown in Fig. 1 (b) is carried out cluster;
Fig. 2 is based in the Web image clustering method of image and text relevant mining the Web image and follows the text synoptic diagram, and italic is represented noun among the figure;
Fig. 3 is that Fig. 2 follows noun visibility result of calculation synoptic diagram in the text;
Fig. 4 is the mutual information comparison diagram to the complicated figure cluster result of 5 query cases;
Fig. 5 (a) is inquiry jaguar synoptic diagram of preceding 5 images among three theme class " jaguar car ", " the jaguar animal " and " jaguar car " in the complicated figure cluster result under not introducing the visibility situation, and the image of red dotted border is the cluster item of mistake among the figure;
Fig. 5 (b) is inquiry jaguar synoptic diagram of preceding 5 images among three theme class " jaguar car ", " the jaguar animal " and " jaguar car " in the complicated figure cluster result after introducing visibility, and the image of red dotted border is the cluster item of mistake among the figure;
Fig. 6 is the synoptic diagram of inquiry mouse by preceding 10 images among three theme class " computermouse ", " the mouse animal " and " Mickey mouse " in the clustering method cluster result of the present invention, and the image of red dotted border is wrong cluster item among the figure.
Embodiment
The present invention proposes a kind of Web image clustering method based on image and text relevant mining, and in conjunction with the accompanying drawings, its enforcement is described in detail as follows.
Web image clustering method based on image and text relevant mining comprises the steps:
(1) extracts the image in the result for retrieval of Google picture searching and follow text according to user inquiring, extract the noun of following in the text and constitute vocabulary;
(2) to following text to carry out text-processing and extracting text feature;
(3) visibility of each word in the calculating vocabulary;
(4) visibility of word is integrated related to calculate word and image correlation with the TF-IDF method;
(5) according to topic model to following text collection analysis, extract implicit theme probability distribution to calculate the degree of subject relativity between any two words in the vocabulary;
(6) utilize complicated graph model and word related with image correlation is related with the word topic relativity to word and carry out modeling;
(7) use complicated figure clustering algorithm image is carried out cluster.
Image in the described result for retrieval that extracts the Google picture searching according to user inquiring and follow text, extracting the noun of following in the text, to constitute the step of vocabulary as follows:
(1) writes image in the result for retrieval that the reptile program downloads the Google picture searching, composing images set IMG={Image1..., ImageNd, N whereindIt is the total number of images among the set IMG;
(2) each image place webpage among the download images set IMG utilizes page analysis program that each webpage is resolved, and behind removal HTML mark and the punctuation mark, the content of text on the reservation page is as the text of following of image;
(3) text of following to each image carries out part-of-speech tagging, removes non-noun word, keeps the noun in the text, constitutes and follows text collection D={d1..., dNd, N whereindBe that set is followed the text sum among the D;
(4) sequential scanning follows among the text collection D each to follow text diIn all words, i=1 wherein ..., Nd, each various words keeps one, forms the vocabulary VOL={w that word list is representedi..., wNw, N whereinwIt is the total words among the vocabulary VOL.
Described to following text to carry out text-processing and to extract the step of text feature as follows:
(1) to each the word w among the vocabulary VOLi, i=1 wherein ..., Nw, NwBe total words in the vocabulary, sequential scanning follows among the text collection D each to follow text dj, add up each word wiAt each document djThe middle frequency n that occursIj, j=1 wherein ..., Nd, NdBe to follow the text sum, and comprise word w among the statistics set DiFollow text number n um (wi);
(2) calculate each word w according to formula (1)iFollow text d at eachjIn word frequency freq (wi, dj), i=1 wherein ..., Nw, NwBe total words in the vocabulary, j=1 ..., Nd, NdBe to follow the text sum among the set D;
freq(wi,dj)=nij/Σk=1Nwnkj.---(1)
(3) to each the word w among the vocabulary VOLi, calculate its contrary document word frequency idf (w according to formula (2)i);
idf(wi)=log(Nd/num(wi)). (2)
(4), will gather that each follows text d among the D according to vector space modeljBe expressed as NwDimensional vector, i are tieed up the word w in the corresponding vocabularyi, its value is tfidf (wi), computing formula is as follows:
tfidf(wi)=freq(wi,dj)×idf(wi). (3)。
The method of the visibility of each word is in the described calculating vocabulary: each word w among the vocabulary VOLiVisual scale value vis (wi) calculate by formula (4);
vis(wi)=((C1+10-9)/(C2+10-9))-IDFGoogle(wi).---(4)
Wherein, C1Be with word wiSubmit to the result for retrieval sum that the Google picture searching returns, C as inquiry2Be with word wiSubmit to the result for retrieval sum that the Google text search returns as inquiry; Exponential factor IDFGoogle(wi) computing formula as follows:
IDFGoogle(wi)=log(|DGoogle|/C2). (5)
Wherein, DGoogleBe all Web page set of Google index, | DGoogle| expression set DGoogleIn page sum.
Described visibility with word is integrated with the TF-IDF method to calculate the related method of word and image correlation to be: word wiWith image I magejThe related r (w of correlativityi, Imagei) calculate by formula (6), j=1 wherein ..., Nd, NdBe to follow the text sum;
r(wi,Imagej)=tfidf(wi)×vis(wi). (6)。
Described according to topic model to following text collection analysis, it is as follows with the step of calculating the degree of subject relativity between any two words in the vocabulary to extract implicit theme probability distribution:
(1) with vocabulary VOL, follow text collection D and the implicit number of topics k of set among the D as the input that the implicit Di Li Cray of topic model distributes, export each implicit theme zjProbability distribution P (zj) and zjAt each word wiOn probability distribution P (wi| zj), j=1 wherein ..., k;
(2) any two word w among the set VOLsAnd wtBetween degree of subject relativity Topic_r (ws, wt) by the defined degree of subject relativity function calculation of formula (7), wherein σ is a normaliztion constant,
Topic_r(ws,wt)=maxjP(z=j|ws)P(z=j|wt)
=maxjp(ws|z=j)P(z=j)P(ws)·p(wt|z=j)P(z=j)P(wt)---(7).
=maxjp(ws|z=j)p(wt|z=j)P(z=j)σ.
And word with word topic relativity related method of carrying out modeling related with image correlation is the complicated graph model of described utilization to word: complicated graph model comprises image node and two kinds of dissimilar nodes of word node, the link of isomery link between word and image and the isomorphism between word and word is as the limit between node, and word and image links weight are by the related r (w with image correlation of the defined word of formula (6)i, Imagei) calculate, word and word link weight are the word and the word degree of subject relativity function T opic_r (w of formula (7) definitions, wt) calculate, complicated graph model is expressed as set of matrices as shown in Equation (8);
{S∈R+Nw×Nw,A∈R+Nd×Nd}.(8)
Wherein, symmetric matrixS∈R+Nw×NwExpression word and word correlation matrix, NwBe total words in the vocabulary, R+Be the arithmetic number set, matrix element SIj(the expression of i ≠ j) word wiAnd wjBetween degree of subject relativity, SIj=Topic_r (wi, wj) matrixA∈R+Nw×NdExpression word and image correlation matrix, NdBe total number of images, matrix element AIjExpression word wiWith j image I magejBetween the correlativity association, AIj=tfidf (wi) vis (wi).
The complicated figure clustering algorithm of described application can be expressed as the defined optimization problem of formula (9) the method that image carries out cluster;
minC(1),C(2),D,B||S-C(1)D(C(1))T||2+||A-C(1)B(C(2))T||2s.t.C(1)∈{0,1}Nw×k1,C(2)∈{0,1}Nd×k2,C(1)1=1,C(2)1=1.---(9)
Wherein, each component ofvector 1 all is 1, k1And k2The cluster number of representing word and image respectively, generic oriental matrix C(1)And C(2)Be the output of complicated figure clustering algorithm, matrix element CPq(2)Represent p image I magepBelong to the q class, the complicated figure clustering algorithm that the defined optimization problem of formula (9) is found the solution is shown in algorithm 1:
Algorithm 1. complicated figure G1Clustering algorithm CGC.
Input: matrix S and A;
Output: generic oriental matrix C(1)And C(2), k1And k2It is respectively the cluster number of word and image;
2-5 is up to convergence forstep 1. iteration step;
Step 2. is calculated D=((C(1))TC(1))-1(C(1))TSC(1)(C(1))TC(1))-1
Step 3. is calculated B=((C(1))TC(1))-1(C(1))TAC(2)(C(2))TC(2))-1
Step 4. is D fixedly, B and C(2), upgrade C line by line(1), make to minimize L that L is calculated as follows:
L=||S-C(1)D(C(1))T||2+||A-C(1)B(C(2))T||2
Step 5. is D fixedly, B and C(1), upgrade C line by line(2), make to minimize L that L is calculated as follows:
L=||S-C(1)D(C(1))T||2+||A-C(1)B(C(2))T||2.
Generic oriental matrix C according toalgorithm 1 output(2)The method of image collection IMG being carried out cluster is, if matrix elementCpq(2)=1Then p image I magepBe classified as the q class, p=1 wherein ..., Nd, NdTotal number of images among the expression set IMG, q=1 ..., k2, k2The cluster number of image among the expression IMG.
Embodiment
Selected 5 vision polysemants as inquiry, they are: " apple ", " bass ", " jaguar ", " mouse " and " tower ".Write the reptile program, extracted Goolge ImageSearch automatically as inquiry according to the key word of submitting toTMReturn results.To each image in the return results, downloaded the Web page at image file and this image place.Because Google has limited and searched for the actual quantity of returning as a result, data set comprises about 4000 data item altogether.In order to extract the text of following of image, the Web page at image place is resolved, extract the follow text of the text of image word on every side as this image.All texts of following pass through part-of-speech tagging, extract noun wherein.Inquiring about its noun vocabulary scale of following text for each is 1000~2000 words.In order to obtain benchmark generic listing vector, we mark the manual image category that data are concentrated.
The workflow diagram of committed step of the present invention is an example with submit queries " bass " as shown in Figure 1, and concrete implementation step is:
1. write all images and image place webpage in the result for retrieval that the reptile program downloads the Google picture searching, by page resolver each html page is resolved, remove HTML mark and punctuation mark, obtain the image collection IMG={Image shown in Fig. 1 (a)1..., ImageNdAnd follow text collection D={d1..., dNd, NdBeing to follow the text sum, also is total number of images simultaneously;
2. utilize the part-of-speech tagging program that each is followed text diCarry out part-of-speech tagging, i=1 wherein ..., Nd, remove the non-noun word in the text, keep the noun in the text;
3. sequential scanning follows among the text collection D each to follow text diIn all words, each various words keeps one, forms the vocabulary VOL={w that word list is representedi..., wNw, N whereinwBe the total words among the vocabulary VOL, to each the word w among the vocabulary VOLiAdd up each word wiAt each document djThe middle frequency n that occursIj, and comprise word w among the set DiFollow text number n um (wi);
4. each is followed text dj(j=1 ..., Nd) extract its text feature, concrete steps are:
(1) to each word w among the vocabulary VOLi, i=1 wherein ..., Nw, NwBe total words in the vocabulary, calculate wiFollowing text djIn word frequencyfreq(wi,dj)=nij/Σk=1Nwnkj;
(2) to each word w among the vocabulary VOLi, calculate wiContrary document word frequency idf (wi)=log (Nd/ num (wi));
(3) according to vector space model, with document djBe expressed as NwDimensional vector:dj=(tfidf(w1),...,tfidf(wNw)),I ties up the word w in the corresponding vocabularyi, its value is tfidf (wi)=freq (wi, dj) * idf (wi);
5. to each word w among the vocabulary VOLiCalculate its visibilityvis(wi)=((C1+10-9)/(C2+10-9))-IDFGoogle(wi),Wherein, C1Be with word wiSubmit to the result for retrieval sum that the Google picture searching returns, C as inquiry2Be with word wiSubmit to the result for retrieval sum that the Google text search returns as inquiry; Exponential factor IDFGoogle(wi) computing formula as follows:
IDFGoogle(wi)=log(|DGoogle|/C2)
Wherein, DGoogleBe all Web page set of Google index, | DGoogle| expression set DGoogleIn page sum, in the present embodiment | DGoogle|=5 * 1011
The visibility of word has embodied word, noun especially, contain the degree that semantic usable image is described.From the angle of cognitive psychology and thinking in images, the word of high-visibility, as " banana ", than the word of low visibility, as " Bayesian ", the easier direct vision image that in human brain, forms.Can be used for expressing the semantic association between word and the image with visibility as a kind of new attribute of word.In the Web page, each word has visibility in various degree around the image, and the high-visibility word has stronger descriptive power to the semanteme of image.With C1/ C2Value can be weighed the visibility of various words, for example C of word " banana " to a certain extent as quantizating index1/ C2Value is greater than " Bayesian ".With Fig. 2 is example, this image be this image be with key word " bass " as inquiry, among preceding 5 results that return by the Google image search engine one.Follow noun C in the text1And C2Be worth to retrieve and obtain in May, 2009 from Google, as shown in table 1.As shown in Figure 3, the C of speech such as " legend ", " record ", " scale "1/ C2Value greater than " largemouth " and " fishermen ".But according to the visibility definition, because " largemouth " and " fishermen " is two main objects in this width of cloth image, they should have more high-visibility.Cause this result's reason to be, the more wide in range words of theme such as " record " appears on the Web page in large quantities, also appears at following in the text of image simultaneously in large quantities, thereby has improved their C1/ C2Value.The C of the wide in range word of theme2Be worth often very big, therefore proposed by the invention visibility model utilization " the contrary document word frequency factor " IDFGoogle(wi)=log (| DGoogle|/C2) come its visibility is suppressed, | DGoogle| be all Web page sums of Google index.The vis of noun (w) is worth as shown in Figure 3 among Fig. 3, and the vis (w) of " largemouth " and " fishermen " is worth maximum, and visible the present invention puies forward the rationality of visibility model.
Table 1
Figure A20091010007100161
6. calculate each word w among the VOLiWith image I magejThe related r (w of correlativityi, Imagej)=tfidf (wi) * vis (wi); Structure word and image correlation matrixA∈R+Nw×Nd,Matrix element AIjExpression word wiWith j image I magejBetween the degree of correlation, AIj=r (wi, Imagej).
7. to any two word w among the vocabulary VOLsAnd wtCalculate its degree of subject relativity, and structure word and word correlation matrix, concrete steps are as follows:
(1) with vocabulary VOL, follow text collection D and implicit number of topics k as the input that the implicit Di Li Cray of topic model distributes, export each implicit theme zj(j=1 ..., probability distribution P (z k)j) and zjAt each word wiOn probability distribution P (wi| zj);
(2) any two word wsAnd wtBetween degree of subject relativity Topic_r (ws, wt) be calculated as follows, σ is a normaliztion constant.
Topic_r(ws,wt)=maxjP(z=j|ws)P(z=j|wt)
=maxjp(ws|z=j)P(z=j)P(ws)·p(wt|z=j)P(z=j)P(wt)
=maxjp(ws|z=j)p(wt|z=j)P(z=j)σ.
(3) structure word and word correlation matrix are symmetric matrixS∈R+Nw×Nw,Matrix element SIj(the expression of i ≠ j) word wiAnd wjBetween degree of subject relativity, SIj=Topic_r (wi, wj).
8. obtain complicated graph model shown in Fig. 1 (b) through above step, this complexity graph model can be expressed as set of matricesS∈R+Nw×Nw,A∈R+Nd×Nd. using complicated figure clustering algorithm can carry out cluster to image collection IMG, and complicated figure clustering algorithm is expressed as optimization problem;
minC(1),C(2),D,B||S-C(1)D(C(1))T||2+||A-C(1)B(C(2))T||2s.t.C(1)∈{0,1}Nw×k1,C(2)∈{0,1}Nd×k2,C(1)1=1,C(2)1=1.
Wherein, each component ofvector 1 all is 1, k1And k2The cluster number of representing word and image respectively, generic oriental matrix C(1)And C(2)Be the output of complicated figure clustering algorithm, matrix element CPq(2)Represent p image I magepBelong to the q class, the concrete steps of complicated figure clustering algorithm are shown in algorithm 1:
Algorithm 1. complicated figure G1Clustering algorithm CGC.
Input: matrix S and A;
Output: generic oriental matrix C(1)And C(2), k1And k2It is respectively the cluster number of word and image;
2-5 is up to convergence forstep 1. iteration step;
Step 2. is calculated D=((C(1))TC(1))-1(C(1))TSC(1)(C(1))TC(1))-1
Step 3. is calculated B=((C(1))TC(1))-1(C(1))TAC(2)(C(2))TC(2))-1
Step 4. is D fixedly, B and C(2), upgrade C line by line(1), make to minimize L that L is calculated as follows:
L=||S-C(1)D(C(1))T||2+||A-C(1)B(C(2))T||2
Step 5. is D fixedly, B and C(1), upgrade C line by line(2), make to minimize L that L is calculated as follows:
L=||S-C(1)D(C(1))T||2+||A-C(1)B(C(2))T||2.
Generic oriental matrix C according toalgorithm 1 output(2)The method of image collection IMG being carried out cluster is, if matrix elementCpq(2)=1Then p image I magepBe classified as the q class, p=1 wherein ..., Nd, NdTotal number of images among the expression set IMG, q=1 ..., k2, k2The cluster number of image among the expression IMG.
As shown in Figure 1, obtain cluster result through step (2), image is classified as 3 theme class in this example, is respectively " bass fishing ", " bass fish " and " bass guitar ".
For the validity that shows core content of the present invention and the overall performance of cluster framework, we carry out following cluster result contrast:
(1) distinguishes r (wi, Imagej)=tfidf (wi) and r (wi, Imagej)=tfidf (wi) * vis (wi) two kinds of situations carry out cluster;
(2) with the related Topic_r (w of the topic relativity between words, wt) with the related P (w of word symbiosis correlativitys, wt) contrast, image follow in the text any two word wsAnd wtThe symbiosis correlativity be defined as the probability P (w in the text of following that they appear at certain image simultaneouslys, wt)=num (ws, wt)/Nd, num (ws, wt) be that it is followed and comprises word w in the text simultaneouslysAnd wtThe number of image.Related with the symbiosis correlativity in conjunction with topic relativity, word and word isomorphism link weight are defined as: λ p (ws, wt)+(1-λ) Topic_r (ws, wt), wherein λ (0<λ<1) is an adjustable parameter.
The cluster Performance evaluation criterion adopts normalized cluster mutual information, i.e. Normalized MutualInformation.Normalized cluster mutual information is defined as: given cluster number k, generic listing vector λ=(λ1..., λK) middle λiSpan be λi=1 ... k, λi=j represents that i data item belongs to CjClass.Use λ(a)And λ(b)Respectively ecbatic with benchmark generic listing vector, then λ(a)And λ(b)Normalization cluster mutual information φ(NMI)Be defined as:
φ(NMI)(λ(a),λ(b))=Σh-1kΣl-1knhllog(n·nhlnh(a)nl(b))(Σh-1knh(a)lognh(a)n)(Σl-1knl(b)lognl(b)n).
Wherein, nh(a)Be corresponding to λ(a)Class ChIn the data item number, nl(b)Be corresponding to λ(b)Class ClIn the data item number.CHlExpression is gathered at λ simultaneously(a)Class ChIn and λ(b)Class ClIn the number of data item.The λ of certain cluster result(a)With benchmark generic λ(b)Between mutual information value φ(NMI)(a), λ(b)) big more, represent that this cluster effect is good more.Desirable cluster is φ(NMI)(a), λ(b))=1.
For parameter lambda, consider three kinds of situations:
1)λ=1;
2)λ=0;
3)λ=0.15;
As shown in Figure 4: the NMI value for all 5 the complicated figure clusters of inquiry all o'clock reaches best in λ=0, so can show the rationality of word proposed by the invention and word degree of subject relativity.
As shown in Figure 4: " λ=0 (vis (w)) " expression word and image links weight adopt AIj=tfidf (wi) * vis (wi).The result can see by the cluster mutual information, in complicated figure cluster, the visibility of word is introduced word make the high-visibility word to the related with it more topic relativity information of image node transmission with the image links weight, has improved the cluster performance.
With shown in Figure 5 be example to inquiry " jaguar " retrieving images cluster result, contrast (a) and (b) figure can see to such an extent that strengthen under some word of describing the image special object and the image links weight situation introducing visibility, the cluster performance improves.
Be to adopt the Web image clustering method the present invention is based on image and text relevant mining that inquiry mouse is submitted to Google picture searching institute return results to carry out preceding 10 images in three themes of cluster gained as shown in Figure 6, first row are theme " computer mouse ", secondary series is theme " mouse animal ", and the 3rd row are theme " Mickey mouse "; The image of red dotted border is wrong cluster item.

Claims (8)

Translated fromChinese
1.一种基于图像和文本相关性挖掘的Web图像聚类方法,其特征在于包括如下步骤:1. a Web image clustering method based on image and text correlation mining, is characterized in that comprising the steps:(1)根据用户查询提取Google图片搜索的检索结果中的图像及其伴随文本,提取伴随文本中的名词构成词汇表;(1) Extract images and accompanying texts in the retrieval results of Google Image Search according to user queries, and extract nouns in the accompanying texts to form a vocabulary;(2)对伴随文本进行文本处理并提取文本特征;(2) Perform text processing on the accompanying text and extract text features;(3)计算词汇表中每个单词的可见度;(3) Calculate the visibility of each word in the vocabulary;(4)将单词的可见度与TF-IDF方法集成以计算单词和图像相关性关联;(4) Integrate word visibility with TF-IDF method to calculate word and image correlation;(5)根据主题模型对伴随文本集合进行分析,提取隐含主题概率分布以计算词汇表中任意两个单词间的主题相关度;(5) Analyze the accompanying text set according to the topic model, and extract the hidden topic probability distribution to calculate the topic correlation between any two words in the vocabulary;(6)利用复杂图模型对单词和图像相关性关联以及单词和单词主题相关性关联进行建模;(6) Modeling word and image correlation associations and word and word topic correlation associations using a complex graph model;(7)应用复杂图聚类算法对图像进行聚类。(7) Apply complex graph clustering algorithm to cluster images.2.根据权利要求1所述的一种基于图像和文本相关性挖掘的Web图像聚类方法,其特征在于所述的根据用户查询提取Google图片搜索的检索结果中的图像及其伴随文本,提取伴随文本中的名词构成词汇表的步骤如下:2. a kind of Web image clustering method based on image and text correlation mining according to claim 1 is characterized in that described according to user query extracts the image and its accompanying text in the retrieval result of Google picture search, extracts The steps to form a vocabulary with nouns in the accompanying text are as follows:(1)编写爬虫程序下载Google图片搜索的检索结果中的图像,构成图像集合IMG={Image1,...,ImageNd},其中Nd是集合IMG中的图像总数;(1) write crawler program to download the image in the retrieval result of Google image search, constitute image collection IMG={Image1 ,...,ImageNd }, wherein Nd is the total number of images in the collection IMG;(2)下载图像集合IMG中每个图像所在网页,利用页面解析程序对每个网页进行解析,去除HTML标记和标点符号后,保留页面上的文本内容作为图像的伴随文本;(2) download the webpage where each image is located in the image collection IMG, utilize the page analysis program to analyze each webpage, after removing HTML tags and punctuation marks, keep the text content on the page as the accompanying text of the image;(3)对每个图像的伴随文本进行词性标注,去除非名词单词,保留文本中的名词,构成伴随文本集合D={d1,...,dNd},其中Nd是集合D中的伴随文本总数;(3) Perform part-of-speech tagging on the accompanying text of each image, remove non-noun words, retain nouns in the text, and form an accompanying text set D={d1 ,...,dNd }, where Nd is the The total number of accompanying texts for ;(4)顺序扫描伴随文本集合D中的每个伴随文本di中的所有单词,其中i=1,…,Nd,每个不同单词保留一个,形成单词列表表示的词汇表VOL={wi,…,wNw},其中Nw是词汇表VOL中的单词总数。(4) Sequentially scan all the words in each accompanying text di in the accompanying text set D, wherein i=1,..., Nd , keep one for each different word, forming a vocabulary VOL={w represented by a word listi ,...,wNw }, whereNw is the total number of words in the vocabulary VOL.3.根据权利要求1所述的一种基于图像和文本相关性挖掘的Web图像聚类方法,其特征在于所述的对伴随文本进行文本处理并提取文本特征的步骤如下:3. a kind of Web image clustering method based on image and text correlation mining according to claim 1, it is characterized in that described accompanying text is carried out text processing and the step of extracting text feature is as follows:(1)对词汇表VOL中的每个单词wi,其中i=1,…,Nw,Nw是词汇表中单词总数,顺序扫描伴随文本集合D中的每个伴随文本dj,统计每个单词wi在每个文档dj中出现的次数nij,其中j=1,…,Nd,Nd是伴随文本总数,并统计集合D中包含单词wi的伴随文本个数num(wi);(1) For each word wi in the vocabulary VOL, where i=1,..., Nw , Nw is the total number of words in the vocabulary, sequentially scan each accompanying text dj in the accompanying text set D, and count The number of occurrences nij of each word wi in each document dj , where j=1,..., Nd , Nd is the total number of accompanying texts, and count the number of accompanying texts num containing word wi in the set D (wi );(2)根据公式(1)计算每个单词wi在每个伴随文本dj中的词频freq(wi,dj),其中i=1,…,Nw,Nw是词汇表中单词总数,j=1,…,Nd,Nd是集合D中伴随文本总数;(2) Calculate the word frequency freq(wi , dj ) of each word wi in each accompanying text dj according to formula (1), where i=1,..., Nw , Nw is the word in the vocabulary Total number, j=1,..., Nd , Nd is the total number of accompanying texts in the set D;freqfreq((wwii,,ddjj))==nnoijij//ΣΣkk==11NNwwnnokjkj..------((11))(3)对词汇表VOL中的每个单词wi,根据公式(2)计算其逆文档词频idf(wi);(3) For each word wi in the vocabulary VOL, calculate its inverse document word frequency idf(wi ) according to formula (2);idf(wi)=log(Nd/num(wi)).    (2)idf(wi )=log(Nd /num(wi )). (2)(4)根据向量空间模型,将集合D中每个伴随文本dj表示成Nw维向量,第i维对应词汇表中的单词wi,其值为tfidf(wi),计算公式如下:(4) According to the vector space model, each accompanying text dj in the set D is expressed as an Nw dimensional vector, the i-th dimension corresponds to the word wi in the vocabulary, and its value is tfidf(wi ), the calculation formula is as follows:tfidf(wi)=freq(wi,dj)×idf(wi).    (3)。tfidf(wi )=freq(wi ,dj )×idf(wi ). (3).4.根据权利要求1所述的一种基于图像和文本相关性挖掘的Web图像聚类方法,其特征在于所述的计算词汇表中每个单词的可见度的方法是:词汇表VOL中每个单词wi的可见度值vis(wi)由公式(4)计算;4. a kind of Web image clustering method based on image and text correlation mining according to claim 1 is characterized in that the method for the visibility of each word in the described calculation vocabulary is: in the vocabulary VOL each The visibility value vis(wi ) of word wi is calculated by formula (4);visvis((wwii))==((((CC11++1010--99))//((CC22++1010--99))))--IDFIDFGoogleGoogle((wwii))..------((44))其中,C1是将单词wi作为查询提交给Google图片搜索返回的检索结果总数,C2是将单词wi作为查询提交给Google文本搜索返回的检索结果总数;指数因子IDFGoogle(wi)的计算公式如下:Among them, C1 is the total number of search results returned by submitting the word wi as a query to Google Image Search, and C2 is the total number of search results returned by submitting the word wi as a query to Google Text Search; the index factor IDFGoogle (wi ) The calculation formula is as follows:IDFGoogle(wi)=log(|DGoogle|/C2).    (5)IDFGoogle (wi )=log(|DGoogle |/C2 ). (5)其中,DGoogle是Google索引的所有Web页面集合,|DGoogle|表示集合DGoogle中的页面总数。Among them, DGoogle is the collection of all Web pages indexed by Google, and |DGoogle | represents the total number of pages in the collection DGoogle .5.根据权利要求1所述的一种基于图像和文本相关性挖掘的Web图像聚类方法,其特征在于所述的将单词的可见度与TF-IDF方法集成以计算单词和图像相关性关联的方法是:单词wi与图像Imagej的相关性关联r(wi,Imagei)由公式(6)计算,其中j=1,…,Nd,Nd是伴随文本总数;5. a kind of Web image clustering method based on image and text correlation mining according to claim 1, is characterized in that described visibility of word is integrated with TF-IDF method to calculate word and image correlation The method is: the correlation r(wi , Imagei ) of word wi and image Imagej is calculated by formula (6), wherein j=1,..., Nd , Nd is the total number of accompanying texts;r(wi,Imagej)=tfidf(wi)×vis(wi).    (6)。r(wi , Imagej )=tfidf(wi )×vis(wi ). (6).6.根据权利要求1所述的一种基于图像和文本相关性挖掘的Web图像聚类方法,其特征在于所述的根据主题模型对伴随文本集合进行分析,提取隐含主题概率分布以计算词汇表中任意两个单词间的主题相关度的步骤如下:6. A kind of Web image clustering method based on image and text correlation mining according to claim 1, it is characterized in that described according to topic model, accompanying text set is analyzed, and hidden topic probability distribution is extracted to calculate vocabulary The steps of topic relevance between any two words in the table are as follows:(1)以词汇表VOL、伴随文本集合D和集合D中的隐含主题数k作为主题模型隐含狄利克雷分配的输入,输出每个隐含主题zj的概率分布P(zj)和zj在每个单词wi上的概率分布P(wi|zj),其中j=1,…,k;(1) Take the vocabulary VOL, the accompanying text set D, and the number k of hidden topics in the set D as the input of the hidden Dirichlet distribution of the topic model, and output the probability distribution P(zj ) of each hidden topic zj and the probability distribution P(wi |zj ) of zj on each word wi , where j=1,...,k;(2)集合VOL中任意两个单词ws和wt之间的主题相关度Topic_r(ws,wt)由公式(7)所定义的主题相关度函数计算,其中σ是归一化常数,(2) The topic correlation Topic_r(ws , wt ) between any two words ws and wt in the set VOL is calculated by the topic correlation function defined by formula (7), where σ is a normalization constant ,TopicTopic__rr((wwsthe s,,wwtt))==maxmaxjjPP((zz==jj||wwsthe s))PP((zz==jj||wwtt))==maxmaxjjpp((wwsthe s||zz==jj))PP((zz==jj))PP((wwsthe s))·&Center Dot;pp((wwtt||zz==jj))PP((zz==jj))PP((wwtt))------((77))..==maxmaxjjpp((wwsthe s||zz==jj))pp((wwtt||zz==jj))PP((zz==jj))σσ..7.根据权利要求1所述的一种基于图像和文本相关性挖掘的Web图像聚类方法,其特征在于所述的利用复杂图模型对单词和图像相关性关联以及单词和单词主题相关性关联进行建模的方法是:复杂图模型包含图像结点和单词结点两种不同类型结点,单词和图像间的异构链接以及单词和单词间的同构链接作为结点间的边,单词和图像链接权重由公式(6)所定义的单词和图像相关性关联r(wi,Imagei)计算,单词和单词链接权重为公式(7)定义的单词和单词主题相关度函数Topic_r(ws,wt)计算,复杂图模型表示为如公式(8)所示的矩阵集合;7. a kind of Web image clustering method based on image and text correlation mining according to claim 1, is characterized in that described utilize complex graph model to word and image correlation and word and word subject correlation association The modeling method is as follows: the complex graph model contains two different types of nodes, image nodes and word nodes, heterogeneous links between words and images and homogeneous links between words and words as edges between nodes, and words The word and image link weight is calculated by the word and image correlation r(wi , Imagei ) defined by formula (6), and the word and word link weight is the word and word topic correlation function Topic_r(ws , wt ), the complex graphical model is expressed as a matrix set as shown in formula (8);{{SS∈∈RR++NNww××NNww,,AA∈∈RR++NNdd××NNdd}}..------((88))其中,对称矩阵S∈R+Nw×Nw表示单词和单词相关性矩阵,Nw是词汇表中单词总数,R+是正实数集合,矩阵元素Sij(i≠j)表示单词wi和wj之间的主题相关度,Sij=Topic_r(wi,wj),矩阵A∈R+Nw×Nd表示单词和图像相关性矩阵,Nd是图像总数,矩阵元素Aij表示单词wi和第j个图像Imagej之间的相关性关联,Aij=tfidf(wi)·vis(wi)。Among them, the symmetric matrix S ∈ R + N w × N w Represents word and word correlation matrix, Nw is the total number of words in the vocabulary, R+ is a set of positive real numbers, matrix element Sij (i≠j) represents the subject correlation between word wi and wj , Sij =Topic_r (wi , wj ), matrix A ∈ R + N w × N d Represents word and image correlation matrix, Nd is the total number of images, matrix element Aij represents the correlation between word wi and the jth image Imagej , Aij =tfidf(wi ) vis(wi) .8.根据权利要求1所述的一种基于图像和文本相关性挖掘的Web图像聚类方法,其特征在于所述的应用复杂图聚类算法对图像进行聚类的方法可表示为如公式(9)所定义的优化问题;8. a kind of Web image clustering method based on image and text correlation mining according to claim 1, is characterized in that described application complex graph clustering algorithm carries out the method for clustering image as formula ( 9) The defined optimization problem;minminCC((11)),,CC((22)),,DD.,,BB||||SS--CC((11))DD.((CC((11))))TT||||22++||||AA--CC((11))BB((CC((22))))TT||||22sthe s..tt..CC((11))∈∈{{0,10,1}}NNww××kk11,,CC((22))∈∈{{0,10,1}}NNdd××kk22,,CC((11))11==11,,CC((22))11==11..------((99))其中,向量1的每个分量都为1,k1和k2分别表示单词和图像的聚类个数,类属指示矩阵C(1)和C(2)是复杂图聚类算法的输出,矩阵元素Cpq(2)表示第p个图像Imagep属于第q类,对公式(9)所定义的优化问题进行求解的复杂图聚类算法如算法1所示:Among them, each component of vector 1 is 1, k1 and k2 represent the number of clusters of words and images respectively, and the category indicator matrices C(1) and C(2) are the output of the complex graph clustering algorithm, The matrix element Cpq(2) indicates that the p-th image Imagep belongs to the q-th class, and the complex graph clustering algorithm for solving the optimization problem defined by formula (9) is shown in Algorithm 1:算法1.复杂图G1的聚类算法CGC.Algorithm 1. Clustering algorithm CGC of complex graphG1 .输入:矩阵S和A;Input: matrices S and A;输出:类属指示矩阵C(1)和C(2),k1和k2分别是单词和图像的聚类个数;Output: generic indicator matrices C(1) and C(2) , k1 and k2 are the number of clusters of words and images respectively;步骤1.重复迭代步骤2-5直到收敛;Step 1. Repeat iteration steps 2-5 until convergence;步骤2.计算D=((C(1))TC(1))-1(C(1))TSC(1)(C(1))TC(1))-1Step 2. Calculate D=((C(1) )T C(1) )-1 (C(1) )T SC(1) (C(1) )T C(1) )-1 ;步骤3.计算B=((C(1))TC(1))-1(C(1))TAC(2)(C(2))TC(2))-1Step 3. Calculate B=((C(1) )T C(1) )-1 (C(1) )T AC(2) (C(2) )T C(2) )-1 ;步骤4.固定D,B和C(2),逐行更新C(1),使得最小化L,L计算如下:Step 4. Fix D, B and C(2) and update C(1) row by row such that L is minimized, L is calculated as follows:L=||S-C(1)D(C(1))T||2+||A-C(1)B(C(2))T||2L=||SC(1) D(C(1) )T ||2 +||AC(1) B(C(2) )T ||2 ;步骤5.固定D,B和C(1),逐行更新C(2),使得最小化L,L计算如下:Step 5. Fix D, B and C(1) and update C(2) row by row such that L is minimized, L is calculated as follows:L=||S-C(1)D(C(1))T||2+||A-C(1)B(C(2))T||2.L=||SC(1) D(C(1) )T ||2 +||AC(1) B(C(2) )T ||2 .根据算法1输出的类属指示矩阵C(2)对图像集合IMG进行聚类的方法是,如果矩阵元素Cpq(2)=1则把第p个图像Imagep归为第q类,其中p=1,…,Nd,Nd表示集合IMG中图像总数,q=1,…,k2,k2表示IMG中图像的聚类个数。According to the category indicator matrix C(2) output by Algorithm 1, the method of clustering the image set IMG is, if the matrix elements C pq ( 2 ) = 1 Then classify the pth image Imagep into the qth category, where p=1,..., Nd , Nd represent the total number of images in the set IMG, q=1,..., k2 , k2 represent the aggregation of images in the IMG number of classes.
CN2009101000718A2009-06-222009-06-22Web image clustering method based on image and text relevant miningExpired - Fee RelatedCN101582080B (en)

Priority Applications (1)

Application NumberPriority DateFiling DateTitle
CN2009101000718ACN101582080B (en)2009-06-222009-06-22Web image clustering method based on image and text relevant mining

Applications Claiming Priority (1)

Application NumberPriority DateFiling DateTitle
CN2009101000718ACN101582080B (en)2009-06-222009-06-22Web image clustering method based on image and text relevant mining

Publications (2)

Publication NumberPublication Date
CN101582080Atrue CN101582080A (en)2009-11-18
CN101582080B CN101582080B (en)2011-05-04

Family

ID=41364228

Family Applications (1)

Application NumberTitlePriority DateFiling Date
CN2009101000718AExpired - Fee RelatedCN101582080B (en)2009-06-222009-06-22Web image clustering method based on image and text relevant mining

Country Status (1)

CountryLink
CN (1)CN101582080B (en)

Cited By (25)

* Cited by examiner, † Cited by third party
Publication numberPriority datePublication dateAssigneeTitle
CN102262642A (en)*2011-01-282011-11-30北京理工大学Web image search engine and realizing method thereof
CN102411626A (en)*2011-12-132012-04-11北京大学 A Method to Classify Query Intent Based on the Distribution of Relevance Scores
CN102496146A (en)*2011-11-282012-06-13南京大学Image segmentation method based on visual symbiosis
CN102609412A (en)*2011-01-072012-07-25华东师范大学RSS (Really Simple Syndication)-based multi-thread graphic information synchronization crawling control method and system
CN102750385A (en)*2012-06-292012-10-24南京邮电大学Correlation-quality sequencing image retrieval method based on tag retrieval
CN102750338A (en)*2012-06-042012-10-24天津大学Text processing method facing transfer learning and text feature extraction method thereof
CN103425644A (en)*2012-05-142013-12-04腾讯科技(深圳)有限公司Method and device for extracting pictures in webpage content
WO2014029257A1 (en)*2012-08-232014-02-27深圳市世纪光速信息技术有限公司Method and system for aggregating group pictures, and picture searching method and system
CN103631890A (en)*2013-11-152014-03-12北京奇虎科技有限公司Method and device for mining image principal information
CN104035997A (en)*2014-06-132014-09-10淮阴工学院Scientific and technical information acquisition and pushing method based on text classification and image deep mining
WO2014173108A1 (en)*2013-04-252014-10-30华为技术有限公司Data classification method and apparatus
CN104657375A (en)*2013-11-202015-05-27中国科学院深圳先进技术研究院Image-text theme description method, device and system
CN104866491A (en)*2014-02-242015-08-26华为技术有限公司Parallel topic mining method and device
CN105005616A (en)*2015-07-202015-10-28清华大学Text illustration method and system based on text image characteristics for interaction expansion
WO2015183735A1 (en)*2014-05-272015-12-03Thomson LicensingMethods and systems for image based searching
CN105426911A (en)*2015-11-132016-03-23浙江大学Dirichlet process mixture model based TAC clustering method
WO2017113592A1 (en)*2015-12-312017-07-06百度在线网络技术(北京)有限公司Model generation method, word weighting method, apparatus, device and computer storage medium
CN107248930A (en)*2017-06-052017-10-13桂林电子科技大学One kind is based on k2MDD web service composition method
CN108109696A (en)*2016-11-232018-06-01中国移动通信有限公司研究院A kind of data processing method and device
CN108228720A (en)*2017-12-072018-06-29北京字节跳动网络技术有限公司Identify method, system, device, terminal and the storage medium of target text content and artwork correlation
CN108228721A (en)*2017-12-082018-06-29复旦大学Fast text clustering method on large corpora
CN108509521A (en)*2018-03-122018-09-07华南理工大学A kind of image search method automatically generating text index
CN108647322A (en)*2018-05-112018-10-12四川师范大学The method that word-based net identifies a large amount of Web text messages similarities
CN110990617A (en)*2019-11-272020-04-10广东智媒云图科技股份有限公司Picture marking method, device, equipment and storage medium
CN112836488A (en)*2021-01-252021-05-25浙江工业大学 Semantic Mining Method of Web Service Description Document Based on TWE-NMF Model

Cited By (41)

* Cited by examiner, † Cited by third party
Publication numberPriority datePublication dateAssigneeTitle
CN102609412A (en)*2011-01-072012-07-25华东师范大学RSS (Really Simple Syndication)-based multi-thread graphic information synchronization crawling control method and system
CN102262642B (en)*2011-01-282013-07-10北京理工大学Web image search engine and realizing method thereof
CN102262642A (en)*2011-01-282011-11-30北京理工大学Web image search engine and realizing method thereof
CN102496146A (en)*2011-11-282012-06-13南京大学Image segmentation method based on visual symbiosis
CN102496146B (en)*2011-11-282014-03-05南京大学 An Image Segmentation Method Based on Visual Symbiosis
CN102411626A (en)*2011-12-132012-04-11北京大学 A Method to Classify Query Intent Based on the Distribution of Relevance Scores
CN103425644A (en)*2012-05-142013-12-04腾讯科技(深圳)有限公司Method and device for extracting pictures in webpage content
CN103425644B (en)*2012-05-142016-04-06腾讯科技(深圳)有限公司The extracting method of picture and device in Web page text
CN102750338A (en)*2012-06-042012-10-24天津大学Text processing method facing transfer learning and text feature extraction method thereof
CN102750385A (en)*2012-06-292012-10-24南京邮电大学Correlation-quality sequencing image retrieval method based on tag retrieval
CN102750385B (en)*2012-06-292014-05-07南京邮电大学Correlation-quality sequencing image retrieval method based on tag retrieval
WO2014029257A1 (en)*2012-08-232014-02-27深圳市世纪光速信息技术有限公司Method and system for aggregating group pictures, and picture searching method and system
WO2014173108A1 (en)*2013-04-252014-10-30华为技术有限公司Data classification method and apparatus
CN103631890A (en)*2013-11-152014-03-12北京奇虎科技有限公司Method and device for mining image principal information
CN103631890B (en)*2013-11-152017-05-17北京奇虎科技有限公司Method and device for mining image principal information
CN104657375A (en)*2013-11-202015-05-27中国科学院深圳先进技术研究院Image-text theme description method, device and system
CN104657375B (en)*2013-11-202018-01-26中国科学院深圳先进技术研究院 A method, device and system for describing a graphic theme
CN104866491A (en)*2014-02-242015-08-26华为技术有限公司Parallel topic mining method and device
CN104866491B (en)*2014-02-242019-07-23华为技术有限公司A kind of parallel Topics Crawling method and device
WO2015183735A1 (en)*2014-05-272015-12-03Thomson LicensingMethods and systems for image based searching
CN104035997B (en)*2014-06-132017-05-10淮阴工学院Scientific and technical information acquisition and pushing method based on text classification and image deep mining
CN104035997A (en)*2014-06-132014-09-10淮阴工学院Scientific and technical information acquisition and pushing method based on text classification and image deep mining
CN105005616A (en)*2015-07-202015-10-28清华大学Text illustration method and system based on text image characteristics for interaction expansion
CN105426911B (en)*2015-11-132018-12-25浙江大学A kind of TAC clustering method based on Di Li Cray process mixed model
CN105426911A (en)*2015-11-132016-03-23浙江大学Dirichlet process mixture model based TAC clustering method
WO2017113592A1 (en)*2015-12-312017-07-06百度在线网络技术(北京)有限公司Model generation method, word weighting method, apparatus, device and computer storage medium
US10565253B2 (en)2015-12-312020-02-18Baidu Online Network Technology (Beijing) Co., Ltd.Model generation method, word weighting method, device, apparatus, and computer storage medium
CN108109696A (en)*2016-11-232018-06-01中国移动通信有限公司研究院A kind of data processing method and device
CN107248930A (en)*2017-06-052017-10-13桂林电子科技大学One kind is based on k2MDD web service composition method
CN107248930B (en)*2017-06-052020-07-28桂林电子科技大学Based on k2Web service composition method of MDD
CN108228720A (en)*2017-12-072018-06-29北京字节跳动网络技术有限公司Identify method, system, device, terminal and the storage medium of target text content and artwork correlation
CN108228720B (en)*2017-12-072019-11-08北京字节跳动网络技术有限公司Identify method, system, device, terminal and the storage medium of target text content and original image correlation
CN108228721A (en)*2017-12-082018-06-29复旦大学Fast text clustering method on large corpora
CN108228721B (en)*2017-12-082021-06-04复旦大学 Fast Text Clustering Methods on Large Corpus
CN108509521A (en)*2018-03-122018-09-07华南理工大学A kind of image search method automatically generating text index
CN108647322A (en)*2018-05-112018-10-12四川师范大学The method that word-based net identifies a large amount of Web text messages similarities
CN108647322B (en)*2018-05-112021-12-17四川师范大学Method for identifying similarity of mass Web text information based on word network
CN110990617A (en)*2019-11-272020-04-10广东智媒云图科技股份有限公司Picture marking method, device, equipment and storage medium
CN110990617B (en)*2019-11-272024-04-19广东智媒云图科技股份有限公司Picture marking method, device, equipment and storage medium
CN112836488A (en)*2021-01-252021-05-25浙江工业大学 Semantic Mining Method of Web Service Description Document Based on TWE-NMF Model
CN112836488B (en)*2021-01-252024-02-20浙江工业大学 Web service description document semantic mining method based on TWE-NMF model

Also Published As

Publication numberPublication date
CN101582080B (en)2011-05-04

Similar Documents

PublicationPublication DateTitle
CN101582080A (en)Web image clustering method based on image and text relevant mining
Ristoski et al.Mining the web of linked data with rapidminer
US9183281B2 (en)Context-based document unit recommendation for sensemaking tasks
US9418083B2 (en)System for computerized evaluation of patent-related information
CN101661513B (en)Detection method of network focus and public sentiment
CN101692223B (en)Refined Search space is inputted in response to user
Ghosh et al.A tutorial review on Text Mining Algorithms
Khare et al.Understanding deep web search interfaces: A survey
CN110083696B (en) Global citation recommendation method and recommendation system based on meta-structure technology
US20090119281A1 (en)Granular knowledge based search engine
CN106649260A (en)Product feature structure tree construction method based on comment text mining
CN105659225A (en)Query expansion and query-document matching using path-constrained random walks
CN115796181A (en)Text relation extraction method for chemical field
Li et al.Context-based diversification for keyword queries over XML data
KR101229401B1 (en)System for Integrating Heterogeneous Web Information and Method of The Same
CN101751439A (en)Image retrieval method based on hierarchical clustering
Gjorgjevska et al.Content Engineering for State-of-the-art SEO Digital Strategies by Using NLP and ML
Wei et al.DF-Miner: Domain-specific facet mining by leveraging the hyperlink structure of Wikipedia
CN104537280A (en)Protein interactive relationship identification method based on text relationship similarity
Asa et al.A comprehensive survey on extractive text summarization techniques
JP2000020538A (en) Information search method, information search device, and information search program storage medium
Rao et al.Enhancing multi-document summarization using concepts
Ibrahim et al.Exquisite: explaining quantities in text
Saenko et al.Filtering abstract senses from image search results
Wang et al.Knowledge graph-based semantic ranking for efficient semantic query

Legal Events

DateCodeTitleDescription
C06Publication
PB01Publication
C10Entry into substantive examination
SE01Entry into force of request for substantive examination
C14Grant of patent or utility model
GR01Patent grant
CF01Termination of patent right due to non-payment of annual fee

Granted publication date:20110504

Termination date:20140622

EXPYTermination of patent right or utility model

[8]ページ先頭

©2009-2025 Movatter.jp