Movatterモバイル変換


[0]ホーム

URL:


CN109543755A - Integrated study Remote Image Classification based on class weight vector - Google Patents

Integrated study Remote Image Classification based on class weight vector
Download PDF

Info

Publication number
CN109543755A
CN109543755ACN201811414486.8ACN201811414486ACN109543755ACN 109543755 ACN109543755 ACN 109543755ACN 201811414486 ACN201811414486 ACN 201811414486ACN 109543755 ACN109543755 ACN 109543755A
Authority
CN
China
Prior art keywords
sample
classifier
classification
integrated
base classifier
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
CN201811414486.8A
Other languages
Chinese (zh)
Inventor
窦鹏
韩镇
王玉磊
张云
刘艳超
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Qingdao National Surveying Haiyao Information Technology Co Ltd
Original Assignee
Qingdao National Surveying Haiyao Information Technology Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Qingdao National Surveying Haiyao Information Technology Co LtdfiledCriticalQingdao National Surveying Haiyao Information Technology Co Ltd
Priority to CN201811414486.8ApriorityCriticalpatent/CN109543755A/en
Publication of CN109543755ApublicationCriticalpatent/CN109543755A/en
Pendinglegal-statusCriticalCurrent

Links

Classifications

Landscapes

Abstract

The invention belongs to remote sensing technology fields, specifically disclose a kind of integrated study Remote Image Classification based on class weight vector.The sample of extraction is divided into training sample and test sample first by the present invention, then using different classifications algorithm training classifier, classified again with these classifiers to test sample later, obtain the class weight vector that classification can be made to be adaptive to different classifications device, to carry out integrating integrated classifier to different classifiers, realize that the classifier of different classifications algorithm realizes have complementary advantages.On this basis, using the method for AdaBoost iteration, promoted to obtain that a precision is higher, the better integrated classifier of integrated result to the integrated classifier of generation.The present invention realizes adaptive between classification and classifier, enhances the diversity of base classifier during integrated different classifications algorithm, ensure that effective promotion of classification of remote-sensing images precision.

Description

Integrated study Remote Image Classification based on class weight vector
Technical field
The invention belongs to remote sensing technology fields, are related to a kind of integrated study classification of remote-sensing images based on class weight vectorMethod.
Background technique
Remote sensing technology provides for the mankind most intuitively effectively to surface observation mode, but how to extract from remotely-sensed dataLULC (Land Use and Land Cover) information is still a long-term sciences problems.By means of computer science, passThe machine learning algorithm of system shows excellent performance (such as maximum likelihood method (Maximum in remote sensing image field of classifying automaticallyLikelihood, ML), decision tree method (Decision Tree, DT), minimum distance method (Minimum Distance, MD), branchHold vector machine (Support Vector Machine, SVM), naive Bayesian (Naive Bayes, NB), artificial neural network(Artificial Neural Network, ANN) etc.), realize the rapidly extracting LULC information from remote sensing image.But by groundThe influence of object complex characteristic still remains wrong point, leakage point, the disadvantages such as close atural object recognition capability is poor, and precision is low.
Currently, being improved by single sorting algorithm smaller and smaller to promote the space of classification of remote-sensing images precision.ResearchPerson has found that can be suitable for the algorithm of any classification problem is to be not present after comparing and analyzing to different sorting algorithms's.For same classification problem, the effect that different sorting algorithms obtains is different, and the same algorithm is for different classes of pointClass performance Ye You club difference.In classification of remote-sensing images, a certain sorting algorithm may distinguish certain LULC classifications well,But for the differentiation poor effect of other LULC classifications.Multi-classifier system (Multiple Classifier System, MCS)Classification can integrate the advantage of different classifications device, it is possible to prevente effectively from the deficiency of single classifier, further promotes nicety of grading,Extensive favor is received in terms of remote sensing image automatic interpretation, and forms a series of theory and technology and method system.
However, current MCS method still suffers from many problems.Such as, some MCS methods depend critically upon priori and knowKnow, only sample meets certain probability distribution, and integrated performance can be optimal.That there is stability is poor for these methods,Complexity is high, is easy over-fitting, the disadvantage that integrated precision is limited, therefore is difficult to be transplanted and promote.
Existing MCS method or the same sorting algorithm of use, start with from the quantity and feature space of sample and generate toolStandby multifarious base classifier;Has multifarious base classifier using different sorting algorithm buildings.Previous seed typeIt is easy to generate a large amount of and discrepant base classifier of tool, but is influenced by algorithm self-characteristic, the integrated performance meeting of multi-categorizerBy certain constraint;Latter type can integrate the advantage of different classifications algorithm, but since algorithm number is limited, and generate base pointClass device negligible amounts, and the integrated needs of multi-categorizer ensure that complicated operation with certain sample distribution.
Summary of the invention
It is an object of the invention to propose a kind of integrated study Remote Image Classification based on class weight vector, withOvercome the shortcomings of existing MCS method, achievees the purpose that efficient, high-precision classification of remote-sensing images.
The present invention to achieve the goals above, adopts the following technical scheme that
Integrated study Remote Image Classification based on class weight vector, includes the following steps:
S1. according to the basic step of AdaBoost, initialization sample concentrates the weight of sample;
S2. the sample of extraction is divided into training sample and survey using there is the method sample drawn for putting back to sampling from sample setSample sheet;
S3. the base classifier of a variety of different classifications algorithms is respectively trained using training sample;
S4. classified using the base classifier in step s3 to test sample, obtain the classification power of each base classifierWeight vector;
S5. each base classifier is integrated based on class weight vector, obtains integrated classifier;
S6. classified using the integrated classifier in step s5 to the sample in step s2 sample set, obtain classification and missDifference thens follow the steps s7 if error in classification is less than setting error threshold, no to then follow the steps s8;
S7. the sample weights in sample set are updated, updated sample is put back in the sample set in step s2;
S8. s1 is returned to step;
S9. AdaBoost the number of iterations is set, above-mentioned steps s2 to step s7 is repeated, is changed until terminating AdaBoostIn generation, generates a series of integrated classifiers with ballot weight;
S10. for remote sensing image to be sorted, a series of integrated classifiers with ballot weight is utilized respectively and are treated pointThe remote sensing image of class is classified, finally by all classification results of the method integration of Nearest Neighbor with Weighted Voting.
Preferably, the sorting algorithm include C4.5 decision tree, support vector machines, artificial neural network, naive Bayesian,K arest neighbors, Logistic recurrence, minimum distance method, expectation maximization, maximum likelihood method, mahalanobis distance method and random forest;
Multiple and different sorting algorithms is arbitrarily chosen in the step s3 from above-mentioned sorting algorithm for training base to classifyDevice.
Preferably, in step s4, the detailed process of class weight vector is obtained are as follows:
If each base classifier passes through class weight vector W for different classes of susceptibilityijTo indicate;
If base classifier collection M={ M1,M2,M3... Ms }, S is base classifier number;Sample set X={ X1,X2,X3,…XN, N is the number of sample;Classification collection Ω={ ω123,…,ωC, C is the number of classification;
Base classifier M is set againiClass weight vector be Wi, 1≤i≤S, tijFor base classifier MiTest sample is divided intoωjThe number of class, eijFor base classifier MiTest sample mistake is divided into classification ωjNumber, 1≤j≤C;
Then base classifier MiTo classification ωjWeight expression are as follows:
Wij=1-Eij (1)
Therefore, base classifier MiClass weight vector expression for identification test sample of voting are as follows:
Wi=(Wi1,Wi2,Wi3,...,Wic) (3)。
Preferably, if M ' expression integrated classifier, x indicates sample;Then classification results of the integrated classifier M ' to sample xAre as follows:
Wherein, M ' (x) indicates that integrated classifier M ' acts on the classification results of sample x;
WijFor base classifier MiFor the weight of j-th of classification;
Mi(x) base classifier M is indicatediAct on the classification results of sample x.
Preferably, in step s7, detailed process is as follows for the sample weights in update sample set:
It improves by the weight of the sample of mistake classification, is chosen as training sample in AdaBoost iteration next time to increaseProbability.
The present invention has the advantage that
The sample of extraction is divided into training sample and test sample first by the present invention, then using the training of different classifications algorithmClassifier later again classifies to test sample with these classifiers, obtains that classification can be made to be adaptive to different classifications deviceClass weight vector realize point of different classifications algorithm integrate integrated classifier to different classifiersClass device, which is realized, to have complementary advantages.On this basis, using the method for AdaBoost iteration, the integrated classifier of generation is promotedObtain that a precision is higher, the better integrated classifier of integrated result.The method achieve adaptive between classification and classifierIt answers, enhances the diversity of base classifier during integrated different classifications algorithm, ensure that having for classification of remote-sensing images precisionEffect is promoted.
Detailed description of the invention
Fig. 1 is the principle frame of the integrated study Remote Image Classification based on class weight vector in the embodiment of the present inventionFigure;
Fig. 2 is that the process of the integrated study Remote Image Classification based on class weight vector in the embodiment of the present invention is shownIt is intended to.
Specific embodiment
With reference to the accompanying drawing and specific embodiment invention is further described in detail:
As shown in Figure 1, technology path of the invention is as follows:
First layer and the second layer be generate different classifications algorithm base classifier (base classifier is the base of Multi-classifers integratedPlinth classifier is the integrated main object of the present invention), and an integrated classifier is generated by the method for class weight vector.
Integrated classifier, i.e. Ensemble Learning based on Weight Vector, abbreviation EL_WV classificationDevice.
In integrated classifier, if each base classifier passes through class weight vector W for different classes of susceptibilityijComeIt indicates.
If base classifier collection M={ M1,M2,M3... Ms }, S is base classifier number;Sample set X={ X1,X2,X3,…XN, N is the number of sample;Classification collection Ω={ ω123,…,ωC, C is the number of classification.
Base classifier M is set againiClass weight vector be Wi, 1≤i≤S, tijFor base classifier MiTest sample is divided intoωjThe number of class, eijFor base classifier MiTest sample mistake is divided into classification ωjNumber, 1≤j≤C.
Then base classifier MiTo classification ωjWeight expression are as follows:
Wij=1-Eij (1)
Therefore, base classifier MiClass weight vector expression for identification test sample of voting are as follows:
Wi=(Wi1,Wi2,Wi3,...,Wic) (3)。
Based on this, the solution procedure of available integrated classifier:
1. extracting training sample set D from sample set DtrainWith test sample collection Dtest
2. i is recycled to S from 1, step 2-1 and step 2-2 is executed;Wherein, i is base classifier MiIndex;
2-1. uses training set DtrainTraining base classifier Mi
2-2. test set DtestIn sample calculate base classifier MiClass weight vector Wi
3. setting M ' expression integrated classifier, x indicates sample;Then classification results of the integrated classifier M ' to sample x are as follows:
Wherein, M ' (x) indicates that integrated classifier M ' acts on the classification results of sample x;
WijFor base classifier MiFor the weight of j-th of classification;
Mi(x) base classifier M is indicatediAct on the classification results of sample x.
Third layer is collected by way of Nearest Neighbor with Weighted Voting using the multiple EL_WV classifiers of AdaBoost grey iterative generationAt obtaining ELM_CWV classifier (Ensemble Learning Method with the use of Class basedWeight Vector)。
Before introducing the detailed process step in the embodiment of the present invention, AdaBoost method is introduced first.
AdaBoost is that each sample assigns identical weight first, and the size of weight is for determining that the sample is selected to instructionPractice the probability concentrated and generate base classifier.Then, training sample is extracted from sample set in a manner of weighted sample and generates weak pointClass device classifies to all samples using Weak Classifier, calculates the error of current class device, calculates separately further according to errorThe weight of each sample keeps the sample weights by mistake classification higher.In this way, in next round iteration, the training sample of selectionOriginally it focuses more on those to be easy by the sample of mistake point, newly-generated classifier will give these samples higher concern.
And so on, by successive ignition, a series of Weak Classifier can be generated.For some unknown entity, ownWeak Classifier all classify to it, finally by all classification results of the method integration of Nearest Neighbor with Weighted Voting.
AdaBoost has derived many versions, and there are mainly two types of most typical algorithms: AdaBoost.M1 andAdaBoost.M2.AdaBoost.M1 is mainly used for multi-class classification, and AdaBoost.M2 is then mainly used for two-value classification.OneAs in the case of, classification of remote-sensing images belongs to more classification problems, therefore application of the AdaBoost in remote sensing automatic interpretation is mainIt is based on AdaBoost.M1.AdaBoost according to the present invention is primarily referred to as AdaBoost.M1, and specific algorithm is as follows:
Assuming that sample set S={ (x1,y1),(x2,y2),…,(xn,yn)}(xp∈X,yp∈Y)。
Wherein, n is the total number of sample, and X and Y respectively represent the feature space and class label of sample, and K is time of iterationNumber, the weight of W (p) representative sample p, p=1,2 .., n.The detailed process of AdaBoost algorithm are as follows:
Input: S;K;Weak typing algorithm WeakLearn;
Training:
1. by the weights initialisation of the sample p in S: W (p)=1/n;
2. working as q=1 ..., when K, following steps are executed:
3. having the method for putting back to sampling by weighting, training sample is obtained from S, calls one point of WeakLearn trainingClass predicts hq:X→Y;H is calculated according to formula (5)qError εq:
4. if εqThe weight of sample p, W (p)=1/n then go to step 3 in > 0.5, initialization sample collection S;
5. calculating βqq/(1-εq), and W is updated according to following formulaq+1(p):
Wherein, ZqFor the normalization factor that the sum of all sample weights are 1 can be made;
6. calculating λq=log (1/ βq) it is used as hqBallot weight;
7. terminating.
Classification: according to formula (7), the final prediction of the entity x of unknown classification is obtained by way of Nearest Neighbor with Weighted Voting.
As shown in Fig. 2, being carried out specifically to the integrated study Remote Image Classification based on class weight vector belowIt is bright:
S1. according to the basic step of AdaBoost, the weight of sample in initialization sample collection D.
S2. using from sample set D has the method sample drawn D for putting back to samplingi, the sample of extraction is divided into training sample DiaWith test sample Dib;Wherein, DiaFor the training of base classifier, DibFor determining class weight vector.
S3. the base classifier of a variety of different classifications algorithms is respectively trained using training sample.
Wherein, sorting algorithm includes C4.5 decision tree, support vector machines, artificial neural network, naive Bayesian, K nearestNeighbour, Logistic recurrence, minimum distance method, expectation maximization, maximum likelihood method, mahalanobis distance method and random forest etc..
Multiple and different sorting algorithms is arbitrarily chosen from above-mentioned sorting algorithm for training base classifier.
Sorting algorithm used in the present embodiment for example can be C4.5 (i.e. C4.5 decision tree), SVM (i.e. supporting vectorMachine), four kinds of ANN (i.e. artificial neural network) and NB (i.e. naive Bayesian).
Use DiaC4.5, SVM, ANN, the base classifier C4.5 of NB is respectively trainedi, SVMi, ANNi, NBi
S4. base classifier C4.5 is utilizedi, SVMi, ANNi, NBiTo test sample DibClassify, obtains each base classificationThe class weight vector of device, specific calculating process hereinbefore it is stated that.
S5. each base classifier is integrated based on class weight vector, obtains integrated classifier EL_WVi
S6. integrated classifier EL_WV is utilizediClassify to the sample in step s2 sample set D, obtain error in classification,If error in classification is less than setting error threshold, s7 is thened follow the steps, it is no to then follow the steps s8.
It is 0.5 that error threshold is normally set up in the present embodiment, when error in classification meets condition less than 0.5.
S7. the sample weights in sample set are updated, that is, are improved by the weight of the sample of mistake classification, to increaseIt is chosen as the probability of training sample in AdaBoost iteration next time, updated sample is put back in the sample set in step s2.
S8. s1 is returned to step, the weight of sample in sample set D is reinitialized.
S9. AdaBoost the number of iterations is set, above-mentioned steps s2 to step s7 is repeated, is changed until terminating AdaBoostIn generation, generates a series of EL_WV integrated classifiers with ballot weight.
S10. for remote sensing image to be sorted, a series of integrated classifier EL_WV with ballot weight are utilized respectivelyClassify to remote sensing image to be sorted, finally by all classification results of the method integration of Nearest Neighbor with Weighted Voting.
Certainly, there are four types of the sorting algorithm in the present embodiment is not limited to, can also be certainly three kinds, five kinds even moreIt is more.In addition, being also not limited to the combination of above-mentioned four kinds of sorting algorithms, the combination of other sorting algorithms can also be.
For the present invention by integrating the base classifier of different classifications algorithm, the advantage realized between different classifications algorithm is mutualIt mends.
And the present invention then expands the scale of base classifier, enhances the multiplicity of base classifier by AdaBoost iterationProperty, realize it is adaptive between classification and base classifier, reached efficiently, high-precision classification of remote-sensing images purpose.
Certainly, described above is only that presently preferred embodiments of the present invention is answered the present invention is not limited to enumerate above-described embodimentWhen explanation, anyone skilled in the art is all equivalent substitutes for being made, bright under the introduction of this specificationAobvious variant, all falls within the essential scope of this specification, ought to be by protection of the invention.

Claims (5)

CN201811414486.8A2018-11-262018-11-26Integrated study Remote Image Classification based on class weight vectorPendingCN109543755A (en)

Priority Applications (1)

Application NumberPriority DateFiling DateTitle
CN201811414486.8ACN109543755A (en)2018-11-262018-11-26Integrated study Remote Image Classification based on class weight vector

Applications Claiming Priority (1)

Application NumberPriority DateFiling DateTitle
CN201811414486.8ACN109543755A (en)2018-11-262018-11-26Integrated study Remote Image Classification based on class weight vector

Publications (1)

Publication NumberPublication Date
CN109543755Atrue CN109543755A (en)2019-03-29

Family

ID=65849947

Family Applications (1)

Application NumberTitlePriority DateFiling Date
CN201811414486.8APendingCN109543755A (en)2018-11-262018-11-26Integrated study Remote Image Classification based on class weight vector

Country Status (1)

CountryLink
CN (1)CN109543755A (en)

Cited By (2)

* Cited by examiner, † Cited by third party
Publication numberPriority datePublication dateAssigneeTitle
CN112380944A (en)*2020-11-062021-02-19中国电力科学研究院有限公司Method and system for evaluating structural state of transmission tower
CN116756484A (en)*2023-05-262023-09-15浙江中烟工业有限责任公司Multi-signal fusion fault detection method, device and medium for cigarette equipment

Citations (3)

* Cited by examiner, † Cited by third party
Publication numberPriority datePublication dateAssigneeTitle
US20110103642A1 (en)*2009-10-302011-05-05Applied Signal Technology, Inc.Multipass Data Integration For Automatic Detection And Classification Of Objects
CN104573013A (en)*2015-01-092015-04-29上海大学Category weight combined integrated learning classifying method
CN105844300A (en)*2016-03-242016-08-10河南师范大学Optimized classification method and optimized classification device based on random forest algorithm

Patent Citations (3)

* Cited by examiner, † Cited by third party
Publication numberPriority datePublication dateAssigneeTitle
US20110103642A1 (en)*2009-10-302011-05-05Applied Signal Technology, Inc.Multipass Data Integration For Automatic Detection And Classification Of Objects
CN104573013A (en)*2015-01-092015-04-29上海大学Category weight combined integrated learning classifying method
CN105844300A (en)*2016-03-242016-08-10河南师范大学Optimized classification method and optimized classification device based on random forest algorithm

Non-Patent Citations (2)

* Cited by examiner, † Cited by third party
Title
窦鹏: "基于投票法的多分类器集成遥感影像分类技术", 《中国优秀硕士学位论文全文数据库 基础科学辑》*
陈洋波等: "基于Landsat的多分类器集成遥感影像分类", 《测绘科学》*

Cited By (3)

* Cited by examiner, † Cited by third party
Publication numberPriority datePublication dateAssigneeTitle
CN112380944A (en)*2020-11-062021-02-19中国电力科学研究院有限公司Method and system for evaluating structural state of transmission tower
CN112380944B (en)*2020-11-062021-12-21中国电力科学研究院有限公司 A method and system for evaluating the structural state of transmission towers based on satellite remote sensing
CN116756484A (en)*2023-05-262023-09-15浙江中烟工业有限责任公司Multi-signal fusion fault detection method, device and medium for cigarette equipment

Similar Documents

PublicationPublication DateTitle
Sun et al.Active learning with Gaussian process classifier for hyperspectral image classification
CN112507996B (en) A face detection method with master sample attention mechanism
CN113139536B (en)Text verification code identification method and equipment based on cross-domain meta learning and storage medium
US7340443B2 (en)Cognitive arbitration system
CN111191732A (en) A target detection method based on fully automatic learning
US20050286772A1 (en)Multiple classifier system with voting arbitration
CN103473786A (en)Gray level image segmentation method based on multi-objective fuzzy clustering
Bouadjenek et al.Histogram of oriented gradients for writer's gender, handedness and age prediction
Iqbal et al.Mitochondrial organelle movement classification (fission and fusion) via convolutional neural network approach
Lasisi et al.Comparative performance analysis of negative selection algorithm with immune and classification algorithms
CN103631753A (en)Progressively-decreased subspace ensemble learning algorithm
CN109543755A (en)Integrated study Remote Image Classification based on class weight vector
Krishnapuram et al.Joint classifier and feature optimization for cancer diagnosis using gene expression data
Jiang et al.Meta-learning to cluster
Azizi et al.From static to dynamic ensemble of classifiers selection: Application to Arabic handwritten recognition
Poostchi et al.Feature selection for appearance-based vehicle tracking in geospatial video
Khalid et al.Frameworks for multivariate m-mediods based modeling and classification in Euclidean and general feature spaces
Chaudhury et al.Effect of grid search and hyper parameter tuned pipeline with various classifiers and PCA for breast cancer detection
Demidova et al.Improving the accuracy of the SVM classification using the Parzen classifier
Khurana et al.Soft computing techniques for change detection in remotely sensed images: A review
CN109886340A (en)A kind of Remote Image Classification
Karem et al.Fuzzy clustering of multiple instance data
Uppada et al.Novel Neural Network for Breast Cancer Diagnosis
Zhao et al.Posterior probability based multi-classifier fusion in pedestrian detection
Sarumathi et al.A comprehensive review on different mixed data clustering ensemble methods

Legal Events

DateCodeTitleDescription
PB01Publication
PB01Publication
SE01Entry into force of request for substantive examination
SE01Entry into force of request for substantive examination
RJ01Rejection of invention patent application after publication

Application publication date:20190329

RJ01Rejection of invention patent application after publication

[8]ページ先頭

©2009-2025 Movatter.jp