One group of gene for being used for lung cancer molecule parting and its applicationTechnical field
The present invention relates to cancer diagnosis and biology field, and the application of diagnostic techniques clinically.Specifically,The present invention relates to the gene that one group is used for lung cancer molecule parting, by the foundation of lung cancer molecular typing methods and by detecting lung cancerThe expression of specific gene in tissue, differentiates the most important hypotype of lung cancer:Adenocarcinoma of lung and lung squamous cancer.The invention further relates to completionThe kit of the lung cancer molecular typing methods.
Background technology
Lung cancer is to one of malignant tumour of population health and life threat maximum in world wide.Male lung cancer incidenceFirst of malignant tumour is accounted for the death rate, female lung cancer incidence accounts for the second of malignant tumour, is only second to breast cancer, extremelyThe rate of dying holds pride of place.According to《2012 tumour registration annual reports》, the incidence of national tumour registration area lung cancer is 53.57/10 ten thousand,Chinese population standardized rate is 25.34/10 ten thousand, and male is 1.94 times of women;The death rate of lung cancer is 45.57/10 ten thousand, Nan Xingwei2.05 times of women.
Lung cancer can be divided into non-small cell lung cancer and Small Cell Lung Cancer.Wherein, non-small cell lung cancer accounts for more than 80%, mainly byGland cancer (also known as adenocarcinoma of lung) and squamous carcinoma (also known as squamous cell carcinoma or lung squamous cancer) composition.In the work of current clinical practice, tissuePathological examination is pulmonary cancer diagnosis and the goldstandard of classification, and traditional histopathological methods are to pass through the pathological tissues of acquisitionFix, dewax, dyeing and etc., section is made and observes its morphological feature under the microscope.Detection method step is various, waitsTime is longer, and testing result has certain subjectivity.Particularly when Tumor Differentiation is poor, lack adenocarcinoma of lung and the shape of lung squamous cancerState feature, parting is with regard to relatively difficult.Therefore there is indefinite this concept of type non-small cell lung cancer of histological type.2009Year, one research found that in biopsy sample the indefinite type non-small cell lung cancer of histological type accounts for 25%, in cytologic specimenIn account for 40%.Therefore, it is necessary to a kind of adenocarcinoma of lung and the method for lung squamous cancer can be effectively distinguished than more objective.
In addition, lung cancer is the disease of the horizontal upper height heterogeneity of molecule, and the identical tumour of Histological Study, its moleculeScience of heredity change is not the same, so as to result in the difference of lung cancer therapy reaction and prognosis.More and more new chemicotherapy medicinesThe appearance requirement of thing and targeted drug is stringent to distinguish adenocarcinoma of lung and lung squamous cancer.Life cycle is notable after using pemetrexed such as adenocarcinoma of lungIncrease, risk of the Lung Squamous Carcinoma Patients using bevacizumab there are massive haemorrhage, albumin paclitaxel with cisplatin treatment late period lung squamaCancer curative effect is preferable, therefore the inevitable requirement that accurate parting is lung cancer individualized treatment is carried out to it.
In recent years, the understanding to lung cancer has been deepened in the research of tumor markers significantly so as to its further classification, diagnosisAnd Index for diagnosis is possibly realized.As carcinomebryonic antigen (CEA) is used for the detection of adenocarcinoma of lung, cytokeratin 19 fragment 21-1(CYFR21-1) it is used for the detection of lung squamous cancer, nerve specificity olefinic alcohol enzyme (NSE) is used for diagnosis of cellule adenocarcinoma of lung etc., butThe problem of being because of Sensitivity and Specificity, there is higher false positive and false negative, diagnosis efficiency is low.Gene molecule diagnosis toolThere are high sensitivity, high specificity, the clear and definite advantage of background mechanism, applied to various tumours more and more in recent years.Therefore,Diagnosed using gene molecule, find a kind of differentiation adenocarcinoma of lung and the method for lung squamous cancer, help to realize personalized medicine, to patientCarrying out precisely treatment has important clinical meaning.
The content of the invention
One of the technical problem to be solved in the present invention is to provide one group of gene for being used for lung cancer molecule parting, establishes lung cancer systemAnalysis model is counted, helps to realize individualized treatment.
The second technical problem to be solved by the present invention is to provide the purposes of one group of gene for being used for lung cancer molecule parting.
The third technical problem to be solved by the present invention is to provide a kind of kit for lung cancer molecule parting and application thereof.
The four of the technical problem to be solved in the present invention are to provide one group of gene for being used for lung cancer molecule parting and are used in preparationJudge the application in the genetic chip of lung cancer hypotype.
In order to solve the above technical problems, the present invention adopts the following technical scheme that:
In one aspect of the invention, there is provided one group of gene for being used for lung cancer molecule parting, including following 13 genes:COMPGene, FXYD3 genes, GABRP genes, HLA-DQA1 genes, MMP10 genes, NMU genes, NTS genes, RPS4Y1 genes,SERPINB4 genes, SERPINB5 genes, SPRR2A genes, SPRR3 genes, UPK1B genes.
As currently preferred technical solution, which is used for the gene of tumor cells parting, further includes following 17 basesBecause of (i.e. including 30 genes):CLCA2 genes, DSG3 genes, FGB genes, HSD17B2 genes, KRT14 genes, KRT17 basesCause, KRT5 genes, KRT6A genes, KRT6B genes, MIR205HG genes, MMP12 genes, S100A2 genes, SERPINB3 basesCause, SPINK1 genes, SPRR1B genes, TFPI2 genes, TRIM29 genes.
The present invention establishes gene marker by the use in conjunction of genetic test, marker combination and data mining algorithmBuilt-up pattern, the main hypotype of lung cancer, i.e. adenocarcinoma of lung and lung squamous cancer is distinguished using polygenes prediction model, mainly including followingStep:
(1) the clinical diagnosis data and gene expression profile data of lung cancer are collected, structure includes more than 20,000 a bases known to the mankindBecause of the lung cancer gene expression profile data storehouse of, 2000 samples;
(2) statistical analysis is carried out to gene expression pattern, filters out 30 genes closely related with lung cancer hypotype, respectivelyFor:CLCA2 genes, COMP genes, DSG3 genes, FGB genes, FXYD3 genes, GABRP genes, HLA-DQA1 genes,HSD17B2 genes, KRT14 genes, KRT17 genes, KRT5 genes, KRT6A genes, KRT6B genes, MIR205HG genes,MMP10 genes, MMP12 genes, NMU genes, NTS genes, RPS4Y1 genes, S100A2 genes, SERPINB3 genes,SERPINB4 genes, SERPINB5 genes, SPINK1 genes, SPRR1B genes, SPRR2A genes, SPRR3 genes, TFPI2 basesCause, TRIM29 genes, UPK1B genes.
(3) above-mentioned 30 gene expression patterns are calculated, lung cancer hypotype is evaluated by Statistic analysis models, calculate lifeThing imitates the similarity score (Similarity Score) of product and lung cancer hypotype.Judge rule according to similarity score is highestThen, hypotype is judged.
The present invention provides a kind of detection method for being used to judge lung cancer hypotype, comprise the following steps:
(1) biological sample for being derived from patients with lung cancer is contacted with biomarker, the biomarker includes above-mentioned30 genes;The biological sample is taken from the tumor tissue in vitro of the object, can be fresh sample or formalParaffin embedding (FFPE) sample that woods is fixed;
On this basis, lung cancer hypotype judgement is further carried out:
(2) expression pattern and expression of 30 genes in the biological sample, the expression based on 30 genes are detectedLevel judges the hypotype classification of the biological sample.Using data analysing method, the biological sample and lung cancer hypotype are calculatedSimilarity score (Similarity Score).According to the highest decision rule of similarity score, hypotype is judged.The detectionIncluding being used for PCR (PCR) from the sample preparation RNA, the RNA, the PCR is reverse transcription PCR (RT-PCR), optional real-time RT-PCR either genetic chip or high throughput sequencing technologies.
In another aspect of this invention, there is provided one group of gene for being used to judge lung cancer hypotype is being prepared for judging lung cancer AsiaApplication in the kit of type.
In another aspect of this invention, there is provided a kind of kit for being used to judge lung cancer hypotype, which includes as followsBiomarker, the biomarker is selected from above-mentioned one group any one being used in the gene of lung cancer molecule parting or moreKind.
As currently preferred technical solution, the biomarker is nucleic acid, oligonucleotide chain or PCR primer group.
As currently preferred technical solution, the PCR primer group includes:
COMP genes:Forward primer is as shown in SEQ ID NO.1, and reverse primer is as shown in SEQ ID NO.2;
FXYD3 genes:Forward primer is as shown in SEQ ID NO.3, and reverse primer is as shown in SEQ ID NO.4;
GABRP genes:Forward primer is as shown in SEQ ID NO.5, and reverse primer is as shown in SEQ ID NO.6;
HLA-DQA1 genes:Forward primer is as shown in SEQ ID NO.7, and reverse primer is as shown in SEQ ID NO.8;
MMP10 genes:Forward primer is as shown in SEQ ID NO.9, and reverse primer is as shown in SEQ ID NO.10;
NMU genes:Forward primer is as shown in SEQ ID NO.11, and reverse primer is as shown in SEQ ID NO.12;
NTS genes:Forward primer is as shown in SEQ ID NO.13, and reverse primer is as shown in SEQ ID NO.14;
RPS4Y1 genes:Forward primer is as shown in SEQ ID NO.15, and reverse primer is as shown in SEQ ID NO.16;
SERPINB4 genes:Forward primer is as shown in SEQ ID NO.17, and reverse primer is as shown in SEQ ID NO.18;
SERPINB5 genes:Forward primer is as shown in SEQ ID NO.19, and reverse primer is as shown in SEQ ID NO.20;
SPRR2A genes:Forward primer is as shown in SEQ ID NO.21, and reverse primer is as shown in SEQ ID NO.22;
SPRR3 genes:Forward primer is as shown in SEQ ID NO.23, and reverse primer is as shown in SEQ ID NO.24;
UPK1B genes:Forward primer is as shown in SEQ ID NO.25, and reverse primer is as shown in SEQ ID NO.26.
As currently preferred technical solution, the PCR primer group further includes:
CLCA2 genes:Forward primer is as shown in SEQ ID NO.27, and reverse primer is as shown in SEQ ID NO.28;
DSG3 genes:Forward primer is as shown in SEQ ID NO.29, and reverse primer is as shown in SEQ ID NO.30;
FGB genes:Forward primer is as shown in SEQ ID NO.31, and reverse primer is as shown in SEQ ID NO.32;
HSD17B2 genes:Forward primer is as shown in SEQ ID NO.33, and reverse primer is as shown in SEQ ID NO.34;
KRT14 genes:Forward primer is as shown in SEQ ID NO.35, and reverse primer is as shown in SEQ ID NO.36;
KRT17 genes:Forward primer is as shown in SEQ ID NO.37, and reverse primer is as shown in SEQ ID NO.38;
KRT5 genes:Forward primer is as shown in SEQ ID NO.39, and reverse primer is as shown in SEQ ID NO.40;
KRT6A genes:Forward primer is as shown in SEQ ID NO.41, and reverse primer is as shown in SEQ ID NO.42;
KRT6B genes:Forward primer is as shown in SEQ ID NO.43, and reverse primer is as shown in SEQ ID NO.44;
MIR205HG genes:Forward primer is as shown in SEQ ID NO.45, and reverse primer is as shown in SEQ ID NO.46;
MMP12 genes:Forward primer is as shown in SEQ ID NO.47, and reverse primer is as shown in SEQ ID NO.48;
S100A2 genes:Forward primer is as shown in SEQ ID NO.49, and reverse primer is as shown in SEQ ID NO.50;
SERPINB3 genes:Forward primer is as shown in SEQ ID NO.51, and reverse primer is as shown in SEQ ID NO.52;
SPINK1 genes:Forward primer is as shown in SEQ ID NO.53, and reverse primer is as shown in SEQ ID NO.54;
SPRR1B genes:Forward primer is as shown in SEQ ID NO.55, and reverse primer is as shown in SEQ ID NO.56;
TFPI2 genes:Forward primer is as shown in SEQ ID NO.57, and reverse primer is as shown in SEQ ID NO.58;
TRIM29 genes:Forward primer is as shown in SEQ ID NO.59, and reverse primer is as shown in SEQ ID NO.60.
The application method of mentioned reagent box comprises the following steps:
(1) biological sample comprising tumor tissues is contacted with biomarker;
(2) expression of the marker in the biological sample is measured;
(3) expression pattern of gene in biological sample is detected, and itself and lung cancer gene expression profile data storehouse are comparedIt is right.
The expression of kit detection can by real-time quantitative reverse transcriptase polymerase chain reaction (RT-PCR), orGenetic chip, or high throughput sequencing technologies.
The expression of the kit detection is mRNA expressions.
Example only as the above-mentioned subsidy of the present invention, for paraffin embedding tumor tissues, is gathered using real-time quantitative reverse transcriptionPolymerase chain reacts (RT-PCR), and the method for distinguishing lung cancer hypotype, comprises the steps of:
(1) paraffin-embedded tissue of tumor tissues is obtained;
(2) expression of 30 genes in the sample is detected with real-time quantitative reverse transcriptase polymerase chain reaction;
(3) expression pattern of 30 genes in the sample is detected, and itself and lung cancer gene expression profile data storehouse are comparedIt is right, distinguish lung cancer hypotype.
In another aspect of this invention, there is provided the kit is preparing the purposes in judging the preparation of lung cancer hypotype.
In another aspect of this invention, there is provided one group of gene for being used for lung cancer molecule parting is being prepared for judging lung cancer AsiaApplication in the genetic chip of type, the genetic chip include solid phase carrier and probe, the probe and 13 gene sequences to be measuredRow and/or its complementary series are hybridized, and 13 genes to be measured are:COMP genes, FXYD3 genes, GABRP genes, HLA-DQA1 genes, MMP10 genes, NMU genes, NTS genes, RPS4Y1 genes, SERPINB4 genes, SERPINB5 genes,SPRR2A genes, SPRR3 genes, UPK1B genes;The probe is sequence shown in SEQ ID No.61~SEQ ID No.73 respectivelyRow.
In another aspect of this invention, there is provided one group of gene for being used for lung cancer molecule parting is being prepared for judging lung cancer AsiaApplication in the genetic chip of type, the genetic chip include solid phase carrier and probe, the probe and above-mentioned 30 bases to be measuredBecause sequence and/or its complementary series are hybridized, the probe is sequence shown in SEQ ID No.61~SEQ ID No.90 respectivelyRow.
The present invention is used to distinguish adenocarcinoma of lung by detecting 30 and the relevant gene of lung cancer hypotype, structure Statistic analysis modelsAnd lung squamous cancer, so as to help doctor to carry out medication guide, realize precisely medical treatment, to improve the survival rate of patients with lung cancer, improve and suffer fromPerson's survival state.Experiment proves that kit of the present invention can accurately distinguish adenocarcinoma of lung and lung squamous cancer, its is applied widely, accurateRate is high, helps to realize personalized medicine, and precisely treatment is carried out to patient has important clinical meaning.
Brief description of the drawings
Fig. 1 is the differentiation result schematic diagram of 30 gene pulmonary cancer diagnosis in the embodiment of the present invention 5.
Embodiment
Following embodiments are merely to illustrate the present invention, rather than limit the scope of the invention.Tool is not specified in embodimentThe experimental method of concrete conditions in the establishment of a specific crime, according to the condition proposed by manufacture kit production company or according to conventional laboratory conditions, such asSambrook et al., molecular cloning:Laboratory manual (New York:Cold Spring Harbor LaboratoryPress, 1989) condition described in.Unless otherwise defined, all professional and scientific terms used in text and this area are ripeIt is identical to practice meaning known to personnel.In addition, any method similar or impartial to described content and material all can be applied toIn the present invention.The preferred methods and materials described herein are for illustrative purposes only.
Embodiment 1.
Training set sample collection and processing:
The present invention analyzes the patients with lung cancer clinical data and its biological sample data of large sample size, including 391Example lung gland, 237 lung squamous cancers, amount to the relevant clinical data and gene expression data of 628 patients, build lung cancer gene expressionModal data storehouse.
The screening of 13 specific genes:
According to the measured value of gene expression abundance, inventor is examined from being sieved more than 20,000 in a gene using statistical analysis technique TSelect 13 genes closely related with tumour original site.These genes having differences property in lung cancer hypotype is expressed, and is hadStatistical significance, is shown in Table 1.
Table 1:13 gene sets
| Gene | Annotation | P values |
| COMP | Cartilage Oligomeric Matrix Protein | <0.001 |
| FXYD3 | FXYD Domain Containing Ion Transport Regulator 3 | <0.001 |
| GABRP | Gamma-Aminobutyric Acid(GABA)A Receptor,Pi | <0.001 |
| HLA-DQA1 | Major Histocompatibility Complex,Class II,DQ Alpha 1 | <0.001 |
| MMP10 | Matrix Metallopeptidase 10 | <0.001 |
| NMU | Neuromedin U | <0.001 |
| NTS | Neurotensin | <0.001 |
| RPS4Y1 | Ribosomal Protein S4,Y-Linked 1 | <0.001 |
| SERPINB4 | Serpin Peptidase Inhibitor,Clade B(Ovalbumin),Member 4 | <0.001 |
| SERPINB5 | Serpin Peptidase Inhibitor,Clade B(Ovalbumin),Member 5 | <0.001 |
| SPRR2A | Small Proline-Rich Protein 2A | <0.001 |
| SPRR3 | Small Proline-Rich Protein 3 | <0.001 |
| UPK1B | Uroplakin 1B | <0.001 |
The structure of 13 gene Statistic analysis models:
Expression pattern based on 13 specific genes in 628 lung cancer samples, inventor use support vector machines(Support Vector Machines) algorithm, establishes Statistic analysis models and is used to differentiate lung cancer hypotype.It is to be measured for every an exampleSample, model calculate gland cancer and the similarity score of squamous carcinoma samples in the gene expression pattern and database of the sample, and according toSimilarity score maximum principle differentiates the hypotype classification of the sample.Since invention in 1992, algorithm of support vector machine is wideIt is applied to solve all kinds of pattern recognition problems, including finance data analysis, speech recognition and biological data analysis generally.This areaTechnical staff can by free analysis software of increasing income, such as:R, Rapidminer and WEKA uses algorithm of support vector machine.Algorithm of support vector machine is not limited solely to, other known data digging methods can all use, such as Nearest Neighbor with Weighted VotingThe closest value (K-nearest Neighbors) of (Weighted Voting), K-, random forest (Random Forest), phaseClose property coefficient (Correlation Coefficients) etc..
Embodiment 2.
Verification collection test:
In the present embodiment, inventor analyzes the high-flux sequence data for including 1130 lung cancer, wherein there is adenocarcinoma of lung576, lung squamous cancer 554.The differentiation of hypotype is carried out to each sample by 13 gene Statistic analysis models, and is examined with clinical pathologyDisconnected result compares, accuracy rate 82.7%.Using adenocarcinoma of lung as reference, the susceptibility of prediction is 93.0%, and specificity is72.0%, it is shown in Table 2.
Table 2:The differentiation result that 13 genetic models are concentrated in 1130 verifications
Embodiment 3.
The screening of 30 specific genes:
The T screening criterias examined are relaxed to P values less than 0.01 by inventor, further obtain 17 extra genes.ThisA little genes having differences property in adenocarcinoma of lung and lung squamous cancer is expressed, and difference has statistical significance.By 17 genes and embodiment 1In 13 genes merge, form 30 gene sets, be shown in Table 3.
Table 3:17 gene sets
| Gene | Annotation | P values |
| KRT5 | Keratin 5,Type II | <0.01 |
| S100A2 | S100A2 | <0.01 |
| KRT6A | Keratin 6A,Type II | <0.01 |
| MIR205HG | MIR205Host Gene | <0.01 |
| KRT17 | Keratin 17,Type I | <0.01 |
| CLCA2 | Chloride Channel Accessory 2 | <0.01 |
| SERPINB3 | Serpin Peptidase Inhibitor,Clade B(Ovalbumin),Member 3 | <0.01 |
| MMP12 | Matrix Metallopeptidase 12 | <0.01 |
| DSG3 | Desmoglein 3 | <0.01 |
| KRT6B | Keratin 6B,Type II | <0.01 |
| KRT14 | Keratin 14,Type I | <0.01 |
| SPINK1 | Serine Peptidase Inhibitor,Kazal Type 1 | <0.01 |
| HSD17B2 | Hydroxysteroid(17-Beta)Dehydrogenase 2 | <0.01 |
| SPRR1B | Small Proline-Rich Protein 1B | <0.01 |
| FGB | Fibrinogen Beta Chain | <0.01 |
| TFPI2 | Tissue Factor Pathway Inhibitor 2 | <0.01 |
| TRIM29 | Tripartite Motif Containing 29 | <0.01 |
The structure of 30 gene Statistic analysis models:
Expression pattern based on 30 specific genes in 628 lung cancer samples, inventor use algorithm of support vector machine,Statistic analysis models are established to be used to differentiate lung cancer hypotype.For every an example sample to be tested, model calculates the gene expression of the samplePattern and gland cancer and the similarity score of squamous carcinoma samples in database, and the sample is differentiated according to similarity score maximum principleHypotype classification.
Embodiment 4.
Verification collection test:
In the present embodiment, inventor analyzes the high-flux sequence data for including 1130 lung cancer, wherein gland cancer 576,Squamous carcinoma 554.By 30 gene Statistic analysis models to each sample carry out hypotype differentiation, and with Clinicopathologic Diagnosis resultCompare, accuracy rate 91.5%.Using adenocarcinoma of lung as reference, the susceptibility of prediction is 95.1%, and specificity 87.7%, is shown in Table4。
Table 4:The differentiation result that 30 genetic models are concentrated in 1130 verifications
Embodiment 5.
In the present embodiment, the paraffin-embedded sample of the cancerous lung tissue after surgery excision is collected, the biological sample is through pathologyLung cancer is confirmed as in diagnosis, and is known as adenocarcinoma of lung.Experimenter by hand scraping blade mode from formalin fix paraffin embeddingTissue block on lung carcinoma cell enrichment region collect and extract total serum IgE;Handled with DNase to ensure to remove genomic DNA completelyPollution;CDNA is obtained after reverse transcription, carries out the real-time quantitative PCR of 30 genes.Real-time quantitative PCR is reacted in ABI companiesTaqman is applied on 7500 type instrumentsTMTechnology is completed.PCR after reaction, according to the expression pattern of 30 genes of sample, unitesMeter analysis model calculates the similarity score of the sample and lung cancer hypotype.
The PCR primer group difference of 30 genes of design is as follows:
COMP genes:Forward primer is as shown in SEQ ID NO.1, and reverse primer is as shown in SEQ ID NO.2;
FXYD3 genes:Forward primer is as shown in SEQ ID NO.3, and reverse primer is as shown in SEQ ID NO.4;
GABRP genes:Forward primer is as shown in SEQ ID NO.5, and reverse primer is as shown in SEQ ID NO.6;
HLA-DQA1 genes:Forward primer is as shown in SEQ ID NO.7, and reverse primer is as shown in SEQ ID NO.8;
MMP10 genes:Forward primer is as shown in SEQ ID NO.9, and reverse primer is as shown in SEQ ID NO.10;
NMU genes:Forward primer is as shown in SEQ ID NO.11, and reverse primer is as shown in SEQ ID NO.12;
NTS genes:Forward primer is as shown in SEQ ID NO.13, and reverse primer is as shown in SEQ ID NO.14;
RPS4Y1 genes:Forward primer is as shown in SEQ ID NO.15, and reverse primer is as shown in SEQ ID NO.16;
SERPINB4 genes:Forward primer is as shown in SEQ ID NO.17, and reverse primer is as shown in SEQ ID NO.18;
SERPINB5 genes:Forward primer is as shown in SEQ ID NO.19, and reverse primer is as shown in SEQ ID NO.20;
SPRR2A genes:Forward primer is as shown in SEQ ID NO.21, and reverse primer is as shown in SEQ ID NO.22;
SPRR3 genes:Forward primer is as shown in SEQ ID NO.23, and reverse primer is as shown in SEQ ID NO.24;
UPK1B genes:Forward primer is as shown in SEQ ID NO.25, and reverse primer is as shown in SEQ ID NO.26.
CLCA2 genes:Forward primer is as shown in SEQ ID NO.27, and reverse primer is as shown in SEQ ID NO.28;
DSG3 genes:Forward primer is as shown in SEQ ID NO.29, and reverse primer is as shown in SEQ ID NO.30;
FGB genes:Forward primer is as shown in SEQ ID NO.31, and reverse primer is as shown in SEQ ID NO.32;
HSD17B2 genes:Forward primer is as shown in SEQ ID NO.33, and reverse primer is as shown in SEQ ID NO.34;
KRT14 genes:Forward primer is as shown in SEQ ID NO.35, and reverse primer is as shown in SEQ ID NO.36;
KRT17 genes:Forward primer is as shown in SEQ ID NO.37, and reverse primer is as shown in SEQ ID NO.38;
KRT5 genes:Forward primer is as shown in SEQ ID NO.39, and reverse primer is as shown in SEQ ID NO.40;
KRT6A genes:Forward primer is as shown in SEQ ID NO.41, and reverse primer is as shown in SEQ ID NO.42;
KRT6B genes:Forward primer is as shown in SEQ ID NO.43, and reverse primer is as shown in SEQ ID NO.44;
MIR205HG genes:Forward primer is as shown in SEQ ID NO.45, and reverse primer is as shown in SEQ ID NO.46;
MMP12 genes:Forward primer is as shown in SEQ ID NO.47, and reverse primer is as shown in SEQ ID NO.48;
S100A2 genes:Forward primer is as shown in SEQ ID NO.49, and reverse primer is as shown in SEQ ID NO.50;
SERPINB3 genes:Forward primer is as shown in SEQ ID NO.51, and reverse primer is as shown in SEQ ID NO.52;
SPINK1 genes:Forward primer is as shown in SEQ ID NO.53, and reverse primer is as shown in SEQ ID NO.54;
SPRR1B genes:Forward primer is as shown in SEQ ID NO.55, and reverse primer is as shown in SEQ ID NO.56;
TFPI2 genes:Forward primer is as shown in SEQ ID NO.57, and reverse primer is as shown in SEQ ID NO.58;
TRIM29 genes:Forward primer is as shown in SEQ ID NO.59, and reverse primer is as shown in SEQ ID NO.60.
The results are shown in Figure 1, and the similarity score of gland cancer and squamous carcinoma is respectively 91.7% and 8.3%.Due to the phase of gland cancerIt is higher than squamous carcinoma like degree fraction, therefore the sample is judged as adenocarcinoma of lung, is consistent with Clinicopathologic Diagnosis result.
Embodiment 6.
In the present embodiment, inventor carries out gene chip experiment, wherein gland cancer 125, squamous carcinoma 80 to 205 lung cancer samplesExample.Genetic chip is by the DNA fragmentation of a large amount of particular sequences or few core former times acid fragment by microelectric technique and micro-processing technologyBe fixed on by matrix high density on the carriers such as glass, silicon chip, after sample to be tested is marked with fluorescence molecule, with big DNA on chip orFew core former times acid fragment hybridization, by a kind of detection method that lots of genes information is obtained after fluorescent scanning and computer analysis.ItsOutstanding feature, can in that can carry out quick, accurate, high-throughout detection and analysis to the nucleic acid sequence information in micro-exampleTo obtain the gene expression abundance of thousands of a genes at the same time collection of illustrative plates is associated with expression pattern.
Experimenter by hand scraping blade mode from formalin fix paraffin embedding tissue block on lung carcinoma cell enrichmentCollect and extract total serum IgE in area;Handled with DNase to ensure to remove the pollution of genomic DNA completely;Obtained after reverse transcriptioncDNA;CDNA is mixed with genetic chip, hybridization reaction;The gene expression abundance of gene is detected by scanning fluorescence signal.Gene corePiece after reaction, extracts the expression pattern of 30 genes of sample, and Statistic analysis models calculate the sample and tumors subtypesSimilarity score.
By 30 gene Statistic analysis models to each sample carry out hypotype differentiation, and with Clinicopathologic Diagnosis result phaseCompare, accuracy rate 95.6%.Using adenocarcinoma of lung as reference, the susceptibility of prediction is 94.4%, and specificity 97.5%, is shown in Table 5.
Table 5:The differentiation result that 30 genetic models are concentrated in 205 verifications
The probe sequence difference of 30 genes is as shown in table 6 below in genetic chip:
Table 6
| Sequence number | Gene | Probe |
| 1 | COMP | SEQ ID NO.61 |
| 2 | FXYD3 | SEQ ID NO.62 |
| 3 | GABRP | SEQ ID NO.63 |
| 4 | HLA-DQA1 | SEQ ID NO.64 |
| 5 | MMP10 | SEQ ID NO.65 |
| 6 | NMU | SEQ ID NO.66 |
| 7 | NTS | SEQ ID NO.67 |
| 8 | RPS4Y1 | SEQ ID NO.68 |
| 9 | SERPINB4 | SEQ ID NO.69 |
| 10 | SERPINB5 | SEQ ID NO.70 |
| 11 | SPRR2A | SEQ ID NO.71 |
| 12 | SPRR3 | SEQ ID NO.72 |
| 13 | UPK1B | SEQ ID NO.73 |
| 14 | CLCA2 | SEQ ID NO.74 |
| 15 | DSG3 | SEQ ID NO.75 |
| 16 | FGB | SEQ ID NO.76 |
| 17 | HSD17B2 | SEQ ID NO.77 |
| 18 | KRT14 | SEQ ID NO.78 |
| 19 | KRT17 | SEQ ID NO.79 |
| 20 | KRT5 | SEQ ID NO.80 |
| 21 | KRT6A | SEQ ID NO.81 |
| 22 | KRT6B | SEQ ID NO.82 |
| 23 | MIR205HG | SEQ ID NO.83 |
| 24 | MMP12 | SEQ ID NO.84 |
| 25 | S100A2 | SEQ ID NO.85 |
| 26 | SERPINB3 | SEQ ID NO.86 |
| 27 | SPINK1 | SEQ ID NO.87 |
| 28 | SPRR1B | SEQ ID NO.88 |
| 29 | TFPI2 | SEQ ID NO.89 |
| 30 | TRIM29 | SEQ ID NO.90 |
Embodiment described above only expresses embodiments of the present invention, its description is more specific and detailed, but can notTherefore it is interpreted as the limitation to the scope of the claims of the present invention.It should be pointed out that for those of ordinary skill in the art,Without departing from the inventive concept of the premise, various modifications and improvements can be made, these belong to the protection model of the present inventionEnclose.Therefore, the protection domain of patent of the present invention should be determined by the appended claims.