Movatterモバイル変換


[0]ホーム

URL:


CN103114150B - The method that storehouse order-checking is identified is built with the mononucleotide polymorphism site of Bayesian statistic based on enzyme action - Google Patents

The method that storehouse order-checking is identified is built with the mononucleotide polymorphism site of Bayesian statistic based on enzyme action
Download PDF

Info

Publication number
CN103114150B
CN103114150BCN201310077509.1ACN201310077509ACN103114150BCN 103114150 BCN103114150 BCN 103114150BCN 201310077509 ACN201310077509 ACN 201310077509ACN 103114150 BCN103114150 BCN 103114150B
Authority
CN
China
Prior art keywords
sequence
checking
heap
sequencing
snp
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Active
Application number
CN201310077509.1A
Other languages
Chinese (zh)
Other versions
CN103114150A (en
Inventor
张祥林
张一桐
陶晔
郑泽群
胡秋萍
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
SHANGHAI MAJORBIO PHARM TECHNOLOGY Co Ltd
Original Assignee
SHANGHAI MAJORBIO PHARM TECHNOLOGY Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by SHANGHAI MAJORBIO PHARM TECHNOLOGY Co LtdfiledCriticalSHANGHAI MAJORBIO PHARM TECHNOLOGY Co Ltd
Priority to CN201310077509.1ApriorityCriticalpatent/CN103114150B/en
Publication of CN103114150ApublicationCriticalpatent/CN103114150A/en
Application grantedgrantedCritical
Publication of CN103114150BpublicationCriticalpatent/CN103114150B/en
Activelegal-statusCriticalCurrent
Anticipated expirationlegal-statusCritical

Links

Landscapes

Abstract

The invention discloses and build, based on enzyme action, the new algorithm that storehouse order-checking is identified with the mononucleotide polymorphism site of Bayesian statistic.It is used for processing RAD sequencing data, finds the candidate SNP in RAD order-checking fragment, adopts the bioinformatic analysis method of Bayesian statistic to identify SNP reliability.It is biological with non-mode that the method may be used for pattern on the one hand, breaks away from a large amount of species and lacks the restriction of reference sequences, decreases order-checking cost;Solve on the other hand and carry out when SNP identifies the bottleneck without reliable statistics method currently with RAD data, make the SNP site accuracy significant increase of acquisition.

Description

The method that storehouse order-checking is identified is built with the mononucleotide polymorphism site of Bayesian statistic based on enzyme action
Technical field
The present invention relates to and a kind of build the method that storehouse order-checking is identified with the mononucleotide polymorphism site of Bayesian statistic based on enzyme action.It is specially and builds storehouse list end (single-end) order-checking based on enzyme action or single nucleotide polymorphism (SNP) site that double; two end (pair-end) order-checking obtains carries out a kind of special Bayesian statistic inspection, thus the genotypic method of accurate identification SNP;When lacking with reference to genome sequence, reliable statistics meaning can be provided for SNP inspection.The method belongs to bioinformatics technique field.This has great importance for the research of non-mode biology and the study on accuracy of genotype identification lacking reference sequences.
Background technology
SNP (SingleNucleotidePolymorphisms) single nucleotide polymorphism, refers to the variation of single core thuja acid on genome, and its quantity is a lot, and in human genome, probably every 1000 bases just have a SNP, rich polymorphism.SNP is by the ideal mark of all kinds of molecular biology research, as built genetic map, and gene type, molecular mark, disease forecasting, the field such as medication guide.
Nowadays, second filial generation DNA sequencing technology is the sequencing technologies of a kind of high flux low cost, and ultimate principle is the order-checking of synthesis limit, limit.For solexa sequence measurement, first with physical method, DNA is interrupted at random, then at fragment two ends plus given joint, joint has amplimer sequence.During order-checking, archaeal dna polymerase synthesizes the complementary strand of fragment to be measured, reads base sequence by the fluorescence signal detected entrained by newly synthesized base, thus obtaining the sequence of fragment to be measured.
Second filial generation sequencing technologies has been widely used for many fields of bioscience, particularly the polymorphism between one species Different Individual of research.It is checked order by individuality that tradition finds the method for SNP marker, obtains short reads, then passes through short sequence alignment program and these short reads comparisons are returned reference sequences, thus obtaining the SNP information that order-checking is individual.Common flow process has (general procedure is as shown in Figure 1): use BWA software that reads comparison is returned reference sequences, uses SAMtools software processes comparison result to find SNP site [1,2];Use SOAP software that reads comparison is returned reference sequences, use SOAPsnp software processes comparison result to find SNP site [3,4].For there being the species of reference sequences can carry out the lookup of SNP marker very easily, but for those non-mode biologies, substantially it is absent from reference sequences.And when without reference to sequence, tradition is found the method for SNP marker and be there is technical bottleneck.
RAD sequencing technologies have employed new storehouse mode of building (enzyme action builds storehouse), its order-checking detailed process is as shown in Figure 2, the specific site of DNA is cut off with restricted enzyme, with physical method, the DNA molecular after enzyme action is interrupted at random again, the DNA molecular of length-specific is selected by agarose gel DNA isolation technics, then specific amplification joint and sequence measuring joints are added at select DNA end, thus building upper machine library to carry out high-flux sequence.
Wherein RAD sequence measurement is method well known in the art, for instance be referred to document [5,6,7,8].SNP site obtains successfully in a lot of fields to utilize RAD sequencing technologies to identify, but till the present invention occurs, method used is typically all and utilizes empirical value to carry out screening and filter.Such as, by two kinds of base depth scales in the 0.25-0.75(low depth base degree of depth in document [6]: the high depth base degree of depth) between site judge into heterozygous sites, ratio judgement below 0.1 becomes site of isozygotying.This method is not statistically significant, and the impact being simultaneously subject to other extraneous factors is relatively larger, and such as order-checking total amount, the SNP genotype accuracy that qualification obtains cannot ensure.Document [9] improves authentication method on the basis of empirical value method, uses method of maximum likelihood to carry out the correction of loci gene type, but its greatest problem is in that to determine the error rate in statistical method.
Document:
1.Li,H.andR.Durbin,FastandaccurateshortreadalignmentwithBurrows-Wheelertransform.Bioinformatics,2009.25(14):p.1754-60.
2.Li,H.,etal.,TheSequenceAlignment/MapformatandSAMtools.Bioinformatics,2009.25(16):p.2078-9.
3.Li,R.,etal.,SNPdetectionformassivelyparallelwhole-genomeresequencing.GenomeRes,2009.19(6):p.1124-32.
4.Li,R.,etal.,SOAP:shortoligonucleotidealignmentprogram.Bioinformatics,2008.24(5):p.713-4.
5.Houston,R.D.,etal.,CharacterisationofQTL-linkedandgenome-widerestrictionsite-associatedDNA(RAD)markersinfarmedAtlanticsalmon.BMCGenomics,2012.13(1):p.244.
6.Scaglione,D.,etal.,RADtagsequencingasasourceofSNPmarkersinCynaracardunculusL.BMCGenomics,2012.13(1):p.3.
7.Davey,J.W.,etal.,SpecialfeaturesofRADSequencingdata:implicationsforgenotyping.MolEcol,2012.
8.Dasmahapatra,K.K.,etal.,Butterflygenomerevealspromiscuousexchangeofmimicryadaptationsamongspecies.Nature,2012.
9.Hohenlohe,P.A.,etal.,PopulationgenomicsofparalleladaptationinthreespinesticklebackusingsequencedRADtags.PLoSGenet.6(2):p.e1000862.
Summary of the invention
It is an object of the invention to provide and a kind of build the method that storehouse order-checking is identified with the mononucleotide polymorphism site of Bayesian statistic based on enzyme action;It is a kind of to build, based on enzyme action, the sequencing data that storehouse order-checking (RAD sequencing technologies) obtains by processing, searching mononucleotide polymorphism site between individual internal or individuality, and gives the technical scheme of statistical test.
The purpose of the present invention is achieved through the following technical solutions:
A kind of building the method that storehouse order-checking is identified with the mononucleotide polymorphism site of Bayesian statistic based on enzyme action, its step is as follows:
1) after the sequencing result obtaining RAD high throughput sequencing technologies, it is filtered removing underproof sequencing sequence to RAD enzyme action end sequencing sequence.
Wherein, RAD high throughput sequencing technologies can be IlluminaGA sequencing technologies, it is also possible to for other high throughput sequencing technologies existing.
Described underproof sequencing sequence is the sequence that sequencing quality exceedes the 50% of whole piece series number lower than the base number of predetermined low quality threshold value, and does not have the sequence of enzyme action feature.
2) sequencing sequence according to order-checking genes of individuals group enzyme action one end, utilizes the full same sex of sequence to generate the information of each individual heap, and calculates each sequence heap order-checking depth information.Such as, the sequencing sequence information of the enzyme action one end after being filtered by each individuality is as the key of Hash, and the value of Hash points to a chained list, for depositing the sequence information of the other end, and calculates order-checking depth information.Available any programming language realizes this process.
3) comparison between two will be carried out by internal all sequences heap one by one, and cluster heap to determine intraindividual candidate's heterozygosis SNP site.
For 3) cluster result, the cluster result of only one of which heap shows to check order in fragment in enzyme action one end to be absent from heterozygous sites, and only the cluster results of two heaps show there is heterozygous sites on fragment is checked order in enzyme action one end.
4) all sequences heap in Different Individual is carried out comparison between two, cluster heap to determine interindividual candidate SNP locus.
For 4) cluster result, only one of which heap cluster result show that two individualities are absent from SNP site, have two cluster results piled to show to exist between individuality SNP site.
Due to follow-up use Bayes statistical method, so each heap or each cluster result are not carried out depth-type filtration herein, it is one of the advantage of the method, retains more SNP site as far as possible.
5) utilize Bayes statistical method that genotypic depth informations various in each candidate SNP locus are analyzed, identify the accuracy of candidate SNP.Owing to lacking the prior probability that reference sequences occurs with various bases on each site, so the actually measured base error rate in each site cannot be obtained.Therefore, this Bayes statistical method uses the exhaustive all error rates being likely to occur of walking method, then selects so that genotype exists the probability the highest situation genotype as this SNP site.Concrete formula is as follows with calculation procedure:
For each candidate SNP locus, can there is base possible in 4, i.e. any in " ATCG " or multiple, the base type of the definition frequency of occurrences the highest (degree of depth is maximum) is G1, and the corresponding degree of depth is N1, all the other bases of definition of successively decreasing successively, i.e. genotype Gi(i=1,2,3,4), degree of depth Ni(i=1,2,3,4).Biologically, general species only there will be two kinds of base types on a SNP site, for instance sequencing data shows that A or T(N occurs in this SNP siteA≥NT), then this site must be isozygoty or two kinds of genotype of heterozygosis.Therefore this bayes method only detects both the above genotype and at the condition lower probability that error rate is ε is:
P(N1,N2,N3,N4|Giiϵ)=N!N1!N2!N3!N4!(1-0.75ϵ)Ni(0.25)N-Ni
P(N1,N2,N3,N4|Gijϵ)N!N1!N2!N3!N4!(0.5-0.25ϵ)(Ni+Nj)(0.25)N-Ni-Nj
Wherein N=N1+N2+N3+N4,
Various genotypic posterior probability are:
Because not having sequence and early-stage Study data, error rate ε cannot determine value, but document [10,11] report IlluminaGA checks order error rate about 1%.Herein, ε is set from 0.01%-5%, step pitch 0.01%.The posterior probability of final utilization is:
P(Nij)Final=max (P (Nij,ε)) i, j ∈ 1,2,3,4}, ε ∈ [0.01%, 5%].
If P is (Nij)FinalIt is not less than 0.95, it was shown that the genotype of this SNP site is ij, is otherwise defined as the data (missing data) that cannot judge.
Technical scheme have employed bioinformatic analysis method, process RAD(restriction-siteassociatedDNA) sequencing data, find the SNP site information in RAD order-checking fragment, bayes method is utilized to identify SNP genotype, lack the bottleneck of reference sequences breaking through non-mode biology, while reducing cost, obtain result accurately.Being firstly introduced Bayes statistical method when identifying SNP site genotype, compared with the method for empirical value before, statistical significance significantly improves, and accuracy also promotes accordingly.
Document:
10.Li,Y.,etal.,Stateoftheartdenovoassemblyofhumangenomesfrommassivelyparallelsequencingdata.HumGenomics,2010.4(4):p.271-7.
11.Xie,W.,etal.,Parent-independentgenotypingforconstructinganultrahigh-densitylinkagemapbasedonpopulationsequencing.ProcNatlAcadSciUSA.107(23):p.10578-83.
Accompanying drawing explanation
Fig. 1 is the Principle of Process figure that tradition finds SNP site method;
Fig. 2 is the order-checking detailed process schematic diagram of RAD sequencing technologies;In figure, (A) digestion with restriction enzyme genomic DNA, and plus P1 joint, each P1 joint contains different sequence labels;(B) with the sample mix of different P1 joints together, interrupt;(C) top connection P2 is added;(D) amplification enrichment RADtags;
Fig. 3 is the example figure of RAD order-checking;
The cluster process figure that Fig. 4 makes a living in heaps, uses EcorI restricted enzyme in legend;
Fig. 5 is enzyme action one end sequence information schematic diagram in heap;
Fig. 6 for internal in heap and individual between SNP site find schematic flow sheet;
Fig. 7 is the example figure of candidate SNP base type and depth information, figure has 20 candidate SNP locus the base type in 15 individualities and depth information respectively, " C | 9 " represents that this site C measures 9 times, " C | 9:T | 3 " represent this site C measure 9 times, T measure 3 times.
Fig. 8 is the example figure of the genotype results of SNP site after Bayesian statistic, in figure, " a " represents homozygous genotype two kinds different respectively with " b ", " h " represents heterozygous genotypes, such as " a " represents AA, and " b " represents CC, and " h " represents AC, in " x1:x2:x3:x4 ", x1 represents the posterior probability isozygotied, x2 represents error rate values when posterior probability of isozygotying is maximum, and x3 represents the posterior probability of heterozygosis, and x4 represents error rate values when heterozygosis posterior probability is maximum.
Fig. 9 utilizes the deletion condition of data after empirical value method and Bayes statistical method.
Figure 10 is the statistic result of empirical value method and Bayes statistical method different loci.
Figure 11 is the result that the site that random choose empirical value method is different from Bayes statistical method carries out utilizing sanger sequence verification.
Detailed description of the invention
The technical characterstic of the present invention is expanded on further below in conjunction with accompanying drawing and specific embodiment.
As in figure 2 it is shown, RAD order-checking is checked order the difference is that needing to utilize restricted enzyme complete degestion genome before adding joint with routine high-throughput, then plus the special joint of RAD, after continue storehouse process to build storehouse with routine identical.Fig. 3 is the example figure of RAD enzyme action end sequencing.Show in figure 3 and use restricted enzyme Ecor1, identify the palindrome of " G^AATTC " on DNA molecular, and between G and A, DNA molecular is cut off, DNA molecular physical method after enzyme action is broken into short sequence fragment, and add top connection at the DNA fragmentation two ends containing enzyme action end sequence, PCR is enriched with and carries out single end (single-end) order-checking, and sequencing reading length is generally 100nt, it is also possible to for 50nt.
Building, based on enzyme action, the method that storehouse order-checking is identified with the mononucleotide polymorphism site of Bayesian statistic, its step is as follows:
1) after the sequencing result obtaining RAD high throughput sequencing technologies, it is filtered removing underproof sequencing sequence to the double; two end sequencing sequence of RAD.
Wherein, RAD high throughput sequencing technologies can be IlluminaGA sequencing technologies, it is also possible to for other high throughput sequencing technologies existing.
Described underproof sequencing sequence is the sequence that sequencing quality exceedes the 50% of whole piece series number lower than the base number of predetermined low quality threshold value, and does not have the sequence of enzyme action feature.
Low quality threshold value is determined by concrete sequencing technologies and order-checking environment, for instance be set as that single base sequencing quality is lower than 20;In sequencing sequence, the uncertain base of sequencing result (N in IlluminaGA sequencing result) number exceedes 10% of whole piece sequencing sequence base number and thinks defective sequence;Except sample joint sequence, test, with other, the exogenous array comparison introduced, such as various terminal sequence.If sequence exists exogenous array, think defective sequence;In the sequencing sequence of enzyme action one end, if initial several bases are not enzyme action end sequences, filter out (such as restricted enzyme Ecor1, sequencing sequence starts if not " AATTC " then filters out whole sequencing sequence).
2) sequencing sequence according to order-checking genes of individuals group enzyme action one end, utilizes the full same sex of sequence to generate the information of each individual heap.Detailed process is as shown in Figure 4.In heap (Stack), the sequence information of enzyme action one end can preserve in the way of Fig. 5, and in Figure 5, what first row represented is the sequence information of enzyme action one end;What secondary series represented is number of times that enzyme action one terminal sequence is sequenced, namely check order depth information;3rd row are the ID of this heap, for uniquely determining heap.
3) individual internal heap compares, if the situation in Fig. 6 does not occur in cluster, it was shown that do not have heterozygosis SNP above individual these sequences internal;If depositing situation in figure 6, it was shown that individual inside exists heterozygosis SNP.
4) when between individuality, heap compares between two, if there is Fig. 6 (a) situation, it was shown that there is the SNP that isozygotys between two individualities;If there is Fig. 6 (b) situation, it was shown that there is heterozygosis SNP between two individualities.
5) base type and the respective depth information of each candidate SNP locus are obtained after piling cluster and compare end, as Fig. 7 represents that above-mentioned information collects.
6) utilize Bayes statistical method that base type and the degree of depth are analyzed, such as Fig. 8, obtain final SNP genotype results.
Embodiment data:
These 141 individualities, including 139 F2 plant and two parents, are carried out RAD-PE order-checking by calabash courd F2 informative population genetic map project.(illustrate: the offspring that male parent and hybridization of female parent generate makes the offspring that F1, F1 selfing generates be F2;Although using RAD-PE order-checking, but analyze and only use enzyme action terminal sequence)
Material source: Zhejiang Academy of Agricultural Science.
Embodiment concrete operations flow process:
Check order the sequencing data obtained by two parent RAD-PE, according to sequencing quality value, the content of N, and whether contain enzyme action end sequence and is filtered, removes underproof sequencing sequence, and the valid data statistics obtained is as shown in table 1.
Table 1: calabash courd RAD order-checking valid data statistics
TitleUse data volume (bp)TitleUse data volume (bp)TitleUse data volume (bp)
Male parent585,377,540F2-463,800,775F2-931,556,104 5 -->
Maternal423,794,746F2-472,522,407F2-941,651,259
F2-13,114,771F2-484,636,152F2-953,213,147
F2-22,302,730F2-493,737,623F2-962,202,354
F2-3537,822F2-50647,499F2-971,956,440
F2-41,650,925F2-513,678,334F2-981,112,431
F2-52,824,708F2-522,153,996F2-991,086,168
F2-6579,177F2-537,029,889F2-1001,705,836
F2-72,805,093F2-542,315,687F2-1012,311,919
F2-82,234,442F2-554,116,520F2-1021,445,671
F2-91,814,510F2-561,554,335F2-1035,292,536
F2-102,063,581F2-571,912,949F2-104371,736
F2-11437,393F2-584,513,981F2-1055,528,190
F2-121,114,627F2-594,660,158F2-1062,286,908
F2-13292,168F2-602,963,600F2-1072,977,378
F2-14L981.379F2-611.912.346F2-1081.113.885
F2-15L710.808F2-622.198.112F2-1092.358.577
F2-16L666.317F2-632.149.228F2-1102.014.988
F2-173.837.185F2-642.907.400F2-1115.021.837
F2-182.705.794F2-652565399F2-112L687.183
F2-19L641.718F2-661.802.757F2-1131.454.774
F2-204.181.837F2-676.136.789F2-114L187.993
F2-212.167.926F2-685.106.060F2-115917.204
F2-228.967F2-695.492.357F2-116673.176
F2-23L936.761F2-704.925.717F2-117903.357
F2-244.907.028F2-712.016.103F2-1181.252.469
F2-252.641.269F2-724.495.767F2-119L066.660
F2-26L344.809F2-73957.643F2-12083.426
F2-272.184.764F2-743.193.347F2-121624.005
F2-282.312.351F2-7530.335F2-1224.246.910
F2-29L318.322F2-762.067.906F2-123824.013
F2-30L830.247F2-77200.856F2-1243.322.863
F2-31358.911F2-786.978.303F2-12589.336
F2-32L450.039F2-795.309.200F2-126367.005
F2-33L767.194F2-803.081.537F2-1271.707.758
F2-34L587.589F2-813.071.09lF2-1282.385.919
F2-35L008.970F2-821.223.914F2-1292.786.068
F2-362.877.974F2-835.586.662F2-130890.661
F2-37L268211F2-841.880.660F2-1311.980.472 6-->
F2-38The skilful .973 of 6.OF2-852.620.672F2-1323.920.370
F2-393.219262F2-865.992.662F2-133500.349
F2-402.241.103F2-875.636.602F2-1342.150.765
F2-413.610.730F2-88544.490F2-135L115.366
F2-42L641.063F2-894.897.907F2-1361.306.934
F2-432.382.923F2-902.355.593F2-137616.728
F2-442.598.428F2-912.506.27lF2-138949.308
F2-45692.675F2-92841.285F2-1391.014.677
Sequencing sequence according to order-checking genes of individuals group enzyme action one end, utilizes the full same sex of sequence to generate the information of two parent's heaps respectively.Obtain 3913 candidate SNP labellings of F2 colony, base type and similar Fig. 7 of corresponding depth profile (data volume is too big, it is impossible to show).It is utilized respectively current conventional empirical value method (degree of depth is not less than 6, heterozygosis criterion 0.25-0.75) with Bayes statistical method, candidate SNP labelling to be identified.Result after identifying using two kinds of methods respectively carries out the statistics of missing data, such as Fig. 9, it was shown that both there are differences, but not notable.But each loci gene type being analyzed, it has been found that have 11l, 005 site is accredited as in bayes method isozygotys, but is accredited as heterozygosis in empirical value;Discovery has 79,401 sites to be accredited as heterozygosis in bayes method, but is accredited as in empirical value and isozygotys, and claiming both the above site is uncertain site.100 uncertain sites of random choose carry out sanger method order-checking, it has been found that what in uncertain site, bayes method was correct occupies 77%, are significantly higher than empirical value method (3 examples selected in Figure 11).
This result shows that utilizing RAD sequencing data to carry out bayes method when SNP identifies be a kind of statistical means reliably.

Claims (3)

CN201310077509.1A2013-03-112013-03-11The method that storehouse order-checking is identified is built with the mononucleotide polymorphism site of Bayesian statistic based on enzyme actionActiveCN103114150B (en)

Priority Applications (1)

Application NumberPriority DateFiling DateTitle
CN201310077509.1ACN103114150B (en)2013-03-112013-03-11The method that storehouse order-checking is identified is built with the mononucleotide polymorphism site of Bayesian statistic based on enzyme action

Applications Claiming Priority (1)

Application NumberPriority DateFiling DateTitle
CN201310077509.1ACN103114150B (en)2013-03-112013-03-11The method that storehouse order-checking is identified is built with the mononucleotide polymorphism site of Bayesian statistic based on enzyme action

Publications (2)

Publication NumberPublication Date
CN103114150A CN103114150A (en)2013-05-22
CN103114150Btrue CN103114150B (en)2016-07-06

Family

ID=48412567

Family Applications (1)

Application NumberTitlePriority DateFiling Date
CN201310077509.1AActiveCN103114150B (en)2013-03-112013-03-11The method that storehouse order-checking is identified is built with the mononucleotide polymorphism site of Bayesian statistic based on enzyme action

Country Status (1)

CountryLink
CN (1)CN103114150B (en)

Families Citing this family (9)

* Cited by examiner, † Cited by third party
Publication numberPriority datePublication dateAssigneeTitle
CN105874460B (en)*2013-11-012018-10-02精赛恩公司Identify method, readable medium and the equipment of at least one base of target sequence
CN104156633B (en)*2014-08-122017-03-01上海美吉生物医药科技有限公司The method that SSR collection of illustrative plates is improved based on RAD collection of illustrative plates
CN104946765A (en)*2015-06-252015-09-30华中农业大学Somatic mutation site excavation method based on genomic sequencing
CN105525012B (en)*2016-01-272019-04-16山东省农业科学院生物技术研究中心A kind of method for identifying molecules of peanut hybridization kind
CN108277267B (en)*2016-12-292019-08-13安诺优达基因科技(北京)有限公司It detects the device of gene mutation and carries out the kit of parting for the genotype to pregnant woman and fetus
CN107273715B (en)*2017-05-102020-03-17安吉康尔(深圳)科技有限公司Detection method and device
WO2020023893A1 (en)*2018-07-272020-01-30Seekin, Inc.Reducing noise in sequencing data
CN114078568B (en)*2020-09-142022-07-05青岛欧易生物科技有限公司Metagenome sequencing data processing system and processing method based on IIB type restriction endonuclease characteristics
CN113718342A (en)*2021-05-062021-11-30安徽农业大学Construction method of high-density genetic map of recombinant inbred line population

Non-Patent Citations (2)

* Cited by examiner, † Cited by third party
Title
基于表达谱芯片和新一代测序技术的高通量基因分型方法的开发;谢为博;《中国博士学位论文全文数据库 基础科学辑》;20110515(第05期);第87-90页第3.3.3节*
高通量测序技术在动植物研究领域中的应用;岳桂东等;《中国科学:生命科学》;20120229;第42卷(第2期);第110页右栏第二段,第113页右栏第二段、第114页左栏第二-五段以及图2*

Also Published As

Publication numberPublication date
CN103114150A (en)2013-05-22

Similar Documents

PublicationPublication DateTitle
CN103114150B (en)The method that storehouse order-checking is identified is built with the mononucleotide polymorphism site of Bayesian statistic based on enzyme action
CN111088382B (en) A kind of whole genome SNP chip of corn and its application
CN102682224B (en)Method and device for detecting copy number variations
CN104145028B (en)A kind of detect the micro-deleted method in chromosome STS region and device thereof
CN115052994B (en)Method for determining base type of preset site in embryo cell chromosome and application thereof
CN104794371B (en)The method and apparatus for detecting retrotransponsons insertion polymorphism
Hills et al.BAIT: organizing genomes and mapping rearrangements in single cells
CN105046105B (en)The Haplotype map and its construction method of chromosome span
CN110021351A (en)Analyze base linkage strength and methods of genotyping and system
CN115989544A (en)Method and system for visualizing short reads in repetitive regions of a genome
CN113564266B (en)SNP typing genetic marker combination, detection kit and application
CN105950707A (en)Method and system for determining nucleic acid sequence
CN105925664A (en)Method and system for determining nucleic acid sequence
CN104293941A (en)Method for constructing sequencing library and application of sequencing library
CN114171116B (en)Method for assessing fetal DNA concentration by maternal dissociation and intrinsic DNA and application thereof
CN113889189B (en) Method and application of estimating fetal DNA concentration using paternal and maternal DNA
CN119359841A (en) A method for intuitively displaying genetic differences between biological individuals through combined graphics and generating combined graphics
CN102831331B (en)Primer design developing method of length polymorphism sign based on restriction enzyme digestion database-establishing pair-end sequencing
CA3149056A1 (en)Methods for dna library generation to facilitate the detection and reporting of low frequency variants
SellAddressing challenges of ancient DNA sequence data obtained with next generation methods
CN113981070B (en)Method, device, equipment and storage medium for detecting embryo chromosome microdeletion
CN113999900B (en)Method for evaluating fetal DNA concentration by using free DNA of pregnant woman and application
US20140136121A1 (en)Method for assembling sequenced segments
US20130143746A1 (en)Method for detecting gene region features based on inter-alu polymerase chain reaction
EP3409788B1 (en)Method and system for nucleic acid sequencing

Legal Events

DateCodeTitleDescription
C06Publication
PB01Publication
C10Entry into substantive examination
SE01Entry into force of request for substantive examination
CB03Change of inventor or designer information

Inventor after:Zhang Xianglin

Inventor after:Zhang Yitong

Inventor after:Tao Ye

Inventor after:Zheng Zequn

Inventor after:Hu Qiuping

Inventor before:Tao Ye

Inventor before:Qian Gang

Inventor before:Zheng Zequn

Inventor before:Hu Qiuping

CORChange of bibliographic data
C14Grant of patent or utility model
GR01Patent grant

[8]ページ先頭

©2009-2025 Movatter.jp