Movatterモバイル変換


[0]ホーム

URL:


US20210020266A1 - Phase-aware determination of identity-by-descent dna segments - Google Patents

Phase-aware determination of identity-by-descent dna segments
Download PDF

Info

Publication number
US20210020266A1
US20210020266A1US16/947,107US202016947107AUS2021020266A1US 20210020266 A1US20210020266 A1US 20210020266A1US 202016947107 AUS202016947107 AUS 202016947107AUS 2021020266 A1US2021020266 A1US 2021020266A1
Authority
US
United States
Prior art keywords
ibd
haplotype
segments
sites
potential
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Abandoned
Application number
US16/947,107
Inventor
William A. Freyman
Kimberly F. McManus
Suyash S. Shringarpure
Ethan M. Jewett
Adam Auton
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
23andMe Inc
Original Assignee
23andMe Inc
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by 23andMe IncfiledCritical23andMe Inc
Priority to US16/947,107priorityCriticalpatent/US20210020266A1/en
Assigned to 23ANDME, INC.reassignment23ANDME, INC.ASSIGNMENT OF ASSIGNORS INTEREST (SEE DOCUMENT FOR DETAILS).Assignors: FREYMAN, WILLIAM A., JEWETT, ETHAN M., MCMANUS, KIMBERLY F., AUTON, ADAM, SHRINGARPURE, SUYASH S.
Publication of US20210020266A1publicationCriticalpatent/US20210020266A1/en
Priority to US17/249,520prioritypatent/US20210193257A1/en
Priority to US18/503,841prioritypatent/US12046327B1/en
Priority to US18/737,679prioritypatent/US12260936B2/en
Priority to US19/057,224prioritypatent/US20250191684A1/en
Abandonedlegal-statusCriticalCurrent

Links

Images

Classifications

Definitions

Landscapes

Abstract

The disclosed embodiments concern methods, apparatus, systems and computer program products for estimating IBD segments. Some implementations use a templated positional Burrows-Wheeler transform (PBWT) technique and a phase switch error heuristic to correct genotyping errors and phase switch errors to make fast and accurate phase aware IBD estimates. In some implementations a templated PBWT technique and a probabilistic hidden Markov model (HMM) are used to correct genotyping errors and phase switch errors.

Description

Claims (42)

What is claimed is:
1. A computer implemented method of processing haplotypes to reduce genotyping errors when determining identity by descent (IBD) segments between haplotypes, the method comprising:
providing a first digital template comprising a first arrangement of masked and unmasked sites in a window of consecutive haplotype sites;
providing a second digital template comprising a second arrangement of masked and unmasked sites in a window of consecutive haplotype sites, wherein the first and second arrangements are different;
providing two or more haplotypes strings for identification of IBD segments therebetween, each of the two or more haplotype strings representing a sequence of allele values at polymorphic sites in a haplotype of an organism; and
computationally identifying IBD segments between the two or more haplotype strings by (i) identifying first matches among alleles of the haplotype strings at unmasked sites produced by applying the first digital template to the two or more haplotype strings, (ii) identifying second matches among alleles of the haplotype at unmasked sites produced by applying the second digital template to the two or more haplotype strings, and (iii) merging the first and second matches among alleles to produce a merged set of IBD segments, wherein the merged set of IBD segments has reduced impact from genotyping errors compared to a set of IBD segments generated without applying the first and second digital templates.
2. The method ofclaim 1, wherein the first and second templates each have a size of at least four consecutive haplotype sites.
3. The method ofclaim 1, wherein identifying the first matches among alleles at unmasked sites comprises sequentially applying the first digital template to the two or more haplotype strings, each time moving to a next sequential section of the two or more haplotype strings.
4. The method ofclaim 1, wherein computationally identifying IBD segments between the two or more haplotype strings further comprises:
computationally identifying additional matches among alleles at unmasked sites produced by applying one or more additional digital templates to the two or more haplotype strings, wherein the one or more additional digital templates have additional arrangements of masked and unmasked sites in windows of consecutive haplotype sites, and each of the additional arrangements is different from both the first and the second arrangements, and
wherein merging the first and second matches among alleles to produce a merged set of IBD segments further comprises computationally merging the additional matches with the first and second matches to produce the merged set of IBD segments.
5. The method ofclaim 4, wherein computationally identifying additional matches among alleles at unmasked sites employs a third digital template, a fourth digital template, a fifth digital template, and a sixth digital template.
6. The method ofclaim 1, wherein the two or more haplotype strings comprise at least one thousand haplotype strings.
7. The method ofclaim 1, wherein computationally identifying IBD segments between the two or more haplotype strings comprises performing a positional Burrows-Wheeler transform (PBWT) on the unmasked sites produced by applying the first and second templates to the two or more haplotype strings.
8. The method ofclaim 7, wherein computationally merging the first and second matches among alleles is performed while considering individual polymorphic sites of the two or more haplotype strings using the PBWT.
9. The method ofclaim 1, wherein applying the first digital template comprises a deterministic process employing the first arrangement of masked and unmasked sites.
10. A system for processing haplotypes to reduce genotyping errors when determining identity by descent (IBD) segments between haplotypes, the system comprising:
(a) one or more processors and associated memory;
(b) computer readable instructions for:
providing a first digital template comprising a first arrangement of masked and unmasked sites in a window of consecutive haplotype sites;
providing a second digital template comprising a second arrangement of masked and unmasked sites in a window of consecutive haplotype sites, wherein the first and second arrangements are different;
providing two or more haplotype strings for identification of IBD segments therebetween, each of the two or more haplotype strings representing a sequence of allele values at polymorphic sites in a haplotype of an organism; and
identifying IBD segments between the two or more haplotype strings by (i) identifying first matches among alleles of the haplotype strings at unmasked sites produced by applying the first digital template to the two or more haplotype strings, (ii) identifying second matches among alleles of the haplotype at unmasked sites produced by applying the second digital template to the two or more haplotype strings, and (iii) merging the first and second matches among alleles to produce a merged set of IBD segments, wherein the merged set of IBD segments has reduced impact from genotyping errors compared to a set of IBD segments generated without applying the first and second digital templates.
11. The system ofclaim 10, wherein the first and second templates each have a size of at least four consecutive haplotype sites.
12. The system ofclaim 10, wherein the instructions for identifying the first matches among alleles at unmasked sites comprises instructions for sequentially applying the first digital template to the two or more haplotype strings, each time moving to a next sequential section of the two or more haplotype strings.
13. The system ofclaim 10, wherein the instructions for identifying IBD segments between the two or more haplotype strings further comprise instructions for:
computationally identifying additional matches among alleles at unmasked sites produced by applying one or more additional digital templates to the two or more haplotype strings, wherein the one or more additional digital templates have additional arrangements of masked and unmasked sites in windows of consecutive haplotype sites, and each of the additional arrangements is different from both the first and the second arrangements, and
wherein merging the first and second matches among alleles to produce a merged set of IBD segments further comprises computationally merging the additional matches with the first and second matches to produce the merged set of IBD segments.
14. The system ofclaim 13, wherein the instructions for identifying additional matches among alleles at unmasked sites employ a third digital template, a fourth digital template, a fifth digital template, and a sixth digital template.
15. The system ofclaim 14, wherein the first through sixth digital templates each comprise two masked sites and two unmasked sites.
16. The system ofclaim 10, wherein the first digital template and the second digital template each have a ratio of masked sites to unmasked sites of between about 2:1 to about 1:2.
17. The system ofclaim 10, wherein the two or more haplotype strings comprise at least one thousand haplotype strings.
18. The system ofclaim 10, wherein the two or more haplotype strings comprise at least one million haplotype strings.
19. The system ofclaim 10, wherein the instructions for identifying IBD segments between the two or more haplotype strings comprise instructions performing a positional Burrows-Wheeler transform (PBWT) on the unmasked sites produced by applying the first and second templates to the two or more haplotype strings.
20. The system ofclaim 19, wherein the instructions for merging the first and second matches among alleles comprise instructions for performing the merging while considering individual polymorphic sites of the two or more haplotype strings using the PBWT.
21. The system ofclaim 10, wherein the total number of digital templates is between 2 and k, where k is the number of haplotype sites in the window.
22. The system ofclaim 10, wherein the total number of digital templates is k!/(m!*(k−m)!), where k is the number of haplotype sites in the window and m is the number of masked sites in the window.
23. The system ofclaim 10, wherein applying the first digital template comprises a deterministic process employing the first arrangement of masked and unmasked sites.
24. A method of identifying IBD segments between two or more haplotype strings, each of the two or more haplotype strings representing a sequence of allele values at polymorphic sites in a haplotype of an organism, the method comprising:
(a) computationally identifying IBD segments between the two or more haplotype strings by (i) identifying first matches among alleles of two or more haplotype strings at unmasked sites produced by applying a first digital template to the two or more haplotype strings, (ii) identifying second matches among alleles of the haplotype at unmasked sites produced by applying a second digital template to the two or more haplotype strings, and (iii) merging the first and second matches among alleles to produce a merged set of IBD segments, wherein the first digital template comprises a first arrangement of masked and unmasked sites in a window of consecutive haplotype sites, wherein the second digital template comprises a second arrangement of masked and unmasked sites in the window of consecutive haplotype sites, and wherein the first and second arrangements are different; and
(b) identifying a potential phase switch error in at least one of the two or more haplotype strings; and
(c) correcting the phase switch error.
25. The method ofclaim 24, wherein identifying the potential phase switch error comprises identifying proximal IBD segments in at least one pair of the two or more haplotype strings.
26. A computer implemented method of processing haplotypes to reduce errors when determining identity by descent (IBD) segments between haplotypes, the method comprising:
providing two or more paired haplotypes strings for identification of IBD segments therebetween, each of the two or more paired haplotype strings representing a sequence of allele values at polymorphic sites in a haplotype of an organism; and
computationally iterating through the two or more paired haplotype strings by:
(i) identifying a first potential IBD segment between the two or more paired haplotype strings by identifying matches among alleles of the haplotype strings;
(ii) comparing the first site of the first potential IBD segment to the last site of a previously identified second potential IBD segment
(iii) determining that the last site of the second potential IBD segment and the first site of the first potential IBD segment are within a threshold number of sites of each other; and
(iv) merging the first potential IBD segment and the second potential IBD segment to form a combined potential IBD segment.
27. The method ofclaim 26, further comprising, after (a) and before (b), removing some initial IBD segments determined to belong to haplotypes having less than a threshold amount of initial IBD segments, wherein the initial IBD segments provided to the HEIM in (b) have had some initial IBD segments removed.
28. The method ofclaim 27, wherein the threshold amount of initial IBD segments is less than two initial IBD segments per chromosome.
29. A computer implemented method of determining identity by descent (IBD) segments, the method comprising:
determining first potential IBD segments among phased haplotype data for a plurality of individuals, wherein the first potential IBD segments have an end site;
determining second potential IBD segments among haplotype data for the plurality of individuals, wherein the second potential IBD segments have a start site;
determining that the end site of the first potential IBD segments and the start site of the second potential IBD segments are within a threshold number of sites of each other; and
merging the first potential IBD segments and the second potential IBD segments to form a combined potential IBD segment.
30. The method ofclaim 29, wherein the first potential IBD segments and the second potential IBD segments are on different haplotypes for an individual of the plurality of individuals, and the method further comprises:
determining that a phase switch error occurred at a site between the first potential IBD segment and the second potential IBD segment for the individual; and
swapping the haplotypes for the individual from the position of the phase switch error onward.
31. The method ofclaim 29, wherein the first potential IBD segment and the second potential IBD segment each span at least the threshold number of sites.
32. The method ofclaim 29, wherein the threshold number of sites is between about 0 and 500 SNPs.
33. A system for determining identity by descent (IBD) segments, the system comprising:
(a) one or more processors and associated memory; and
(b) computer readable instructions for:
determining first potential IBD segments among phased haplotype data for a plurality of individuals, wherein the first potential IBD segments have an end site;
determining second potential IBD segments among haplotype data for the plurality of individuals, wherein the second potential IBD segments have a start site;
determining that the end site of the first potential IBD segments and the start site of the second potential IBD segments are within a threshold number of sites of each other; and
merging the first potential IBD segments and the second potential IBD segments to form a combined potential IBD segment.
34. The system ofclaim 33, wherein the first potential IBD segments and the second potential IBD segments are on different haplotypes for an individual of the plurality of individuals, and the computer readable instructs further comprise instructions for:
determining that a phase switch error occurred at a site between the first potential IBD segment and the second potential IBD segment for the individual; and
swapping the haplotypes for the individual from the position of the phase switch error onward.
35. The system ofclaim 33, wherein the first potential IBD segments and the second potential IBD segments overlap for an individual of the plurality of individuals.
36. The system ofclaim 33, wherein the first potential IBD segment and the second potential IBD segment each span at least the threshold number of sites.
37. The system ofclaim 33, wherein the threshold number of sites is between about 0 and 500 SNPs.
38. The system ofclaim 33, wherein the plurality of individuals do not share a parent-child relationship.
39. The system ofclaim 33, wherein the computer readable instructions further comprise instructions for:
determining a third potential IBD segment among phased haplotype data for a plurality of individuals, wherein the third potential IBD segment has a start site;
determining that the end site of the combined potential IBD segment and the start site of the third potential IBD segments are within the threshold number of SNPs of each other; and
merging the combined potential IBD segments and the third potential IBD segments.
40. The system ofclaim 39, wherein the combined potential IBD segment and the third potential IBD segment are on different haplotypes for an individual of the plurality of individuals, and the computer readable instructions further comprise instructions for:
determining that a phase switch error occurred at a site between the combined potential IBD segment and the third potential IBD segment for the individual; and
swapping the haplotypes for the individual from the position of the phase switch error.
41. The system ofclaim 33, wherein the computer readable instructions further comprise instructions for determining that the combined potential IBD segments have a minimum length in centimorgans and storing the combined potential IBD segments as IBD segments for the plurality of individuals.
42. A computer implemented method of processing haplotypes to reduce errors when determining identity by descent (IBD) segments between haplotypes, the method comprising:
(a) computationally identifying initial IBD segments between two or more haplotype strings by identifying first matches among alleles of the haplotype strings using a plurality of templates, each comprising a unique arrangement of masked and unmasked sites in a window of consecutive haplotype sites; and
(b) providing information characterizing the initial IBD segments to a hidden Markov model (HMM) which removes potential phase switch errors to produce final IBD segment, wherein the HMM analyzes the information characterizing the initial IBD segments using distances between consecutive haplotype sites on a chromosome, one or more rates of recombination based on meiosis, and one or more rates of phase switch error based on a phasing method employed to phase the haplotypes.
US16/947,1072019-07-192020-07-17Phase-aware determination of identity-by-descent dna segmentsAbandonedUS20210020266A1 (en)

Priority Applications (5)

Application NumberPriority DateFiling DateTitle
US16/947,107US20210020266A1 (en)2019-07-192020-07-17Phase-aware determination of identity-by-descent dna segments
US17/249,520US20210193257A1 (en)2019-07-192021-03-04Phase-aware determination of identity-by-descent dna segments
US18/503,841US12046327B1 (en)2019-07-192023-11-07Identity-by-descent relatedness based on focal and reference segments
US18/737,679US12260936B2 (en)2019-07-192024-06-07Identity-by-descent relatedness based on focal and reference segments
US19/057,224US20250191684A1 (en)2019-07-192025-02-19Identity-by-descent relatedness based on focal and reference segments

Applications Claiming Priority (2)

Application NumberPriority DateFiling DateTitle
US201962876497P2019-07-192019-07-19
US16/947,107US20210020266A1 (en)2019-07-192020-07-17Phase-aware determination of identity-by-descent dna segments

Related Child Applications (2)

Application NumberTitlePriority DateFiling Date
US17/249,520ContinuationUS20210193257A1 (en)2019-07-192021-03-04Phase-aware determination of identity-by-descent dna segments
US18/503,841ContinuationUS12046327B1 (en)2019-07-192023-11-07Identity-by-descent relatedness based on focal and reference segments

Publications (1)

Publication NumberPublication Date
US20210020266A1true US20210020266A1 (en)2021-01-21

Family

ID=74192449

Family Applications (5)

Application NumberTitlePriority DateFiling Date
US16/947,107AbandonedUS20210020266A1 (en)2019-07-192020-07-17Phase-aware determination of identity-by-descent dna segments
US17/249,520AbandonedUS20210193257A1 (en)2019-07-192021-03-04Phase-aware determination of identity-by-descent dna segments
US18/503,841ActiveUS12046327B1 (en)2019-07-192023-11-07Identity-by-descent relatedness based on focal and reference segments
US18/737,679ActiveUS12260936B2 (en)2019-07-192024-06-07Identity-by-descent relatedness based on focal and reference segments
US19/057,224PendingUS20250191684A1 (en)2019-07-192025-02-19Identity-by-descent relatedness based on focal and reference segments

Family Applications After (4)

Application NumberTitlePriority DateFiling Date
US17/249,520AbandonedUS20210193257A1 (en)2019-07-192021-03-04Phase-aware determination of identity-by-descent dna segments
US18/503,841ActiveUS12046327B1 (en)2019-07-192023-11-07Identity-by-descent relatedness based on focal and reference segments
US18/737,679ActiveUS12260936B2 (en)2019-07-192024-06-07Identity-by-descent relatedness based on focal and reference segments
US19/057,224PendingUS20250191684A1 (en)2019-07-192025-02-19Identity-by-descent relatedness based on focal and reference segments

Country Status (4)

CountryLink
US (5)US20210020266A1 (en)
EP (1)EP4000070A4 (en)
CA (1)CA3147888A1 (en)
WO (1)WO2021016114A1 (en)

Cited By (13)

* Cited by examiner, † Cited by third party
Publication numberPriority datePublication dateAssigneeTitle
US20200321073A1 (en)*2019-04-032020-10-08University Of Central Florida Research Foundation, Inc.Methods and system for efficient indexing for genetic genealogical discovery in large genotype databases
US11049589B2 (en)2008-12-312021-06-2923Andme, Inc.Finding relatives in a database
US11170873B2 (en)2007-10-152021-11-0923Andme, Inc.Genetic comparisons between grandparents and grandchildren
US11171962B2 (en)2007-10-152021-11-0923Andme, Inc.Genome sharing
US11170047B2 (en)2012-06-062021-11-0923Andme, Inc.Determining family connections of individuals in a database
US11514627B2 (en)2019-09-132022-11-2923Andme, Inc.Methods and systems for determining and displaying pedigrees
US11521708B1 (en)2012-11-082022-12-0623Andme, Inc.Scalable pipeline for local ancestry inference
US11531445B1 (en)2008-03-192022-12-2023Andme, Inc.Ancestry painting
US20230019141A1 (en)*2021-07-072023-01-19Mars, IncorporatedSystem, method, and apparatus for predicting genetic ancestry
US20230260608A1 (en)*2022-02-162023-08-17Ancestry.Com Dna, LlcRelationship prediction
US11783919B2 (en)2020-10-092023-10-1023Andme, Inc.Formatting and storage of genetic markers
US11817176B2 (en)2020-08-132023-11-1423Andme, Inc.Ancestry composition determination
US12046327B1 (en)*2019-07-192024-07-2323Andme, Inc.Identity-by-descent relatedness based on focal and reference segments

Citations (1)

* Cited by examiner, † Cited by third party
Publication numberPriority datePublication dateAssigneeTitle
US20050191731A1 (en)*1999-06-252005-09-01Judson Richard S.Methods for obtaining and using haplotype data

Family Cites Families (135)

* Cited by examiner, † Cited by third party
Publication numberPriority datePublication dateAssigneeTitle
US4632724A (en)*1985-08-191986-12-30International Business Machines CorporationVisibility enhancement of first order alignment marks
US5692501A (en)1993-09-201997-12-02Minturn; PaulScientific wellness personal/clinical/laboratory assessments, profile and health risk managment system with insurability rankings on cross-correlated 10-point optical health/fitness/wellness scales
US20040229213A1 (en)1998-06-252004-11-18Institut PasteurExhaustive analysis of viral protein interactions by two-hybrid screens and selection of correctly folded viral interacting polypeptides
US6703228B1 (en)1998-09-252004-03-09Massachusetts Institute Of TechnologyMethods and products related to genotyping and DNA analysis
US20070150978A1 (en)*1999-05-172007-06-28Byrum Joseph RNucleic acid molecules and other molecules associated with plants
US20080008996A1 (en)*1999-06-292008-01-10Byrum Joseph RNucleic acid molecules and other molecules associated with plants
US6730023B1 (en)1999-10-152004-05-04HemopetAnimal genetic and health profile database management
US20070277267A1 (en)*1999-10-152007-11-29Byrum Joseph RNucleic acid molecules and other molecules associated with plants
US6640211B1 (en)1999-10-222003-10-28First Genetic Trust Inc.Genetic profiling and banking system and method
US20020133495A1 (en)2000-03-162002-09-19Rienhoff Hugh Y.Database system and method
US7142205B2 (en)2000-03-292006-11-28Autodesk, Inc.Single gesture map navigation graphical user interface for a personal digital assistant
US6570567B1 (en)2000-05-312003-05-27Alan EatonSystem and method for using a graphical interface for the presentation of genealogical information
AU3118602A (en)2000-10-182002-04-29Genomic Health IncGenomic profile information systems and methods
US20030130798A1 (en)2000-11-142003-07-10The Institute For Systems BiologyMultiparameter integration methods for the analysis of biological networks
US20030113727A1 (en)2000-12-062003-06-19Girn Kanwaljit SinghFamily history based genetic screening method and apparatus
US6988040B2 (en)2001-01-112006-01-17Affymetrix, Inc.System, method, and computer software for genotyping analysis and identification of allelic imbalance
US7957907B2 (en)2001-03-302011-06-07Sorenson Molecular Genealogy FoundationMethod for molecular genealogical research
US7005293B2 (en)2001-12-182006-02-28Agilent Technologies, Inc.Multiple axis printhead adjuster for non-contact fluid deposition devices
US20040002818A1 (en)2001-12-212004-01-01Affymetrix, Inc.Method, system and computer software for providing microarray probe data
US7343565B2 (en)2002-03-202008-03-11Mercurymd, Inc.Handheld device graphical user interfaces for displaying patient medical records
US7135286B2 (en)2002-03-262006-11-14Perlegen Sciences, Inc.Pharmaceutical and diagnostic business systems and methods
US8855935B2 (en)2006-10-022014-10-07Ancestry.Com Dna, LlcMethod and system for displaying genetic and genealogical data
US20080154566A1 (en)2006-10-022008-06-26Sorenson Molecular Genealogy FoundationMethod and system for displaying genetic and genealogical data
US20040012633A1 (en)2002-04-262004-01-22Affymetrix, Inc., A Corporation Organized Under The Laws Of DelawareSystem, method, and computer program product for dynamic display, and analysis of biological sequence data
US20040175700A1 (en)2002-05-152004-09-09Elixir Pharmaceuticals, Inc.Method for cohort selection
US20070037182A1 (en)2002-05-282007-02-15Gaskin James ZMultiplex assays for inferring ancestry
US20040229231A1 (en)2002-05-282004-11-18Frudakis Tony N.Compositions and methods for inferring ancestry
US7790867B2 (en)2002-12-052010-09-07Rosetta Genomics Inc.Vaccinia virus-related nucleic acids and microRNA
US20040146870A1 (en)2003-01-272004-07-29Guochun LiaoSystems and methods for predicting specific genetic loci that affect phenotypic traits
US20060257888A1 (en)2003-02-272006-11-16Methexis Genomics, N.V.Genetic diagnosis using multiple sequence variant analysis
WO2004090100A2 (en)2003-04-042004-10-21Agilent Technologies, Inc.Visualizing expression data on chromosomal graphic schemes
WO2004097712A2 (en)2003-04-282004-11-11Applera CorporationMethodology and graphical user interface to visualize genomic information
US7729863B2 (en)2003-12-172010-06-01Fred Hutchinson Cancer Research CenterMethods and materials for canine breed identification
US20060046256A1 (en)2004-01-202006-03-02Applera CorporationIdentification of informative genetic markers
US7629122B2 (en)2004-05-032009-12-08The Children's Hospital Of PhiladelphiaMethods and compositions for the diagnosis of Cornelia de Lange Syndrome
JP4514687B2 (en)2004-11-082010-07-28株式会社東芝 Pattern recognition device
US20060161460A1 (en)2004-12-152006-07-20Critical Connection Inc.System and method for a graphical user interface for healthcare data
US20060166224A1 (en)2005-01-242006-07-27Norviel Vernon AAssociations using genotypes and phenotypes
US7951078B2 (en)2005-02-032011-05-31Maren Theresa ScheunerMethod and apparatus for determining familial risk of disease
WO2006084195A2 (en)2005-02-032006-08-10The Government Of The United States Of America As Represented By The Secretary Of The Department Of Health And Human Services, Centers For Disease Control And PreventionPersonal assessment including familial risk analysis for personalized disease prevention plan
WO2006089238A2 (en)2005-02-182006-08-24Dna Print GenomicsMultiplex assays for inferring ancestry
US7681177B2 (en)2005-02-282010-03-16Skyler Technology, Inc.Method and/or system for transforming between trees and strings
US20060287876A1 (en)2005-06-202006-12-21Davor JedlickaComputer system and method for assessing family structures using affinographs
AU2006263703A1 (en)2005-06-282007-01-04Nokia CorporationUser interface for geographic search
US8285486B2 (en)2006-01-182012-10-09Dna Tribes LlcMethods of determining relative genetic likelihoods of an individual matching a population
US20070178500A1 (en)2006-01-182007-08-02Martin LucasMethods of determining relative genetic likelihoods of an individual matching a population
US8340950B2 (en)2006-02-102012-12-25Affymetrix, Inc.Direct to consumer genotype-based products and services
US7818281B2 (en)2006-02-142010-10-19Affymetrix, Inc.Computer software for visualizing recombination events in a group of individuals from recombination breakpoints and assignments in high density SNP genotyping data by generating a color-coded view for each individual chromosome and a whole genome view for the group
EP1864219A1 (en)2006-02-282007-12-12Mentor Graphics CorporationMonitoring physical parameters in an emulation environment
US20080189047A1 (en)2006-11-012008-08-070752004 B.C. Ltd.Method and system for genetic research using genetic sampling via an interactive online network
WO2008053493A2 (en)2006-11-022008-05-08Famillion Ltd.Method and system for computerized management of related data records
US20080131887A1 (en)2006-11-302008-06-05Stephan Dietrich AGenetic Analysis Systems and Methods
US20080228699A1 (en)2007-03-162008-09-18Expanse Networks, Inc.Creation of Attribute Combination Databases
WO2009002942A2 (en)2007-06-222008-12-31Apple Inc.Touch screen device, method, and graphical user interface for providing maps, directions, and location-based information
WO2009010948A1 (en)2007-07-182009-01-22Famillion Ltd.Method and system for use of a database of personal data records
US20090043752A1 (en)2007-08-082009-02-12Expanse Networks, Inc.Predicting Side Effect Attributes
CN105861664A (en)2007-09-262016-08-17生命科技临床服务实验室公司Methods and systems for genomic analysis using ancestral data
US8589437B1 (en)2007-10-152013-11-1923Andme, Inc.De-identification and sharing of genetic data
US9336177B2 (en)2007-10-152016-05-1023Andme, Inc.Genome sharing
US8510057B1 (en)2007-10-152013-08-1323Andme, Inc.Summarizing an aggregate contribution to a characteristic for an individual
US10275569B2 (en)2007-10-152019-04-3022andMe, Inc.Family inheritance
US7983893B2 (en)2008-01-082011-07-19Mentor Graphics CorporationFault support in an emulation environment
US20090182579A1 (en)2008-01-102009-07-16Edison LiuMethod of processing genomic information
US20090198519A1 (en)2008-01-312009-08-06Mcnamar Richard TimothySystem for gene testing and gene research while ensuring privacy
US8214192B2 (en)2008-02-272012-07-03Mentor Graphics CorporationResource remapping in a hardware emulation environment
US20170330358A1 (en)2008-03-192017-11-1623Andme, Inc.Ancestry painting
US20090307179A1 (en)2008-03-192009-12-10Brandon ColbyGenetic analysis
US8214195B2 (en)2008-03-212012-07-03Mentor Graphics CorporationTesting in a hardware emulation environment
WO2010017520A1 (en)2008-08-082010-02-11Navigenics, Inc.Methods and systems for personalized action plans
US9218451B2 (en)2008-08-262015-12-2223Andme, Inc.Processing data from genotyping chips
US8645343B2 (en)2008-08-262014-02-0423Andme, Inc.Processing data from genotyping chips
US20100063835A1 (en)2008-09-102010-03-11Expanse Networks, Inc.Method for Secure Mobile Healthcare Selection
US20100076988A1 (en)2008-09-102010-03-25Expanse Networks, Inc.Masked Data Service Profiling
US20100070292A1 (en)2008-09-102010-03-18Expanse Networks, Inc.Masked Data Transaction Database
US8200509B2 (en)2008-09-102012-06-12Expanse Networks, Inc.Masked data record access
US20100076950A1 (en)2008-09-102010-03-25Expanse Networks, Inc.Masked Data Service Selection
US20100063830A1 (en)2008-09-102010-03-11Expanse Networks, Inc.Masked Data Provider Selection
US20100063865A1 (en)2008-09-102010-03-11Expanse Networks, Inc.Masked Data Provider Profiling
US7917438B2 (en)2008-09-102011-03-29Expanse Networks, Inc.System for secure mobile healthcare selection
US20100070455A1 (en)2008-09-122010-03-18Navigenics, Inc.Methods and Systems for Incorporating Multiple Environmental and Genetic Risk Factors
US20100281401A1 (en)2008-11-102010-11-04Signature Genomic LabsInteractive Genome Browser
WO2010065139A1 (en)2008-12-052010-06-1023Andme, Inc.Gamete donor selection based on genetic calculations
US8386519B2 (en)2008-12-302013-02-26Expanse Networks, Inc.Pangenetic web item recommendation system
US20100169262A1 (en)2008-12-302010-07-01Expanse Networks, Inc.Mobile Device for Pangenetic Web
US8108406B2 (en)2008-12-302012-01-31Expanse Networks, Inc.Pangenetic web user behavior prediction system
US20100169313A1 (en)2008-12-302010-07-01Expanse Networks, Inc.Pangenetic Web Item Feedback System
US8255403B2 (en)2008-12-302012-08-28Expanse Networks, Inc.Pangenetic web satisfaction prediction system
US20100169338A1 (en)2008-12-302010-07-01Expanse Networks, Inc.Pangenetic Web Search System
WO2010077336A1 (en)2008-12-312010-07-0823Andme, Inc.Finding relatives in a database
US8473218B2 (en)2009-01-292013-06-25Microsoft CorporationRefining HLA data
US8731819B2 (en)2009-06-102014-05-20Ancestralhunt Partners, LlcSystem and method for the collaborative collection, assignment, visualization, analysis, and modification of probable genealogical relationships based on geo-spatial and temporal proximity
HUE034854T2 (en)2009-11-052018-03-28Univ Hong Kong Chinese Fetal genomic analysis from maternal biological samples
WO2011065982A2 (en)2009-11-302011-06-0323Andme, Inc.Polymorphisms associated with parkinson's disease
US20110257889A1 (en)2010-02-242011-10-20Pacific Biosciences Of California, Inc.Sequence assembly and consensus sequence determination
JP5517752B2 (en)2010-05-312014-06-11キヤノン株式会社 Image forming apparatus and computer program
WO2012099890A1 (en)2011-01-182012-07-26University Of Utah Research FoundationEstimation of recent shared ancestry
US8786603B2 (en)2011-02-252014-07-22Ancestry.Com Operations Inc.Ancestor-to-ancestor relationship linking methods and systems
US10127346B2 (en)2011-04-132018-11-13The Board Of Trustees Of The Leland Stanford Junior UniversitySystems and methods for interpreting a human genome using a synthetic reference sequence
CA2834375C (en)2011-04-272020-07-14Amyris, Inc.Methods for genomic modification
US9153142B2 (en)2011-05-262015-10-06International Business Machines CorporationUser interface for an evidence-based, hypothesis-generating decision support system
US20130297221A1 (en)2011-06-012013-11-07The Board Of Trustees Of The Leland Stanford Junior UniversityMethod and System for Accurate Construction Of Long Range Haplotype
US10790041B2 (en)2011-08-172020-09-2923Andme, Inc.Method for analyzing and displaying genetic information between family members
US9367663B2 (en)2011-10-062016-06-14Sequenom, Inc.Methods and processes for non-invasive assessment of genetic variations
US8990250B1 (en)2011-10-112015-03-2423Andme, Inc.Cohort selection with privacy protection
TWI619038B (en)2011-11-072018-03-21Admedec Co Ltd Safety box
US10437858B2 (en)2011-11-232019-10-0823Andme, Inc.Database and data processing system for use with a network-based personal genetics services platform
US10777302B2 (en)2012-06-042020-09-1523Andme, Inc.Identifying variants of interest by imputation
US10025877B2 (en)2012-06-062018-07-1723Andme, Inc.Determining family connections of individuals in a database
US9116882B1 (en)2012-08-022015-08-2523Andme, Inc.Identification of matrilineal or patrilineal relatives
US10847248B2 (en)2012-08-102020-11-24The Board Of Trustees Of The Leland Stanford Junior UniversityTechniques for determining haplotype by population genotype and sequence data
US9449143B2 (en)2012-08-282016-09-20Inova Health SystemAncestral-specific reference genomes and uses thereof
WO2014039875A1 (en)2012-09-062014-03-13Ancestry.Com Dna, LlcUsing haplotypes to infer ancestral origins for recently admixed individuals
US10114922B2 (en)2012-09-172018-10-30Ancestry.Com Dna, LlcIdentifying ancestral relationships using a continuous stream of input
US9977708B1 (en)2012-11-082018-05-2223Andme, Inc.Error correction in ancestry classification
US9213947B1 (en)2012-11-082015-12-1523Andme, Inc.Scalable pipeline for local ancestry inference
AU2014218418A1 (en)2013-03-152015-10-01Ancestry.Com Dna, LlcFamily networks
MX2017004989A (en)2014-10-142017-11-13Ancestry Com Dna LlcReducing error in predicted genetic relationships.
MX2017004980A (en)*2014-10-172017-12-07Ancestry Com Dna LlcAncestral human genomes.
US20170329899A1 (en)2014-10-292017-11-1623Andme, Inc.Display of estimated parental contribution to ancestry
US20170329902A1 (en)2014-10-292017-11-1623Andme, Inc.Estimation of admixture generation
US10867705B2 (en)2014-11-062020-12-15Ancestryhealth.Com, LlcPredicting health outcomes
MX2017015224A (en)*2015-05-302018-02-19Ancestry Com Dna LlcDiscovering population structure from patterns of identity-by-descent.
US10957422B2 (en)2015-07-072021-03-23Ancestry.Com Dna, LlcGenetic and genealogical analysis for identification of birth location and surname information
CA2992397A1 (en)2015-07-132017-01-19Ancestry.Com Dna, LlcLocal genetic ethnicity determination system
US20170329915A1 (en)2015-08-272017-11-1623Andme, Inc.Systems and methods for generating a modular web page template to display personal genetic and physiological condition information
EP3432198B1 (en)2017-07-192024-04-17Tata Consultancy Services LimitedCrowdsourcing and deep learning based segmenting and karyotyping of chromosomes
US11482306B2 (en)*2019-02-272022-10-25Ancestry.Com Dna, LlcGraphical user interface displaying relatedness based on shared DNA
US11848073B2 (en)*2019-04-032023-12-19University Of Central Florida Research Foundation, Inc.Methods and system for efficient indexing for genetic genealogical discovery in large genotype databases
US20210020266A1 (en)*2019-07-192021-01-2123Andme, Inc.Phase-aware determination of identity-by-descent dna segments
US11514627B2 (en)2019-09-132022-11-2923Andme, Inc.Methods and systems for determining and displaying pedigrees
WO2021243094A1 (en)2020-05-272021-12-0223Andme, Inc.Machine learning platform for generating risk models
US20220044761A1 (en)2020-05-272022-02-1023Andme, Inc.Machine learning platform for generating risk models
US11817176B2 (en)2020-08-132023-11-1423Andme, Inc.Ancestry composition determination
CA3194288A1 (en)2020-10-092022-04-1423Andme, Inc.Formatting and storage of genetic markers
WO2022087478A1 (en)2020-10-232022-04-2823Andme, Inc.Machine learning platform for generating risk models

Patent Citations (1)

* Cited by examiner, † Cited by third party
Publication numberPriority datePublication dateAssigneeTitle
US20050191731A1 (en)*1999-06-252005-09-01Judson Richard S.Methods for obtaining and using haplotype data

Non-Patent Citations (5)

* Cited by examiner, † Cited by third party
Title
"Centimorgan." CentiMorgan - ISOGG Wiki, 10 July 2010, https://isogg.org/wiki/CentiMorgan. (Year: 2010)*
Browning, Brian L., and Sharon R. Browning. "Improving the accuracy and efficiency of identity-by-descent detection in population data." Genetics 194.2 (2013): 459-471. (Year: 2013)*
Browning, Sharon R., and Brian L. Browning. "High-resolution detection of identity by descent in unrelated individuals." The American Journal of Human Genetics 86.4 (2010): 526-539. (Year: 2011)*
Li, Hong, et al. "Relationship estimation from whole-genome sequence data." PLoS genetics 10.1 (2014): e1004144. (Year: 2014)*
Upton, Alex, et al. "High-performance computing to detect epistasis in genome scale data sets." Briefings in bioinformatics 17.3 (2016): 368-379. (Year: 2016)*

Cited By (27)

* Cited by examiner, † Cited by third party
Publication numberPriority datePublication dateAssigneeTitle
US12431221B2 (en)2007-10-152025-09-3023Andme, Inc.Window-based method for determining inherited segments
US11170873B2 (en)2007-10-152021-11-0923Andme, Inc.Genetic comparisons between grandparents and grandchildren
US11171962B2 (en)2007-10-152021-11-0923Andme, Inc.Genome sharing
US12327615B2 (en)2007-10-152025-06-1023Andme, Inc.Genetic comparisons between grandparents and grandchildren
US11683315B2 (en)2007-10-152023-06-2023Andme, Inc.Genome sharing
US11531445B1 (en)2008-03-192022-12-2023Andme, Inc.Ancestry painting
US11468971B2 (en)2008-12-312022-10-1123Andme, Inc.Ancestry finder
US11508461B2 (en)2008-12-312022-11-2223Andme, Inc.Finding relatives in a database
US11049589B2 (en)2008-12-312021-06-2923Andme, Inc.Finding relatives in a database
US11322227B2 (en)2008-12-312022-05-0323Andme, Inc.Finding relatives in a database
US11170047B2 (en)2012-06-062021-11-0923Andme, Inc.Determining family connections of individuals in a database
US11521708B1 (en)2012-11-082022-12-0623Andme, Inc.Scalable pipeline for local ancestry inference
US12354710B1 (en)2012-11-082025-07-0823Andme, Inc.Scalable pipeline for local ancestry inference
US20240087671A1 (en)*2019-04-032024-03-14University Of Central Florida Research Foundation, Inc.Methods and system for efficient indexing for genetic genealogical discovery in large genotype databases
US12237051B2 (en)*2019-04-032025-02-25University Of Central Florida Research Foundation, Inc.Methods and system for efficient indexing for genetic genealogical discovery in large genotype databases
US20200321073A1 (en)*2019-04-032020-10-08University Of Central Florida Research Foundation, Inc.Methods and system for efficient indexing for genetic genealogical discovery in large genotype databases
US11848073B2 (en)*2019-04-032023-12-19University Of Central Florida Research Foundation, Inc.Methods and system for efficient indexing for genetic genealogical discovery in large genotype databases
US12260936B2 (en)*2019-07-192025-03-2523Andme, Inc.Identity-by-descent relatedness based on focal and reference segments
US12046327B1 (en)*2019-07-192024-07-2323Andme, Inc.Identity-by-descent relatedness based on focal and reference segments
US20240321398A1 (en)*2019-07-192024-09-2623Andme, Inc.Identity-by-descent relatedness based on focal and reference segments
US12073495B2 (en)2019-09-132024-08-2723Andme, Inc.Methods and systems for determining and displaying pedigrees
US11514627B2 (en)2019-09-132022-11-2923Andme, Inc.Methods and systems for determining and displaying pedigrees
US12159690B2 (en)2020-08-132024-12-0323Andme, Inc.Ancestry composition determination
US11817176B2 (en)2020-08-132023-11-1423Andme, Inc.Ancestry composition determination
US11783919B2 (en)2020-10-092023-10-1023Andme, Inc.Formatting and storage of genetic markers
US20230019141A1 (en)*2021-07-072023-01-19Mars, IncorporatedSystem, method, and apparatus for predicting genetic ancestry
US20230260608A1 (en)*2022-02-162023-08-17Ancestry.Com Dna, LlcRelationship prediction

Also Published As

Publication numberPublication date
EP4000070A1 (en)2022-05-25
WO2021016114A1 (en)2021-01-28
EP4000070A4 (en)2023-08-09
US12260936B2 (en)2025-03-25
US20210193257A1 (en)2021-06-24
US20250191684A1 (en)2025-06-12
CA3147888A1 (en)2021-01-28
US20240321398A1 (en)2024-09-26
US12046327B1 (en)2024-07-23

Similar Documents

PublicationPublication DateTitle
US12260936B2 (en)Identity-by-descent relatedness based on focal and reference segments
US12217832B2 (en)Deep learning-based variant classifier
US20230402132A1 (en)Error Correction in Ancestry Classification
US10600217B2 (en)Methods for the graphical representation of genomic sequence data
Freyman et al.Fast and robust identity-by-descent inference with the templated positional burrows–wheeler transform
CA2964902C (en)Ancestral human genomes
US9367800B1 (en)Ancestry painting with local ancestry inference
Bzikadze et al.UniAligner: a parameter-free framework for fast sequence alignment
Llinares-López et al.Genome-wide genetic heterogeneity discovery with categorical covariates
Alkan et al.RedNemo: topology-based PPI network reconstruction via repeated diffusion with neighborhood modifications
Yang et al.Improved detection algorithm for copy number variations based on hidden Markov model
Wu et al.A practical algorithm based on particle swarm optimization for haplotype reconstruction
StramSNP Imputation for Association Studies
Choudhury et al.HAPI-Gen: Highly accurate phasing and imputation of genotype data
Liu et al.Genome-wide selection on transposable elements in maize
HK40062519A (en)Deep learning-based variant classifier
WangMissing SNP Genotype Imputation
HuangComputational Methods Using Large-Scale Population Whole-Genome Sequencing Data
NZ789499A (en)Deep learning-based variant classifier
NZ791625A (en)Variant classifier based on deep neural networks
HK40025635B (en)Deep learning-based variant classifier
HK40025635A (en)Deep learning-based variant classifier

Legal Events

DateCodeTitleDescription
ASAssignment

Owner name:23ANDME, INC., CALIFORNIA

Free format text:ASSIGNMENT OF ASSIGNORS INTEREST;ASSIGNORS:FREYMAN, WILLIAM A.;MCMANUS, KIMBERLY F.;SHRINGARPURE, SUYASH S.;AND OTHERS;SIGNING DATES FROM 20200806 TO 20200807;REEL/FRAME:053556/0220

STPPInformation on status: patent application and granting procedure in general

Free format text:APPLICATION DISPATCHED FROM PREEXAM, NOT YET DOCKETED

STPPInformation on status: patent application and granting procedure in general

Free format text:DOCKETED NEW CASE - READY FOR EXAMINATION

STPPInformation on status: patent application and granting procedure in general

Free format text:RESPONSE TO NON-FINAL OFFICE ACTION ENTERED AND FORWARDED TO EXAMINER

STPPInformation on status: patent application and granting procedure in general

Free format text:FINAL REJECTION MAILED

STCBInformation on status: application discontinuation

Free format text:ABANDONED -- FAILURE TO RESPOND TO AN OFFICE ACTION


[8]ページ先頭

©2009-2025 Movatter.jp