Movatterモバイル変換


[0]ホーム

URL:


Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
Thehttps:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

NIH NLM Logo
Log inShow account info
Access keysNCBI HomepageMyNCBI HomepageMain ContentMain Navigation
pubmed logo
Advanced Clipboard
User Guide

Full text links

Nature Publishing Group full text link Nature Publishing Group Free PMC article
Full text links

Actions

Share

.2017 Apr;20(4):602-611.
doi: 10.1038/nn.4524. Epub 2017 Mar 6.

Whole genome sequencing resource identifies 18 new candidate genes for autism spectrum disorder

Ryan K C Yuen  1Daniele Merico  1  2Matt Bookman  3  4Jennifer L Howe  1Bhooma Thiruvahindrapuram  1Rohan V Patel  1Joe Whitney  1Nicole Deflaux  3  4Jonathan Bingham  3  4Zhuozhi Wang  1Giovanna Pellecchia  1Janet A Buchanan  1Susan Walker  1Christian R Marshall  1  5Mohammed Uddin  1Mehdi Zarrei  1Eric Deneault  1Lia D'Abate  1  6Ada J S Chan  1  6Stephanie Koyanagi  1Tara Paton  1Sergio L Pereira  1Ny Hoang  1  7Worrawat Engchuan  1Edward J Higginbotham  1Karen Ho  1Sylvia Lamoureux  1Weili Li  1Jeffrey R MacDonald  1Thomas Nalpathamkalam  1Wilson W L Sung  1Fiona J Tsoi  1John Wei  1Lizhen Xu  1Anne-Marie Tasse  8Emily Kirby  8William Van Etten  9Simon Twigger  9Wendy Roberts  7Irene Drmic  1  7Sanne Jilderda  1  7Bonnie MacKinnon Modi  1  7Barbara Kellam  1Michael Szego  1  10Cheryl Cytrynbaum  6  10  11  12Rosanna Weksberg  6  11  12Lonnie Zwaigenbaum  13Marc Woodbury-Smith  1  14Jessica Brian  15Lili Senman  15Alana Iaboni  15Krissy Doyle-Thomas  15Ann Thompson  14Christina Chrysler  14Jonathan Leef  15Tal Savion-Lemieux  16Isabel M Smith  17Xudong Liu  18Rob Nicolson  19  20Vicki Seifer  21Angie Fedele  21Edwin H Cook  22Stephen Dager  23Annette Estes  24Louise Gallagher  25Beth A Malow  26Jeremy R Parr  27Sarah J Spence  28Jacob Vorstman  29Brendan J Frey  2  30James T Robinson  31Lisa J Strug  1  32Bridget A Fernandez  33Mayada Elsabbagh  16Melissa T Carter  12  34Joachim Hallmayer  35Bartha M Knoppers  36Evdokia Anagnostou  15Peter Szatmari  37  38  39Robert H Ring  40David Glazer  3  4Mathew T Pletcher  21Stephen W Scherer  1  6  41
Affiliations

Whole genome sequencing resource identifies 18 new candidate genes for autism spectrum disorder

Ryan K C Yuen et al. Nat Neurosci.2017 Apr.

Abstract

We are performing whole-genome sequencing of families with autism spectrum disorder (ASD) to build a resource (MSSNG) for subcategorizing the phenotypes and underlying genetic factors involved. Here we report sequencing of 5,205 samples from families with ASD, accompanied by clinical information, creating a database accessible on a cloud platform and through a controlled-access internet portal. We found an average of 73.8 de novo single nucleotide variants and 12.6 de novo insertions and deletions or copy number variations per ASD subject. We identified 18 new candidate ASD-risk genes and found that participants bearing mutations in susceptibility genes had significantly lower adaptive ability (P = 6 × 10-4). In 294 of 2,620 (11.2%) of ASD cases, a molecular basis could be determined and 7.2% of these carried copy number variations and/or chromosomal abnormalities, emphasizing the importance of detecting all forms of genetic variation as diagnostic and therapeutic targets in ASD.

PubMed Disclaimer

Conflict of interest statement

Competing Financial Interests: The authors declare no competing financial interests.

Figures

Figure 1
Figure 1. Schematic of sample and data processing in MSSNG
An executive committee oversees the project. The parameters for DNA sample selection and (genetic and phenotypic) data are managed by the committee, including consenting and ethics protocols. Coded identifiers’ for samples selected for WGS are posted as they are identified at MSSNG portal (http://mss.ng/research), so the ASD research community can monitor progress. Phenome data include subject information (identity number, year of birth, sex), family code (proband, parent, sibling), results of diagnostic tests (e.g. ADOS, ADI-R, age at diagnosis, functional assessments, intelligence tests, body measurements and dysmorphic features). The database accommodates as much of this information as is available for each sample, but that varies widely. Future plans include incorporation of fields for co-morbidities, related conditions, exposures, extended family history, interventions, and other parameters that become apparent. WGS technologies were Complete Genomics and Illumina HiSeq (2000 and X). WGS data are transferred to Google Genomics for data processing through the Google Cloud. Ref-blocked gVCFs were generated and stored in Google Cloud Storage, which were also processed forde novo mutation detection in the local cluster (for Complete Genomics data using filtering method) and Google Compute Engine (for Illumina data using DenovoGear). The Ref-blocked gVCFs and thede novo mutations were annotated through the Google Compute Engine (using Annovar), which can be accessed through the BigQuery tables. Quality controls of the genomic data were performed in the local cluster and the Google Cloud. The processed genetic data and the phenotypic data are accessible through the MSSNG Portal interface. The MSSNG database is designed to support incremental addition of data without changes in architecture, scaling to at least tens of thousands. New WGS and phenotype data are continually added to MSSNG as new batches of 1,000 samples are processed. DACO: Data access committee; UPD: Uniparental disomy; Ti/Tv: Transition to transversion ratio; IBS: Identity by state.
Figure 2
Figure 2. Characteristics and quality of WGS from different sequencing platforms
(a) Number of SNVs detected per genome. (b) Number of indels detected per genome. (c) Number of rare coding SNVs detected per genome after quality filtering. (d) Number of rare coding indels detected per genome after quality filtering. Genomes sequenced by Complete Genomics with 2.0 pipeline version are colored in orange, by Complete Genomics with 2.4 pipeline version are colored in brown, by Complete Genomics with 2.5 pipeline version are colored in green, by Illumina HiSeq 2000 are colored in blue, and by Illumina HiSeq X are colored in purple. Details of quality for individual samples can be found in Supplementary Table 1.
Figure 3
Figure 3. ASD-susceptibility genes/loci
(a) ASD-risk genes with higher than expected mutation rate from MSSNG integrated with other large-scale high-throughput sequencing projects. ASD-risk genes are ranked in descending order of the number of mutations found for each gene. Other LOF mutations, including inherited LOF mutations and LOF mutations with unknown inheritance (where parents are unavailable for testing), and CNVs found in the MSSNG cohort are indicated (except for genes found by higher than expectedde novo missense mutation rate). MSSNG data are in green and published data are in yellow. Novel putative ASD-risk genes identified in this study carry an asterisk.Δ indicate genes with druggable protein domains identified (Supplementary Table 6). (b) Pathogenic chromosomal abnormalities and CNVs identified falling into one of four categories: Chromosomal abnormalities; DECIPHER loci and other genomic disorders associated with ASD; large rare CNVs between 3–10Mb and CNVs disrupting ASD candidate genes not described above in Figure 3a. Deletions are in red, duplications are in blue and complex variants are in purple. # indicate CNVs shared between affected siblings; ‡ indicates a CNV carried by an individual with a second pathogenic CNV; † indicates a CNV shared between individuals within an extended pedigree. Details can be found in Supplementary Table 8. Examples of CNVs affecting theNRXN1 andCHD8 genes, and thePTCHD1-AS non-coding gene identified from the WGS are shown in Figure 4.
Figure 4
Figure 4. CNV characterization via WGS reads in the MSSNG Portal
(a–c) Visualization of CNVs in WGS data. (a) A heterozygous 246kb deletion of three exons ofNRXN1 at chromosome 2p16.3 in subject 2-1428-003 (average 50% decrease in sequence read-depth); (b) a 31.1kb duplication withinCHD8 at chromosome 14q11.2 in subject 2-1375-003 (average of 50% increase in sequence read-depth) and (c) 125kb deletion of exon 3 of the non-coding genePTCHD1-AS at Xp22.11 in male subject 1-0277-003 (no reads apparent, other than a small stretch of likely mis-aligned repetitive sequences). Left and right panels show the proximal and distal breakpoints of the CNVs respectively. Aligned reads viewed from the BAM files in the MSSNG browser are shown indicating the read depth. Genome co-ordinates are shown above and impacted genes below. The predicted CNVs visible from the WGS data and high-resolution microarray are shown by the red (deletion) and blue (duplication) bars. For 32 CNVs described in Figure 3b plus 17 additional CNVs, we derived a more accurate estimate of the breakpoints by visual inspection of read depth from the BAM file in the MSSNG browser. On average, the size difference between the CNV predicted by microarray data and the estimated size from WGS data was 6.9kb and for 31/49 (63%) CNVs, the size of the CNV was smaller in the microarray data than WGS. For four CNVs, the WGS-resolved breakpoints altered the exons of genes being annotated as deleted or duplicated. In another case, this resulted in a CNV from microarray no longer being classified as pathogenic as the revised breakpoints no longer included coding sequence.
Figure 5
Figure 5. Phenotype comparison for the samples with and without identified mutations
Standard score of (a) IQ Full scale and (b) Vineland Adaptive Behavior were compared between samples with pathogenic CNVs,de novo LOF mutations, mutations in ASD-risk genes and other samples without any of these mutations.
Figure 6
Figure 6. Interaction similarity network of ASD-risk genes
Connections represent gene similarity based on physical protein interactions and pathway interactions. Connection thickness is proportional to the fraction of interaction partners shared by the connected genes. The size of the node for each gene is proportional to the total mutation count (Figure 3). Genes associated with LOF mutations are in circle shape, while genes associated with missense mutations are in diamond shape. The node color corresponds to the BrainSpan brain expression Principal Component 1 (prenatal in yellow, postnatal in blue, balanced in light blue, undetermined in grey). The labels of novel ASD-risk genes are displayed in red. The network was visualized using Cytoscape.
See this image and copyright information in PMC

Similar articles

See all similar articles

Cited by

See all "Cited by" articles

References

    1. Lai MC, Lombardo MV, Baron-Cohen S. Autism. Lancet. 2014;383:896–910. - PubMed
    1. Robinson EB, et al. Genetic risk for autism spectrum disorders and neuropsychiatric variation in the general population. Nat Genet. 2016;48:552–555. - PMC - PubMed
    1. Anagnostou E, et al. Autism spectrum disorder: advances in evidence-based practice. CMAJ. 2014;186:509–519. - PMC - PubMed
    1. Carter MT, Scherer SW. Autism spectrum disorder in the genetics clinic: a review. Clin Genet. 2013;83:399–407. - PubMed
    1. Miles JH. Complex Autism Spectrum Disorders and Cutting-Edge Molecular Diagnostic Tests. JAMA. 2015;314:879–880. - PubMed

MeSH terms

Related information

Grants and funding

LinkOut - more resources

Full text links
Nature Publishing Group full text link Nature Publishing Group Free PMC article
Cite
Send To

NCBI Literature Resources

MeSHPMCBookshelfDisclaimer

The PubMed wordmark and PubMed logo are registered trademarks of the U.S. Department of Health and Human Services (HHS). Unauthorized use of these marks is strictly prohibited.


[8]ページ先頭

©2009-2025 Movatter.jp