Movatterモバイル変換


[0]ホーム

URL:


Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
Thehttps:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

NIH NLM Logo
Log inShow account info
Access keysNCBI HomepageMyNCBI HomepageMain ContentMain Navigation
pubmed logo
Advanced Clipboard
User Guide

Full text links

Nature Publishing Group full text link Nature Publishing Group Free PMC article
Full text links

Actions

Share

.2018 Apr 26;50(5):727-736.
doi: 10.1038/s41588-018-0107-y.

An analytical framework for whole-genome sequence association studies and its implications for autism spectrum disorder

Donna M Werling #  1Harrison Brand #  2  3  4Joon-Yong An #  1Matthew R Stone #  2Lingxue Zhu #  5Joseph T Glessner  2  3  4Ryan L Collins  2  3  6Shan Dong  1Ryan M Layer  7  8Eirene Markenscoff-Papadimitriou  1Andrew Farrell  7  8Grace B Schwartz  1Harold Z Wang  2Benjamin B Currall  2  3  4Xuefang Zhao  2  3  4Jeanselle Dea  1Clif Duhn  1Carolyn A Erdman  1Michael C Gilson  1Rachita Yadav  2  3  4Robert E Handsaker  4  9Seva Kashin  4  9Lambertus Klei  10Jeffrey D Mandell  1Tomasz J Nowakowski  1  11  12Yuwen Liu  13Sirisha Pochareddy  14Louw Smith  1Michael F Walker  1Matthew J Waterman  15Xin He  13Arnold R Kriegstein  16John L Rubenstein  1Nenad Sestan  14Steven A McCarroll  4  9Benjamin M Neale  4  17  18Hilary Coon  19  20A Jeremy Willsey  1  21Joseph D Buxbaum  22  23  24  25Mark J Daly  4  17  18Matthew W State  1Aaron R Quinlan  7  8  20Gabor T Marth  7  8Kathryn Roeder  5  26Bernie Devlin  27Michael E Talkowski  28  29  30  31Stephan J Sanders  32
Affiliations

An analytical framework for whole-genome sequence association studies and its implications for autism spectrum disorder

Donna M Werling et al. Nat Genet..

Abstract

Genomic association studies of common or rare protein-coding variation have established robust statistical approaches to account for multiple testing. Here we present a comparable framework to evaluate rare and de novo noncoding single-nucleotide variants, insertion/deletions, and all classes of structural variation from whole-genome sequencing (WGS). Integrating genomic annotations at the level of nucleotides, genes, and regulatory regions, we define 51,801 annotation categories. Analyses of 519 autism spectrum disorder families did not identify association with any categories after correction for 4,123 effective tests. Without appropriate correction, biologically plausible associations are observed in both cases and controls. Despite excluding previously identified gene-disrupting mutations, coding regions still exhibited the strongest associations. Thus, in autism, the contribution of de novo noncoding variation is probably modest in comparison to that of de novo coding variants. Robust results from future WGS studies will require large cohorts and comprehensive analytical strategies that consider the substantial multiple-testing burden.

PubMed Disclaimer

Figures

Figure 1
Figure 1. Burden analyses for gene-defined annotation categories
a) The observed relative risk ofde novo mutations in cases vs. controls is shown by the red line against grey violin plots representing the kernel density estimation of relative risk from 10,000 label-swapping permutations of case-control status for 11 gene-defined annotation categories. Box plots further illustrate the relative risk from permutations, including the median (center line), first and third quartiles (box), 1.5x interquartile range or the most extreme value (whiskers), and permuted relative risk observations beyond 1.5x interquartile range (outlier points). P-values from a case-control label-swapping permutation analysis and Bonferroni-corrected p-values (10 tests) ≤0.05 are shown. Loss-of-function variants were not analyzed as cases with such mutations were excluded from the cohort.b) The analysis in ‘a’ is repeated considering onlyde novo mutations in or near 179 ASD genes. Permutation p-values are Bonferroni-corrected for 7 tests. Considering SNVs and indels separately does not alter these findings (Supplementary Fig. 5).
Figure 2
Figure 2. Defining annotation categories
Five groups of annotations were defined: 1) Conservation across species; 2) Variant type; 3) GENCODE gene definitions; 4) Gene lists; and 5) Functional annotations. Picking one annotation from each group resulted in 66,402 possible combinations of which 51,801 were non-redundant (Supplementary Table 7). The 13,704 annotations categories that included at least seven observed mutations were considered in the category-wide association test.
Figure 3
Figure 3. Category-wide association study
a) The burden ofde novo SNVs and indels in n=519 cases vs. n=519 controls for 13,704 annotation categories with ≥7 observed variants are shown as points in the volcano plot (Supplementary Table 7). Permutation p-values were calculated by 10,000 label-swapping permutations of case-control status in each annotation category. No test survives Bonferroni correction for 4,123 effective tests (top horizontal red line).b) Correlations of p-values between annotation categories (small dots) in simulated data are shown by proximity in the first two t-SNE dimensions. The large circles show 200 independent clusters of annotation categories defined by k-means clustering. The circle size represents the degrees of freedom accounted for by the cluster using Eigenvalue decomposition. In total, 4,123 effective tests explain 99% of the variability in p-values (Supplementary Fig. 6).c–h) Six clusters from (b) are shown in greater detail, with cluster number in bold. The edges represent p-value correlation ≥0.4.i–k) The number of nominally significant annotation categories (p≤0.05 from two-sided binomial test) was calculated for cases, controls, and 10,000 permutations to assess whether more annotation categories are enriched forde novo variants in cases than expected in (a). Cases have a greater than expected number of nominally significant categories relating to coding mutations and noncoding indels, but not for all noncoding mutations, nor for noncoding mutations nearest to ASD genes. P-values were calculated as the proportion of permutations in which the same or a greater number of categories had a two-sided binomial test p-value ≤0.05 as in the observed data.
Figure 4
Figure 4. Structural variation in 519 ASD families
Structural variation (SV) analyses identified an average of 5,863 SVs per genome 171de novo SVs.a) We observed no difference in distribution of SV sizes between cases (n=519) and sibling controls (n=519) for any class of SV (cxSV = complex SV) at an unadjusted nominal significance threshold (two-tailed Wilcoxon rank-sum test; alpha = 0.05).b) We observed no differences in maternal/paternal transmission rates between cases and sibling controls for any class of SV or any range of variant frequencies (VF) (two-tailed binomial test). Mean paternal transmission rate (dot) and 95% binomial confidence intervals are shown in plot (error bars).c) We observed no significant enrichments for eitherde novo or rare inherited SV (VF < 0.1%) in genic or noncoding annotations after correcting for multiple comparisons in a two-sided sign test between case and control counts. Error bars represent the 95% confidence intervals.d) Analysis of balanced SV discovered ade novo reciprocal translocation in a case predicted to disruptGRIN2B (t(12q21.2;13p11.2)), a constrained gene previously implicated in ASD,.e) WGS revealed small CNVs undetected by previous analyses, including a 4,391bpde novo deletion of exons 8–10 ofCHD2 (GRCh37.63:chr15:g.93484245_93488636del), a gene previously implicated in ASD fromde novo coding mutations.f) Analysis of breakpoint sequences also classified 23de novo SVs that were predicted to be germline mosaic in the parents, including this 242.8kb paternally transmitted mosaic duplication at 8q24.23 that was previously characterized asde novo in the child (GRCh37.63:chr8:g.136681615_136924426dup). Bar plots represent the means and 95% confidence intervals of estimated copy number in the duplicated locus. All p-values were calculated with a two-tailed t-test of estimated copy numbers in sequential 36.4kb bins.
Figure 5
Figure 5. Effective number of tests in CWAS and power calculation
a) The green line shows the threshold to achieve 80% power at nominal significance across the range of relative risks of a category (log10 scaled x-axis) and number of de novo mutations per individual within the category (log10 scaled y-axis). The purple line shows the 80% power corrected for 4,123 effective tests. The grey dots represent the observed results forde novo mutation burden in 519 families for the 13,704 annotation categories with ≥7 mutations.b) The lines show the threshold of 80% power across the range of relative risks and category sizes as sample size increases (correcting for correspondingly more effective tests, see Supplementary Information). For reference, the relative location for six classes of variation are shown.
See this image and copyright information in PMC

Comment in

Similar articles

  • Whole genome sequencing resource identifies 18 new candidate genes for autism spectrum disorder.
    C Yuen RK, Merico D, Bookman M, L Howe J, Thiruvahindrapuram B, Patel RV, Whitney J, Deflaux N, Bingham J, Wang Z, Pellecchia G, Buchanan JA, Walker S, Marshall CR, Uddin M, Zarrei M, Deneault E, D'Abate L, Chan AJ, Koyanagi S, Paton T, Pereira SL, Hoang N, Engchuan W, Higginbotham EJ, Ho K, Lamoureux S, Li W, MacDonald JR, Nalpathamkalam T, Sung WW, Tsoi FJ, Wei J, Xu L, Tasse AM, Kirby E, Van Etten W, Twigger S, Roberts W, Drmic I, Jilderda S, Modi BM, Kellam B, Szego M, Cytrynbaum C, Weksberg R, Zwaigenbaum L, Woodbury-Smith M, Brian J, Senman L, Iaboni A, Doyle-Thomas K, Thompson A, Chrysler C, Leef J, Savion-Lemieux T, Smith IM, Liu X, Nicolson R, Seifer V, Fedele A, Cook EH, Dager S, Estes A, Gallagher L, Malow BA, Parr JR, Spence SJ, Vorstman J, Frey BJ, Robinson JT, Strug LJ, Fernandez BA, Elsabbagh M, Carter MT, Hallmayer J, Knoppers BM, Anagnostou E, Szatmari P, Ring RH, Glazer D, Pletcher MT, Scherer SW.C Yuen RK, et al.Nat Neurosci. 2017 Apr;20(4):602-611. doi: 10.1038/nn.4524. Epub 2017 Mar 6.Nat Neurosci. 2017.PMID:28263302Free PMC article.
  • Genome-wide de novo risk score implicates promoter variation in autism spectrum disorder.
    An JY, Lin K, Zhu L, Werling DM, Dong S, Brand H, Wang HZ, Zhao X, Schwartz GB, Collins RL, Currall BB, Dastmalchi C, Dea J, Duhn C, Gilson MC, Klei L, Liang L, Markenscoff-Papadimitriou E, Pochareddy S, Ahituv N, Buxbaum JD, Coon H, Daly MJ, Kim YS, Marth GT, Neale BM, Quinlan AR, Rubenstein JL, Sestan N, State MW, Willsey AJ, Talkowski ME, Devlin B, Roeder K, Sanders SJ.An JY, et al.Science. 2018 Dec 14;362(6420):eaat6576. doi: 10.1126/science.aat6576.Science. 2018.PMID:30545852Free PMC article.
  • An integrative analysis of non-coding regulatory DNA variations associated with autism spectrum disorder.
    Williams SM, An JY, Edson J, Watts M, Murigneux V, Whitehouse AJO, Jackson CJ, Bellgrove MA, Cristino AS, Claudianos C.Williams SM, et al.Mol Psychiatry. 2019 Nov;24(11):1707-1719. doi: 10.1038/s41380-018-0049-x. Epub 2018 Apr 27.Mol Psychiatry. 2019.PMID:29703944
  • Genetic architecture of autism spectrum disorder: Lessons from large-scale genomic studies.
    Choi L, An JY.Choi L, et al.Neurosci Biobehav Rev. 2021 Sep;128:244-257. doi: 10.1016/j.neubiorev.2021.06.028. Epub 2021 Jun 21.Neurosci Biobehav Rev. 2021.PMID:34166716Review.
  • Voltage-gated Calcium Channels and Autism Spectrum Disorders.
    Breitenkamp AF, Matthes J, Herzig S.Breitenkamp AF, et al.Curr Mol Pharmacol. 2015;8(2):123-32. doi: 10.2174/1874467208666150507105235.Curr Mol Pharmacol. 2015.PMID:25966693Review.
See all similar articles

Cited by

See all "Cited by" articles

References

    1. Schizophrenia Working Group of the Psychiatric Genomics, C. Biological insights from 108 schizophrenia-associated genetic loci. Nature. 2014;511:421–7. - PMC - PubMed
    1. Astle WJ, et al. The Allelic Landscape of Human Blood Cell Trait Variation and Links to Common Complex Disease. Cell. 2016;167:1415–1429. e19. - PMC - PubMed
    1. de Lange KM, et al. Genome-wide association study implicates immune activation of multiple integrin genes in inflammatory bowel disease. Nat Genet. 2017;49:256–261. - PMC - PubMed
    1. Sanders SJ, et al. Insights into Autism Spectrum Disorder Genomic Architecture and Biology from 71 Risk Loci. Neuron. 2015;87:1215–33. - PMC - PubMed
    1. Deciphering Developmental Disorders, S. Prevalence and architecture of de novo mutations in developmental disorders. Nature. 2017;542:433–438. - PMC - PubMed

Publication types

MeSH terms

Substances

Related information

Grants and funding

LinkOut - more resources

Full text links
Nature Publishing Group full text link Nature Publishing Group Free PMC article
Cite
Send To

NCBI Literature Resources

MeSHPMCBookshelfDisclaimer

The PubMed wordmark and PubMed logo are registered trademarks of the U.S. Department of Health and Human Services (HHS). Unauthorized use of these marks is strictly prohibited.


[8]ページ先頭

©2009-2025 Movatter.jp