NCBI Bookshelf. A service of the National Library of Medicine, National Institutes of Health.
Adam MP, Feldman J, Mirzaa GM, et al., editors. GeneReviews® [Internet]. Seattle (WA): University of Washington, Seattle; 1993-2025.
Stephanie E Wallace, MD andLora JH Bean, PhD.
Author Information and AffiliationsInitial Posting:March 14, 2017; Last Revision:June 18, 2020.
Estimated reading time: 30 minutes
Note: This information, provided by the editors ofGeneReviews, is intended both for individuals who have limited experience with comprehensive genetic testing (seeIntroductory Information) and for clinicians who routinely order comprehensive genetic testing (seeDetailed Information). – The Editors
This discussion addresses clinical tests available through CLIA-certified laboratories in the United States. Research testing is not discussed.
Many inherited disorders and phenotypes are genetically heterogeneous – that is, pathogenic variants in more than onegene can cause onephenotype (e.g., dilated cardiomyopathy, ataxia, hereditary hearing loss and deafness) or one genetic disorder (e.g., Noonan syndrome). Prior to the development of massively parallel sequencing (also known asnext-generation sequencing), the only cost-effective way to test more than one gene was serial single-gene testing (i.e., complete testing of one gene that might account for the phenotype before proceeding to testing of the next gene) ‒ an expensive and time-consuming approach with a potentially low yield. In the past ten years, improvements in massively parallel sequencing techniques have led to the development and widespread clinical use of multigene panels, which allow simultaneous testing of two to more than 150 genes. The methods used in multigene panels may includesequence analysis,deletion/duplication analysis, and/or other non-sequencing-based tests.
There are two types of multigene panels:
The humanexome includes all coding nuclear DNA sequences, approximately 180,000 exons that are transcribed into mature RNA. (Note that mitochondrial DNA is not included in the exome.) Comprising only 1%-2% of the human genome, the exome nonetheless contains the majority of currently recognized disease-causing variants.
Exome sequencing is a laboratory test designed to identify and analyze the sequence of all protein-coding nuclear genes in the genome. Approximately 95% of theexome can be sequenced with currently available techniques. The diagnostic utility ofexome sequencing has consistently been 20%-30% (i.e., a diagnosis is identified in 20%-30% of individuals who were previously undiagnosed but had features suggestive of a genetic condition) [Gahl et al 2012,Lazaridis et al 2016].
In the past five years,exome sequencing has increasingly become clinically available because:
The human genome includes all coding and noncoding nuclear and mitochondrial DNA sequences. Nuclear DNA encodes most of the more than 20,000 genes in humans; mitochondrial DNA encodes 37 genes. Most of the more than 3.2 billion base pairs that comprise the human genome are repetitive DNA or noncoding sequences – including noncoding RNAs, variants in which have been attributed to specific inherited disorders.
Genome sequencing is a laboratory test designed to identify and analyze the sequence of all coding and noncoding nuclear DNA. Mitochondrial DNA is part of the genome; however, mitochondrial sequencing is often ordered as a separate laboratory test.
Genome sequencing continues to be significantly more costly thanexome sequencing because of the high cost of data analysis. However, the diagnostic utility (20%-30%) is roughly the same for the two test methods: althoughgenome sequencing can identify variants outside of the coding regions, determination of pathogenicity of these variants is often not possible. Therefore, most confirmed pathogenic variants identified by genome sequencing are within exons [Taylor et al 2015].
Achromosomal microarray (CMA) is a molecular genetic test used to detect copy number variants (CNVs); CNVs are deletions (loss) or duplications (gain) ofchromosome material that range in size from approximately one kilobase (kb) to multiple megabases (Mb), with the largest CNVs resulting in a loss or gain of an entire chromosome. Depending on the size andgenomic location of a CNV, thedeletion orduplication may contain zero, one, or many genes. CNVs may be benign, pathogenic, or of uncertain clinical significance.
The most common types of CMA are oligonucleotide arraycomparative genomic hybridization (oligo aCGH), single-nucleotidepolymorphismgenotyping array (SNP array), and oligo aCGH / SNP combination array. CMA can be designed to identify deletions and duplications across the genome or in a targeted region(s) of the genome.
CMA is more sensitive at detecting CNVs thankaryotype analysis, which has largely been supplanted by CMA. High-resolution karyotype analysis can detect deletions as small as 3-5 Mb and duplications larger than ~5 Mb, whereas most CMA can detect CNVs as small as 100 kb. Oligo aCGH arrays, specifically, can be designed to detect CNVs as small as a singleexon.
CMA has been available as a clinical diagnostic test since 2004 and is recommended as a first-line test for individuals with developmental delay, intellectual disability, multiplecongenital anomalies, and/or autism spectrum disorder. For these disorders, CMA has a diagnostic yield of 15%-20%, compared to the 3% yield of a traditionalkaryotype [Manning et al 2010,Miller et al 2010].
Exome sequencing requires sequence enrichment to target exons and sequence eachexon. Genetic alterations that cannot reliably be detected byexome sequencing include alterations that:
Disorders resulting from genetic alterations that are not reliably identified byexome sequencing because of technical limitations:
Genetic disorders that may not be recognized during analysis ofexome sequencing due to unexpected inheritance pattern:
Disorders for which associated genes and/or pathogenic variants have not been reported:
Disorders that are not known to be genetic:
Laboratory-dependent variables. Read depth (sometimes calledexome coverage) and accuracy of base calling
Additional laboratory-dependent variables. Sequence enrichment method used to target exons, sequencing technique, and length of sequence generated
Laboratory-dependent variables introduced by analysis of data. Application of filters and analysis of remaining unfiltered variants
Because the expertise of the molecular geneticist(s) and the data available vary, variant classification, and therefore clinicalsensitivity, are laboratory dependent to some degree.
Laboratory-independent variables. Source of the DNA; GC (guanine-cytosine) content of the region
Genome sequencing is typically performed bynext-generation sequencing of shearedgenomic DNA. Genome sequencing techniques have nonstandardized, highly variable coverage.
Whilegenome sequencing is significantly more costly thanexome sequencing, it has distinct advantages:
The coverage of the genome is less than 100% and varies by laboratory.Telenti et al [2016] sequenced more than 10,000 genomes at a mean read depth of 30-40x (i.e., each DNA fragment was sequenced an average of 30 to 40 times); the authors reported that 91.5% of exons and 95.2% of knownpathogenic variant positions could be sequenced with high confidence. The clinicalsensitivity ofgenome sequencing is unknown.
Althoughgenome sequencing can identify variants outside of the coding regions, most of the confirmed pathogenic variants identified by genome sequencing are within theexome [Taylor et al 2015]. The diagnostic utility ofexome sequencing and genome sequencing (~20%-30%) remains similar. As more noncoding pathogenic variants are identified, the clinicalsensitivity and value of genome sequencing should increase.
Mendelian disorders caused by:
Alterations that:
Disorders caused by genetic alterations that are not identified ongenome sequencing because of technical limitations:
Disorders for which associated genes and/or pathogenic variants have not been reported:
Disorders that are not known to be genetic:
Clinicalsensitivity (the ability to identify pathogenic variants causative of known clinical disorders) can be superior.
Results can be easier to analyze. Because fewer genes are sequenced, fewer variants will be identified. Therefore, multigene panels often have the following advantages:
Exome sequencing andgenome sequencing do not require the clinician to determine which disorders (and, hence, which genes) are likely to be involved; thus, testing can be ordered earlier in a patient’s diagnostic evaluation because extensive clinical evaluations, laboratory testing, and radiographic evaluations are not needed to identify diagnostic clues that would lead the clinician to suspect a specific disorder(s).
Exome sequencing andgenome sequencing can detect the presence of two or more genetically distinct disorders (the phenotypic presentation of which may have complicated diagnosis) in the same individual [Yang et al 2013,Adams et al 2014].
Using amultigene panel forces the clinician to select the best panel for the patient. Selection is often difficult because:
Achromosomal microarray (CMA) is a molecular genetic test used to detect copy number variants (CNVs), i.e., deletions (loss) or duplications (gain) of chromosomal material. CNVs range in size from approximately one kilobase (kb) to multiple megabases (Mb), with the largest CNVs resulting in a loss or gain of an entirechromosome, or multiple chromosomes (such as ahaploid set of 23 chromosomes). Depending on the size andgenomic location of a CNV, thedeletion orduplication may include no known genes, onegene, or many genes. CNVs may be benign, pathogenic, or of uncertain clinical significance.
Prior to the development of CMA, detection of CNVs was limited to what could be seen on high-resolutionkaryotype analysis (i.e., deletions as small as three to five Mb and duplications larger than approximately five Mb). Identification of smaller CNVs usingfluorescent in situ hybridization (FISH) analysis required the clinician to determine thechromosome region of interest. In contrast, CMA can detect CNVs smaller than those identified with high-resolution karyotype analysis, and the clinician does not need to determine a region of interest. CMA, which has been available as a clinical diagnostic test since 2004, is recommended as a first-line test for individuals with developmental delay, intellectual disability, multiplecongenital anomalies, and/or autism spectrum disorder. For these disorders, the diagnostic yield of CMA (15%-20%) is greater than that of karyotype analysis (~3%) [Manning et al 2010,Miller et al 2010].
Thesensitivity of CMA depends on the following:
Advantages of CMA compared tokaryotype analysis
Limitations of CMA compared tokaryotype analysis
The American College of Medical Genetics and Genomics (ACMG) and the Clinical Genome Resource (ClinGen) have recently introduced a technical standard for interpretation of CNVs using an evidence-based scoring system [Riggs et al 2020]. This guidance considers several factors when evaluating the potential pathogenicity of a CNV detected by CMA [Miller et al 2010] including:
Common characteristics of pathogenic orlikely pathogenic CNVs
Common characteristics of CNVs ofuncertain significance
Common characteristics of benign orlikely benign CNVs
Syndromes Caused by Recurrent Deletions and Duplications
Deletion/Duplication Syndrome (Chromosome Locus) 1 | Approximate Size | ISCA ID 2 | Region Location 3 | Genes of Interest in This Region |
---|---|---|---|---|
1q21.1 recurrent microdeletion | 1.35 Mb | ISCA-37421 | GRCh38/hg38 chr1: 147,105,904-147,922,392 | GJA5 GJA8 |
3q29 recurrent deletion | 1.6 Mb | ISCA-37443 | GRCh38/hg38 chr3: 196,029,183-197,617,791 | DLG1 FBXO45 PAK2 RNF168 |
3q29duplication syndrome (OMIM611936) | ||||
Sotos syndrome (5q35deletion) | 1.9 Mb | ISCA-37425 | GRCh38/hg38 chr5: 176,301,976-177,620,792 | NSD1 |
Williams syndrome (7q11.23deletion) | 1.5-1.8 Mb | ISCA-37392 | GRCh38/hg38 chr7: 73,330,452-74,728,172 | LIMK1 FTF2I STX1A BAZ1B CLIP2 GTF2IRD1 NCF1 |
7q11.23 duplication syndrome | ELN GTF2I | |||
10q22.3-q23.2deletion syndrome (OMIM612242) | 7 Mb | ISCA-37424 | GRCh38/hg38 chr10: 79,923,087-86,979,631 | BMPR1A GRID1 NRG3 |
Angelman syndrome (15q11.2-q13 maternaldeletion) | 7 Mb 4 | ISCA-37404 | GRCh38/hg38 chr15: 22,782,170-28,134,728 | UBE3A |
5 Mb 5 | ISCA-37478 | GRCh38/hg38 chr15: 23,465,365-28,134,728 | ||
Prader-Willi syndrome (15q11.2-q13 paternaldeletion) | 7 Mb 4 | ISCA-37404 | GRCh38/hg38 chr15: 22,782,170-28,134,728 | SNURF-SNRPN OCA2 NECDIN MAGEL2 SNORD116 |
5 Mb 5 | ISCA-37478 | GRCh38/hg38 chr15: 23,465,365-28,134,728 | ||
15q duplication syndrome 6 (15q11.2-q13.1) | 7 Mb 4 | ISCA-37404 | GRCh38/hg38 chr15: 22,782,170-28,134,729 | UBE3A GABRB3 GABRA5 GABRG3 HERC2 |
5 Mb 5 | ISCA-37478 | GRCh38/hg38 chr15: 23,513,243-28,312,040 | ||
15q13.3 microdeletion | ~2.0 Mb 7 | ISCA-37411 | GRCh38/hg38 chr15: 30,900,686-32,153,204 | CHRNA7 OTUD7A |
16p12.2 recurrent deletion | 520 kb | NA | GRCh37/ hg19 chr 16: ~21,950,000-~22,470,000 | UQCRC2 CDR2 POLR3E EEF2K MOSMO |
16p11.2 recurrent microdeletion | 593 kb | ISCA-37400 | GRCh38/hg38 chr16: 29,638,676-30,188,531 | PRRT2 KCTD13 TBX6 |
16p11.2duplication syndrome (OMIM614671) | 593 kb | Unknown | ||
16p11.2deletion syndrome, 220-kb (OMIM613444) | 220 kb | ISCA-37486 | GRCh38/hg38 chr16: 28,811,314-29,035,178 | SH2B1 |
Hereditary neuropathy with liability to pressure palsies (17p12deletion) | 1.5 Mb | ISCA-37436 | GRCh38/hg38 chr17: 14,194,598-15,519,638 | PMP22 |
Charcot-Marie-Tooth neuropathy type 1A (17p12duplication; OMIM118220) | ||||
Smith-Magenis syndrome (17p11.2deletion) | 3.7 Mb | ISCA-37418 | GRCh38/hg38 chr17: 16,906,714-20,309,889 | RAI1 |
Potocki-Lupski syndrome (17p11.2duplication) | ||||
Neurofibromatosis 1 (17q11.2deletion) | 1.0-1.4 Mb | ISCA-37431 | GRCh38/hg38 chr17: 30,780,079-31,937,008 | NF1 |
17q11.2duplication syndrome (OMIM613675) | ||||
17q12 recurrent deletion syndrome | 1.4 Mb | ISCA-37432 | GRCh38/hg38 chr17: 36,458,167-37,854,616 | ACACA LHX1 HNF1B |
17q12 recurrent duplication | HNF1B | |||
Koolen-de Vries syndrome (17q21.31deletion) | 500 kb | ISCA-37420 | GRCh38/hg38 chr17: 45,627,800-46,087,514 | KANSL1 |
22q11.2 deletion syndrome | 1.5 Mb | ISCA-37433 | GRCh38/hg38 chr22: 18,924,718-20,299,685 | TBX1 |
3 Mb | ISCA-37446 | GRCh38/hg38 chr22: 18,924,718-21,111,383 | ||
22q11.2duplication (OMIM608363) | 1.5 Mb | ISCA-37433 | GRCh38/hg38 chr22: 18,924,718-20,299,685 | Unknown |
3 Mb | ISCA-37446 | GRCh38/hg38 chr22: 18,924,718-21,111,383 | ||
22q11.2deletion syndrome, distal (OMIM611867) | 1.1-2.1 Mb 9 | ISCA-37397 | GRCh38/hg38 chr22: 21,562,828-23,306,924 | TOP3B |
22q11.2duplication syndrome, distal 8 | Unknown | |||
Xq28duplication (OMIM300815) | 0.3 Mb | ISCA-37439 | GRCh38/hg38 chrX: 154,396,223-154,555,683 | GDI1 |
Xq28 duplication syndrome, Int22h1/Int22h2 mediated | 0.5 Mb | ISCA-37494 | GRCh38/hg38 chrX: 154,890,328-155,335,092 | CLIC2 RAB39B |
Data are compiled from the following standard references:chromosomelocus from OMIM, genes from HGNC.
Standardized clinical annotation and interpretation forgenomic variants from the Clinical Genome Resource (ClinGen) project (formerly the International Standards for Cytogenomic Arrays [ISCA] Consortium)
Genomic coordinates represent the minimumdeletion/duplication size as designated by ClinGen. Coordinates may vary slightly based on array design used by the testing laboratory. Note that the size of the microdeletion/microduplication as calculated from thesegenomic positions may differ from the expected size due to the presence of segmental duplications near breakpoints.
Class 1deletion/duplication, extending from BP1 to BP3
Class 2deletion/duplication, extending from BP2 to BP3
Approximately 80% of individuals have a maternal isodicentric 15q11.2-q13.1 supernumerarychromosome – idic(15) – that typically comprises two extra copies of 15q11.2-q13.1, resulting in tetrasomy for 15q11.2-q13.1. Approximately 20% of individuals have one extra copy.
Extending from BP4 to BP5
Includes deletions extending from LCR22-D to either LCR22-E or –F
Pathogenic variants that cannot be reliably detected regardless of CMA platform
Genome-wide CMAs interrogate the entire genome for CNVs as small as 100 kb. Probes cover the genome with an increased number and density overchromosome regions associated with known microdeletions/microduplications (Table 1).
Exon-focusedCMAs detect CNVs that involve one or more exons. A high number of densely spaced probes cover eachexon, increasing the resolution of the CMA for expressed genes.
Customized CMAs detect CNVs within a specific region of interest, either intra- or inter-genic, using a high number of densely spaced probes that cover a specificgenomic region.
Note: All CMAs have genome-wide probe coverage, although the coverage may be minimal, to allow for quality control and data analysis.
The most common CMA platforms:
Oligonucleotide arraycomparative genomic hybridization (oligo aCGH) detects differences in DNA content (copy number) between two individuals, usually an affected individual and a healthy control. DNA from an affected individual is fluorescently labeled with one dye; DNA from a healthy control is labeled with a different dye. The DNA samples are co-hybridized to an array in which hundreds of thousands of oligonucleotide probes are attached to the surface. After time to allow hybridization, the excess DNA is washed off, and the fluorescent signals are measured from each dye at each oligonucleotide. The fluorescence intensity of hybridized DNA from an affected individual and fluorescence intensity of hybridized DNA from the healthy control are measured, and the ratio of the fluorescence intensity for each dye is plotted on a log2 scale.
A log2 ratio of zero indicates a normal DNA copy number in the affected individual, whereas a reduced log2 ratio indicates adeletion, and an increased log2 ratio indicates aduplication (seeFigure 1). Because a single oligonucleotide probe with an aberrant log2 ratio may represent laboratory artifact, several adjacent oligonucleotide probes with the same deviation are required to confirm the presence of a CNV. The number of adjacent probes required to confirm the presence of a CNV is determined by the laboratory performing the test. Due to the comparative nature of the analysis, same-sex control are used and ploidy abnormalities cannot be detected.
Single-nucleotidepolymorphismgenotyping arrays (SNP arrays) determine thegenotype of an individual at selected singlebase pair sites in that person's genome. These single base pair sites are selected because they are likely to be polymorphic (i.e., the nucleotide at the site varies among individuals). Typically, at each polymorphic site (SNP), there are two possible alleles: the major (or reference)allele referred to as the "A" allele, and the minor (non-reference) allele referred to as the "B" allele. At each SNP, an individual may behomozygous for the reference allele (AA), a compoundheterozygote for the reference allele and the non-reference allele (AB), or homozygous for the non-reference allele (BB).
Similar to oligo aCGH, SNP arrays also rely on fluorescence-based visualization ofgenomic DNA bound to oligonucleotide probes fixed to an array. However, rather than the comparative hybridization of two samples used in oligo aCGH, SNP arrays hybridize DNA fragments from the affected individual's sample in anallele-specific manner. The "A" allele is labeled with one fluorescent signal and the "B" allele is labeled with a different fluorescent signal.
The two pieces of information gathered for each SNP (seeFigure 2):
Therelative fluorescence intensity for the two alleles at each site, which represents the allelic ratio, referred to asB-allele frequency. Most of the genome is diploid (eachchromosome and thus each allele is present in two copies); therefore, an individual will have an "AA," "AB," or "BB"genotype at each SNP, and the B-allele frequency will be 0, 0.5 or 1, respectively. If the SNP is in a deleted region, the genotype (i.e., "A" or "B") will appearhomozygous (i.e., "AA" or "BB"), with a B-allele frequency of 0 or 1, respectively. If the SNP is in a region of aduplication, the possible genotypes for each SNP include: "AAA," "AAB," "ABB," or "BBB" with a B-allele frequency of 0, 0.3, 0.6, or 1, respectively.
Thetotal fluorescence intensity indicates the number of alleles at a specific SNP in an individual. The data are normalized using external control samples and plotted on a log2 scale. An individual with two alleles at a specific SNP will have a log2 ratio of zero. An individual with aheterozygousdeletion that includes the SNP will have a reduced log2 ratio. An individual with a heterozygousduplication including the SNP will have an increased log2 ratio.
CNVs are identified using a combination of the B-allele frequency and total fluorescence intensity of adjacent SNPs; several adjacent SNPs with the same deviation are required to confirm the presence of a CNV.
Oligo/SNP combination arrays use an oligo aCGH platform that includes both oligonucleotide probes to identify CNVs and probes for select SNPs. The number (and therefore the density) of SNP probes is usually much lower in combination arrays than in SNPgenotyping arrays. However, the inclusion of SNP probes allows for the detection ofuniparental isodisomy and large stretches of copy-neutral homozygosity (seeOligo aCGH vs SNP Array).
SNP arrays can detect regions of copy-neutral homozygosity (regions of the genome that are diploid and identical). In stretches of copy-neutral homozygosity, all SNPs arehomozygous ("AA" or "BB"), but the total fluorescence intensity is at the same level as diploid regions of the genome (seeFigure 3). Identification of regions of copy-neutral homozygosity may help identify:
SNP arrays can detect a lower level ofmosaicism than oligo aCGH. SNP arrays can detect mosaicism for CNVs present in ≥5% of the cells tested including gain or loss of a wholechromosome (i.e.,aneuploidy) (seeFigure 4) [Conlin et al 2010]. Oligo aCGH is slightly less sensitive, detecting mosaicism present in ≥10%-20% of cells tested.
SNP array can detect polyploidy (e.g., triploidy) which cannot be reliably detected by oligo aCGH as the presence of three fluorescence intensity ratios are normalized.
SNP arrays are more difficult to customize than oligo arrays because some regions of the genome are less variable than others. For example, exons are less likely to have nucleotides that vary among individuals. In fact, some exons do not have any variable nucleotides. As oligo aCGH does not require variability for its probes, oligo probes can cover exons that are identical among most individuals.
Database of Genomic Variants:dgv.tcag.ca/dgv/app/home
ClinGen Dosage Sensitivity Map:www.ncbi.nlm.nih.gov/projects/dbvar/clingen
DECIPHER:decipher.sanger.ac.uk
ECARUCA:www.ecaruca.net
GeneReviews® chapters are owned by the University of Washington. Permission ishereby granted to reproduce, distribute, and translate copies of content materials fornoncommercial research purposes only, provided that (i) credit for source (http://www.genereviews.org/) and copyright (© 1993-2025 University ofWashington) are included with each copy; (ii) a link to the original material is providedwhenever the material is published elsewhere on the Web; and (iii) reproducers,distributors, and/or translators comply with theGeneReviews® Copyright Notice and UsageDisclaimer. No further modifications are allowed. For clarity, excerptsof GeneReviews chapters for use in lab reports and clinic notes are a permitteduse.
For more information, see theGeneReviews® Copyright Notice and UsageDisclaimer.
For questions regarding permissions or whether a specified use is allowed,contact:ude.wu@tssamda.
Your browsing activity is empty.
Activity recording is turned off.
See more...