Genetic point mutation that results in an amino acid change in a protein
Ingenetics, amissense mutation is apoint mutation in which a singlenucleotide change results in acodon that codes for a differentamino acid.[1] It is a type ofnonsynonymous substitution. Missense mutations change amino acids, which in turn alter proteins and may alter a protein's function or structure.[2] These mutations may arise spontaneously from mutagens like UV radiation,[3] tobacco smoke,[4] an error inDNA replication,[5] and other factors. Screening for missense mutations can be done by sequencing the genome of an organism and comparing the sequence to areference genome to analyze for differences.[6] Missense mutations can be repaired by the cell when there are errors in DNA replication by using mechanisms such as DNA proofreading andmismatch repair.[7][8] They can also be repaired by using genetic engineering technologies[9] or pharmaceuticals.[10][11] Some notable examples of human diseases caused by missense mutations areRett syndrome,[12]cystic fibrosis,[13] andsickle-cell disease.[14]
This image shows an example of missense mutation. One of the nucleotides (adenine) is replaced by another nucleotide (cytosine) in the DNA sequence. This results in an incorrect amino acid (proline) being incorporated into the protein sequence.
Missense mutation refers to a change in one amino acid in aprotein arising from apoint mutation in a single nucleotide.[1] Amino acids are the building blocks of proteins. Missense mutations are a type ofnonsynonymous substitution in a DNA sequence.[15] Two other types of nonsynonymous substitutions arenonsense mutations, in which a codon is changed to a prematurestop codon that results in the resulting protein being cut short,[16] andnonstop mutations, in which a stop codon deletion results in a longer but nonfunctional protein.[17] The latter two types are not considered to be missense mutations.
Point mutation categories. Missense mutations are a type of nonsynonymous point mutation.
Not all missense mutations lead to appreciable protein changes.[15][23] An amino acid may be replaced by a different amino acid of very similar chemical properties in which case the protein may still function normally; this is termed aconservative mutation.[23] Alternatively, the amino acid substitution could occur in a region of the protein which does not significantly affect the protein secondary structure or function.[15] Lastly, when more than one codon codes for the same amino acid (termed "degenerate coding"), the resulting mutation does not produce any change in translation and hence no change in protein is observed; degenerate coding would be classified as asynonymous substitution,[24] or a silent mutation, and not a missense mutation.[15]
Missense mutations may be inherited or arise spontaneously, termedde novo mutations.[25] Well studied diseases arising from inherited missense mutations include sickle cell anemia,[26] cystic fibrosis,[13] and early-onset Alzheimer's[27] and Parkinson's disease.[28] De novo mutations that increase or decrease the activity of synapses have been implicated in the development of neurological and developmental disorders,[29] such a Autism Spectrum Disorder[29] and intellectual delay.[25]
Environmental mutagens, such as tobacco smoke or UV radiation, may be a cause of spontaneous missense mutations.[4][3] Tobacco smoke has been implicated intransversion mutations in the K-ras gene, with a meta-analysis of lung carcinomas showing 25 tumours containing a G to T mutation causing an amino acid change from glycine to cysteine, and 11 tumours with a G to T mutation causing an amino acid change from glycine to valine.[4] Similarly, numerous studies have shown ultraviolet light induces missense mutations in the p53 gene,[3][30] which when unregulated, reduces the cell's ability to recognize DNA damage and engage inapoptosis, leading to cell proliferation and potential skin carcinogenesis.[3]
Spontaneous tautomerization of adenine, resulting in adenine mispairing with cytosine, rather than thymine, after replication. Future replications would lead to cytosine pairing with guanine, instead of an adenine-thymine pair in that location, resulting in a missense mutation.
DNA polymerase replication errors during cell division may lead to spontaneous missense mutations if DNA polymerase's proofreading ability does not detect and repair an error it makes.[25] Spontaneous DNA polymerase errors are estimated to occur at a frequency of 1/109 base pairs.[25]
Although rarer,tautomerization of bases also creates spontaneous missense mutations.[31] Tautomerization occurs when hydrogen atoms on DNA bases spontaneously change locations, impacting the structure of the base, and allowing it to pair with an incorrect base.[32] If this strand of DNA is replicated, the incorrect base will be the template for a new strand, leading to a mutation, possibly changing the amino acid and therefore, the protein.[5] For example, Wang et al., (2011) used X-ray cystallography to demonstrate that a de novo mutation was created when DNA repair mechanisms did not recognize a C-A base mismatch due to tautomerization allowing the base structures to be compatible.[33]
Next Generation Sequencing (NGS) has changed the world of sequencing by decreasing the cost of sequencing and increasing the throughput.[34] It does this by utilizingmassively parallel sequencing to sequence the genome. This involves clonally amplifiedDNA fragments that can be spatially separated into second generation sequencing (SGS) or third generation sequencing (TGS) platforms.[35] There is variation between these protocols, but the overall methods are similar. Using massively parallel sequencing allows the NGS platform to produce very large sequences in a single run.[36] The DNA fragments are typically separated by length using gel electrophoresis.
NGS consists of four main steps, DNA isolation, target enrichment, sequencing, and data analysis.[36] The DNA isolation step involves breaking the genomic DNA into many small fragments.[6] There are many different mechanisms that can be used to accomplish this such as mechanical methods, enzymatic digestion, and more.[6] This step also consists of adding adaptors to either end of the DNA fragments that are complementary to the flow cell oligos and include primer binding sites for the target DNA.[36] The target enrichment step amplifies the region of interest. This includes creating a complementary strand to the DNA fragments through hybridization to a flow cell oligo.[36] It then gets denatured and bridge amplification occurs before the reverse strand is finally washed and sequencing can occur. The sequencing step involves massive parallel sequencing of all DNA fragments simultaneously using a NGS sequencer. This information is saved and analyzed in the last step, data analysis, using bioinformatics software.[6] This compares the sequences to a reference genome to align the fragments and show mutations in the targeted area of the sequence.[6]
Newborn screening (NBS) for missense mutations is increasingly incorporating genomic technologies in addition to traditional biochemical methods to improve the detection of genetic disorders early in life. Traditional NBS primarily relies on biochemical assays, such astandem mass spectrometry,[37] to detect metabolic abnormalities indicative of conditions likephenylketonuria orcongenital hypothyroidism.[38] However, these methods may miss genetic causes or produce ambiguous results. To address these deficiencies,next-generation sequencing (NGS) is being added to NBS programs.[39] For instance, targeted gene panels andwhole-exome sequencing (WES) are used to identify disease causing missense mutations in genes associated with treatable conditions, such assevere combined immunodeficiency (SCID) andcystic fibrosis. Studies like the BabyDetect project have demonstrated the utility of genomic screening in identifying disorders missed by conventional methods, with actionable results for conditions affecting more than 400 genes.[40][41] In addition, genomic approaches allow for the detection of rare or recessive conditions that may not manifest biochemically at birth, significantly expanding the scope of diseases screened.[42] These advancements align with the established principles of NBS, which emphasize early detection and intervention to prevent morbidity and mortality.[43]
Three mechanisms of DNA repair are represented in simplified form. DNA proofreading and mismatch repair are used to fix missense mutations. Nucleotide excision repair is used to repair large DNA lesions, not missense mutations[44].
DNA polymerases, used inDNA replication, have a high specificity of 104 to 106-fold in base pairing.[7] They have proofreading abilities to correct incorrect matches, allowing 90-99.9% of mismatches to be excised and repaired.[7] The base mismatches that go unnoticed are repaired by the DNA mismatch repair pathway, also inherent in cells.[45][8] The DNA mismatch repair pathway usesexonucleases that move along the DNA strand and remove the incorrectly incorporated base in order for DNA polymerase to fill in the correct base.[45]Exonuclease1 is involved in many DNA repair systems and moves 5' to 3' on the DNA strand.[46]
More recently, research has explored the use ofgenetic engineering[9] and pharmaceuticals as potential treatments.[10][11] tRNA therapies have emerged in research studies as a potential missense mutation treatment, following evidence supporting their use in nonsense mutation correction.[47] Missense-correcting tRNAs are engineered to identify the mutated codon, but carry the correct charged amino acid which is inserted into the nascent protein.[9]
Pharmaceuticals that target specific proteins affected by missense mutations have also shown therapeutic potential.[10][11] Pharmaceutical studies have particularly focused on targeting the p53 mutant protein and Ca2+ channel abnormalities, both caused by gain of function missense mutations due to their high prevalence in a number of cancers and genetic diseases respectively.[11][47] In cystic fibrosis, most commonly caused by missense mutations,[48] drugs known as modulators target the defective Cystic Fibrosis Transmembrane Conductance Regulator (CFTR) protein.[49] For example, to reduce the defects caused by class III CFTR mutations, Ivacaftor, part of the modulator Kalydeco, forces the chloride channel to remain in an open position.[50]
Gene therapy is being explored as a treatment for missense mutations. This involves inserting the correct sequence of DNA into an incorrect gene.[50] Artificial Intelligence programs, such as AlphaFold, are also being developed to predict the effect of missense mutations.[51] Identifying potential deleterious mutations can assist with disease diagnosis and treatment.[51]
Diverged nucleotide sequence demonstrating how sequences diverge over time. Red letters are nucleotides changed from the original sequence.
If a missense mutation is not deleterious, it will not be selected against and can contribute tospecies divergence.[52][53] Over time, mutations occur randomly in individuals and can becomefixed in populations if they are not selected against.[54] Missense mutations are a type of mutation that are notneutral, and therefore can be acted on by selection. Selection cannot act on synonymous mutations (mutations that do not change anything phenotypically).[55]
Tracking missense mutations, like nonsynonymous SNPs, in ancestral species populations allow genealogies and phylogenetic trees to be created and evolutionary connections to be made.[56] Missense mutation analysis is often used in evolutionary genetics to create relationships between species, as amino acid changes leading to protein changes are needed for species to diverge from each other.[57]
Wild type (left) and mutated (right) form of lamin A (pdb id: 1IFR). Normally, Arginine 527 (blue) formssalt bridge with glutamate 537 (magenta), but R527L substitution results in breaking this interaction (leucine has a nonpolar tail and therefore cannot form a static salt bridge).
LMNA missense mutation (c.1580G>T) introduced at LMNA gene – position 1580 (nt) in the DNA sequence (CGT) causing theguanine to be replaced with thethymine, yielding CTT in the DNA sequence. This results at the protein level in the replacement of thearginine by theleucine at the position 527.[58] This leads to destruction ofsalt bridge and structure destabilization. Atphenotype level this manifests with overlappingmandibuloacral dysplasia andprogeria syndrome.
Missense mutations in the MeCP2 protein can causeRett syndrome, otherwise known as the RTT phenotype.[59] This phenotype primarily effects females, as males do not live with this mutation past infancy.[59] T158M, R306C and R133C are the most common missense mutations causing RTT.[59] T158M is a mutation of anadenine being substituted for aguanine causing thethreonine at amino acid position 158 being substituted with amethionine.[60] R133C is a mutation of acytosine at base position 417 in the gene encoding theMeCP2 protein being substituted for athymine, causing an amino acid substitution at position 133 in the protein ofarginine withcysteine.[12]
(1) Normal red blood cells and (2) sickled-cell red blood cells
Sickle-cell disease changes the shape of red blood cells from round to sickle shaped.[61] In the most common variant of sickle-cell disease, the 20th nucleotide of the gene for thebeta chain ofhemoglobin is altered from thecodon GAG to GTG.[61] Thus, the 6th amino acid,glutamic acid, is substituted byvaline—notated as an "E6V" or a "Glu6Val" mutation—which causes the protein to be sufficiently altered with a sickle-cell phenotype.[62] The affected cells cause issues in the bloodstream as they can become sticky due to their improper ion transport leading to them being susceptible to water loss.[14] This can cause a buildup of blood cells that obstructs blood flow to any organ in the body.[14]
Other conditions that can be caused by missense mutations
^abAmir RE, Van den Veyver IB, Wan M, Tran CQ, Francke U, Zoghbi HY (October 1999). "Rett syndrome is caused by mutations in X-linked MECP2, encoding methyl-CpG-binding protein 2".Nature Genetics.23 (2):185–188.doi:10.1038/13810.PMID10508514.
^abcParker JC, Orringer EP (1990-04-27). "Sickle Cell Disease. Charles F. Whrrren and John F. Bertles, Eds. New York Academy of Sciences, New York, 1989. xiv, 477 pp., illus. $119. Annals of the New York Academy of Sciences, vol. 565. From a conference, Bethesda, MD, April 1988".Science.248 (4954): 502.doi:10.1126/science.248.4954.502.ISSN0036-8075.PMID17815604.
^abcdBrown TA (2002). "Chapter 14, Mutation, Repair and Recombination".Genomes. 2nd edition. Oxford: Wiley-Liss.
^Miura Y, Nakagomi S (September 2021). "Management of Cutaneous Manifestations of Genetic Epidermolysis Bullosa: A Multiple Case Series".Journal of Wound, Ostomy, and Continence Nursing.48 (5):453–459.doi:10.1097/WON.0000000000000784.PMID34495939.
^Piel FB, Steinberg MH, Rees DC (April 2017). Longo DL (ed.). "Sickle Cell Disease".The New England Journal of Medicine.376 (16):1561–1573.doi:10.1056/NEJMra1510865.PMID28423290.
^Xu J (2014).Next-generation sequencing: current technologies and applicaitons [sic?]. Norfolk: Caister academic press.ISBN978-1-908230-33-1.
^abcdValencia CA, Pervaiz MA, Husami A, Qian Y, Zhang K (2013).Next Generation Sequencing Technologies in Medical Genetics. SpringerBriefs in Genetics. New York, NY: Springer New York.doi:10.1007/978-1-4614-9032-6.ISBN978-1-4614-9031-9.