Addition of adenylic acids to 3' end of mature mRNA
Typical structure of a mature eukaryotic mRNA
Polyadenylation is the addition of apoly(A) tail to an RNA transcript, typically amessenger RNA (mRNA). The poly(A) tail consists of multipleadenosine monophosphates; in other words, it is a stretch of RNA that has onlyadenine bases. Ineukaryotes, polyadenylation is part of the process that produces mature mRNA fortranslation. In manybacteria, the poly(A) tail promotes degradation of the mRNA. It, therefore, forms part of the larger process ofgene expression.
The process of polyadenylation begins as thetranscription of ageneterminates. The3′-most segment of the newly made pre-mRNA is first cleaved off by aset of proteins; these proteins then synthesize the poly(A) tail at the RNA's 3′ end. In some genes these proteins add a poly(A) tail at one of several possible sites. Therefore, polyadenylation can produce more than one transcript from a single gene (alternative polyadenylation), similar toalternative splicing.[1]
The poly(A) tail is important for the nuclear export, translation and stability of mRNA. The tail is shortened over time, and, when it is short enough, the mRNA is enzymatically degraded.[2] However, in a few cell types, mRNAs with short poly(A) tails are stored for later activation by re-polyadenylation in the cytosol.[3] In contrast, when polyadenylation occurs in bacteria, it promotes RNA degradation.[4] This is also sometimes the case for eukaryoticnon-coding RNAs.[5][6]
mRNA molecules in both prokaryotes and eukaryotes have polyadenylated 3′-ends, with the prokaryotic poly(A) tails generally shorter and fewer mRNA molecules polyadenylated.[7]
Chemical structure of RNA. The sequence of bases differs between RNA molecules.
RNAs are a type of large biological molecules, whose individual building blocks are called nucleotides. The namepoly(A) tail (for polyadenylic acid tail)[8] reflects the way RNA nucleotides are abbreviated, with a letter for the base the nucleotide contains (A foradenine, C forcytosine, G forguanine and U foruracil). RNAs are produced (transcribed) from aDNA template. By convention, RNA sequences are written in a 5′ to 3′ direction. The 5′ end is the part of the RNA molecule that is transcribed first, and the 3′ end is transcribed last. The 3′ end is also where the poly(A) tail is found on polyadenylated RNAs.[1][9]
Messenger RNA (mRNA) is RNA that has a coding region that acts as a template for protein synthesis (translation). The rest of the mRNA, theuntranslated regions, tune how active the mRNA is.[10] There are also many RNAs that are not translated, called non-coding RNAs. Like the untranslated regions, many of these non-coding RNAs have regulatory roles.[11]
In nuclear polyadenylation, a poly(A) tail is added to an RNA at the end of transcription. On mRNAs, the poly(A) tail aids in transcription termination, export of the mRNA from the nucleus and translation by acting as a binding site ofPABPC1.[2][12] Almost all eukaryotic mRNAs are polyadenylated,[13] with the exception of animal replication-dependenthistone mRNAs.[14] These are the only mRNAs in eukaryotes that lack a poly(A) tail, ending instead in astem-loop structure followed by a purine-rich sequence, termed histone downstream element, that directs where the RNA is cut so that the 3′ end of the histone mRNA is formed.[15]
Many eukaryotic non-coding RNAs are always polyadenylated at the end of transcription. There are small RNAs where the poly(A) tail is seen only in intermediary forms and not in the mature RNA as the ends are removed during processing, the notable ones beingmicroRNAs.[16][17] But, for manylong noncoding RNAs – a seemingly large group ofregulatory RNAs that, for example, includes the RNAXist, which mediatesX chromosome inactivation – a poly(A) tail is part of the mature RNA.[18]
CPSF: cleavage/polyadenylation specificity factor CstF: cleavage stimulation factor PAP: polyadenylate polymerase PABII: polyadenylate binding protein 2 CFI: cleavage factor I CFII: cleavage factor II
Theprocessive polyadenylation complex in the nucleus of eukaryotes works on products ofRNA polymerase II, such asprecursor mRNA. Here, a multi-protein complex(see components on the right)[19] cleaves the 3′-most part of a newly produced RNA and polyadenylates the end produced by this cleavage. The cleavage is catalysed by the enzymeCPSF[14][19] and occurs 10–30 nucleotides downstream of its binding site.[20]This site often has thepolyadenylation signal sequence AAUAAA on the RNA, but variants of it that bind more weakly toCPSF exist.[19][21] Two other proteins add specificity to the binding to an RNA: CstF and CFI. CstF binds to a GU-rich region further downstream of CPSF's site.[22] CFI recognises a third site on the RNA (a set of UGUAA sequences in mammals[23][24][25]) and can recruit CPSF even if the AAUAAA sequence is missing.[26][27] The polyadenylation signal – the sequence motif recognised by the RNA cleavage complex – varies between groups of eukaryotes. Most human polyadenylation sites contain the AAUAAA sequence,[22] but this sequence is less common in plants and fungi.[28]
The RNA is typically cleaved before transcription termination, as CstF also binds to RNA polymerase II.[29] Through a poorly understood mechanism (as of 2002), it signals for RNA polymerase II to slip off of the transcript.[30] Cleavage also involves the protein CFII, though it is unknown how.[31] The cleavage site associated with a polyadenylation signal can vary up to some 50 nucleotides.[32]
When the RNA is cleaved, polyadenylation starts, catalysed by polyadenylate polymerase.Polyadenylate polymerase builds the poly(A) tail by addingadenosine monophosphate units fromadenosine triphosphate to the RNA, cleaving offpyrophosphate.[33] Another protein, PAB2, binds to the new, short poly(A) tail and increases the affinity of polyadenylate polymerase for the RNA. When the poly(A) tail is approximately 250nucleotides long the enzyme can no longer bind to CPSF and polyadenylation stops, thus determining the length of the poly(A) tail.[34][35] CPSF is in contact with RNA polymerase II, allowing it to signal the polymerase to terminate transcription.[36][37] When RNA polymerase II reaches a "termination sequence" (⁵'TTTATT3' on the DNA template and ⁵'AAUAAA3' on the primary transcript), the end of transcription is signaled.[38] The polyadenylation machinery is also physically linked to thespliceosome, a complex that removesintrons from RNAs.[27]
The poly(A) tail acts as the binding site forpoly(A)-binding protein. Poly(A)-binding protein promotes export from the nucleus and translation, and inhibits degradation.[39] This protein binds to the poly(A) tail prior to mRNA export from the nucleus and in yeast also recruits poly(A) nuclease, an enzyme that shortens the poly(A) tail and allows the export of the mRNA. Poly(A)-binding protein is exported to the cytoplasm with the RNA. mRNAs that are not exported are degraded by theexosome.[40][41] Poly(A)-binding protein also can bind to, and thus recruit, several proteins that affect translation,[40] one of these isinitiation factor-4G, which in turn recruits the40S ribosomal subunit.[42] However, a poly(A) tail is not required for the translation of all mRNAs.[43] Further, poly(A) tailing (oligo-adenylation) can determine the fate of RNA molecules that are usually not poly(A)-tailed (such as (small) non-coding (sn)RNAs etc.) and thereby induce their RNA decay.[44]
In eukaryoticsomatic cells, the poly(A) tails of most mRNAs in the cytoplasm gradually get shorter, and mRNAs with shorter poly(A) tail are translated less and degraded sooner.[45] However, it can take many hours before an mRNA is degraded.[46] This deadenylation and degradation process can be accelerated by microRNAs complementary to the3′ untranslated region of an mRNA.[47] Inimmature egg cells, mRNAs with shortened poly(A) tails are not degraded, but are instead stored and translationally inactive. These short tailed mRNAs are activated by cytoplasmic polyadenylation after fertilisation, duringegg activation.[48]
In animals, poly(A) ribonuclease (PARN) can bind to the5′ cap and remove nucleotides from the poly(A) tail. The level of access to the 5′ cap and poly(A) tail is important in controlling how soon the mRNA is degraded. PARN deadenylates less if the RNA is bound by the initiation factors 4E (at the 5′ cap) and 4G (at the poly(A) tail), which is why translation reduces deadenylation. The rate of deadenylation may also be regulated by RNA-binding proteins. Additionally, RNA triple helix structures and RNA motifs such as the poly(A) tail 3' end binding pocket retard deadenylation process and inhibit poly(A) tail removal.[49] Once the poly(A) tail is removed, the decapping complex removes the 5′ cap, leading to a degradation of the RNA. Several other proteins are involved in deadenylation inbudding yeast and human cells, most notably theCCR4-Not complex.[50]
There is polyadenylation in the cytosol of some animal cell types, namely in thegermline, during earlyembryogenesis and in post-synaptic sites ofnerve cells. This lengthens the poly(A) tail of an mRNA with a shortened poly(A) tail, so that the mRNA will betranslated.[45][51] These shortened poly(A) tails are often less than 20 nucleotides, and are lengthened to around 80–150 nucleotides.[3]
In the early mouse embryo, cytoplasmic polyadenylation of maternal RNAs from the egg cell allows the cell to survive and grow even though transcription does not start until the middle of the 2-cell stage (4-cell stage in human).[52][53] In the brain, cytoplasmic polyadenylation is active during learning and could play a role inlong-term potentiation, which is the strengthening of the signal transmission from a nerve cell to another in response to nerve impulses and is important for learning and memory formation.[3][54]
Cytoplasmic polyadenylation requires the RNA-binding proteinsCPSF andCPEB, and can involve other RNA-binding proteins likePumilio.[55] Depending on the cell type, the polymerase can be the same type of polyadenylate polymerase (PAP) that is used in the nuclear process, or the cytoplasmic polymeraseGLD-2.[56]
Results of using different polyadenylation sites on the same gene
Many protein-coding genes have more than one polyadenylation site, so a gene can code for several mRNAs that differ in their3′ end.[28][57][58] The 3' region of a transcript contains many polyadenylation signals (PAS). When more proximal (closer towards 5' end) PAS sites are utilized, this shortens the length of the 3' untranslated region (3' UTR) of a transcript.[59] Studies in both humans and flies have shown tissue specific APA. With neuronal tissues preferring distal PAS usage, leading to longer 3' UTRs and testis tissues preferring proximal PAS leading to shorter 3' UTRs.[60][61] Studies have shown there is a correlation between a gene's conservation level and its tendency to do alternative polyadenylation, with highly conserved genes exhibiting more APA. Similarly, highly expressed genes follow this same pattern.[62]Ribo-sequencing data (sequencing of only mRNAs inside ribosomes) has shown that mRNA isoforms with shorter 3' UTRs are more likely to be translated.[59]
Since alternative polyadenylation changes the length of the3' UTR,[63] it can also change which binding sites are available formicroRNAs in the 3′ UTR.[20][64] MicroRNAs tend to repress translation and promote degradation of the mRNAs they bind to, although there are examples of microRNAs that stabilise transcripts.[65][66] Alternative polyadenylation can also shorten the coding region, thus making the mRNA code for a different protein,[67][68] but this is much less common than just shortening the 3′ untranslated region.[28]
The choice of poly(A) site can be influenced by extracellular stimuli and depends on the expression of the proteins that take part in polyadenylation.[69][70] For example, the expression ofCstF-64, a subunit ofcleavage stimulatory factor (CstF), increases inmacrophages in response tolipopolysaccharides (a group of bacterial compounds that trigger an immune response). This results in the selection of weak poly(A) sites and thus shorter transcripts. This removes regulatory elements in the 3′ untranslated regions of mRNAs for defense-related products likelysozyme andTNF-α. These mRNAs then have longer half-lives and produce more of these proteins.[69] RNA-binding proteins other than those in the polyadenylation machinery can also affect whether a polyadenylation site is used,[71][72][73][74] as canDNA methylation near the polyadenylation signal.[75] In addition, numerous other components involved in transcription, splicing or other mechanisms regulating RNA biology can affect APA.[76]
For manynon-coding RNAs, includingtRNA,rRNA,snRNA, andsnoRNA, polyadenylation is a way of marking the RNA for degradation, at least inyeast.[77] This polyadenylation is done in the nucleus by theTRAMP complex, which maintains a tail that is around 4 nucleotides long to the 3′ end.[78][79] The RNA is then degraded by theexosome.[80] Poly(A) tails have also been found on human rRNA fragments, both the form of homopolymeric (A only) and heterpolymeric (mostly A) tails.[81]
Polyadenylation in bacteria helps polynucleotide phosphorylase degrade past secondary structure
In many bacteria, both mRNAs and non-coding RNAs can be polyadenylated. This poly(A) tail promotes degradation by thedegradosome, which contains two RNA-degrading enzymes:polynucleotide phosphorylase andRNase E. Polynucleotide phosphorylase binds to the 3′ end of RNAs and the 3′ extension provided by the poly(A) tail allows it to bind to the RNAs whosesecondary structure would otherwise block the 3′ end. Successive rounds of polyadenylation and degradation of the 3′ end by polynucleotide phosphorylase allows thedegradosome to overcome these secondary structures. The poly(A) tail can also recruit RNases that cut the RNA in two.[82] These bacterial poly(A) tails are about 30 nucleotides long.[83]
In as different groups as animals andtrypanosomes, themitochondria contain both stabilising and destabilising poly(A) tails. Destabilising polyadenylation targets both mRNA and noncoding RNAs. The poly(A) tails are 43 nucleotides long on average. The stabilising ones start at the stop codon, and without them the stop codon (UAA) is not complete as the genome only encodes the U or UA part. Plant mitochondria have only destabilising polyadenylation. Mitochondrial polyadenylation has never been observed in either budding or fission yeast.[84][85]
While many bacteria and mitochondria have polyadenylate polymerases, they also have another type of polyadenylation, performed bypolynucleotide phosphorylase itself. This enzyme is found in bacteria,[86] mitochondria,[87]plastids[88] and as a constituent of the archaeal exosome (in thosearchaea that have anexosome).[89] It can synthesise a 3′ extension where the vast majority of the bases are adenines. Like in bacteria, polyadenylation by polynucleotide phosphorylase promotes degradation of the RNA in plastids[90] and likely also archaea.[84]
Although polyadenylation is seen in almost all organisms, it is not universal.[7][91] However, the wide distribution of this modification and the fact that it is present in organisms from all threedomains of life implies that thelast universal common ancestor of all living organisms, it is presumed, had some form of polyadenylation system.[83] A few organisms do not polyadenylate mRNA, which implies that they have lost their polyadenylation machineries during evolution. Although no examples of eukaryotes that lack polyadenylation are known, mRNAs from the bacteriumMycoplasma gallisepticum and the salt-tolerant archaeanHaloferax volcanii lack this modification.[92][93]
The most ancient polyadenylating enzyme ispolynucleotide phosphorylase. This enzyme is part of both the bacterialdegradosome and the archaealexosome,[94] two closely related complexes that recycle RNA into nucleotides. This enzyme degrades RNA by attacking the bond between the 3′-most nucleotides with a phosphate, breaking off a diphosphate nucleotide. This reaction is reversible, and so the enzyme can also extend RNA with more nucleotides. The heteropolymeric tail added by polynucleotide phosphorylase is very rich in adenine. The choice of adenine is most likely the result of higherADP concentrations than other nucleotides as a result of usingATP as an energy currency, making it more likely to be incorporated in this tail in early lifeforms. It has been suggested that the involvement of adenine-rich tails in RNA degradation prompted the later evolution of polyadenylate polymerases (the enzymes that produce poly(A) tails with no other nucleotides in them).[95]
Polyadenylate polymerases are not as ancient. They have separately evolved in both bacteria and eukaryotes fromCCA-adding enzyme, which is the enzyme that completes the 3′ ends oftRNAs. Its catalytic domain is homologous to that of otherpolymerases.[80] It is presumed that the horizontal transfer of bacterial CCA-adding enzyme to eukaryotes allowed the archaeal-like CCA-adding enzyme to switch function to a poly(A) polymerase.[83] Some lineages, likearchaea andcyanobacteria, never evolved a polyadenylate polymerase.[95]
Poly(A)polymerase was first identified in 1960 as anenzymatic activity in extracts made from cell nuclei that could polymerise ATP, but not ADP, into polyadenine.[101][102] Although identified in many types of cells, this activity had no known function until 1971, when poly(A) sequences were found in mRNAs.[103][104] The only function of these sequences was thought at first to be protection of the 3′ end of the RNA from nucleases, but later the specific roles of polyadenylation in nuclear export and translation were identified. The polymerases responsible for polyadenylation were first purified and characterized in the 1960s and 1970s, but the large number of accessory proteins that control this process were discovered only in the early 1990s.[103]
^Marzluff WF, Gongidi P, Woods KR, Jin J, Maltais LJ (November 2002). "The human and mouse replication-dependent histone genes".Genomics.80 (5):487–98.doi:10.1016/S0888-7543(02)96850-3.PMID12408966.
^Lutz CS (October 2008). "Alternative polyadenylation: a twist on mRNA 3′ end formation".ACS Chemical Biology.3 (10):609–17.doi:10.1021/cb800138w.PMID18817380.
^Nag A, Narsinh K, Martinson HG (July 2007). "The poly(A)-dependent transcriptional pause is mediated by CPSF acting on the body of the polymerase".Nature Structural & Molecular Biology.14 (7):662–9.doi:10.1038/nsmb1253.PMID17572685.S2CID5777074.
^Tefferi A, Wieben ED, Dewald GW, Whiteman DA, Bernard ME, Spelsberg TC (August 2002). "Primer on medical genomics part II: Background principles and methods in molecular genetics".Mayo Clinic Proceedings.77 (8):785–808.doi:10.4065/77.8.785.PMID12173714.S2CID2237085.
^Vinciguerra P, Stutz F (June 2004). "mRNA export: an assembly line from genes to nuclear pores".Current Opinion in Cell Biology.16 (3):285–92.doi:10.1016/j.ceb.2004.03.013.PMID15145353.
^Sakurai T, Sato M, Kimura M (November 2005). "Diverse patterns of poly(A) tail elongation and shortening of murine maternal mRNAs from fully grown oocyte to 2-cell embryo stages".Biochemical and Biophysical Research Communications.336 (4):1181–9.Bibcode:2005BBRC..336.1181S.doi:10.1016/j.bbrc.2005.08.250.PMID16169522.
^Alt FW, Bothwell AL, Knapp M, Siden E, Mather E, Koshland M, Baltimore D (June 1980). "Synthesis of secreted and membrane-bound immunoglobulin mu heavy chains is directed by mRNAs that differ at their 3′ ends".Cell.20 (2):293–301.doi:10.1016/0092-8674(80)90615-7.PMID6771018.S2CID7448467.
^Slomovic S, Portnoy V, Schuster G (2008). "Chapter 24 Detection and Characterization of Polyadenylated RNA in Eukarya, Bacteria, Archaea, and Organelles".RNA Turnover in Bacteria, Archaea and Organelles. Methods in Enzymology. Vol. 447. pp. 501–20.doi:10.1016/S0076-6879(08)02224-6.ISBN978-0-12-374377-0.PMID19161858.
^Evguenieva-Hackenberg E, Roppelt V, Finsterseifer P, Klug G (December 2008). "Rrp4 and Csl4 are needed for efficient degradation but not for polyadenylation of synthetic and natural RNA by the archaeal exosome".Biochemistry.47 (50):13158–68.doi:10.1021/bi8012214.PMID19053279.
^abSlomovic S, Portnoy V, Yehudai-Resheff S, Bronshtein E, Schuster G (April 2008). "Polynucleotide phosphorylase and the archaeal exosome as poly(A)-polymerases".Biochimica et Biophysica Acta (BBA) - Gene Regulatory Mechanisms.1779 (4):247–55.doi:10.1016/j.bbagrm.2007.12.004.PMID18177749.