Piwi-interacting RNA (piRNA) is the largest class of smallnon-codingRNA molecules expressed in animal cells.[1][2][3] piRNAs form RNA-protein complexes through interactions withpiwi-subfamilyArgonaute proteins. These piRNA complexes are mostly involved in theepigenetic andpost-transcriptional silencing oftransposable elements and other spurious or repeat-derived transcripts, but can also be involved in the regulation of other genetic elements ingerm line cells.[4][5][6]
piRNAs are mostly created from loci that function as transposon traps which provide a kind of RNA-mediatedadaptive immunity against transposon expansions and invasions.[7] They are distinct frommicroRNA (miRNA) in size (26–31 nucleotides as opposed to 21–24 nt), lack of sequence conservation, increased complexity, and independence ofDicer for biogenesis, at least in animals.[5][1][2] (PlantDcl2 may play a role in rasi/piRNA biogenesis.)[8][9]
Double-stranded RNAs capable of silencing repeat elements, then known asrepeat associated small interfering RNA (rasiRNA), were proposed inDrosophila in 2001.[10] By 2008, it was still unclear how piRNAs are generated, but potential methods had been suggested, and it was certain their biogenesis pathway is distinct frommiRNA andsiRNA, while rasiRNA is now considered a piRNA subspecies.[11]
Proposed piRNA structure, with the 3′ end 2′-O-methylation
piRNAs have been identified in bothvertebrates andinvertebrates, and althoughbiogenesis and modes of action do vary somewhat between species, a number of features are conserved. piRNAs have no clearsecondary structure motifs,[1][12] due to the fact that the length of a piRNA varies between species (from 21 to 31nucleotides), and the bias for a5’uridine is common to piRNAs in both vertebrates and invertebrates. piRNAs inCaenorhabditis elegans have a 5’ monophosphate and a 3’ modification that acts to block either the 2’ or 3’ oxygen;[13] this has also been confirmed to exist inDrosophila melanogaster,[14]zebrafish,[15]mice,[16] andrats.[15] This 3’ modification is a 2’-O-methylation; the reason for this modification is not clear, but it has been suggested that it increases piRNA stability.[15][17]
More than 50,000 unique piRNA sequences have been discovered in mice and more than 13,000 inD. melanogaster.[18] It is thought that there are many hundreds of thousands of different piRNA species inmammals.[19]
In the early 1980s, it was discovered that a single mutation in thefruit flygenome could specifically activate all copies of aretrovirus-like element calledGypsy in the femalegermline. The site of the mutations that made these Gypsies "dance" was thus called theflamenco locus. In 2001, Aravinet al. proposed that double-stranded (ds) RNA-mediated silencing is implicated in the control ofretrotransposons in the germline and by 2003 the idea had emerged that vestiges of transposons might produce dsRNAs required for the silencing of "live" transposons.[10] Sequencing of the 200,000-bp flamenco locus was difficult, as it turned out to be packed with transposable element fragments (104 insertions of 42 different transposons, including multiple Gypsies), all facing the same direction. Indeed, piRNAs are all found in clusters throughout animal genomes; these clusters may contain as few as ten or many thousands of piRNAs matching different, phased transposon fragments. This led to the idea in 2007 that in germlines a pool of primary piRNAs is processed from long single-stranded transcripts encoded by piRNA clusters in the opposite orientation of the transposons, so that the piRNAs can anneal to and complement the transposon-encoded transcripts, thereby triggering their degradation. Any transposon landing in the correct orientation in such a cluster will make the individual more or less immune to that transposon, and such an advantageous mutation will spread quickly through the population. The original mutations in the flamenco locus inhibited the transcription of the master transcript, thereby deactivating this defense system.[7][20][1][21][22]
A historical example of invasion and Piwi response is known: theP-element transposon invaded aDrosophila melanogaster genome in the mid-20th century, and, through interbreeding, within decades all wild fruit flies worldwide (though not the reproductively isolated lab strains) contained the same P-element. Repression of further P-element activity, spreading near-simultaneously, appears to have occurred by the Piwi-interacting RNA pathway.[23]
piRNA clusters in genomes can now readily be detected viabioinformatics methods.[24] WhileD. melanogaster and vertebrate piRNAs have been located in areas lacking any protein-codinggenes,[11][20] piRNAs inC. elegans have been identified amidst protein-coding genes.[13]
In mammals, piRNAs are found both intestes[25] andovaries,[26] although they only seem to be required in males.[4] In invertebrates, piRNAs have been detected in both the male and femalegermlines.[15][19]
At the cellular level, piRNAs have been found within both thenucleus andcytoplasm, suggesting that piRNA pathways may function in both of these areas[11] and, therefore, may have multiple effects.[27]
There are at least three Argonaute (Ago) subfamilies that have been found ineukaryotes. Unlike the Ago subfamily which is present in animals, plants, and fission yeast, the Piwi subfamily has only been found in animals.[28] RasiRNA has been observed inDrosophila and some unicellular eukaryotes but its presence in mammals has not been determined, unlike piRNA which has been observed in many species of invertebrates and vertebrates including mammals;[29] however, since proteins which associate with rasiRNA are found in both vertebrates and invertebrates, it is possible that active rasiRNA exist and have yet to be observed in other animals. RasiRNAs have been observed inSchizosaccharomyces pombe, a species of yeast, as well in some plants, neither of which have been observed to contain the Piwi subfamily of Argonaute proteins.[8] It has been observed that both rasiRNA and piRNA are maternally linked, but more specifically it is the Piwi protein subfamily that is maternally linked and therefore leads to the observation that rasiRNA and piRNA are maternally linked.[clarification needed][30]
The ping-pong mechanism for the biogenesis of the 5′ end of rasiRNA.
Thebiogenesis of piRNAs is not yet fully understood, although possible mechanisms have been proposed. piRNAs show a significant strand bias, that is, they are derived from one strand ofDNA only,[1] and this may indicate that they are the product of long single stranded precursor molecules.[2] A primary processing pathway is suggested to be the only pathway used to producepachytene piRNAs; in this mechanism, piRNA precursors aretranscribed resulting in piRNAs with a tendency to target 5’uridines.[31][32] Also proposed is a ‘Ping Pong’ mechanism wherein primary piRNAs recognise their complementary targets and cause the recruitment ofpiwi proteins. This results in thecleavage of the transcript at a point tennucleotides from the 5’ end of the primary piRNA, producing the secondary piRNA.[32] These secondary piRNAs are targeted toward sequences that possess anadenine at the tenth position.[31] Since the piRNA involved in the ping pong cycle directs its attacks on transposon transcripts, the ping pong cycle acts only at the level oftranscription.[22] One or both of these mechanisms may be acting in differentspecies;C. elegans, for instance, does have piRNAs, but does not appear to use the ping pong mechanism at all.[19]
A significant number of piRNAs identified inzebrafish andD. melanogaster contain adenine at their tenth position,[11] and this has been interpreted as possible evidence of aconservedbiosyntheticmechanism across species.[17] Ping-pong signatures have been identified in very primitive animals such as sponges and cnidarians, pointing to the existence of the ping-pong cycle already in the early branches of metazoans.[33]
The piRNA Ping-Pong pathway was first proposed from studies inDrosophila where the piRNA associated with the two cytoplasmic Piwi proteins, Aubergine (Aub) and Argonaute-3 (Ago3) exhibited a high frequency of sequence complementarity over exactly 10 nucleotides at their 5′ ends.[32][34] This relationship is known as the "ping-pong signature" and is also observed in associated piRNA from Mili and Miwi2 proteins isolated from mouse testes. The proposed function of Ping-Pong inDrosophila or in mouse remains to be understood, but a leading hypothesis is that the interaction between Aub and Ago3 allows for a cyclic refinement of piRNA that are best suited to target active transposon sequences. Aub piRNA are primarily antisense to transposable element transcripts and are believed to be the main factor in targeting deleterious transcripts through complementarity. Conversely, Ago3 piRNA sequences are predominantly of sense orientation to transposable element transcripts and are derived from the product of Aub cleavage of transposon mRNA. As such, Ago3 piRNA lack the ability to target transposable element transcripts directly. Therefore, it was proposed that Ago3 piRNA guide the production of piRNA that are loaded into Aub by targeting newly exported piRNA cluster transcripts. Several lines of evidence support the effect of Ago3 on the production of Aub piRNA, in particular from examining the piRNA repertoire inDrosophila ovaries that are mutant for Ago3 and the Tudor-domain protein Kumo/Qin.[35][36]
The molecular mechanism that underpins Ping-Pong likely involves several piRNA pathway associated factors.Qin was reported to coordinate the loading of Ago3 with piRNA, in addition to interacting with both Aub and Ago3.[36] However, theTudor proteinkrimper (A1ZAC4) was also shown to interact with both Aub and Ago3 through its Tudor domains while also binding itself through its N-terminal Krimper domain.[37] Specifically, Krimper interacts with Ago3 in its piRNA-unloaded state, while its interaction with Aub is dependent on the symmetrical dimethylation of arginine residues in the N-terminal region of Aub.[37][38] In Silkmoth germ cells, it was proposed thatVasa protein coordinates the Ping-Pong mechanism of Silkmoth Aub (Siwi) and Ago3.[39]
It is likely that the mechanism of Ping-Pong is primarily coordinated by Krimper but factors such as Kumo/Qin and Vasa, in addition to other factors have necessary functions in the Ping-Pong mechanism.
TheDrosophila piRNA pathway can be separated into two branches: the cytoplasmic branch consisting of Aub and Ago3 operating the Ping-Pong mechanism, and the nuclear branch, pertaining to the co-transcriptional silencing of genomic loci by Piwi in the nucleus. Through complementary strategies, two studies show that Aub and Ago3 target cleavage triggers the 'phased' loading of piRNA into Piwi.[40][41] Phasing begins with the targeting and cleavage of a complementary target by either Aub or Ago3 associated with a 'responder' piRNA. Once cleaved, the targeted transcript is then processed further by a mechanism believed to require the mitochondrial-associated endonuclease, Zucchini, which leads to the loading of Piwi protein with sequential fragments of the targeted transcript. In this way, the Aub or Ago3 'responder' piRNA sequence cleaves a complementary target that is then sliced at periodic intervals of approximately 27 nucleotides that are sequentially loaded into Piwi protein. Once loaded with piRNA, Piwi then enters the germ cell nucleus to co-transcriptionally silence nascent transcripts with complementarity to its piRNA guide.[42]It is currently unknown whether phasing occurs in other organisms.
The wide variation in piRNA sequences andpiwi function across species contributes to the difficulty in establishing the functionality of piRNAs.[43] However, like othersmall RNAs, piRNAs are thought to be involved ingene silencing,[1] specifically the silencing oftransposons.[44] The majority of piRNAs areantisense to transposon sequences,[3][22] suggesting that transposons are targets of the piRNAs. In mammals, it appears that the activity of piRNAs in transposon silencing is most important during the development of theembryo,[31] and in bothC. elegans and humans, piRNAs are necessary forspermatogenesis.[43]
piRNA has a role inRNA silencing via the formation of anRNA-induced silencing complex (RISC). piRNAs interact withpiwi proteins that are part of a family of proteins called theArgonautes. These are active in the testes of mammals and are required forgerm-cell andstem-cell development ininvertebrates. Three piwi subfamily proteins – MIWI, MIWI2, and MILI – have been found to be essential for spermatogenesis in mice. piRNAs direct the piwi proteins to their transposon targets.[31] A decrease or absence of PIWIgene expression is correlated with an increased expression of transposons.[11][31] Transposons have a high potential to cause deleterious effects on their hosts[21] and, in fact, mutations in piRNA pathways have been found to reducefertility inD. melanogaster.[20] Further, it is thought that piRNA andendogenoussmall interfering RNA (endo-siRNA) may have comparable and even redundant functionality in transposon control in mammalianoocytes.[22]
piRNAs appear to affect particularmethyltransferases that perform themethylations which are required to recognise and silence transposons,[31] but this relationship is not well understood.
In Dipterans viral-derived piRNAs derived from positive-sense RNA viruses were first identified inDrosophila ovarian somatic sheet (OSS) cells.[45] Subsequent experimental studies have demonstrated that the piRNA pathway is not required for antiviral defence inDrosophila melanogaster.[46] However, in mosquitoes the PIWI family of proteins has expanded[47] and some PIWI proteins have been identified as antiviral such as Piwi4.[48] As such virus infections in mosquitoes commonly produce virus-derived piRNAs in diverse positive-sense RNA,[49] negative-sense RNA[50][48] and single-stranded DNA viruses.[51]
piRNAs can be transmitted maternally,[15] and based on research inD. melanogaster, piRNAs may be involved in maternally derivedepigenetic effects.[20] The activity of specific piRNAs in the epigenetic process also requires interactions between piwi proteins and HP1a, as well as other factors.[18]
Genetic screens examining fertility defects identified a number of proteins that are not Piwi-clade Argonautes, yet produce the same sterility phenotypes as Piwi mutants.
Many factors required for the piRNA pathway inDrosophila containTudor domains that are known to bind symmetrically dimethylated arginine residues (sDMA) present in methylation motifs of Piwi proteins. Piwi proteins are symmetrically dimethylated by the PRMT5 methylosome complex, consisting of Valois (MEP50) and Capsulèen (dart5; PRMT5).[52][53]
Major advances in the study of piRNA have been achieved thanks to the use ofnext-generation sequencing techniques, such as Solexa, 454, andIllumina platform sequencing. These techniques allow analysis of highly complex and heterogeneous RNA populations like piRNAs. Due to their small size, expression and amplification of small RNAs can be challenging, so specialisedPCR-based methods have been developed in response to this difficulty.[54][55] However, research has also revealed that a number of annotated piRNAs may be false positives; for instance, a majority of piRNAs that were expressed in somatic non-gonadal tissues were considered to derive from non-coding RNA fragments.[56]
^abSiomi MC, Sato K, Pezic D, Aravin AA (April 2011). "PIWI-interacting small RNAs: the vanguard of genome defence".Nature Reviews Molecular Cell Biology.12 (4):246–258.doi:10.1038/nrm3089.PMID21427766.S2CID5710813.
^Carmen L, Michela B, Rosaria V, Gabriella M (2009). "Existence of snoRNA, microRNA, piRNA characteristics in a novel non-coding RNA: x-ncRNA and its biological implication in Homo sapiens".Journal of Bioinformatics and Sequence Analysis.1 (2):031–040.
^Kirino Y, Mourelatos Z (April 2007). "Mouse Piwi-interacting RNAs are 2′-O-methylated at their 3′ termini".Nature Structural & Molecular Biology.14 (4):347–348.doi:10.1038/nsmb1218.PMID17384647.S2CID31193964.
^abLin H, Yin H, Beyret E, Findley S, Deng W (2008). "The role of the piRNA pathway in stem cell self-renewal".Developmental Biology.319 (2): 479.doi:10.1016/j.ydbio.2008.05.048.
^Aravin A, Gaidatzis D, Pfeffer S, Lagos-Quintana M, Landgraf P, Iovino N, Morris P, Brownstein MJ, Kuramochi-Miyagawa S, Nakano T, Chien M, Russo JJ, Ju J, Sheridan R, Sander C, Zavolan M, Tuschl T (July 2006). "A novel class of small RNAs bind to MILI protein in mouse testes".Nature.442 (7099):203–207.Bibcode:2006Natur.442..203A.doi:10.1038/nature04916.PMID16751777.S2CID4379895.
^Anne J, Mechler BM (May 2005). "Valois, a component of the nuage and pole plasm, is involved in assembly of these structures, and binds to Tudor and the methyltransferase Capsuléen".Development.132 (9):2167–77.doi:10.1242/dev.01809.PMID15800004.S2CID6810484.