
Baltimore classification is a system used toclassify viruses by their routes of transferringgenetic information from thegenome tomessenger RNA (mRNA). Seven Baltimore groups, or classes, exist and are numbered inRoman numerals from I to VII. Groups are defined by whether theviral genome is made ofdeoxyribonucleic acid (DNA) orribonucleic acid (RNA), whether the genome is single- or double-stranded, whether a single-stranded RNA genome ispositive-sense (+) or negative-sense (–), and whether the virus makes DNA from RNA (reverse transcription (RT)). Viruses within Baltimore groups typically have the samereplication method, but other characteristics such asvirion structure are not directly related to Baltimore classification.
The seven Baltimore groups are for double-stranded DNA (dsDNA) viruses, single-stranded DNA (ssDNA) viruses, double-stranded RNA (dsRNA) viruses, positive-sense single-stranded RNA (+ssRNA) viruses, negative-sense single-stranded RNA (–ssRNA) viruses, ssRNA viruses that have a DNA intermediate in their life cycle (ssRNA-RT), and dsDNA viruses that have an RNA intermediate in their life cycle (dsDNA-RT). Only one class exists for ssDNA viruses because their genomes are converted to dsDNA beforetranscription regardless of sense. Some viruses belong to more than one Baltimore group, such as DNA viruses that have either dsDNA or ssDNA as their genome.
Many virus characteristics do not define which Baltimore group they belong to but do correlate to specific Baltimore groups. This includes the use ofRNA editing andalternative splicing, whether the virus's genome is segmented, the size and structure of the virus's genome, thehost range of viruses, whether the virus packages replication and transcription machinery intovirions, and unorthodox methods oftranslating mRNA into proteins. Furthermore, while Baltimore groups were not established based on evolutionary relationships, research in the 21st century has found that certain groups, such as dsRNA, +ssRNA, and many –ssRNA viruses, sharecommon ancestry.
Baltimore classification was created in 1971 by virologistDavid Baltimore and initially only included the first six groups. It was later expanded to include group VII after the discovery of dsDNA-RT viruses. Since then, it has become common among virologists to use Baltimore classification alongside virus taxonomy due to its utility. In 2018 and 2019, Baltimore classification was partially integrated into virus taxonomy based on evidence that certain groups were descended from common ancestors. Various taxa now correspond to specific Baltimore groups. An extension of Baltimore classification has been proposed by virologistVadim Agol to encompass all possible routes of genetic information transmission.
Baltimore classification groups viruses together by their routes of transferring genetic information from the genome to messenger RNA (mRNA). Characteristics that determine the Baltimore group of a virus include whether the genome is made ofdeoxyribonucleic acid (DNA) orribonucleic acid (RNA), the strandedness of the genome, which can be either single- or double-stranded, thesense of a single-stranded RNA genome, which can be either positive (+) or negative (–), and whether the virus synthesizes DNA from RNA (reverse transcription (RT)). There are seven Baltimore groups or classes, numbered withRoman numerals, listed hereafter.[1][2][3][note 1]
Baltimore classification is chiefly based on the path towardtranscription of the viral genome, and viruses within each group usually share the manner by which the mRNA synthesis occurs. While not the direct focus of Baltimore classification, groups are organized in such a manner that viruses in each group also typically have the same mechanisms of replicating the viral genome.[7][8] Structural characteristics of the extracellular virus particle, called avirion, such as the shape of the viral capsid, which stores the genome, and the presence of aviral envelope, alipid membrane that surrounds the capsid, have no direct relation to Baltimore groups, nor do the groups necessarily show genetic relation based on evolutionary history.[1]

The first Baltimore group contains viruses that have a double-stranded DNA (dsDNA) genome. All dsDNA viruses have their mRNA synthesized in a three-step process. First, atranscription preinitiation complex binds to the DNA upstream of the transcription site, recruiting ahostRNA polymeraseenzyme. Once the RNA polymerase is recruited, it uses the negative-sense strand as a template for synthesizing mRNA strands, which are positive sense. The RNA polymerase then terminates transcription upon reaching a specific signal, such as apolyadenylation site.[10][11]
dsDNA viruses make use of several mechanisms to replicate their genome. A widely used method is bidirectional replication, in which tworeplication forks are established at areplication origin site and move in opposite directions on a DNA molecule.[12] Arolling circle mechanism that produces linear strands while progressing in a loop around a circular genome is also common.[13] Many dsDNA viruses use a strand displacement method whereby one strand is synthesized from a template strand, and a complementary strand is then synthesized from the previously synthesized strand to form a dsDNA genome.[14] Lastly, some dsDNA viruses are replicated as part of a process calledreplicative transposition, whereby a viral genome that isintegrated into a host cell's genome is replicated to another part of the host cell's genome.[15]
dsDNA viruses can be divided informally into those that replicate in thenucleus, and as such are relatively dependent on host cell machinery for transcription and replication, and those that replicate incytoplasm, in which case they have obtained their own means of transcription and replication.[16] dsDNA viruses are also sometimes divided between tailed dsDNA viruses, which refers to members of therealmDuplodnaviria, specifically the head-tail of the classCaudoviricetes, and tailless or non-tailed (icosahedral) dsDNA viruses, which refers to viruses in the realmsSingelaviria andVaridnaviria.[17][18][19][note 3]
dsDNA viruses are classified into five realms and include many taxa that are unassigned to a realm:

The second Baltimore group contains viruses that have a single-stranded DNA (ssDNA) genome. ssDNA viruses have the same manner of transcription as dsDNA viruses. Because the genome is single-stranded, however, it is first made into a double-stranded form by aDNA polymerase enzyme upon entering a host cell. mRNA is then synthesized from the double-stranded form. The double-stranded form of ssDNA viruses may be produced either directly after entry into a cell or as a consequence of replicating the viral genome.[28][29]
Most ssDNA viruses contain circular genomes that are replicated byrolling circle replication (RCR). ssDNA RCR is initiated by anendonuclease enzyme that bonds to and cleaves the positive-sense strand, which allows a DNA polymerase to use the negative-sense strand as a template for replication. Replication progresses in a loop around the genome by extending the 3′-end ("three prime end") of the positive-sense strand, which displaces the prior positive-sense strand.[note 4] The endonuclease then cleaves the positive-sense strand again to create a standalone genome that is joined (ligated) into a circular loop. The new ssDNA genome may be packaged into virions or replicated by a DNA polymerase to create a double-stranded form for transcription or additional rounds of replication.[28][32]
Parvoviruses andbidnaviruses have linear ssDNA genomes that are replicated byrolling hairpin replication (RHR), which is similar to RCR. Their genomes havehairpin loops at each end of the genome that repeatedly unfold and refold during replication to change the direction of DNA synthesis to move back and forth along the linear genome, which produces numerous copies of the genome in a continuous process. Individual genomes are then excised from thismolecule by the endonuclease.[32][33][34]
Nearly all ssDNA viruses have positive-sense genomes, but a few exceptions and peculiarities exist.Anelloviruses are the only ssDNA viruses that have negative-sense genomes.[35] Parvoviruses may package either the positive- or negative-sense strand into capsids.[29] Lastly, bidnaviruses may package both the positive- and negative-sense strands of their bipartite genome.[34][36] In any case, the sense of ssDNA viruses, unlike that of ssRNA viruses, is not sufficient to separate ssDNA viruses into two Baltimore groups since all ssDNA viral genomes are converted to dsDNA forms before transcription and replication.[3][7]
ssDNA viruses are classified into two realms and include a few families that are unassigned to realms:

The third Baltimore group contains viruses that have a double-stranded RNA (dsRNA) genome. After entering a host cell, the viralRNA-dependent RNA polymerase (RdRp) synthesizes a positive-sense strand from the negative-sense strand of the dsRNA genome. This positive-sense strand may be used either as mRNA for translation or as a template for replication to form the dsRNA genome.[39][40]
dsRNA is not a molecule made by cells, so eukaryotes have evolved antiviral systems to detect and inactivate viral dsRNA. To counter this, many dsRNA viruses replicate their genomes inside of capsids, thereby avoiding detection inside of the host cell's cytoplasm. Positive-sense strands are then forced out from the capsid to be translated or translocated from the mature capsid to a progeny capsid.[39][40][41]
dsRNA viruses are classified into two phyla within the kingdomOrthornavirae, realmRiboviria:[42]

The fourth Baltimore group contains viruses that have a positive-sense single-stranded RNA (+ssRNA) genome. For +ssRNA viruses, the genome functions as mRNA, so no transcription is required for translation. +ssRNA viruses will, however, produce positive-sense copies of the genome from negative-sense strands of an intermediate dsRNA genome. This acts as both a transcription and replication process since the replicated +ssRNA is also mRNA.[6][44][45]
Many +ssRNA viruses are able to have only a portion of their genome transcribed. Typically, subgenomic RNA (sgRNA) strands are used for the translation of structural and movement proteins needed during intermediate and late stages of infection. sgRNA transcription may occur by commencing RNA synthesis within the genome rather than from the 5′-end ("five prime end"), by stopping RNA synthesis at specific sequences in the genome, or, as a part of both aforementioned methods, by synthesizingleader sequences from viral RNA that are then attached to sgRNA strands. During infection, the viral RdRp is always translated directly from the genome first because replication, performed by the RdRp, is required for sgRNA synthesis.[45][46]
Because the process of replicating the viral genome produces intermediate dsRNA molecules, +ssRNA viruses can be targeted by the host cell's immune system. To avoid detection, +ssRNA viruses replicate in membrane-associated vesicles that are used as replication factories. From there, only +ssRNA strands enter the main cytoplasmic area of the cell. These strands may be used as mRNA or as progeny genomes.[44][47]
+ssRNA viruses can be divided informally into those that have polycistronic mRNA, which encodes a polyprotein that is cleaved to form multiple mature proteins, and those that undergo multiple rounds of translation of the genome or subgenomic mRNAs to express proteins.[48][49] +ssRNA viruses are classified into three phyla in the kingdomOrthornavirae, realmRiboviria:[42]
The fifth Baltimore group contains viruses that have a negative-sense, single-stranded RNA (–ssRNA) genome. At least two lineages of –ssRNA viruses exist, which transcribe and replicate their genomes differently. The first are viruses of the phylumNegarnaviricota in the kingdomOrthornavirae, realmRiboviria. Negarnaviricots transcribe mRNA, which is positive sense, directly from the negative-sense genome. The first process for –ssRNA transcription involves the viral RdRp binding to a leader sequence on the 3′-end of the genome, transcribing a 5′ triphosphate-leader RNA sequence, then stopping and restarting on a transcription signal that iscapped, continuing until a stop signal is reached. There, the RdRp synthesizes apolyadenylated tail and releases the mRNA or, for polycistronic genomes, continues transcription.[51]
The second manner is similar, but instead of synthesizing a cap, the RdRp may use its endonuclease activity tosnatch a short sequence of nucleotides from host cell mRNA and use it as the 5′ cap of viral mRNA.[52] Genomic –ssRNA is replicated from the positive-sense antigenome in a manner similar to transcription, except in reverse using the antigenome as a template for the genome. The RdRp complex moves from the 3′-end to the 5′-end of the antigenome and ignores all transcription signals when synthesizing genomic –ssRNA.[40][53]
Various –ssRNA viruses use special mechanisms for transcription. The way of polyadenylating the end of an mRNA sequence may be throughpolymerase stuttering, during which the RdRp transcribes anadenine fromuracil and then moves back in the RNA sequence to transcribe it again, continuing this process until hundreds of adenines have been added to the 3′-end of the mRNA.[54] Some –ssRNA viruses are ambisense, as both the positive- and negative-sense strands separately encode viral proteins. These viruses produce one mRNA strand from the genome and one from a complementary strand.[55][56]
–ssRNA viruses inNegarnaviricota can be divided informally into those that have non-segmented and segmented genomes. Non-segmented –ssRNA viruses replicate in the cytoplasm, and segmented –ssRNA viruses replicate in the nucleus. For segmented viruses, the RdRp transcribes one monocistronic mRNA strand from each segment of the genome.[40][57][58] This distinction is closely followed withinNegarnaviricota, as viruses in the subphylumHaploviricotina usually have non-segmented genomes, and viruses in the subphylumPolyploviricotina have segmented genomes.[59] Moreover, –ssRNA viruses that synthesize a cap structure on viral mRNA are assigned toHaploviricotina, whereas –ssRNA viruses that snatch caps from host mRNA belong toPolyploviricotina.[60]

The second lineage of –ssRNA viruses is the realmRibozyviria, which includesHepatitis D virus (HDV) and its relatives. Ribozyvirians havecovalently-closed circular –ssRNA genomes that are covered in nucleocapsid proteins to form a ribonucleoprotein (RNP) complex. After entering a cell, the RNP complex migrates from thecytosol to the nucleus, where the genome is replicated by RCR by a hostRNA polymerase II enzyme. This process creates a long molecule with many copies of the genome, called aconcatemer, that has a series of positive-sense genomic strands.Ribozymes encoded in this antigenome catalyze cleavage of the concatemer to form individual strands that are either translated or ligated for replication through RCR to produce concatemers of –ssRNA antigenomic strands. Ribozymes encoded in the negative-sense strands then catalyze cleavage of the negative-sense concatemer to produce individual genomic –ssRNA strands.[62][63][64][note 5]
Lastly, there is a group of –ssRNA viruses assigned to the tentative phylumArctiviricota in the kingdomOrthornavirae. Arctiviricots inhabit the Arctic Ocean and are believed to represent a separate –ssRNA lineage inOrthornavirae fromNegarnaviricota. Their mechanisms of replication and transcription have not been described.[66] In summary, –ssRNA viruses belong to the following taxa:

The sixth Baltimore group contains viruses that have a (positive-sense) single-stranded RNA genome with a DNA intermediate ((+)ssRNA-RT) in their replication cycle. ssRNA-RT viruses are transcribed in the same manner asDNA viruses, but their genomes are first converted to a dsDNA form through a process calledreverse transcription (RT). The viralreverse transcriptase enzyme then synthesizes a DNA strand from the +ssRNA strand, and the RNA strand is degraded and replaced with a DNA strand to create a dsDNA copy of the genome. The viral enzymeintegrase then integrates the dsDNA molecule into the DNA of the host cell, where it is now called aprovirus. The host cell's RNA polymerase II then transcribes RNA in the nucleus from the proviral DNA. Some of this RNA becomes mRNA whereas other strands become copies of the viral genome for replication.[68][69][70][71]
ssRNA-RT viruses are all included in the classRevtraviricetes, the sole class in the kingdomPararnavirae, realmRiboviria. Excluding the familyCaulimoviridae, which belongs to group VII, all members of theRevtraviricetes orderOrtervirales are ssRNA-RT viruses.[42][72] ssRNA-RT viruses are sometimes called retroviruses,[73] a term shared with members of the ssRNA-RT familyRetroviridae.[67]

The seventh Baltimore group contains viruses that have a double-stranded DNA genome with an RNA intermediate (dsDNA-RT) in their replication cycle. dsDNA-RT viruses have gaps in their circular genomes so that parts of the genome are ssDNA. These gaps arerepaired to create a complete, covalently-closed circular dsDNA genome before transcription.[3][69][75] The host cell's RNA polymerase II then transcribes RNA strands from the genome in the nucleus. The viral enzyme reverse transcriptase then produces dsDNA from pregenomic RNA (pgRNA) strands by the same general mechanism as ssRNA-RT viruses, but with replication occurring in a loop around the circular genome. Replication occurs after pgRNA is packaged into capsids and before capsidsbud from the cell.[75][76][77]
dsDNA-RT viruses are, like ssRNA-RT viruses, all included in the classRevtraviricetes. Two families of dsDNA-RT viruses are recognized:Caulimoviridae, which belongs to the orderOrtervirales, andHepadnaviridae, which is the sole family in the orderBlubervirales.[42][72] The provisional familyNudnaviridae is considered to be a sister family to hepadnavirids.[74] dsDNA-RT viruses are often called pararetroviruses.[72][76]
Some viruses can be classified into two Baltimore groups.Pleolipoviruses, for example, encapsidate either ssDNA or dsDNA genomes. For betapleolipoviruses, an encapsidated genome contains ssDNA regions and dsDNA regions.[3][78][79] Similarly,bacilladnaviruses are ssDNA viruses with short dsDNA regions in their genomes.[80] Therefore, both pleolipoviruses and bacilladnaviruses can be considered both dsDNA and ssDNA viruses.[3] Ambisense RNA viruses also exist: certainbunyaviruses, such asarenaviruses, contain segmented genomes in which one segment is partly positive sense and partly negative sense.[3] Furthermore, RNA viruses of the phylumAmbiviricota have non-segmented genomes with at least twoopen reading frames (ORFs). One is encoded on the positive-sense strand and the other on the negative-sense strand.[3][81] These RNA viruses may constitute a new Baltimore group, or they can be considered both +ssRNA and –ssRNA viruses.[3]

Many characteristics of viruses do not define which Baltimore group a virus belongs to but still correspond to specific Baltimore groups. This includes the use of RNA editing, alternative splicing during transcription, whether the virus's genome is segmented, the size and structure of the virus's genome, the host range of viruses, whether the virus packages replication and transcription machinery into virions, and unorthodox methods of translating mRNA.
RNA editing is used by various ssRNA viruses to produce different proteins from a single gene. This can be done by polymerase slippage during transcription or by post-transcriptional editing. During polymerase slippage, the RNA polymerase slips one nucleotide back during transcription, which adds a nucleotide not included in the template strand to the mRNA strand. Editing of a genomic template would impair gene expression, so RNA editing is only done during and after transcription.[83][84] Forebola viruses, RNA editing is used to express three different proteins from a single gene, which increases their ability to adapt to their hosts.[82][83][84]
Alternative splicing is a mechanism by which different proteins can be produced from a single gene by using alternative splicing sites to produce different mRNA strands. It is used by various DNA, –ssRNA, and reverse transcribing viruses. Viruses may make use of alternative splicing solely to produce multiple proteins from a single pre-mRNA strand or for other specific purposes. For some viruses, such aspapillomaviruses, alternative splicing acts as a way to regulate early and lategene expression during different stages of infection.Herpesviruses use it as a potential anti-host defense mechanism to prevent synthesis of specific antiviral proteins.[85][86]
Alternative splicing differs from RNA editing in that alternative splicing does not change the mRNA sequence like RNA editing but instead changes the coding capacity of an mRNA sequence as a result of alternative splicing sites. The two processes otherwise have the same result: multiple proteins are expressed from a single gene.[85]

Viral genomes can exist as a single (monopartite) segment, a segmented genome, or amultipartite genome. For monopartite viruses, all genes are on a single genome segment. For segmented viruses, the genome is separated into at least two molecules that are packaged together into one virion. Multipartite viruses are segmented viruses that package their genome segments into separate virions. Monopartite and segmented viruses are found in all cellular life, whereas multipartite viruses mainly infect plants and fungi.[78][87][88] By Baltimore group, dsDNA and RT viruses are non-segmented, ssDNA and +ssRNA viruses are mostly non-segmented, dsRNA viruses are mostly segmented, and around half of –ssRNA viruses are segmented.[3] Pleolipoviruses vary as some have monopartite ssDNA genomes while others are bipartite with one ssDNA segment and one dsDNA segment.[78][79] Viruses in the ssDNAplant virus familyGeminiviridae likewise vary between being monopartite and bipartite.[78][89]
Viral genomes may be either linear with ends or circular in a loop. Whether a virus has a linear or circular genome varies from group to group. Most dsDNA and –ssRNA viruses have linear genomes, ssDNA viruses mainly have circular genomes, dsRNA, +ssRNA, and ssRNA-RT viruses have linear genomes, and dsDNA-RT viruses have circular genomes.[3][90][91] As all +ssRNA viruses have genomes that can act as mRNA, and circular mRNA does not exist in cellular life, all +ssRNA viruses have linear genomes.[92] Among –ssRNA viruses, those of the phylumNegarnaviricota have linear genomes,[93] and those of the realmRibozyviria have circular genomes.[62]

The size, or length, of a genome varies by Baltimore group. dsDNA viruses have genomes ranging from about 5 to 2,500 kilobases (kb) in length, ssDNA viruses 1–25 kb,[95] dsRNA viruses 4–30 kb, +ssRNA viruses 3.5–40 kb, –ssRNA viruses 1.7–20 kb, ssRNA-RT viruses 5–13 kb, and dsDNA-RT viruses 3–10 kb in length. The relatively small genomes of viruses that are not group I are likely due to physical limitations. For example, ssDNA has the potential to form extensive secondary structures, and ssRNA is relatively chemically unstable. dsDNA viruses have much more varied genome sizes, likely because they have the same genomic organization as cells. This enables them to either exploit cellular machinery or encode their own machinery. As such, dsDNA seems to be the only genomic organization that can support genomes that exceed about 50 kb.[3]
Different Baltimore groups tend to be found within different branches of cellular life. The vast majority of dsDNA viruses infect prokaryotes, but they also infectprotists, animals, and rarely fungi. ssDNA viruses infect bacteria and most eukaryotes but are rare in archaea. dsRNA viruses infect plants, protists, and animals, are rare in bacteria, but are not found in archaea. +ssRNA viruses are found in all eukaryotes, infect many bacteria, but do not infect archaea. –ssRNA viruses infect animals and plants, are rare in fungi, but are not found in prokaryotes. ssRNA-RT viruses infect all eukaryotes but do not infect prokaryotes. Lastly, dsDNA-RT viruses infect animals and plants but not prokaryotes. Whether dsDNA-RT viruses infect protists is unknown.[3]
By host, a large majority of prokaryotic viruses are dsDNA viruses, but a significant minority are ssDNA and +ssRNA viruses. There are a relatively small number of prokaryotic dsRNA viruses and no prokaryotic –ssRNA or RT viruses. Eukaryotic viruses, in contrast, are predominantly RNA viruses, though eukaryotic DNA viruses are common. Well-characterized eukaryoticviromes contain mostly +ssRNA viruses and, in some lineages such as fungi, dsRNA viruses. ssRNA-RT viruses are also common in eukaryotes, especially in animals.[3][87][96][97]
Biological factors influence host range. For example, dsDNA viruses do not infect plants because large dsDNA molecules are unable to pass throughplasmodesmata, intercellular channels that connectplant cells. The dominance of DNA viruses in prokaryotes may be because they outcompete RNA viruses. In eukaryotic cells, however, the nucleus is a barrier that requires adaptation by DNA viruses. They either have to evolve means to enter the nucleus to replicate or obtain their own replication and transcription machinery to replicate in virus factories in the cytosol. In contrast, theendomembranes of eukaryotic cells appear to be a beneficial environment for RNA virus replication.[3]

Viruses often package into the virion machineries necessary for replication and transcription, varying by Baltimore group. dsDNA viruses sometimes package transcription machinery, ssDNA and +ssRNA viruses do not package either replication or transcription machinery, dsRNA and +ssRNA-RT viruses package both, –ssRNA viruses package almost everything, and dsDNA-RT viruses package most components of their replication and transcription machinery.[3]
dsDNA viruses encode a broad range of proteins involved in replication and transcription. In some cases, they encode nearly complete systems that grant the virus partial autonomy from cells during genome expression and replication.[3] Most ssDNA viruses encode an endonuclease that initiates RCR or RHR while relying on host cell machinery for the rest of replication and transcription.[3][28][29] The endonuclease has to be encoded by these viruses because they use a DNA replication method not normally used by cells. Anelloviruses and bidnaviruses are the exceptions: anelloviruses encode proteins that aren'thomologous to known proteins, and bidnaviruses encode a protein-primed DNA polymerase.[3]
RNA replication and reverse transcription are usually discouraged by cells, which necessitates that all RNA and RT viruses encode their own RNA-dependent polymerase.[3]Satellite viruses, such as the viruses ofRibozyviria, are the only exception because they depend on other viruses for replication.[3][62] Almost all RNA and RT viruses incorporate their RNA-dependent polymerase into the virion because the enzyme is required to synthesize viral mRNA in infected cells. The exceptions are +ssRNA viruses and caulimoviruses, which are dsDNA-RT viruses. +ssRNA viruses do not do so because their genomes function as mRNA and are translated upon cell entry.[3][97] For caulimoviruses, the host enzyme RNA polymerase II transcribes the genome.[98][99]

Translation is the process by whichproteins are synthesized from mRNA byribosomes. Baltimore groups do not directly pertain to the translation of viral mRNA to proteins, but atypical types of translation used by viruses are usually found within specific Baltimore groups. For example:[7][101]


Excluding ribozyvirians, RNA viruses of groups III–V are believed to share common ancestry.[2][3] +ssRNA viruses form the basal, ancestral lineage of these viruses from which dsRNA viruses and –ssRNA viruses appear to have evolved from on multiple occasions.[3][66] The two orders of RT viruses in the classRevtraviricetes,Blubervirales andOrtervirales, are believed by virologists to have evolved from two different families ofretrotransposons on separate occasions.[114] ssRNA-RT viruses all belong toOrtervirales and thus share common ancestry.[3][72] dsDNA-RT viruses, on the other hand, are found in both orders and therefore represent two separate lineages of dsDNA-RT viruses.[72][98] Ribozyvirians constitute a lineage of –ssRNA viruses unrelated to other RNA viruses.[3]
Most ssDNA viruses likely originate fromplasmids that, on multiple occasions,recombined with other genomes to obtain the structural proteins needed to form virions.[2][3] The evolutionary history of dsDNA viruses is the most complex as they appear to have emerged independently on numerous occasions. Two major lineages of dsDNA viruses exist: the realmDuplodnaviria and the realmVaridnaviria, the latter of which also contains ssDNA viruses that are descended from dsDNA viruses. The opposite is true in the realmMonodnaviria, which contains dsDNA viruses descended from ssDNA viruses. There are also two minor realms,Adnaviria andSingelaviria,[22] that exclusively contain dsDNA viruses. Lastly, there are dsDNA virus families unassigned to higher taxa that are unique from existing realms and which likely constitute small realms.[2][3]
Of the replication-expression systems used by viruses, only Baltimore group I (dsDNA) is used by cells.[2][3] The other groups may be remnants of the primordial stage of life before the emergence of modern-like cells, during which the dsDNA system used by extant cells had not yet become uniform. The ancestors of RNA viruses in particular may have emerged during the time of theRNA world. And although virus realms are evolutionarily independent from each other, the replicative proteins encoded by viruses in the four major realms (Duplodnaviria,Monodnaviria,Riboviria, andVaridnaviria) are built on the coreRNA recognition motif, one of the most common nucleic acid-bindingdomains in nature. Therefore, the replication-expression cycles most likely diversified before the separation of large dsDNA replicators, which became the ancestors of cellular life, from other types of replicators, which becameselfish genetic elements and gave rise to viruses.[3]

Before Baltimore classification was created, a variety of classification systems for viruses had been proposed, classifying viruses by host, structure,biochemical properties, and other characteristics.[115] Baltimore classification was proposed in 1971 by virologistDavid Baltimore in a paper titled "Expression of Animal Virus Genomes" that was published in theacademic journalBacteriology Reviews (now namedMicrobiology and Molecular Biology Reviews). Baltimore focused on classifyinganimal viruses but effectively classified all viruses by their routes of information transmission from genomic nucleic acid to mRNA.[3][116] The system initially contained the first six groups but was later expanded to include group VII after the discovery of dsDNA-RT viruses.[3][117][118] Because of the utility of Baltimore classification, it came to be used alongside standard virus taxonomy, which is based on evolutionary relationships and governed by theInternational Committee on Taxonomy of Viruses (ICTV).[118]
Over time, the belief that Baltimore groups weremonophyletic spread among virologists. This was reflected in taxonomies published by the ICTV and theNational Center for Biotechnology Information, which for decades placed the Baltimore groups as informal higher ranks above official taxonomic ranks.[3] From 1991 to 2017, virus taxonomy used a five-rank system ranging from order to species, with Baltimore classification used in conjunction. Outside of official taxonomy, supergroups of viruses joining different taxa were created over time based on increasing evidence of deeper evolutionary relations.[118] The advancement of sequencing methods in the 21st century in particular made it possible to study virus evolution and diversity in greater detail. This enabled virologists to better understand the relationships between Baltimore groups and the evolutionary history of viruses.[3][116] Consequently, in 2016, the ICTV began to consider establishing ranks higher than order as well as how the Baltimore groups would be treated among higher taxa.[118]
In two votes in 2018 and 2019, the ICTV established a 15-rank system ranging from realm to species.[118] As part of this, the Baltimore groups for ssDNA, dsRNA, +ssRNA, –ssRNA, and RT viruses were incorporated into formal taxa. In 2019, the realmRiboviria was established and initially included all dsRNA, +ssRNA, and –ssRNA viruses.[119][120] A year later,Riboviria was expanded to also include RT viruses.[42][121] Within the realm, RT viruses are included in the kingdomPararnavirae and the three other Baltimore groups in the kingdomOrthornavirae as defining traits of the kingdom's phyla.[42] While –ssRNA viruses ofRibozyviria were initially classified inRiboviria, this was a clerical error that was fixed in 2020.[122][123] A year later,Ribozyviria was established for HDV and its relatives.[65][123] For ssDNA viruses, the realmMonodnaviria was established in 2020 to accommodate almost all ssDNA viruses, as well as dsDNA viruses descended from them.[23][124]
In 1974, virologistVadim Agol proposed an extension of Baltimore classification to encompass all possible means of genetic information transmission and describe the hierarchical routes of information transmission, including both expression and replication, rather than solely mRNA synthesis. In the expanded system, there are 35 classes, 17 superclasses, and six types of genetic information transfer. The system was revisited in 2021 byKoonin et al. in light of discoveries made since the 1970s. Known viruses occupy 13 classes, one of which is shared with cells, seven superclasses, and three types. A fourteenth class is occupied byF-like plasmids. Ambisense viruses occupy two classes simultaneously, though separate classes could be made for them. Most unoccupied classes are of DNA-RNA hybrids, which appear to be disfavored by evolution since it may be advantageous to convert such molecules to dsDNA, the molecule most suitable for genome replication. According to Koonin et al., viruses that belong to the unoccupied classes are unlikely to be discovered unless they are rare in nature.[3]