Region within a prokaryotic cell containing genetic material
Thenucleoid (meaningnucleus-like) is an irregularly shaped region within theprokaryotic cell that contains all or most of thegenetic material.[1][2][3] Thechromosome of a typical prokaryote iscircular, and its length is very large compared to the cell dimensions, so it needs to be compacted in order to fit. In contrast to thenucleus of aeukaryotic cell, it is not surrounded by anuclear membrane. Instead, the nucleoid forms by condensation and functional arrangement with the help of chromosomal architecturalproteins andRNA molecules as well asDNA supercoiling. The length of a genome widely varies (generally at least a few million base pairs) and a cell may contain multiple copies of it.
There is not yet a high-resolution structure known of a bacterial nucleoid, however key features have been researched inEscherichia coli as amodel organism. InE. coli, the chromosomal DNA is on averagenegatively supercoiled and folded intoplectonemic loops, which are confined to different physical regions, and rarely diffuse into each other. These loops spatially organize into megabase-sized regions called macrodomains, within which DNA sites frequently interact, but between which interactions are rare. The condensed and spatially organized DNA forms a helical ellipsoid that is radially confined in the cell. The 3D structure of the DNA in the nucleoid appears to vary depending on conditions and is linked togene expression so that the nucleoid architecture andgene transcription are tightly interdependent, influencing each other reciprocally.
Formation of theEscherichia coli nucleoidA. An illustration of an open conformation of the circular genome ofEscherichia coli. Arrows represent bi-directional DNA replication. The genetic position of the origin of bi-directional DNA replication (oriC) and the site of chromosome decatenation (dif) in the replication termination region (ter) are marked. Colors represent specific segments of DNA as discussed in C.B. An illustration of a random coil form adopted by the pure circular DNA ofEscherichia coli at thermal equilibrium without supercoils and additional stabilizing factors.[4][5]C. A cartoon of the chromosome of a newly bornEscherichia coli cell. The genomic DNA is not only condensed by 1000-fold compared to its pure random coil form but is also spatially organized.oriC anddif are localized in the mid-cell, and specific regions of the DNA indicated by colors in A organize into spatially distinct domains.
In many bacteria, thechromosome is a single covalently closed (circular) double-stranded DNA molecule that encodes the genetic information in ahaploid (monoploid)[6][7][8] form. The size of the DNA varies from 500,000 to several millionbase pairs (bp) encoding from 500 to several thousand genes depending on the organism.[2] The chromosomal DNA is present in cells in a highly compact, organized form called the nucleoid (meaningnucleus-like), which is not encased by anuclear membrane as in eukaryotic cells.[9] The isolated nucleoid contains 80% DNA, 10% protein, and 10% RNA by weight.[10][11]
Thegram-negative bacteriumEscherichia coli is a model system for nucleoid research into how chromosomal DNA becomes the nucleoid, the factors involved therein, what is known about its structure, and how some of the DNA structural aspects influencegene expression.[2][3]
There are two essential aspects of nucleoid formation; condensation of a large DNA into a small cellular space and functional organization of DNA in a three-dimensional form. The haploid circular chromosome inE. coli consists of ~ 4.6 x 106 bp. If DNA is relaxed in theB form, it would have a circumference of ~1.5 millimeters (0.332 nm x 4.6 x 106). However, a large DNA molecule such as theE. coli chromosomal DNA does not remain a straight rigid molecule in a suspension.[5]Brownian motion will generatecurvature and bends in DNA. The maximum length up to which a double-helical DNA remains straight by resisting the bending enforced by Brownian motion is ~50 nm or 150 bp, which is called thepersistence length. Thus, pure DNA becomes substantially condensed without any additional factors; at thermal equilibrium, it assumes arandom coil form.[4][5] The random coil ofE. coli chromosomal DNA would occupy a volume (4/3 π r3) of ~ 523 μm3, calculated from theradius of gyration (Rg = (√N a)/√6) wherea is theKuhn length (2 x persistence length), andN is the number of Kuhn length segments in the DNA (total length of the DNA divided bya).[5] Although DNA is already condensed in the random coil form, it still cannot assume the volume of the nucleoid which is less than a micron. Thus, the inherent property of DNA is not sufficient: additional factors must help condense DNA further on the order of ~103 (volume of the random coil divided by the nucleoid volume). The second essential aspect of nucleoid formation is the functional arrangement of DNA. Chromosomal DNA is not only condensed but also functionally organized in a way that is compatible with DNA transaction processes such asreplication,recombination,segregation, andtranscription.[12][13][14] Almost five decades of research beginning in 1971,[10] has shown that the final form of the nucleoid arises from a hierarchical organization of DNA. At the smallest scale (1 kb or less), nucleoid-associated DNA architectural proteins condense and organize DNA by bending, looping, bridging or wrapping DNA. At a larger scale (10 kb or larger), DNA forms plectonemic loops, a braided form of DNA induced by supercoiling. At the megabase scale, the plectonemic loops coalesce into six spatially organized domains (macrodomains), which are defined by more frequent physical interactions among DNA sites within the same macrodomain than between different macrodomains.[15] Long- and short-range DNA-DNA connections formed within and between the macrodomains contribute to condensation and functional organization. Finally, the nucleoid is a helicalellipsoid with regions of highly condensed DNA at the longitudinal axis.[16][17][18]
Nucleoid at ≥1 kb scale. DNA organization by nucleoid-associated proteins. DNA is depicted as a grey straight or curved line and the nucleoid-associated proteins are depicted as blue spheres.
In eukaryotes, genomic DNA is condensed in the form of a repeating array of DNA-protein particles callednucleosomes.[19][20][21]
A nucleosome consists of ~146 bp of DNA wrapped around an octameric complex of thehistone proteins. Although bacteria do not have histones, they possess a group of DNA binding proteins referred to as nucleoid-associated proteins (NAPs) that are functionally analogous to histones in a broad sense. NAPs are highly abundant and constitute a significant proportion of the protein component of nucleoid.[22]
A distinctive characteristic of NAPs is their ability to bind DNA in both a specific (either sequence- or structure-specific) and non-sequence specific manner. As a result, NAPs are dual function proteins.[23] The specific binding of NAPs is mostly involved in gene-specifictranscription,DNA replication,recombination, andrepair.[12][13][14] At the peak of their abundance, the number of molecules of many NAPs is several orders of magnitude higher than the number of specific binding sites in the genome.[23] Therefore, it is reasoned that NAPs bind to the chromosomal DNA mostly in the non-sequence specific mode and it is this mode that is crucial for chromosome compaction. Non-sequence specific binding of a NAP may not be completely random; there could be low-sequence specificity and or structural specificity due to sequence-dependent DNA conformation or DNA conformation created by other NAPs.[21]
Although molecular mechanisms of how NAPs condense DNAin vivo are not well understood, based on the extensivein vitro studies it appears that NAPs participate in chromosome compaction via the following mechanisms: NAPs induce and stabilize bends in DNA, thus aid inDNA condensation by reducing the persistence length.[23] NAPs condense DNA by bridging, wrapping, and bunching that could occur between nearby DNA segments or distant DNA segments of the chromosome. Another mechanism by which NAPs participate in chromosome compaction is by constrainingnegative supercoils in DNA thus contributing to the topological organization of the chromosome.[23]
There are at least 12 NAPs identified inE. coli,[23] the most extensively studied of which are HU, IHF, H-NS, and Fis. Their abundance and DNA binding properties and effect on DNA condensation and organization are summarized in the tables below.[23]
Properties and the abundance of major nucleoid-associated proteins ofE. coli
1 Abundance (molecules/cell) data were taken from;[24] The number in the parenthesis is micromolar concentration calculated using the following formula: (number of native functional units/Avogadro number) x (1/cell volume in liter) x 103. Cell volume in liter ( 2 x 10−15) was determined by assuming volume of theE. coli cell to be 2 μm3.[24]
DNA binding properties of nucleoid-associated DNA architectural protein ofE. coli
Protein
Binding motif
Specific DNA binding affinity1
Random DNA binding affinity1
HU
A structural motif defined by bends and kinks in DNA[25][26]
Histone-like protein fromE. coli strain U93 (HU) is an evolutionarily conserved protein in bacteria.[36][37] HU exists inE. coli as homo- and heterodimers of two subunits HUα and HUβ sharing 69% amino acid identity.[38] Although it is referred to as a histone-like protein, close functional relatives of HU in eukaryotes arehigh-mobility group (HMG) proteins, and not histones.[39][40] HU is a non-sequence specific DNA binding protein. It binds with low-affinity to any linear DNA. However, it preferentially binds with high-affinity to a structurally distorted DNA.[41][42][43][44][45][27] Examples of distorted DNA substrates includecruciform DNA, bulged DNA, dsDNA containing a single-stranded break such asnicks, gaps, orforks. Furthermore, HU specifically binds and stabilizes a protein-mediated DNA loop.[46] In the structurally specific DNA binding mode, HU recognizes a common structural motif defined by bends or kinks created by distortion,[25][47][26] whereas it binds to a linear DNA by locking the phosphate backbone.[48] While the high-affinity structurally-specific binding is required for specialized functions of HU such assite-specific recombination,DNA repair,DNA replication initiation, and gene regulation,[12][13][14] it appears that the low-affinity general binding is involved in DNA condensation.[48] In chromatin-immunoprecipitation coupled with DNA sequencing (ChIP-Seq), HU does not reveal any specific binding events.[49] Instead, it displays a uniform binding across the genome presumably reflecting its mostly weak, non-sequence specific binding, thus masking the high-affinity bindingin vivo.[49]
In strains lacking HU, the nucleoid is "decondensed", consistent with a role of HU in DNA compaction.[50] The followingin vitro studies suggest possible mechanisms of how HU might condense and organize DNAin vivo. Not only HU stably binds to distorted DNA with bends, it induces flexible bends even in a linear DNA at less than 100 nM concentration. In contrast, HU shows the opposite architectural effect on DNA at higher physiologically relevant concentrations.[48][12][13][14][50][51] It forms rigid nucleoprotein filaments causing the straitening of DNA and not the bending. The filaments can further form a DNA network (DNA bunching) expandable both laterally and medially because of the HU-HU multimerization triggered by the non-sequence-specific DNA binding.[48]
How are these behaviors of HU relevant inside the cell? The formation of filaments requires high-density binding of HU on DNA, one HU dimer per 9-20 bp DNA. But there is only one HU dimer every ~150 bp of the chromosomal DNA based on the estimated abundance of 30,000 HU dimers per cell (4600000 bp /30,000).[24] This indicates that the flexible bends are more likely to occurin vivo. The flexible bending would cause condensation due to a reduction in thepersistence length of DNA as shown bymagnetic tweezers experiments, which allow studying condensation of a single DNA molecule by a DNA binding protein.[51][52] However, because of thecooperativity, the rigid filaments and networks could form in some regions in the chromosome. The filament formation alone does not induce condensation,[51] but DNA networking or bunching can substantially contribute to condensation by bringing distant or nearby chromosome segments together.[48]
Genome-wide occupancy of nucleoid-associated proteins ofE. coli. A circular layout of theE. coli genome depicting genome-wide occupancy of NAPs Fis, H-NS, HU, and IHF in growth and stationary phases inE. coli. Histogram plots of the genome occupancy of NAPs as determined by chromatin-immunoprecipitation coupled with DNA sequencing (ChIP-seq) are shown outside the circular genome. The bin size of the histograms is 300 bp. Figure prepared in circos/0.69-6 using the ChIP-Seq data from.[49][53]
Integration host factor (IHF) is structurally almost identical to HU[54] but behaves differently from HU in many aspects. Unlike HU, which preferentially binds to a structural motif regardless of the sequence, IHF preferentially binds to a specific DNA sequence even though the specificity arises through the sequence-dependent DNA structure and deformability. The specific binding of IHF at cognate sites bends DNA sharply by >160-degree.[54] An occurrence of the cognate sequence motif is about 3000 in theE. coli genome.[49] The estimated abundance of IHF in the growth phase is about 6000 dimers per cell. Assuming that one IHF dimer binds to a single motif and nucleoid contains more than one genome equivalent during the exponential growth phase, most of the IHF molecules would occupy specific sites in the genome and likely only condense DNA by inducing sharp bending.[49]
Besides preferential binding to a specific DNA sequence, IHF also binds to DNA in a non-sequence specific manner with the affinities similar to HU. A role of the non-specific binding of IHF in DNA condensation appears to be critical in the stationary phase because the IHF abundance increases by five-fold in the stationary phase and the additional IHF dimers would likely bind the chromosomal DNA non-specifically.[24][55][56] Unlike HU, IHF does not form thick rigid filaments at higher concentrations. Instead, its non-specific binding also induces DNA bending albeit the degree of bending is much smaller than that at specific sites and is similar to the flexible bending induced by HU in a linear DNA at low concentrations.[57]In vitro, the bending induced by non-specific binding of IHF can cause DNA condensation and promotes the formation of higher-order nucleoprotein complexes depending on the concentrations of potassium chloride and magnesium chloride.[57] The higher-order DNA organization by IHFin vivo is as yet unclear.[57]
A distinguishable feature of histone-like or heat-stable nucleoid structuring protein (H-NS)[58][59][60][61] from other NAPs is the ability to switch from the homodimeric form at relatively low concentrations (<1 x 10−5 M) to an oligomeric state at higher levels.[62][63] Because of oligomerization properties, H-NS spreads laterally along AT-rich DNA in anucleation reaction, where high-affinity sites function as nucleation centers.[64][65][31] The spreading of H-NS on DNA results in two opposite outcomes depending on the magnesium concentration in the reaction. At low magnesium concentration (< 2 mM), H-NS forms rigid nucleoprotein filaments whereas it forms inter- and intra-molecular bridges at higher magnesium concentrations (> 5 mM).[66][67][68][69][70] The formation of rigid filaments results in straightening of DNA with no condensation whereas the bridging causes substantial DNA folding.[69] Analysis of H-NS binding in the genome byChIP-Seq assays provided indirect evidence for the spreading of H-NS on DNAin vivo. H-NS binds selectively to 458 regions in the genome.[53] Although H-NS has been demonstrated to prefer curved DNA formed by repeated A-tracks in DNA sequences[64][71] the basis of the selective binding is the presence of a conserved sequence motif found in AT-rich regions.[30] More importantly, the frequent occurrence of the sequence motif within an H-NS binding region that can re-enforce the cooperative protein-protein interactions, and the unusually long length of the binding region are consistent with the spreading of the protein. Whether the filament formation or DNA bridging is prevalentin vivo depends on the physiological concentration of magnesium inside the cell.[69][72] If the magnesium concentration is uniformly low (< 5 mM), H-NS would form rigid nucleoprotein filamentsin vivo.[69] Alternatively, if there is an uneven distribution of magnesium in the cell, it could promote both DNA bridging and stiffening but in different regions of the nucleoid.[69]
Furthermore, H-NS is best known as a global gene silencer that preferentially inhibits transcription of horizontally transferred genes and it is the rigid filament that leads to gene silencing.[73][74] Taken together, it appears that the formation of rigid filaments is the most likely outcome of H-NS-DNA interactionsin vivo that leads to gene silencing but does not induce DNA condensation. Consistently, the absence of H-NS does not change the nucleoid volume.[75] However, it is possible thatE. coli experiences high-magnesium concentration under some environmental conditions. In such conditions, H-NS can switch from its filament inducing form to the bridge inducing form that contributes to DNA condensation and organization.[69]
Factor for Inversion Stimulation (Fis) is a sequence specific DNA binding protein that binds to specific DNA sequences containing a 15-bp symmetric motif.[32][33][76] Like IHF, Fis induces DNA bending at cognate sites. The ability to bend DNA is apparent in the structure of Fis homodimer. A Fis homodimer possesses twohelix-turn-helix (HTH) motifs, one from each monomer. An HTH motif typically recognizes the DNA major groove. However, the distance between the DNA recognition helices of the two HTH motifs in the Fis homodimer is 25Å, that is ~ 8 Å shorter than the pitch of a canonicalB-DNA, indicating that the protein must bend or twist DNA to bind stably.[77][78] Consistently, thecrystal structure of Fis-DNA complexes shows that the distance between the recognition helices remains unchanged whereas DNA curves in the range of 60-75 degree.[33] There are 1464 Fis binding regions distributed across theE. coli genome and a binding motif, identified computationally, matches with the known 15-bp motif.[53][79] Specific binding of Fis at such sites would induce bends in DNA, thus contribute to DNA condensation by reducing persistence length of DNA. Furthermore, many Fis binding sites occur in tandem such as those in the stable RNA promoters, e.g.,P1 promoter of rRNAoperonrrnB. The coherent bending by Fis at the tandem sites is likely to create a DNA micro-loop that can further contribute to DNA condensation.[80]
Besides high-affinity specific binding to cognate sites, Fis can bind to a random DNA sequence. The non-specific DNA binding is significant because Fis is as abundant as HU in thegrowth phase. Therefore, most of Fis molecules are expected to bind DNA in a non-sequence specific manner.Magnetic tweezers experiments show that this non-specific binding of Fis can contribute to DNA condensation and organization.[81][82] Fis causes mild condensation of a single DNA molecule at <1 mM, but induces substantial folding through the formation of DNA loops of an average size of ~800 bp at >1 mM. The loops in magnetic tweezers experiments are distinct from the micro-loops created by coherent DNA bending at cognate sites, as they require the formation of high-density DNA-protein complexes achieved by sequence-independent binding. Although, occurrence of such loopsin vivo remains to be demonstrated, high-density binding of Fis may occurin vivo through concerted action of both specific and non-specific binding. The in-tandem occurrence of specific sites might initiate a nucleation reaction similar to that of H-NS, and then non-specific binding would lead to the formation of localized high-density Fis arrays. The bridging between these localized regions can create large DNA loops.[82] Fis is exclusively present in thegrowth phase and not in thestationary phase.[83][84] Thus, any role in chromosomal condensation by Fis must be specific to growing cells.[84]
Early studies examining the effect of RNase A treatment on isolated nucleoids indicated thatRNA participated in the stabilization of the nucleoid in the condensed state.[85] Moreover, treatment with RNase A disrupted the DNA fibers into thinner fibers, as observed by an atomic force microscopy of the nucleoid using the “on-substrate lysis procedure”.[86] These findings demonstrated the participation of RNA in the nucleoid structure, but the identity of the RNA molecule(s) remained unknown until recently.[50] Most of the studies on HU focused on its DNA binding.[86] However, HU also binds todsRNA and RNA-DNA hybrids with a lower affinity similar to that with a linear dsDNA.[87] Moreover, HU preferentially binds to RNA containing secondary structures and an RNA-DNA hybrid in which the RNA contains a nick or overhang.[87][88] The binding affinities of HU with these RNA substrates are similar to those with which it binds to distorted DNA. An immunoprecipitation of HU-bound RNA coupled to reverse transcription and microarray (RIP-Chip) study as well as an analysis of RNA from purified intact nucleoids identified nucleoid-associated RNA molecules that interact with HU.[50] Several of them are non-coding RNAs, and one such RNA named naRNA4 (nucleoid-associated RNA 4), is encoded in a repetitive extragenic palindrome (REP325). In a strain lackingREP325, the nucleoid is decondensed as it is in a strain lacking HU.[50] naRNA4 most likely participate in DNA condensation by connecting DNA segments in the presence of HU.[89] Recent studies provide insights into the molecular mechanism of how naRNA4 establishes DNA-DNA connections. The RNA targets regions of DNA containing cruciform structures and forms an RNA-DNA complex that is critical for establishing DNA-DNA connections.[90] Surprisingly, although HU helps in the formation of the complex, it is not present in the final complex, indicating its potential role as a catalyst (chaperone). The nature of the RNA-DNA complex remains puzzling because the formation of the complex does not involve extensive Watson/Crick base pairing but is sensitive to RNase H, which cleaves RNA in an RNA-DNA hybrid and the complex binds to an antibody specific to RNA-DNA hybrids.[50][86][87]
DNA supercoilingA. A linear double-stranded DNA becomes a topologically constrained molecule if the two ends are covalently joined, forming a circle. Rules of DNA topology are explained using such a molecule (ccc-DNA) in which a numerical parameter called the linking number (Lk) defines the topology. Lk is a mathematical sum of two geometric parameters, twist (Tw) and writhe (Wr). A twist is the crossing of two strands, and writhe is coiling of the DNA double helix on its axis that requires bending. Lk is always an integer and remains invariant no matter how much the two strands are deformed. It can only be changed by introducing a break in one or both DNA strands by DNA metabolic enzymes called topoisomerases.B. A torsional strain created by a change in Lk of a relaxed, topologically constrained DNA manifests in the form of DNA supercoiling. A decrease in Lk (Lk<Lk0) induces negative supercoiling whereas an increase in Lk (Lk>Lk0) induces positive supercoiling. Only negative supercoiling is depicted here. For example, if a cut is introduced into a ccc-DNA and four turns are removed before rejoining the two strands, the DNA becomes negatively supercoiled with a decrease in the number of twists or writhe or both. Writhe can adopt two types of geometric structures called plectoneme and toroid. Plectonemes are characterized by the interwinding of the DNA double helix and an apical loop, whereas spiraling of DNA double helix around an axis forms toroids.
Because of itshelical structure, a double-stranded DNA molecule becomes topologically constrained in the covalently closed circular form which eliminates the rotation of the free ends.[91] The number of times the two strands cross each other in a topologically constrained DNA is called thelinking number (Lk), which is equivalent to the number of helical turns or twists in a circular molecule.[92] The Lk of atopological DNA remains invariant, no matter how the DNA molecule is deformed, as long as neither strand is broken.[93][94]
The Lk of DNA in the relaxed form is defined as Lk0. For any DNA, Lk0 can be calculated by dividing the length (in bp) of the DNA by the number of bp per helical turn. This is equal to 10.4 bp for the relaxedB-form DNA. Any deviation from Lk0 causessupercoiling in DNA. A decrease in the linking number (Lk<Lk0) creates negative supercoiling whereas an increase in the linking number (Lk>Lk0) creates positive supercoiling.[95][93]
The supercoiled state (when Lk is not equal to Lk0) results in a transition in DNA structure that can manifest as a change in the number of twists (negative <10.4 bp/turn, positive >10.4 bp per turn) and/or in the formation ofwrithes, called supercoils. Thus, Lk is mathematically defined as a sign dependent sum of the two geometric parameters, twist and writhe. A quantitative measure of supercoiling that is independent of the size of DNA molecules is the supercoiling density (σ) where σ =∆Lk/Lk0.[94]
Writhes can adopt two structures; plectoneme andsolenoid or toroid. A plectonemic structure arises from the interwinding of the helical axis. Toroidal supercoils originate when DNA forms several spirals, around an axis and not intersecting with each other, like those in a telephone cord.[93] The writhes in the plectonemes form are right- and left-handed in positively or negatively supercoiled DNA, respectively. The handedness of the toroidal supercoils is opposite to those of plectonemes. Both plectonemes and toroidal supercoils can be either in a free form or restrained in a bound form with proteins. The best example of the bound toroidal supercoiling in biology is the eukaryoticnucleosome in which DNA wraps aroundhistones.[20]
Basic units of genomic organization in bacteria and eukaryotes Genomic DNA, depicted as a grey line, is negatively supercoiled in both bacteria and eukaryotes. However, the negatively supercoiled DNA is organized in the plectonemic form in bacteria, whereas it is organized in the toroidal form in eukaryotes. Nucleoid associated proteins (NAPs), shown as colored spheres, restrain half of the plectonemic supercoils, whereas almost all of the toroidal supercoils are induced as well as restrained by nucleosomes (colored orange), formed by wrapping of DNA around histones.
In most bacteria, DNA is present in supercoiled form. The circular nature of theE. coli chromosome makes it topologically constrained molecule that is mostly negatively supercoiled with an estimated average supercoiling density (σ) of -0.05.[96] In the eukaryoticchromatin, DNA is found mainly in the toroidal form that is restrained and defined by histones through the formation of nucleosomes. In contrast, in theE. coli nucleoid, about half of the chromosomal DNA is organized in the form of free, plectonemic supercoils.[97][98][99] The remaining DNA is restrained in either the plectonemic form or alternative forms, including but not limited to the toroidal form, by interaction with proteins such as NAPs. Thus, plectonemic supercoils represent effective supercoiling of theE. coli genome that is responsible for its condensation and organization. Both plectonemic and toroidal supercoiling aid in DNA condensation. Branching of plectonemic structures provides less DNA condensation than does the toroidal structure. A same size DNA molecule with equal supercoiling densities is more compact in a toroidal form than in a plectonemic form. In addition to condensing DNA, supercoiling aids in DNA organization. It promotes disentanglement of DNA by reducing the probability of catenation.[100] Supercoiling also helps bring two distant sites of DNA in proximity thereby promoting a potential functional interaction between different segments of DNA.[94]
Three factors contribute to generating and maintaining chromosomal DNA supercoiling inE. coli: (i) activities oftopoisomerases, (ii) the act oftranscription, and (iii) NAPs.[98]
Topoisomerases are a particular category of DNA metabolic enzymes that create or remove supercoiling by breaking and then re-ligating DNA strands.[101]E. coli possesses four topoisomerases.DNA gyrase introduces negative supercoiling in the presence of ATP and it removes positive supercoiling in the absence of ATP.[102] Across all forms of life, DNA gyrase is the only topoisomerase that can create negative supercoiling and it is because of this unique ability that bacterial genomes possess free negative supercoils; DNA gyrase is found in all bacteria but absent from higher eukaryotes. In contrast, Topo I opposes DNA gyrase by relaxing the negatively supercoiled DNA.[103][104] There is genetic evidence to suggest that a balance between the opposing activities of DNA gyrase and Topo I are responsible for maintaining a steady-state level of average negative superhelicity inE. coli.[103][105] Both enzymes are essential forE. coli survival. A null strain oftopA, the gene encoding Topo I, survives only because of the presence of suppressor mutations in the genes encoding DNA gyrase.[103][105] These mutations result in reduced gyrase activity, suggesting that excess negative supercoiling due to the absence of Topo I is compensated by reduced negative supercoiling activity of DNA gyrase. Topo III is dispensable inE. coli and is not known to have any role in supercoiling inE. coli.[106] The primary function of Topo IV is to resolve sister chromosomes. However, it has been shown to also contribute to the steady-state level of negative supercoiling by relaxing negative supercoiling together with Topo I.[107][108]
E. coli DNA topoisomerases
Topoisomerase
Type
Function
Single- or double-stranded cleavage
Topoisomerase I
IA
Removes (-) supercoiling
SS
Topoisomerase III
IA
Removes (-) supercoiling
SS
Topoisomerase IV
IIA
Removes (-) supercoiling
DS
DNA gyrase
IIA
Creates (-) supercoiling and removes (+) supercoiling
A twin supercoiling domain model proposed by Liu and Wang argued that unwinding ofDNA double helix during transcription induces supercoiling in DNA as shown in.[109] According to their model, transcribingRNA polymerase (RNAP) sliding along DNA forces the DNA to rotate on its helical axis. A hindrance in the free rotation of DNA might arise due to a topological constraint, causing the DNA in front of RNAP to become over-twisted (positively supercoiled) and the DNA behind RNAP would become under-twisted (negatively supercoiled). It has been found that a topological constraint is not needed because RNAP generates sufficient torque that causes supercoiling even in a linear DNA template.[110] If DNA is already negatively supercoiled, this action relaxes existing negative supercoils before causing a buildup of positive supercoils ahead of RNAP and introduces more negative supercoils behind RNAP. In principle, DNA gyrase and Topo I should remove excess positive and negative supercoils respectively but if the RNAP elongation rate exceeds the turnover of the two enzymes, transcription contributes to the steady-state level of supercoiling.[110]
Twin supercoiling domain model for transcription-induced supercoilingA. An example of topologically constrained DNA. A grey bar represents a topological constraint, e.g. a protein or a membrane anchor.B. Accommodation of RNA polymerase for transcription initiation results in the opening of the DNA double helix.C. An elongating RNA polymerase complex cannot rotate around the helical axis of DNA. Therefore, removal of helical turns by RNA polymerase causes overwinding of the topologically constrained DNA ahead and underwinding of the DNA behind, generating positively and negatively supercoiled DNA, respectively. Supercoiling can manifest as either change in the numbers of twists as shown in C or plectonemic writhe as shown in D.
In the eukaryotic chromatin, DNA is rarely present in the free supercoiled form because nucleosomes restrain almost all negative supercoiling through tight binding of DNA to histones. Similarly, inE. coli, nucleoprotein complexes formed by NAPs restrain half of the supercoiling density of the nucleoid.[96][99] In other words, if a NAP dissociates from anucleoprotein complex, the DNA would adopt the free, plectonemic form. DNA binding of HU, Fis, and H-NS has been experimentally shown to restrain negative supercoiling in a relaxed but topologically constrained DNA.[111][112][113][114][115] They can do so either by changing the helical pitch of DNA or generating toroidal writhes by DNA bending and wrapping. Alternatively, NAPs can preferentially bind to and stabilize other forms of the underwound DNA such as cruciform structures and branched plectonemes. Fis has been reported to organize branched plectonemes through its binding to cross-over regions and HU preferentially binds to cruciform structures.[115]
NAPs also regulate DNA supercoiling indirectly. Fis can modulate supercoiling by repressing the transcription of the genes encoding DNA gyrase.[116] There is genetic evidence to suggest that HU controls supercoiling levels by stimulating DNA gyrase and reducing the activity of Topo I.[117][118] In support of the genetic studies, HU was shown to stimulate DNA gyrase-catalyzed decatenation of DNAin vitro.[119] It is unclear mechanistically how HU modulates the activities of the gyrase and Topo I. HU might physically interact with DNA gyrase and Topo I or DNA organization activities of HU such as DNA bending may facilitate or inhibit the action of DNA gyrase and Topo I respectively.[117][119]
Plectonemic supercoils organize into multiple topological domains
One of the striking features of the nucleoid is that plectonemic supercoils are organized into multiple topological domains.[120] In other words, a single cut in one domain will only relax that domain and not the others. A topological domain forms because of a supercoiling-diffusion barrier. Independent studies employing different methods have reported that the topological domains are variable in size ranging from 10 to 400 kb.[98][120][121] A random placement of barriers commonly observed in these studies seems to explain the wide variability in the size of domains.[120]
Although identities of domain barriers remain to be established, possible mechanisms responsible for the formation of the barriers include: (i) A domain barrier could form when a protein with an ability to restrain supercoils simultaneously binds to two distinct sites on the chromosome forming a topologically isolated DNA loop or domain. It has been experimentally demonstrated that protein-mediated looping in supercoiled DNA can create a topological domain.[122][123] NAPs such as H-NS and Fis are potential candidates, based on their DNA looping abilities and the distribution of their binding sites. (ii) Bacterial interspersed mosaic elements (BIMEs) also appear as potential candidates for domain barriers. BIMEs are palindromic repeats sequences that are usually found between genes. A BIME has been shown to impede diffusion of supercoiling in a synthetically designed topological cassette inserted in theE. coli chromosome.[124] There are ~600 BIMEs distributed across the genome, possibly dividing the chromosome into 600 topological domains.[125] (iii) Barriers could also result from the attachment of DNA to the cell membrane through a protein which binds to both DNA and membrane or through nascent transcription and the translation of membrane-anchored proteins. (iv) Transcription activity can generate supercoiling-diffusion barriers. An actively transcribing RNAP has been shown to block dissipation of plectonemic supercoils, thereby forming a supercoiling-diffusion barrier.[126][127][128]
The chromosomal DNA within the nucleoid is segregated into independent supercoiled topological domainsA. An illustration of a single topological domain of a supercoiled DNA. A single double-stranded cut anywhere would be sufficient to relax the supercoiling tension of the entire domain.B. An illustration of multiple topological domains in a supercoiled DNA molecule. A presence of supercoiling-diffusion barriers segregates a supercoiled DNA molecule into multiple topological domains. Hypothetical supercoiling diffusion barriers are represented as green spheres. As a result, a single double-stranded cut will only relax one topological domain and not the others. Plectonemic supercoils of DNA within theE. coli nucleoid are organized into several topological domains, but only four domains with a different number of supercoils are shown for simplicity.
The nucleoid reorganizes in stationary phase cells suggesting that the nucleoid structure is highly dynamic, determined by the physiological state of cells. A comparison of high-resolution contact maps of the nucleoid revealed that the long-range contacts in the Ter macrodomain increased in thestationary phase, compared to thegrowth phase.[129] Furthermore, CID boundaries in the stationary phase were different from those found in the growth phase. Finally, nucleoid morphology undergoes massive transformation during prolonged stationary phase;[130] the nucleoid exhibits ordered, toroidal structures.[131]
Growth-phase specific changes in nucleoid structure could be brought about by a change in levels of nucleoid-associated DNA architectural proteins (the NAPs and the Muk subunits), supercoiling, and transcription activity. The abundance of NAPs and the Muk subunits changes according to the bacterial growth cycle. Fis and the starvation-induced DNA binding protein Dps, another NAP, are almost exclusively present in the growth phase and stationary phase respectively. Fis levels rise upon entry into exponential phase and then rapidly decline while cells are still in the exponential phase, reaching levels that are undetectable in stationary phase.[132] While Fis levels start to decline, levels of Dps start to rise and reach a maximum in the stationary phase.[24] A dramatic transition in the nucleoid structure observed in the prolonged stationary phase has been mainly attributed to Dps. It forms DNA/crystalline assemblies that act to protect the nucleoid from DNA damaging agents present during starvation.[131]
HU, IHF, and H-NS are present in both growth phase and stationary phase.[24] However, their abundance changes significantly such that HU and Fis are the most abundant NAPs in the growth phase, whereas IHF and Dps become the most abundant NAPs in the stationary phase.[24] HUαα is the predominant form in early exponential phase, whereas the heterodimeric form predominates in the stationary phase, with minor amounts of homodimers.[133] This transition has functional consequences regarding nucleoid structure, because the two forms appear to organize and condense DNA differently; both homo- and heterodimers form filaments, but only the homodimer can bring multiple DNA segments together to form a DNA network.[48] The copy number of MukB increases two-fold in stationary phase.[134][135] An increase in the number of MukB molecules could have influence on the processivity of the MukBEF complex as a DNA loop extruding factor resulting in larger or a greater number of the loops.[134][135]
Supercoiling can act in a concerted manner with DNA architectural proteins to reorganize the nucleoid. The overall supercoiling level decreases in the stationary phase, and supercoiling exhibits a different pattern at the regional level.[136] Changes in supercoiling can alter the topological organization of the nucleoid. Furthermore, because a chromosomal region of high transcription activity forms a CID boundary, changes in transcription activity during different growth phases could alter the formation of CID boundaries, and thus the spatial organization of the nucleoid. It is possible that changes in CID boundaries observed in the stationary phase could be due to the high expression of a different set of genes in the stationary phase compared to the growth phase.[129]
TheE. coli chromosome structure and gene expression appear to influence each other reciprocally. On the one hand, a correlation of a CID boundary with high transcription activity indicates that chromosome organization is driven by transcription. On the other hand, the 3D structure of DNA within nucleoid at every scale may be linked to gene expression. First, it has been shown that reorganization of the 3D architecture of the nucleoid inE. coli can dynamically modulate cellular transcription pattern.[137] A mutant of HUa made the nucleoid very much condensed by increased positive superhelicity of the chromosomal DNA. Consequently, many genes were repressed, and many quiescent genes were expressed. Besides, there are many specific cases in which protein-mediated local architectural changes alter gene transcription. For example, the formation of rigid nucleoprotein filaments by H-NS blocks RNAP access to the promoter thus prevent gene transcription.[138] Through gene silencing, H-NS acts as a global repressor preferentially inhibiting transcription of horizontally transferred genes.[53][30] In another example, specific binding of HU at thegal operon facilitates the formation of a DNA loop that keeps thegal operon repressed in the absence of the inducer.[139] The topologically distinct DNA micro-loop created by coherent bending of DNA by Fis at stable RNA promoters activates transcription.[80] DNA bending by IHF differentially controls transcription from the two tandem promoters of theilvGMEDA operon inE. coli.[140][141] Specific topological changes by NAPs not only regulate gene transcription, but are also involved in other processes such as DNA replication initiation, recombination, and transposition.[12][13][14] In contrast to specific gene regulation, how higher-order chromosome structure and its dynamics influences gene expression globally at the molecular level remains to be worked out.[142]
A two-way interconnectedness exists between DNA supercoiling and gene transcription.[142] Negative supercoiling of the promoter region can stimulate transcription by facilitating the promoter melting and by increasing the DNA binding affinity of a protein regulator. Stochastic bursts of transcription appear to be a general characteristic of highly expressed genes, and supercoiling levels of the DNA template contributes to transcriptional bursting.[143] According to the twin supercoiling domain model, transcription of a gene can influence transcription of other nearby genes through a supercoiling relay. One such example is the activation of theleu-500 promoter.[142] Supercoiling not only mediates gene-specific changes, but it also mediates large-scale changes in gene expression. Topological organization of the nucleoid could allow independent expression of supercoiling-sensitive genes in different topological domains. A genome-scale map of unrestrained supercoiling showed that genomic regions have different steady-state supercoiling densities, indicating that the level of supercoiling differs in individual topological domains.[136] As a result, a change in supercoiling can result in domain-specific gene expression, depending on the level of supercoiling in each domain.[136]
The effect of supercoiling on gene expression can be mediated by NAPs that directly or indirectly influence supercoiling. The effect of HU on gene expression appears to involve a change in supercoiling and perhaps a higher-order DNA organization. A positive correlation between DNA gyrase binding and upregulation of the genes caused by the absence of HU suggests that changes in supercoiling are responsible for differential expression. HU was also found to be responsible for a positional effect on gene expression by insulating transcriptional units by constraining transcription-induced supercoiling.[144] Point mutations in HUa dramatically changed the gene expression profile ofE. coli, altering itsmorphology,physiology, andmetabolism. As a result, the mutant strain was more invasive of mammalian cells.[137][145] This dramatic effect was concomitant with nucleoid compaction and increased positive supercoiling.[48][146] The mutant protein was an octamer, in contrast to the wild-type dimer. It wraps DNA on its surface in a right-handed manner, restraining positive supercoils as opposed to wild-type HU.[146] These studies show that amino acid substitutions in HU can have a dramatic effect on nucleoid structure, that in turn results in significant phenotypic changes.[146]
Since MukB and HU have emerged as critical players in long-range DNA interactions, it will be worthwhile to compare the effect of each of these two proteins on global gene expression.[147] Although HU appears to control gene expression by modulating supercoiling density, the exact molecular mechanism remains unknown and the impact of MukB on gene expression is yet to be analyzed.[147][148]
Nucleoid is spatially organized into chromosomal interactions domains (CIDs) and macrodomainsA. Chromosome conformation capture (3C) methods probe 3D genome organization by quantifying physical interactions between genomic loci that are nearby in 3D-space but may be far away in the linear genome. A genome is cross-linked with formaldehyde to preserve physical contacts between genomic loci. Subsequently, the genome is digested with a restriction enzyme. In the next step, a DNA ligation is carried out under diluted DNA concentrations to favor intra-molecular ligation (between cross-linked fragments that are brought into physical proximity by 3D genome organization). A frequency of ligation events between distant DNA sites reflects a physical interaction. In the 3C method, ligation junctions are detected by the semi-quantitative PCR amplification in which amplification efficiency is a rough estimate of pairwise physical contact between genomic regions of interests and its frequency. The 3C method probes a physical interaction between two specific regions identified a priori, whereas its Hi-C version detects physical interactions between all possible pairs of genomic regions simultaneously. In the Hi-C method, digested ends are filled in with a biotinylated adaptor before ligation. Ligated fragments are sheared and then enriched by a biotin-pull down. Ligation junctions are then detected and quantified by the paired-end next-generation sequencing methods.B. Hi-C data are typically represented in the form of a two-dimensional matrix in which the x-axis and y-axis represent the genomic coordinates. The genome is usually divided into bins of a fixed size, e.g., 5-kb. The size of bins essentially defines the contact resolution. Each entry in the matrix, mij, represents the number of chimeric sequencing reads mapped to genomic loci in bins i and j. A quantification of the reads (represented as a heatmap) denotes the relative frequency of contacts between genomic loci of bins i and j. A prominent feature of the heatmap is a diagonal line that appears due to more frequent physical interaction between loci that are very close to each other in the linear genome. The intensity further from the diagonal line represents the relative frequency of physical interaction between loci that are far away from each other in the linear genome. Triangles of high-intensity along the diagonal line represent highly self-interacting chromosomal interaction domains (CIDs) that are separated by a boundary region that consists of a smaller number of interactions.C. In many bacterial species includingE. coli, it appears that supercoiled topological domains organize as CIDs. Plectonemic supercoiling promotes a high level of interaction among genomic loci within a CID, and a plectoneme-free region (PFR), created due to high transcription activity, acts as a CID boundary. Nucleoid-associated proteins, depicted as closed circles, stabilize the supercoiling-mediated interactions. The actively transcribing RNA polymerase (depicted as a green sphere) in the PFR blocks dissipation of supercoiling between the two domains thus acts as a supercoiling diffusion barrier. The size of the CIDs ranges between 30 and 400 kb. Several triangles (CIDs) merge to form a bigger triangle that represents a macrodomain. In other words, CIDs of a macrodomain physically interact with each other more frequently than with CIDs of a neighboring macrodomain or with genomic loci outside of that macrodomain. A macrodomain may comprise several CIDs. For simplicity, a macrodomain comprising only two CIDs is shown.
In recent years, the advent of a molecular method calledchromosome conformation capture (3C) has allowed studying a high-resolution spatial organization of chromosomes in both bacteria and eukaryotes.[149] 3C and its version that is coupled withdeep sequencing (Hi-C)[150] determine physical proximity, if any, between any two genomic loci in 3D space. A high-resolution contact map of bacterial chromosomes including theE. coli chromosome has revealed that a bacterial chromosome is segmented into many highly self-interacting regions called chromosomal interaction domains (CIDs).[129][151][152] CIDs are equivalent totopologically associating domains (TADs) observed in many eukaryotic chromosomes,[153] suggesting that the formation of CIDs is a general phenomenon of genome organization. Two characteristics define CIDs or TADs. First, genomic regions of a CID physically interact with each other more frequently than with the genomic regions outside that CID or with those of a neighboring CID. Second, the presence of a boundary between CIDs that prevents physical interactions between genomic regions of two neighboring CIDs.[129]
TheE. coli chromosome was found to consist of 31 CIDs in the growth phase. The size of the CIDs ranged from 40 to ~300 kb. It appears that a supercoiling-diffusion barrier responsible for segregating plectonemic DNA loops into topological domains functions as a CID boundary inE. coli and many other bacteria. In other words, the presence of a supercoiling-diffusion barrier defines the formation of CIDs. Findings from the Hi-C probing of chromosomes inE. coli,Caulobacter crescentus, andBacillus subtilis converge on a model that CIDs form because plectonemic looping together with DNA organization activities of NAPs promotes physical interactions among genomic loci, and a CID boundary consists of a plectoneme-free region (PFR) that prevents these interactions. A PFR is created due to high transcription activity because the helical unwinding of DNA by actively transcribing RNAP restrains plectonemic supercoils. As a result, dissipation of supercoils is also blocked, creating a supercoiling-diffusion barrier. Indirect evidence for this model comes from an observation that CIDs of bacterial chromosomes including theE. coli chromosome display highly transcribed genes at their boundaries, indicating a role of transcription in the formation of a CID boundary.[129][151] More direct evidence came from a finding that the placement of a highly transcribed gene at a position where no boundary was present created a new CID boundary in theC. crescentus chromosome.[151] However, not all CID boundaries correlated with highly transcribed genes in theE. coli chromosome suggesting that other unknown factors are also responsible for the formation of CID boundaries and supercoiling diffusion barriers.[151]
Plectonemic DNA loops organized as topological domains or CIDs appear to coalesce further to form large spatially distinct domains called macrodomains (MDs). InE. coli, MDs were initially identified as large segments of the genome whose DNA markers localized together (co-localized) influorescence in situ hybridization (FISH) studies.[154][155] A large genomic region (~1-Mb) coveringoriC (origin of chromosome replication) locus co-localized and was called Ori macrodomain. Likewise, a large genomic region (~1-Mb) covering the replication terminus region (ter) co-localized and was called Ter macrodomain. MDs were later identified based on how frequently pairs of lambdaatt sites that were inserted at various distant locations in the chromosome recombined with each other. In this recombination-based method, an MD was defined as a large genomic region whose DNA sites can primarily recombine with each other, but not with those outside of that MD. The recombination-based method confirmed the Ori and Ter MDs that were identified in FISH studies and identified two additional MDs.[15][156]
The two additional MDs were formed by the additional ~1-Mb regions flanking the Ter and were referred to as Left and Right. These four MDs (Ori, Ter, Left, and Right) composed most of the genome, except for two genomic regions flanking the Ori. These two regions (NS-L and NS-R) were more flexible and non-structured compared to an MD as DNA sites in them recombined with DNA sites located in MDs on both sides. The genetic position oforiC appears to dictate the formation of MDs, because repositioning oforiC by genetic manipulation results in the reorganization of MDs. For example, genomic regions closest to theoriC always behave as an NS regardless of DNA sequence and regions further away always behave as MDs.[157]
The Hi-C technique further confirmed a hierarchical spatial organization of CIDs in the form of macrodomains.[129] In other words, CIDs of a macrodomain physically interacted with each other more frequently than with CIDs of a neighboring macrodomain or with genomic loci outside of that macrodomain. The Hi-C data showed that theE. coli chromosome was partitioning into two distinct domains. The region surroundingter formed an insulated domain that overlapped with the previously identified Ter MD. DNA-DNA contacts in this domain occurred only in the range of up to ~280 kb. The rest of the chromosome formed a single domain whose genomic loci exhibited contacts in the range of >280-kb.[129] While most of the contacts in this domain were restricted to a maximum distance of ~500 kb, there were two loose regions whose genomic loci formed contacts at even greater distances (up to ~1 Mb). These loose regions corresponded to the previously identified flexible and less-structured regions (NS). The boundaries of the insulated domain encompassingter and the two loose regions identified by the Hi-C method segmented the entire chromosome into six regions that correspond with the four MDs and two NS regions defined by recombination-based assays.[129]
Genome-wide occupancy of MatP and MukB ofE. coli A circular layout of theE. coli genome depicting genome-wide occupancy of MatP and MukB inE. coli. The innermost circle depicts theE. coli genome. The regions of the genome which organize as spatial domains(macrodomains) in the nucleoid are indicated as colored bands. Histogram plots of genome occupancy for MatP and MukB as determined by chromatin-immunoprecipitation coupled with DNA sequencing (ChIP-seq) are shown in outside circles. The bin size of the histograms is 300 bp. The figure was prepared in circos/0.69-6 using the processed ChIP-Seq data from.[158]
A search for protein(s) responsible for macrodomain formation led to identification of Macrodomain Ter protein (MatP). MatP almost exclusively binds in the Ter MD by recognizing a 13-bp motif called the macrodomainter sequence (matS).[35] There are 23matS sites present in the Ter domain, on average there is one site every 35-kb. Further evidence of MatP binding in the Ter domain comes from fluorescence imaging of MatP. Discrete MatP foci were observed that co-localized with Ter domain DNA markers.[35] A strong enrichment ofChIP-Seq signal in the Ter MD also corroborates the preferential binding of MatP to this domain.[35]
MatP condenses DNA in the Ter domain because the lack of MatP increased the distance between two fluorescent DNA markers located 100-kb apart in the Ter domain. Furthermore, MatP is a critical player in insulating the Ter domain from the rest of the chromosome.[129] It promotes DNA-DNA contacts within the Ter domain but prevents contacts between the DNA loci of Ter domain and those of flanking regions. How does MatP condense DNA and promote DNA-DNA contacts? The experimental results are conflicting. MatP can form a DNA loop between twomatS sitesin vitro and its DNA looping activity depends on MatP tetramerization. Tetramerization occurs via coiled-coil interactions between two MatP molecules bound to DNA.[159] One obvious model based onin vitro results is that MatP promotes DNA-DNA contactsin vivo by bridgingmatS sites. However, although MatP connected distant sites in Hi-C studies, it did not specifically connect thematS sites. Furthermore, a MatP mutant that was unable to form tetramers behaved like wild-type. These results argue against thematS bridging model for Ter organization, leaving the mechanism of MatP action elusive. One possibility is that MatP spreads to nearby DNA segments from its primarymatS binding site and bridge distant sites via a mechanism that does not depend on the tetramerization.[159]
Models for DNA organization by MatP and MukBEFA. AmatS-bridging model for DNA organization in the Ter macrodomain by MatP. MatP recognizes a 13-bp signature DNA sequence calledmatS that is present exclusively in the Ter macrodomain. There are 23matS sites separated by one another by an average of 35-kb. MatP binds to amatS site as a dimer, and the tetramerization of the DNA-bound dimers bridgesmatS sites forming large DNA loops.B. The architecture of theE. coli MukBEF complex. The complex is formed by protein-protein interactions between MukB (blue), MukF (dark orange) and MukE (light orange). MukB, which belongs to the family of structural maintenance of chromosomes (SMCs) proteins, forms a dimer (monomers are shown by dark and light blue colors) consisting of an ATPase head domain and a 100 nm long intramolecular coiled-coil with a hinge region in the middle. Because of the flexibility of the hinge region, MukB adopts a characteristic V-shape of the SMC family. MukF also tends to exist as a dimer because of the strong dimerization affinity between monomers.[160][161] The C-terminal domain of MukF can interact with the head domain of MukB while its central domain can interact with MukE. Two molecules of MukE and one molecule of MukF associate with each other independent of MukB to form a trimeric complex (MukE2F). Since MukF tends to exist in a dimeric form, the dimerization of MukF results in an elongated hexameric complex (MukE2F)2.[162] In the absence of ATP, the (MukE2F)2 complex binds to the MukB head domains through the C-terminal domain of MukF to form a symmetric MukBEF complex (shown on the left). The stoichiometry of the symmetric complex is B2(E2F)2. The ATP binding between the MukB head domains forces the detachment of one MukF molecule and two MukE molecules.[135][162] As a result, an asymmetric MukBEF complex of the stoichiometry B2(E2F)1 is formed. Since MukF readily dimerizes, the MukF dimerization can potentially join two ATP-bound asymmetric molecules resulting in the formation of a dimer of dimers with the stoichiometry of B4(E2F)2 (shown on the right). The stoichiometry of the MukBEF complexin vivo is estimated to be B4(E2F)2 suggesting that a dimer of dimers is the functional unitin vivo.[163]C. A model for loop extrusion by a MukBEF dimer of dimers. A dimer of dimer loads onto DNA (depicted as a grey line) through DNA binding domains of MukB. MukB has been shown to bind DNA via its hinge region and the top region of its head domain.[51][164] The translocation of the complex away from its loading site then extrudes DNA loops. The loops are extruded in a rock-climbing manner by the coordinated opening and closing of the MukBEF ring through the MukB head disengagement that occurs due to coordinated ATP hydrolysis in the two dimers.[163] Dark and light blue circles represent ATP binding and hydrolysis events respectively. MukE is not shown in the complex for simplicity.
MukB belongs to a family of ATPases calledstructural maintenance of chromosome proteins (SMCs), which participate in higher-order chromosome organization in eukaryotes.[148] Two MukB monomers associate via continuous antiparallel coiled-coil interaction forming a 100-nm long rigid rod. A flexible hinge region occurs in the middle of the rod.[165][166] Due to the flexibility of the hinge region, MukB adopts a characteristic V-shape of the SMC family. The non-SMC subunits associating with MukB are MukE and MukF. The association closes the V formation, resulting in large ring-like structures. MukE and MukF are encoded together with MukB in the same operon inE. coli.[167] Deletion of either subunit results in the same phenotype suggesting that the MukBEF complex is the functional unitin vivo.[163] DNA binding activities of the complex reside in the MukB subunit, whereas MukE and MukF modulate MukB activity.[167]
MukBEF complex, together with Topo IV, is required for decatenation and repositioning of newly replicatedoriCs.[168][169][170][171][158] The role of MukBEF is not restricted during DNA replication. It organizes and condenses DNA even in non-replicating cells.[134] The recent high-resolution chromosome conformation map of the MukB-depletedE. coli strain reveals that MukB participates in the formation of DNA-DNA interactions on the entire chromosome, except in the Ter domain.[129] How is MukB prevented from acting in the Ter domain? MatP physically interacts with MukB, thus preventing MukB from localizing to the Ter domain.[158] This is evident in the DNA binding of MatP and MukB in the Ter domain. DNA binding of MatP is enriched in the Ter domain, whereas DNA binding of MukB is reduced compared to the rest of the genome. Furthermore, in a strain already lacking MatP, the absence of MukB causes a reduction in DNA contacts throughout the chromosome, including the Ter domain.[129] This result agrees with the view that MatP displaces MukB from the Ter domain.[129]
How does the MukBEF complex function to organize theE. coli chromosome? According to the current view, SMC complexes organize chromosomes by extruding DNA loops.[172] SMC complexes translocate along DNA to extrude loops in a cis-manner (on the same DNA molecule), wherein the size of loops depends on processivity of the complex. SMC complexes from different organisms differ in the mechanism of loop extrusion.[172] Single molecule fluorescence microscopy of MukBEF inE. coli suggests that the minimum functional unitin vivo is a dimer of dimers.[163] This unit is formed by joining of two ATP-bound MukBEF complexes through MukF-mediated dimerization. MukBEF localizes in the cell as 1-3 clusters that are elongated parallel to the long axis of the cell. Each cluster contains an average ~ 8-10 dimers of dimers. According to the current model, the MukBEF extrudes DNA loops in a “rock-climbing” manner.[163][173] A dimer of the dimers releases one segment of DNA and capture a new DNA segment without dissociating from the chromosome. Besides DNA looping, a link between negative supercoiling andin vivo MukBEF function together with the ability of the MukB subunit to constrain negative supercoilsin vitro suggests that MukBEF organizes DNA by generating supercoils.[174][175][176]
In addition to contributing to the chromosome compaction by bending, bridging, and looping DNA at a smaller scale (~1-kb), NAPs participate in DNA condensation and organization by promoting long-rang DNA-DNA contacts. Two NAPs, Fis and HU, emerged as the key players in promoting long-range DNA-DNA contacts that occur throughout the chromosome.[129] It remains to be studied how DNA organization activities of Fis and HU that are well understood at a smaller scale (~1-kb) results in the formation of long-range DNA-DNA interactions. Nonetheless, some of the HU-mediated DNA interactions require the presence of naRNA4.[89] naRNA4 also participates in making long-range DNA contacts. HU catalyzes some of the contacts, not all, suggesting that RNA participates with other NAPs in forming DNA contacts. HU also appears to act together with MukB to promote long-range DNA-DNA interactions. This view is based on observations that the absence of either HU or MukB caused a reduction in the same DNA-DNA contacts. It is unclear how MukB and HU potentially act together in promoting DNA-DNA interactions. It is possible that the two proteins interact physically. Alternatively, while MukBEF extrudes large DNA loops, HU condenses and organizes those loops.[172][51]
There are reports that functionally-related genes ofE. coli are physically together in 3-D space within the chromosome even though they are far apart by genetic distance. Spatial proximity of functionally-related genes not only make the biological functions more compartmentalized and efficient but would also contribute to the folding and spatial organization of the nucleoid. A recent study using fluorescent markers for detection of specific DNA loci examined pairwise physical distances between the seven rRNA operons that are genetically separated from each other (by as much as two million bp). It reported that all of the operons, exceptrrnC, were in physical proximity.[177][178] Surprisingly, 3C-seq studies did not reveal the physical clustering ofrrn operons, contradicting the results of the fluorescence-based study.[129] Therefore, further investigation is required to resolve these contradicting observations. In another example, GalR, forms an interaction network of GalR binding sites that are scattered across the chromosome.[179] GalR is a transcriptional regulator of the galactose regulon composed of genes encoding enzymes for transport and metabolism of the sugar D-galactose.[180] GalR exists in only one to two foci in cells[179] and can self-assemble into large ordered structures.[181] Therefore, it appears that DNA-bound GalR multimerizes to form long-distance interactions.[179][181]
Conventionaltransmission electron microscopy (TEM) of chemically fixedE. coli cells portrayed the nucleoid as an irregularly shapedorganelle. However, wide-fieldfluorescence imaging of live nucleoids in 3D revealed a discrete, ellipsoid shape.[3][17][18] The overlay of a phase-contrast image of the cell and the fluorescent image of the nucleoid showed a close juxtaposition only in the radial dimension along its entire length of the nucleoid to the cell periphery. This finding indicates radial confinement of the nucleoid.[16] A detailed examination of the 3D fluorescence image after cross-sectioning perpendicular to its long axis further revealed two global features of the nucleoid:curvature and longitudinal, high-density regions. Examining thechirality of the centerline of the nucleoid by connecting the center of intensity of each cross-section showed that the overall nucleoid shape is curved.[18] The fluorescence intensity distribution in the cross-sections revealed a density substructure, consisting of curved, high-density regions or bundles at the central core, and low-density regions at the periphery.[16][17] One implication of the radial confinement is that it determines the curved shape of the nucleoid. According to one model, the nucleoid is forced to bend because it is confined into a cylindricalE. coli cell whose radius is smaller than its bendable length (persistence length).[16] This model was supported by observations that removal of the cell wall or inhibition of cell wall synthesis increased the radius of the cell and resulted in a concomitant increase in the helical radius and a decrease in the helical pitch in the nucleoid.[16]
Nucleoid as a helical ellipsoid with longitudinal high-density DNA regionsA. A cartoon ofE. coli cell with a curved nucleoid (dark grey). A curved centroids path, denoted by red and green dots, emphasizes the curved shape of the nucleoid[16]B. Cross-sectioning of theE. coli nucleoid visualized by HU-mCherry. Fluorescence intensity is taken as a proxy for DNA density and is represented by blue to red in increasing order.[17]
An expansion force due to DNA-membrane connections appears to function in opposition to condensation forces to maintain an optimal condensation level of the nucleoid.Cell-fractionation and electron microscopy studies first indicated the possibility of DNA-membrane connections.[182][183] There are now several known examples of DNA-membrane connections. Transertion is a mechanism of concurrent transcription, translation, and insertion of nascent membrane proteins that forms transient DNA-membrane contacts.[184] Transertion of two membrane proteins LacY and TetA has been demonstrated to cause the repositioning of chromosomal loci toward the membrane.[185] Another mechanism of nucleoid-membrane connections is through a direct contact between membrane-anchored transcription regulators and their target sites in the chromosome. One example of such as transcription regulator inE. coli is CadC. CadC contains a periplasmic sensory domain and a cytoplasmic DNA binding domain. Sensing of an acidic environment by its periplasmic sensory domain stimulates DNA binding activity of CadC, which then activates transcription of its target genes.[186] The membrane-localization of genes regulated by a membrane-anchored transcription regulator is yet to be demonstrated. Nonetheless, activation of target genes in the chromosome by these regulators is expected to result in a nucleoid-membrane contact albeit it would be a dynamic contact. Besides these examples, the chromosome is also specifically anchored to the cell membrane through protein-protein interaction between DNA-bound proteins, e.g., SlmA and MatP, and thedivisome.[187][188] Since membrane-protein encoding genes are distributed throughout the genome, dynamic DNA-membrane contacts through transertion can act as a nucleoid expansion force. This expansion force would function in opposition to condensation forces to maintain an optimal condensation level. The formation of highly condensed nucleoids upon the exposure ofE. coli cells to chloramphenicol, which blocks translation, provides support for the expansion force of transient DNA-membrane contacts formed through transertion.[189][190] The round shape of overly-condensed nucleoids after chloramphenicol treatment also suggests a role for transertion-mediated DNA-membrane contacts in defining the ellipsoid shape of the nucleoid.[190]
Changes in the structure of the nucleoid of bacteria and archaea are observed after exposure to DNA damaging conditions. The nucleoids of the bacteriaBacillus subtilis andEscherichia coli both become significantly more compact after UV irradiation.[193][194] Formation of the compact structure inE. coli requiresRecA activation through specific RecA-DNA interactions.[195] The RecA protein plays a key role in homologous recombinational repair of DNA damage.
Similar toB. subtilis andE. coli above, exposures of the archaeanHaloferax volcanii to stresses that damage DNA cause compaction and reorganization of the nucleoid.[196] Compaction depends on the Mre11-Rad50 protein complex that catalyzes an early step in homologous recombinational repair of double-strand breaks in DNA. It has been proposed that nucleoid compaction is part of a DNA damage response that accelerates cell recovery by helping DNA repair proteins to locate targets, and by facilitating the search for intact DNA sequences during homologous recombination.[196]
^abcDame RT, Tark-Dame M (June 2016). "Bacterial chromatin: converging views at different scales".Current Opinion in Cell Biology.40:60–65.doi:10.1016/j.ceb.2016.02.015.PMID26942688.
^Worcel A, Burgi E (November 1972). "On the structure of the folded chromosome of Escherichia coli".Journal of Molecular Biology.71 (2):127–147.doi:10.1016/0022-2836(72)90342-7.PMID4564477.
^abcdeKano Y, Goshima N, Wada M, Imamoto F (1989). "Participation of hup gene product in replicative transposition of Mu phage in Escherichia coli".Gene.76 (2):353–8.doi:10.1016/0378-1119(89)90175-3.PMID2666261.
^abcdeOgura T, Niki H, Kano Y, Imamoto F, Hiraga S (January 1990). "Maintenance of plasmids in HU and IHF mutants of Escherichia coli".Molecular & General Genetics.220 (2):197–203.doi:10.1007/bf00260482.PMID2183003.S2CID10701528.
^abcPinson V, Takahashi M, Rouviere-Yaniv J (April 1999). "Differential binding of the Escherichia coli HU, homodimeric forms and heterodimeric form to linear, gapped and cruciform DNA".Journal of Molecular Biology.287 (3):485–97.doi:10.1006/jmbi.1999.2631.PMID10092454.
^Suryanarayana T, Subramanian AR (September 1978). "Specific association of two homologous DNA-binding proteins to the native 30-S ribosomal subunits of Escherichia coli".Biochimica et Biophysica Acta (BBA) - Nucleic Acids and Protein Synthesis.520 (2):342–57.doi:10.1016/0005-2787(78)90232-0.PMID213117.
^Bonnefoy E, Takahashi M, Yaniv JR (September 1994). "DNA-binding parameters of the HU protein of Escherichia coli to cruciform DNA".Journal of Molecular Biology.242 (2):116–29.doi:10.1006/jmbi.1994.1563.PMID8089835.
^Murtin C, Engelhorn M, Geiselmann J, Boccard F (December 1998). "A quantitative UV laser footprinting analysis of the interaction of IHF with specific binding sites: re-evaluation of the effective concentration of IHF in the cell".Journal of Molecular Biology.284 (4):949–61.doi:10.1006/jmbi.1998.2256.PMID9837718.
^Jacquet M, Cukier-Kahn R, Pla J, Gros F (December 1971). "A thermostable protein factor acting on in vitro DNA transcription".Biochemical and Biophysical Research Communications.45 (6):1597–607.doi:10.1016/0006-291x(71)90204-x.PMID4942735.
^Falconi M, Gualtieri MT, La Teana A, Losso MA, Pon CL (May 1988). "Proteins from the prokaryotic nucleoid: primary and quaternary structure of the 15-kD Escherichia coli DNA binding protein H-NS".Molecular Microbiology.2 (3):323–9.doi:10.1111/j.1365-2958.1988.tb00035.x.PMID3135462.S2CID36215353.
^Ueguchi C, Suzuki T, Yoshida T, Tanaka K, Mizuno T (October 1996). "Systematic mutational analysis revealing the functional domain organization of Escherichia coli nucleoid protein H-NS".Journal of Molecular Biology.263 (2):149–62.doi:10.1006/jmbi.1996.0566.PMID8913298.
^Bouffartigues E, Buckle M, Badaut C, Travers A, Rimsky S (May 2007). "H-NS cooperative binding to high-affinity sites in a regulatory element results in transcriptional silencing".Nature Structural & Molecular Biology.14 (5):441–8.doi:10.1038/nsmb1233.PMID17435766.S2CID43768346.
^Yamada H, Muramatsu S, Mizuno T (September 1990). "An Escherichia coli protein that preferentially binds to sharply curved DNA".Journal of Biochemistry.108 (3):420–5.doi:10.1093/oxfordjournals.jbchem.a123216.PMID2126011.
^Kostrewa D, Granzin J, Stock D, Choe HW, Labahn J, Saenger W (July 1992). "Crystal structure of the factor for inversion stimulation FIS at 2.0 A resolution".Journal of Molecular Biology.226 (1):209–26.doi:10.1016/0022-2836(92)90134-6.PMID1619650.
^abTravers A, Muskhelishvili G (June 1998). "DNA microloops and microdomains: a general mechanism for transcription activation by torsional transmission".Journal of Molecular Biology.279 (5):1027–43.doi:10.1006/jmbi.1998.1834.PMID9642081.
^Johnson RC, Simon MI (July 1985). "Hin-mediated site-specific recombination requires two 26 bp recombination sites and a 60 bp recombinational enhancer".Cell.41 (3):781–91.doi:10.1016/s0092-8674(85)80059-3.PMID2988787.S2CID34572809.
^Pettijohn DE, Hecht R (1974). "RNA molecules bound to the folded bacterial genome stabilize DNA folds and segregate domains of supercoiling".Cold Spring Harbor Symposia on Quantitative Biology.38:31–41.doi:10.1101/sqb.1974.038.01.006.PMID4598638.
^abSinden RR, Carlson JO, Pettijohn DE (October 1980). "Torsional tension in the DNA double helix measured with trimethylpsoralen in living E. coli cells: analogous measurements in insect and human cells".Cell.21 (3):773–83.doi:10.1016/0092-8674(80)90440-7.PMID6254668.S2CID2503376.
^abBliska JB, Cozzarelli NR (March 1987). "Use of site-specific recombination as a probe of DNA structure and metabolism in vivo".Journal of Molecular Biology.194 (2):205–18.doi:10.1016/0022-2836(87)90369-x.PMID3039150.
^Dean F, Krasnow MA, Otter R, Matzuk MM, Spengler SJ, Cozzarelli NR (1983). "Escherichia coli type-1 topoisomerases: identification, mechanism, and role in recombination".Cold Spring Harbor Symposia on Quantitative Biology.47 (2):769–77.doi:10.1101/sqb.1983.047.01.088.PMID6305585.
^Rouvière-Yaniv J, Yaniv M, Germond JE (June 1979). "E. coli DNA binding protein HU forms nucleosomelike structure with circular double-stranded DNA".Cell.17 (2):265–74.doi:10.1016/0092-8674(79)90152-1.PMID222478.S2CID28092421.
^Broyles SS, Pettijohn DE (January 1986). "Interaction of the Escherichia coli HU protein with DNA. Evidence for formation of nucleosome-like structures with altered DNA helical pitch".Journal of Molecular Biology.187 (1):47–60.doi:10.1016/0022-2836(86)90405-5.PMID3514923.
^abBensaid A, Almeida A, Drlica K, Rouviere-Yaniv J (February 1996). "Cross-talk between topoisomerase I and HU in Escherichia coli".Journal of Molecular Biology.256 (2):292–300.doi:10.1006/jmbi.1996.0086.PMID8594197.
^Malik M, Bensaid A, Rouviere-Yaniv J, Drlica K (February 1996). "Histone-like protein HU and bacterial DNA topology: suppression of an HU deficiency by gyrase mutations".Journal of Molecular Biology.256 (1):66–76.doi:10.1006/jmbi.1996.0068.PMID8609614.
^Claret L, Rouviere-Yaniv J (October 1997). "Variation in HU composition during growth of Escherichia coli: the heterodimer is required for long term survival".Journal of Molecular Biology.273 (1):93–104.doi:10.1006/jmbi.1997.1310.PMID9367749.
^Pagel JM, Winkelman JW, Adams CW, Hatfield GW (April 1992). "DNA topology-mediated regulation of transcription initiation from the tandem promoters of the ilvGMEDA operon of Escherichia coli".Journal of Molecular Biology.224 (4):919–35.doi:10.1016/0022-2836(92)90460-2.PMID1569580.
^Wang S, Cosstick R, Gardner JF, Gumport RI (October 1995). "The specific binding of Escherichia coli integration host factor involves both major and minor grooves of DNA".Biochemistry.34 (40):13082–90.doi:10.1021/bi00040a020.PMID7548068.
^Jin DJ, Cabrera JE (November 2006). "Coupling the distribution of RNA polymerase to global gene regulation and the dynamic structure of the bacterial nucleoid in Escherichia coli".Journal of Structural Biology.156 (2):284–91.doi:10.1016/j.jsb.2006.07.005.PMID16934488.
^Kavenoff R, Ryder OA (March 1976). "Electron microscopy of membrane-associated folded chromosomes of Escherichia coli".Chromosoma.55 (1):13–25.doi:10.1007/bf00288323.PMID767075.S2CID31879250.
^Worcel A, Burgi E (January 1974). "Properties of a membrane-attached form of the folded chromosome of Escherichia coli".Journal of Molecular Biology.82 (1):91–105.doi:10.1016/0022-2836(74)90576-2.PMID4594427.
^Eltsov M, Zuber B (November 2006). "Transmission electron microscopy of the bacterial nucleoid".Journal of Structural Biology.156 (2):246–254.doi:10.1016/j.jsb.2006.07.007.PMID16978880.