Hubbry Logo
search
logo

Sigma factor

logo
Community Hub0 Subscribers
Read side by side
from Wikipedia

A sigma factor (σ factor or specificity factor) is a protein needed for initiation of transcription in bacteria.[1][2] It is a bacterial transcription initiation factor that enables specific binding of RNA polymerase (RNAP) to gene promoters. It is homologous to archaeal transcription factor B and to eukaryotic factor TFIIB.[3] The specific sigma factor used to initiate transcription of a given gene will vary, depending on the gene and on the environmental signals needed to initiate transcription of that gene. Selection of promoters by RNA polymerase is dependent on the sigma factor that associates with it.[4] They are also found in plant chloroplasts as a part of the bacteria-like plastid-encoded polymerase (PEP).[5]

The sigma factor, together with RNA polymerase, is known as the RNA polymerase holoenzyme. Every molecule of RNA polymerase holoenzyme contains exactly one sigma factor subunit, which in the model bacterium Escherichia coli is one of those listed below. The number of sigma factors varies between bacterial species.[1][6] E. coli has seven sigma factors. Sigma factors are distinguished by their characteristic molecular weights. For example, σ70 is the sigma factor with a molecular weight of 70 kDa.

The sigma factor in the RNA polymerase holoenzyme complex is required for the initiation of transcription, although once that stage is finished, it is dissociated from the complex and the RNAP continues elongation on its own.

Specialized sigma factors

[edit]

Different sigma factors are utilized under different environmental conditions. These specialized sigma factors bind the promoters of genes appropriate to the environmental conditions, increasing the transcription of those genes.

Sigma factors in E. coli:

  • σ70(RpoD) – σA – the "housekeeping" sigma factor or also called as primary sigma factor (Group 1), transcribes most genes in growing cells. Every cell has a "housekeeping" sigma factor that keeps essential genes and pathways operating.[1] In the case of E. coli and other gram-negative rod-shaped bacteria, the "housekeeping" sigma factor is σ70.[1] Genes recognized by σ70 all contain similar promoter consensus sequences consisting of two parts.[1] Relative to the DNA base corresponding to the start of the RNA transcript, the consensus promoter sequences are characteristically centered at 10 and 35 nucleotides before the start of transcription (−10 and −35).
  • σ19 (FecI) – the ferric citrate sigma factor, regulates the fec gene for iron transport and metabolism
  • σ24 (RpoE) – extreme heat stress response and the extracellular proteins sigma factor
  • σ28 (RpoF/FliA) – the flagellar synthesis and chemotaxis sigma factor
  • σ32 (RpoH) – the heat shock sigma factor, it is turned on when the bacteria are exposed to heat. Due to the higher expression, the factor will bind with a high probability to the polymerase-core-enzyme. Doing so, other heatshock proteins are expressed, which enable the cell to survive higher temperatures. Some of the enzymes that are expressed upon activation of σ32 are chaperones, proteases and DNA-repair enzymes.
  • σ38 (RpoS) – the starvation/stationary phase sigma factor
  • σ54 (RpoN) – the nitrogen-limitation sigma factor

There are also anti-sigma factors that inhibit the function of sigma factors and anti-anti-sigma factors that restore sigma factor function.

Structure

[edit]
Domain organization, promoter recognition and structural organization of the σ70 family. (a) The domain organization of σ factors from Groups 1, 3 and 4 are illustrated above σ70 consensus E. coli promoter DNA. (b) Organization of E. coli σ70 in an RNA polymerase transcription initiation complex. (PDB 4YLN).

By sequence similarity, most sigma factors are σ70-like (InterProIPR000943). They have four main regions (domains) that are generally conserved:

N-terminus --------------------- C-terminus
             1.1    2    3    4

The regions are further subdivided. For example, region 2 includes 1.2 and 2.1 through 2.4.

Domain 1.1 is found only in "primary sigma factors" (RpoD, RpoS in E.coli; "Group 1"). It is involved in ensuring the sigma factor will only bind the promoter when it is complexed with the RNA polymerase.[7] Domains 2-4 each interact with specific promoter elements and with RNAP. Region 2.4 recognizes and binds to the promoter −10 element (called the "Pribnow box"). Region 4.2 recognizes and binds to the promoter −35 element.[7]

Not every sigma factor of the σ70 family contains all the domains. Group 2, which includes RpoS, is very similar to Group 1 but lacks domain 1. Group 3 also lacks domain 1, and includes σ28. Group 4, also known as the Extracytoplasmic Function (ECF) group, lack both σ1.1 and σ3. RpoE is a member.[7]

Other known sigma factors are of the σ54/RpoN (InterProIPR000394) type. They are functional sigma factors, but they have significantly different primary amino acid sequences.[8]

Protein domain infoboxes
Sigma70 region 1.1
Identifiers
SymbolSigma70_r1_1
PfamPF03979
InterProIPR007127
Available protein structures:
Pfam  structures / ECOD  
PDBRCSB PDB; PDBe; PDBj
PDBsumstructure summary
Sigma70 region 1.2
Crystal structure of Thermus aquaticus RNA polymerase sigma subunit fragment containing regions 1.2 to 3.1
Identifiers
SymbolSigma70_r1_2
PfamPF00140
InterProIPR009042
PROSITEPDOC00592
SCOP21sig / SCOPe / SUPFAM
Available protein structures:
Pfam  structures / ECOD  
PDBRCSB PDB; PDBe; PDBj
PDBsumstructure summary
Sigma70 region 2
Crystal structure of a sigma70 subunit fragment from Escherichia coli RNA polymerase
Identifiers
SymbolSigma70_r2
PfamPF04542
Pfam clanCL0123
InterProIPR007627
PROSITEPDOC00592
SCOP21sig / SCOPe / SUPFAM
Available protein structures:
Pfam  structures / ECOD  
PDBRCSB PDB; PDBe; PDBj
PDBsumstructure summary
Sigma70 region 3
Solution structure of sigma70 region 3 from Thermotoga maritima
Identifiers
SymbolSigma70_r3
PfamPF04539
Pfam clanCL0123
InterProIPR007624
SCOP21ku2 / SCOPe / SUPFAM
Available protein structures:
Pfam  structures / ECOD  
PDBRCSB PDB; PDBe; PDBj
PDBsumstructure summary
Sigma70 region 4
Solution structure of sigma70 region 4 from Thermotoga maritima
Identifiers
SymbolSigma70_r4
PfamPF04545
Pfam clanCL0123
InterProIPR007630
SCOP21or7 / SCOPe / SUPFAM
Available protein structures:
Pfam  structures / ECOD  
PDBRCSB PDB; PDBe; PDBj
PDBsumstructure summary
Sigma70 region 4.2
Crystal structure of Escherichia coli sigma70 region 4 bound to its -35 element DNA
Identifiers
SymbolSigma70_r4_2
PfamPF08281
Pfam clanCL0123
InterProIPR013249
SCOP21or7 / SCOPe / SUPFAM
Available protein structures:
Pfam  structures / ECOD  
PDBRCSB PDB; PDBe; PDBj
PDBsumstructure summary

Retention during transcription elongation

[edit]

The core RNA polymerase (consisting of 2 alpha (α), 1 beta (β), 1 beta-prime (β'), and 1 omega (ω) subunits) binds a sigma factor to form a complex called the RNA polymerase holoenzyme. It was previously believed that the RNA polymerase holoenzyme initiates transcription, while the core RNA polymerase alone synthesizes RNA. Thus, the accepted view was that sigma factor must dissociate upon transition from transcription initiation to transcription elongation (this transition is called "promoter escape"). This view was based on analysis of purified complexes of RNA polymerase stalled at initiation and at elongation. Finally, structural models of RNA polymerase complexes predicted that, as the growing RNA product becomes longer than ~15 nucleotides, sigma must be "pushed out" of the holoenzyme, since there is a steric clash between RNA and a sigma domain. However, σ70 can remain attached in complex with the core RNA polymerase in early elongation[9] and sometimes throughout elongation.[10] Indeed, the phenomenon of promoter-proximal pausing indicates that sigma plays roles during early elongation. All studies are consistent with the assumption that promoter escape reduces the lifetime of the sigma-core interaction from very long at initiation (too long to be measured in a typical biochemical experiment) to a shorter, measurable lifetime upon transition to elongation.

Sigma cycle

[edit]

It had long been thought that the sigma factor obligatorily leaves the core enzyme once it has initiated transcription, allowing it to link to another core enzyme and initiate transcription at another site. Thus, the sigma factor would cycle from one core to another. However, fluorescence resonance energy transfer was used to show that the sigma factor does not obligatorily leave the core.[9] Instead, it changes its binding with the core during initiation and elongation. Therefore, the sigma factor cycles between a strongly bound state during initiation and a weakly bound state during elongation.

Sigma factor competition

[edit]

The number of RNAPs in bacterial cells (e.g., E. coli) have been shown to be smaller than the number of sigma factors. Consequently, if a certain sigma factor is overexpressed, not only will it increase the expression levels of genes whose promoters have preference for that sigma factor, but it will also reduce the probability that genes with promoters with preference for other sigma factors will be expressed.[11][12][13][14]

Meanwhile, transcription initiation has two major rate limiting steps: the closed and the open complex formation. However, only the dynamics of the first step depends on the concentration of sigma factors. Interestingly, the faster the closed complex formation relative to the open complex formation, the less responsive is a promoter to changes in sigma factors' concentration (see [14] for a model and empirical data of this phenomenon).

Genes with dual sigma factor preference

[edit]

While most genes of E. coli can be recognized by an RNAP with one and only one type of sigma factor (e.g. sigma 70), a few genes (~ 5%) have what is called a "dual sigma factor preference",[15] that is, they can respond to two different sigma factors, as reported in RegulonDB.[16] The most common ones are those promoters that can respond to both sigma 70 and to sigma 38 (illustrated in the figure). Studies of the dynamics of these genes showed that when the cells enter stationary growth they are almost as induced as those genes that have preference for σ38 alone. This induction level was shown to be predictable from their promoter sequence.[15] A model of their dynamics is shown in the figure. In the future, these promoters may become useful tools in synthetic genetic constructs in E. coli.

Left: illustration of genes whose promoters can be recognized by both sigma 70 (green) and sigma 38 (blue). Shown are the RNA polymerases, carrying the two different sigma factors, and either of them can bind to the promoter region (grey rectangle). Right: Model proposed in [15] of these genes. The model consists of a two-step process of gene expression (transcription followed by translation). The rate constant from transcription (kt) accounts for the possibility of binding by either RNAP (those carrying sigma 70, and those carrying sigma 38). The model also includes translation (rate constant kt), and RNA and protein degradation to "nothing" represented by the "slashed zero glyph".

See also

[edit]

References

[edit]
[edit]
Revisions and contributorsEdit on WikipediaRead on Wikipedia
from Grokipedia
Sigma factors are multi-domain proteins that function as dissociable subunits of bacterial RNA polymerase (RNAP), conferring specificity for promoter recognition and enabling the initiation of transcription at appropriate DNA sites.[1] By binding to the RNAP core enzyme—composed of subunits ββ'α₂ω—they form the holoenzyme, which interacts with conserved promoter elements such as the -10 and -35 boxes to unwind DNA and start RNA synthesis.[1] Once transcription begins, the sigma factor typically dissociates from the elongating complex, allowing the core enzyme to proceed.[1] This mechanism is fundamental to prokaryotic gene expression, with sigma factors absent in eukaryotic RNAP II but present in bacterial-like systems such as chloroplasts.[2] Structurally, sigma factors in the predominant σ70 family feature up to four conserved regions (σ1, σ2, σ3, and σ4) connected by flexible linkers, which span the surface of the RNAP core to contact promoter DNA.[1] Region 2.4 recognizes the -10 promoter element (extended -10 motif), while region 4.2 binds the -35 element, stabilizing the complex and promoting open complex formation.[1] A distinct σ54 family exists, characterized by three domains and requiring ATP hydrolysis by enhancer-binding proteins for activation, rather than direct promoter melting.[2] These structural features allow sigma factors to respond dynamically to cellular needs, with their activity often modulated by anti-sigma factors that sequester them until specific signals release them.[1] Bacterial genomes encode multiple sigma factors, classified primarily within the σ70 family into four groups based on sequence conservation and function.[2] Group 1 comprises essential housekeeping sigma factors, such as σ70 (RpoD) in Escherichia coli, which directs the transcription of most constitutive genes during exponential growth.[2] Group 2 includes primary-like factors similar to Group 1, such as σS (RpoS) for stationary phase and general stress responses.[2] Group 3 encompasses specialized factors like σ28 (FliA) for flagellar synthesis and σB for multiple stresses in Gram-positive bacteria.[2] The largest and most diverse is Group 4, the extracytoplasmic function (ECF) sigma factors, with over 40 subgroups that regulate responses to envelope stress, iron acquisition, and virulence; for instance, Pseudomonas aeruginosa encodes up to 20 ECF sigmas.[1] Bacteria like Streptomyces coelicolor possess over 60 sigma factors, highlighting extensive diversification.[3] The significance of sigma factors lies in their role as global regulators of bacterial adaptation, enabling rapid shifts in gene expression to cope with environmental challenges such as heat shock, oxidative stress, nutrient limitation, and host interactions.[4] Alternative sigma factors, in particular, control virulence determinants in pathogens; for example, σS in Salmonella enterica activates genes for intracellular survival, while σB in Listeria monocytogenes promotes invasion of host cells.[4] Competition among sigma factors for RNAP binding creates a hierarchical network that prioritizes essential responses, ensuring survival and pathogenesis.[5] Dysregulation of sigma factor activity has implications for antibiotic resistance and biofilm formation, making them potential targets for therapeutic intervention.[6]

Overview and distribution

Definition and primary function

Sigma factors are dissociable subunits of the bacterial RNA polymerase (RNAP) holoenzyme, which consists of the core enzyme (comprising β, β', α₂, and ω subunits) bound to a sigma factor, enabling the specific recognition of promoter DNA sequences to initiate transcription.[1] Without a sigma factor, the core RNAP lacks promoter specificity and cannot efficiently initiate transcription at precise genomic locations.[7] The primary function of sigma factors is to confer promoter specificity to the core RNAP, directing it to bind cognate promoter sites and facilitating the formation of the open promoter complex, which unwinds DNA to allow RNA synthesis from specific gene sets in response to cellular conditions.[1] This selectivity enables bacteria to regulate global gene expression, such as activating stress response genes or maintaining housekeeping functions.[8] In Escherichia coli, the housekeeping sigma factor σ⁷⁰ (encoded by rpoD) exemplifies this role, directing the initiation of transcription for most genes during exponential growth under normal conditions.[9] Sigma factors were first identified in the late 1960s through purification studies that separated the dissociable specificity factor from the core RNAP, revealing its essential role in promoter-directed transcription. Further characterization in the 1970s involved analyses of rifampicin-resistant mutants, which highlighted differences in initiation mechanisms, and promoter sequencing efforts that defined consensus elements recognized by sigma factors.[7] Bacterial sigma factors exhibit structural and functional homology to the archaeal transcription factor B (TFB) and the eukaryotic general transcription factor TFIIB, sharing conserved helix-turn-helix motifs in cyclin-like repeat domains that position them similarly in the transcription initiation complex.[10] This evolutionary relationship underscores a common ancestral mechanism for promoter recognition across domains of life.[11]

Occurrence across organisms

Sigma factors are essential components of bacterial transcription machinery and are present across all bacterial phyla, enabling promoter-specific initiation of RNA synthesis.[12] The number of sigma factors encoded in bacterial genomes varies significantly depending on the species and environmental adaptations; for instance, Escherichia coli possesses seven distinct sigma factors, including the housekeeping σ⁷⁰ and stress-responsive alternatives like σˢ and σᴱ.[13] In contrast, soil-dwelling actinobacteria such as Streptomyces coelicolor encode over 65 sigma factors, reflecting their complex developmental cycles and secondary metabolism.[14] Bacterial-like sigma factors are also found in the plastids of plants and algae, organelles that originated from an ancient cyanobacterial endosymbiont. In Arabidopsis thaliana, six nuclear-encoded plastid sigma factors (SIG1 through SIG6) direct the transcription of chloroplast genes by recognizing bacterial-type promoters, with each factor showing specialization—such as SIG1 for photosystem assembly genes like psaA and psbA, or SIG6 for early chloroplast biogenesis.[15] This conservation underscores the endosymbiotic transfer of cyanobacterial transcription systems into the eukaryotic lineage, where these factors integrate plastid gene expression with host nuclear control via signals like light and redox state.[15] True sigma factors are absent in archaea and eukaryotes, which instead rely on structurally analogous proteins for transcription initiation. In archaea, transcription factor B (TFB) facilitates promoter recognition by the RNA polymerase, while in eukaryotes, TFIIB performs a similar role within the multiprotein pre-initiation complex.[10] These proteins share homology with bacterial sigma factors, particularly in their helix-turn-helix motifs for DNA binding, suggesting a common evolutionary precursor.[10] From an evolutionary perspective, sigma factors are believed to have arisen specifically within the bacterial domain following the divergence of bacteria from the last universal common ancestor (LUCA), which likely possessed a primordial initiation factor ancestral to both sigma factors and TFB/TFIIB.[10] This post-LUCA innovation in bacteria allowed for diverse promoter specificities, contrasting with the more uniform archaeal and eukaryotic systems. Recent investigations into bacterial endosymbionts of eukaryotes, such as those in insects and amoebae, continue to reveal active sigma factor-mediated regulation of endosymbiont gene expression, affirming their bacterial identity amid host interactions, though no evidence has emerged for true sigma factors in eukaryotic nuclear genomes.

Molecular structure

Conserved domains

Bacterial sigma factors of the σ70 family exhibit a modular domain architecture characterized by four conserved regions (1.1 to 4.2), which are connected by flexible linkers and enable specific interactions with the RNA polymerase (RNAP) core enzyme and promoter DNA. Region 1, primarily found in primary (Group 1) sigma factors, functions in an inhibitory role by binding to the core RNAP and preventing promiscuous DNA interactions in the apo-sigma state; it includes subdomain 1.1, which is unique to housekeeping sigmas and adopts an autoinhibitory conformation that mimics DNA to block promoter access.[16] Region 2 is highly conserved across the family and is responsible for recognizing the -10 promoter element (Pribnow box) through its subdomain 2.4, which features a helix-turn-helix (HTH) motif that interacts with the major groove of double-stranded DNA and facilitates DNA melting during open complex formation. Region 3 serves as a central scaffold for core RNAP interactions and promoter stabilization, while Region 4, also highly conserved, binds the -35 promoter element via its subdomain 4.2 HTH motif, which contacts the DNA backbone and major groove to confer promoter specificity. The interactions between these regions and the RNAP core enzyme are mediated primarily through Regions 2, 3, and 4, forming an extensive interface that spans the surface of the β and β' subunits. Region 3 binds directly to the β' subunit of the core RNAP via a compact three-helix bundle structure, involving electrostatic interactions between charged residues and hydrophobic contacts that stabilize the holoenzyme assembly; for instance, conserved acidic residues in Region 3 form salt bridges with basic patches on β', while hydrophobic cores involving leucine and valine residues provide additional stability.[16] Region 2 contributes through an α-helix in subdomain 2.2 that docks into the β' coiled-coil domain, and Region 4 interfaces with the β flap via hydrophobic and van der Waals contacts, collectively ensuring tight sigma-core association during transcription initiation. Full-length σ70-family factors typically range from 70 to 100 kDa in molecular weight, reflecting their multi-domain structure, whereas extracytoplasmic function (ECF, Group 4) sigmas are smaller at approximately 20 kDa, as they lack Regions 1 and 3 and consist primarily of the core-binding and DNA-recognition elements from Regions 2 and 4.[16] These domains collectively enable promoter recognition, with Region 2.4 contacting the -10 box and Region 4.2 the -35 box to position the holoenzyme accurately. Early structural insights into this architecture came from crystal structures of σ70-RNAP complexes, such as the 2002 Thermus aquaticus holoenzyme model resolved at 4 Å, which revealed the positioning of Regions 2 and 4 atop the core enzyme and their proximity to the promoter cleft.[17] A contemporaneous Thermus thermophilus structure at 2.6 Å further delineated the Region 3-β' interactions, confirming the modular folding and linker flexibility essential for function.[18]

Structural variations and recent insights

While the canonical σ70-family sigma factors feature four conserved regions (1–4), structural variations among alternative sigma factors enable specialized promoter recognition and interactions with RNA polymerase (RNAP). Extracytoplasmic function (ECF) sigma factors, the largest group of alternatives, exhibit a compact architecture lacking regions 1.1, 1.2, non-conserved region (NCR), and region 3, retaining primarily regions 2 and 4 connected by a variable linker that partially substitutes for region 3.2; this streamlined ~200-amino-acid structure facilitates rapid, specific responses to environmental cues without the inhibitory elements of region 1 found in housekeeping sigmas.[19][16] The σ54 (RpoN) family diverges profoundly, comprising three distinct regions (RI–RIII) organized into four structural domains: RI (flexible and unresolved in many structures), RII (a β-hairpin "RII-finger" for DNA/RNA interaction), a central binding domain (CBD) at the RNA exit channel, and an RpoN box for upstream DNA recognition; unlike σ70, σ54 does not directly bind -35/-10 promoter elements but requires enhancer-binding proteins with AAA+ ATPase domains for activation and isomerization to the open complex.[20][21] Group 2 sigma factors, such as σB (stress response) and σF (early sporulation in Bacillus subtilis), closely resemble primary sigmas but lack subdomain 1.1, featuring conserved regions 2–4 with molecular weights around 29–30 kDa; these variations subtly alter RNAP affinity and promoter specificity, supporting compartment-specific expression during development. ECF subgroups further diversify with specialized anti-sigma sensors integrated near their genes, enhancing signal transduction for niche adaptations like osmolarity or oxidative stress.[22][16] Recent cryo-EM studies have illuminated these variations' impacts on transcription. In 2023, structures of open promoter complexes with distinct σI factors (SigI1 and SigI6) from Clostridium thermocellum at 3.0 Å and 3.3 Å resolutions revealed unique promoter melting mechanisms, including σ-RNAP rearrangements where region 4 inserts into the major groove for enhanced specificity and a flipped base at the -11 position aiding bubble formation.[23] Advancing into 2024–2025, high-resolution mapping via artificial promoters and deep sequencing identified over 64,000 distinct σ54 binding motifs, expanding known -24 GG and -12 GC elements and highlighting sequence variability that modulates enhancer dependence. In Bacillus velezensis, a sigX knockout mutant exhibited thinner pellicles and disorganized colonies, underscoring σX's role in biofilm architecture and matrix gene expression for surface adhesion.[24][25]

Mechanism of transcription initiation

Promoter recognition and binding

Sigma factors confer promoter specificity to bacterial RNA polymerase (RNAP) by recognizing conserved DNA sequence elements in promoter regions, enabling the holoenzyme to bind and initiate transcription. In Escherichia coli, the housekeeping sigma factor σ70 primarily recognizes the -10 (TATAAT) and -35 (TTGACA) consensus sequences, located approximately 10 and 35 base pairs upstream of the transcription start site, respectively. The optimal spacing between these elements is 17 ± 1 bp, which positions the recognition domains of σ70 correctly on the DNA helix for stable binding.[26][27] The binding mechanism involves specific interactions between conserved regions of the sigma factor and promoter DNA. Region 2.4 of σ70 contacts the -10 element through recognition helices and intercalation of aromatic residues, such as tryptophan, which flip out bases like A-11 and T-7 to facilitate initial DNA distortion. Meanwhile, region 4.2, forming a helix-turn-helix motif, binds the -35 element via both specific hydrogen bonds and nonspecific interactions with the DNA backbone. These contacts induce a conformational fit in the holoenzyme-DNA complex, stabilizing the initial closed complex and promoting isomerization toward a more open state. The process follows a kinetic model involving rapid association to form the closed complex, followed by slower isomerization steps that enhance specificity.[28][27] Promoter specificity is further tuned by variations in consensus sequences and additional motifs. Alternative sigma factors often rely on extended -10 motifs or unique elements for recognition; for instance, group IV extracytoplasmic function (ECF) sigmas target promoters with a conserved -10 motif (G/TGAA) and variable upstream regions, sometimes featuring homopolymeric T-tracts that induce DNA curvature to aid RNAP engagement. The σ32 factor, which directs heat-shock gene expression, binds promoters with a weaker consensus, including a -10 element (CCCCATNT) and less stringent -35 region, allowing activation under stress conditions. In contrast, σ54 (group III) forms a stable closed complex at promoters lacking typical -10/-35 elements but requires an enhancer-bound activator protein to hydrolyze ATP, driving isomerization and promoter opening.[26][29][30] The σ70 holoenzyme exhibits high binding affinity to consensus promoters, with dissociation constants in the nanomolar range (approximately 10-9 M), reflecting the stability of the initial complex and underscoring the precision of sigma-directed recognition.[31][27]

Holoenzyme assembly and open complex formation

The bacterial RNA polymerase (RNAP) holoenzyme is assembled through the transient association of a sigma (σ) factor with the core enzyme, which comprises two α subunits, β, β', and ω. This binding enables specific promoter recognition and is reversible, allowing σ factors to cycle between free and holoenzyme-bound states. The equilibrium dissociation constant (K_d) for the interaction between σ^{70} and the core RNAP is approximately 1.9 × 10^{-7} M, reflecting moderate affinity that supports dynamic assembly without permanent sequestration of cellular RNAP resources.[32][1] Following initial promoter binding directed by the σ factor, the holoenzyme forms a closed complex (RP_c) in which double-stranded promoter DNA is engaged but not unwound. Isomerization to the open complex (RP_o) involves σ-directed DNA melting, typically spanning about 14 base pairs from approximately -11 to +3 relative to the transcription start site, with nucleation at the -10 element. For housekeeping σ^{70}-dependent promoters, this transition occurs spontaneously, driven by conformational changes in the RNAP induced by σ regions 2 and 4 that load promoter DNA into the active site cleft. In contrast, for σ^{54}-dependent promoters, isomerization requires energy from NTP hydrolysis by enhancer-binding proteins (bEBPs), which remodel the closed complex to enable strand separation.[33][34][35] Key interactions stabilizing the open complex include those from σ region 3, which positions a flexible loop near the RNAP active center to clamp and orient promoter DNA, facilitating template strand access and preventing re-annealing. Recent cryo-EM structures of open complexes with distinct σ^I factors from Clostridium thermocellum (resolved at 3.0–3.3 Å) reveal how σ^I engages the -10 and -35 elements to propagate conformational changes, including opening of the RNAP β' jaw domain, which accommodates the unwound DNA bubble for melting. These structures highlight conserved mechanisms across σ families while showing variations in σ^I's C-terminal domain interactions with the flap-tip helix.[36][23] The resulting RP_o establishes a productive initiation site, with the melted DNA bubble positioning the template strand in the active center for NTP incorporation and RNA synthesis. For consensus promoters, open complex formation is efficient, with roughly half exhibiting rapid isomerization rates that support robust transcription output, though efficiency varies with sequence deviations and cellular conditions. This process links directly to the σ cycle, enabling σ recycling after initiation.[37][38]

Dynamics in the transcription process

Sigma factor release and retention

In the classical model of bacterial transcription, the sigma factor dissociates from the RNA polymerase holoenzyme shortly after initiation, typically after the synthesis of 8-10 nucleotides of nascent RNA, facilitating promoter clearance and the transition to elongation.[2] This release has been observed in vitro for the primary sigma factor σ70 in Escherichia coli, where the growing RNA chain sterically clashes with sigma region 1.1, promoting dissociation through competition for binding sites on the core enzyme.[39] However, in vivo studies reveal that sigma factors, particularly σ70, can be retained in a substantial fraction of early elongation complexes, often ~40-60% of complexes after promoter escape.[40] Retention persists stochastically during early elongation, with some complexes maintaining σ70 for over 100 nucleotides or even throughout the entire transcription unit, as evidenced by chromatin immunoprecipitation and single-molecule analyses.[40] Recent work as of November 2025 suggests long-range retention may be common for subsets of E. coli operons, influencing pausing and regulation.[41] This retention stabilizes promoter-proximal pausing by enabling recognition of -10-like elements downstream, and its extent depends on promoter strength: sequences with pause-inducing elements in the initial transcribed region increase retention, aiding escape from weaker promoters where rapid release might otherwise lead to abortive initiation.[40][42] Mechanisms of sigma release and retention involve dynamic interactions between sigma, the core RNA polymerase, and the nascent RNA. The nascent RNA competes with sigma for contacts in the RNA exit channel, favoring dissociation in strong promoters with efficient escape, while core enzyme interactions, such as those in sigma region 4 with the β flap, can stabilize retention during pausing.[39] For the alternative sigma factor σ54, release is more complete and activator-dependent; enhancer-binding proteins remodel the holoenzyme to displace σ54 after open complex formation, as revealed by cryo-EM structures showing scrunching and reduced σ54-DNA interactions during escape.[43][44] This prevents retention due to its inhibitory role in elongation without activation.[35] Examples include σ70 retention in E. coli at promoters with suboptimal -10 elements, where prolonged association enhances pausing to coordinate downstream gene regulation, and in stress-responsive contexts, such as with the alternative sigma RpoS (σS), where retention supports efficient expression of stationary-phase genes by modulating elongation dynamics.[40][45] These dynamics influence transcription efficiency by balancing promoter clearance and pause resolution, with retention promoting anti-termination at regulatory pauses to ensure full-length transcript production, particularly under stress conditions where rapid adaptation is critical.[40] As part of the broader sigma cycle, release enables sigma recycling for new initiations.[46]

The sigma cycle

The sigma cycle describes the dynamic, iterative process through which sigma factors associate with the bacterial RNA polymerase (RNAP) core enzyme to direct transcription initiation, followed by dissociation and reuse for subsequent rounds. In the initial step, a free sigma factor binds to the apo-core RNAP (comprising subunits α₂ββ′ω), forming the holoenzyme that confers promoter specificity. This holoenzyme then locates and binds to appropriate promoter sequences on DNA, enabling the unwinding of the DNA double helix to form the open promoter complex and initiate RNA synthesis. Once transcription proceeds beyond the promoter region (typically after synthesizing 10–15 nucleotides), the sigma factor dissociates from the elongating RNAP, freeing it to recycle and bind another core enzyme, thereby allowing the core to continue elongation without sigma while the sigma participates in new initiation events.[7] The kinetics of the sigma cycle are rapid and tightly regulated to support efficient cellular transcription. Sigma-core association and dissociation occur on timescales of seconds, with exchange rates influenced by cellular concentrations; in Escherichia coli, σ70 levels are ~700–1,000 molecules per cell (~1–2 μM), while core RNAP concentrations are ~2,000–3,000 molecules per cell (~3–5 μM), resulting in a slight excess of cores over sigmas in exponentially growing cells.[47][48] This ratio ensures availability for cycling without excess sequestration, though alternative sigmas are present at lower levels (e.g., ~300–1,000 molecules for σ38 (RpoS) under stress), promoting selective redirection of the RNAP pool.[13] The process is concentration-dependent, with higher free sigma accelerating holoenzyme formation and overall transcription throughput.[49][7] Variations in the sigma cycle exist across sigma factor classes to accommodate specialized functions. For the alternative σ54 (RpoN), the cycle incorporates ATP-dependent activators (enhancer-binding proteins) that bind upstream of the promoter and hydrolyze ATP to remodel the closed complex into an open one; post-initiation, these activators must separately dissociate and recycle, decoupling their turnover from sigma release and enabling responses to nutrient signals like nitrogen limitation. In contrast, extracytoplasmic function (ECF) sigma factors, which mediate acute stress responses (e.g., to envelope damage), feature accelerated cycling with faster dissociation rates, allowing transient pulses of target gene expression without prolonged RNAP commitment; this rapid turnover supports brief activation during environmental challenges, followed by quick return to housekeeping transcription.[50][19] The sigma cycle enhances transcriptional efficiency by preventing the long-term sequestration of core RNAP, ensuring a dynamic pool available for diverse promoters; disruptions, such as mutations impairing sigma-core interactions or release timing, result in growth defects, including reduced proliferation and impaired adaptation to antibiotics or stress in bacteria like E. coli. A September 2025 model proposes that sigma cycling coordinates with toxin-antitoxin systems via a nutrient-responsive cybernetic framework to optimize population-level fitness in fluctuating environments, balancing growth and survival across the bacterial life cycle.[51][52]

Classification of sigma factors

Housekeeping sigma factors

Housekeeping sigma factors are the primary sigma factors in bacteria, essential for cell viability and responsible for directing the transcription of the majority of housekeeping genes that maintain fundamental cellular processes under normal growth conditions.[53] These factors ensure constitutive expression of genes involved in routine cellular maintenance, distinguishing them from alternative sigma factors that respond to specific environmental cues.[54] Key characteristics of housekeeping sigma factors include their broad promoter recognition capabilities, allowing them to bind consensus promoter sequences such as the -10 (TATAAT) and -35 (TTGACA) boxes in a wide array of promoters.[1] They are typically the most abundant sigma factors in the cell; for instance, in Escherichia coli, the housekeeping sigma factor σ70 (encoded by rpoD) constitutes 60–95% of total sigma factors during exponential growth phase.[55] This high cellular concentration, estimated at approximately 700 copies per cell, enables efficient recruitment of RNA polymerase to housekeeping promoters.[56] Functionally, housekeeping sigma factors orchestrate the expression of genes critical for bacterial growth, metabolism, and DNA replication, supporting exponential proliferation in nutrient-rich environments.[57] In γ-proteobacteria such as E. coli, σ70 exemplifies this role by initiating transcription from promoters of core metabolic and replication genes.[57] Similarly, in Gram-positive bacteria like Bacillus subtilis, the housekeeping sigma factor σA (encoded by sigA) performs analogous functions, promoting RNA polymerase attachment to primary promoters across all growth phases.[58] [59] Recent advances in 2024 have explored engineering variants of housekeeping sigma factors, such as σ70, to enhance gene expression in industrial bacterial strains. These engineered σ factors enable tunable control of recombinant protein production without extensive genomic modifications, improving yields in applications like metabolic engineering.[60]

Alternative sigma factors

Alternative sigma factors are specialized subunits of bacterial RNA polymerase that enable the enzyme to recognize distinct promoter sequences, thereby directing transcription toward genes involved in adaptive responses to environmental stresses, developmental processes, or specific signals. Unlike the constitutive housekeeping sigma factors, alternative sigmas are typically present in low abundance under standard conditions and become activated in response to cues such as nutrient limitation, temperature shifts, or cellular damage, with bacterial genomes encoding between 5 and 50 such factors depending on the species.[61][1] Major classes of alternative sigma factors include those responsive to stationary phase and general stress (e.g., σS or RpoS), heat shock (σ32 or RpoH), nitrogen limitation (σ54 or RpoN), and extracytoplasmic function (ECF) factors that monitor envelope integrity (e.g., σE or RpoE for periplasmic stress). These classes allow bacteria to reprogram gene expression rapidly, prioritizing survival and adaptation over routine metabolism. For instance, σS in Escherichia coli upregulates genes for oxidative stress resistance and osmotic balance during nutrient scarcity.[62][63][64] In E. coli, seven sigma factors are encoded, with the six alternatives comprising σ19 (FecI, iron uptake), σ24 (RpoE, envelope stress), σ28 (FliA, flagellar synthesis), σ32 (heat shock), σ38 (RpoS, stationary phase), and σ54 (nitrogen regulation). Beyond model organisms, alternative sigmas play key roles in pathogenesis; for example, σB in Listeria monocytogenes enhances intracellular survival and virulence by coordinating stress tolerance during host infection. Similarly, a 2025 study revealed that σ54 (RpoN) in Clostridioides difficile acts as a global regulator, promoting antibiotic resistance through modulation of efflux pumps and metabolic pathways essential for persistence in the gut.[65][66][67] The primary function of alternative sigma factors is to redirect RNA polymerase from housekeeping promoters to adaptive gene sets, ensuring efficient resource allocation during challenges. In Bacillus velezensis, σX supports biofilm formation by activating matrix production genes, enhancing community stability and biocontrol against pathogens, as demonstrated in a 2025 analysis of environmental isolates. Recent advances highlight their diversity: in Streptomyces species, σI factors recognize unique promoters to fine-tune developmental transitions, indirectly influencing secondary metabolite pathways including antibiotic biosynthesis. A 2023 discovery identified plasmid-encoded "rogue" sigma factors, such as SigN in Bacillus subtilis, that trigger cell lysis upon DNA damage, potentially aiding plasmid dissemination in microbial populations. Additionally, in Chlamydia trachomatis, alternative sigmas σ28 and σ54 bind late-cycle genes in 2025 genomic studies, driving the switch to infectious elementary bodies essential for pathogenesis.[25][23][68][69]

Regulation and interactions

Sigma factor competition

Sigma factors in bacteria, such as those in Escherichia coli, compete for a limited pool of core RNA polymerase (RNAP) enzymes to form functional holoenzymes capable of promoter recognition and transcription initiation. This competition occurs primarily at the sigma-binding site on the core RNAP, where the relative affinities and cellular concentrations of different sigma factors determine which one associates with the core to direct transcription. For instance, the housekeeping sigma factor σ70 exhibits high affinity for core RNAP with a dissociation constant (Kd) of 0.26 nM, enabling it to dominate under normal growth conditions, while alternative sigmas like the heat shock factor σ32 (Kd = 1.24 nM) and the stationary phase factor σS (Kd = 4.26 nM) have lower affinities but are induced to higher levels during stress, allowing them to vie effectively for core binding.[13] The outcomes of this rivalry profoundly influence the bacterial transcriptome by reallocating RNAP resources toward context-specific gene expression. Under stress conditions, elevated concentrations of alternative sigmas reduce the availability of σ70-holoenzymes, leading to downregulation of growth-related genes and upregulation of stress-response genes. In E. coli exponential phase, approximately 70–80% of holoenzymes are σ70, supporting housekeeping functions; however, during stress responses like the stringent response, this fraction decreases as released core RNAP from rRNA transcription becomes available for alternative sigmas, resulting in hypersensitivity of gene expression shifts with response factors up to 3-fold.[70] A key example is the competition between σS and σ70 during stationary phase, where nutrient limitation and oxidative stress trigger σS accumulation to activate the general stress response regulon (~10% of the genome). Recent findings reveal multilayered regulation of σS (RpoS), including transcriptional control by systems like TorR/TorS under acid stress, translational enhancements via tRNA modifications (e.g., MiaA, TrmL), and post-translational stabilization by anti-adaptors (e.g., IraP, IraD) that prevent proteolysis, collectively amplifying σS competition against σ70 and promoting adaptive heterogeneity in cell populations.[71] Similarly, under heat shock, σ32 induction competes with σ70, redirecting RNAP to chaperones and proteases for protein homeostasis.[13] This competitive mechanism ensures coordinated physiological responses by prioritizing essential transcriptomes during environmental challenges; disruptions, such as mutations altering affinities, can dysregulate competition and impair adaptation, underscoring its role in bacterial fitness.[70] Within the broader sigma cycle, competition also influences sigma recycling post-initiation, further modulating holoenzyme availability.[72]

Anti-sigma factors and dual promoter specificity

Anti-sigma factors are proteins that bind to and sequester sigma factors, thereby inhibiting their association with the RNA polymerase core enzyme and preventing promoter-specific transcription initiation. These regulators are particularly prevalent among extracytoplasmic function (ECF) sigma factors and those involved in sporulation, where they enable rapid responses to environmental stresses by controlling sigma availability. Mechanisms of anti-sigma action include direct binding to sequester the sigma factor, often relieved by stimuli such as proteolysis, partner-switching, or phosphorylation changes.[16][73] A well-characterized example is RsbW, an anti-sigma factor that inhibits the stress-responsive sigma factor σ^B in Bacillus subtilis by forming a stable complex that blocks σ^B from binding the RNA polymerase core. RsbW's inhibitory effect is counteracted through an anti-anti-sigma factor, RsbV, whose dephosphorylation under stress conditions promotes partner-switching, releasing σ^B for transcription activation. This system exemplifies the partner-switching mechanism common in Gram-positive bacteria. In Clostridioides difficile, RsbW similarly regulates σ^B activity, influencing sporulation efficiency and stress tolerance; mutants lacking RsbW exhibit defects in spore formation and altered virulence, highlighting its role in pathogenesis.[74][75][76] Another prominent case is RseA, a transmembrane anti-sigma factor that sequesters the ECF sigma factor σ^E (RpoE) in Escherichia coli, maintaining it inactive under normal conditions. During envelope stress, such as outer membrane protein misfolding, RseA undergoes sequential proteolytic degradation by proteases like DegS and RseP, liberating σ^E to direct the transcription of stress response genes. This regulated proteolysis mechanism is a hallmark of ECF sigma regulation, ensuring precise activation in response to extracytoplasmic signals.[77][78] Dual promoter specificity refers to the ability of certain bacterial genes to be recognized by more than one sigma factor, allowing coordinated or conditional expression under varying physiological states. In E. coli, approximately 5% of genes possess promoters with dual specificity, primarily for the housekeeping sigma factor σ^70 (RpoD) and the stationary-phase/stress sigma factor σ^38 (RpoS), enabling fine-tuned regulation during transitions like nutrient limitation. This overlap facilitates additive or alternative activation, where σ^70 drives expression during exponential growth and σ^38 predominates under stress, with 84% of dual-specificity promoters responsive to both factors.[79] The molecular basis of dual specificity often involves overlapping consensus sequences in promoter regions, particularly shared -10 elements (TATAAT-like) that accommodate both sigma factors' recognition domains, while variations in the -35 region or extended motifs modulate affinity. A 2022 computational and experimental study analyzed these promoters using sequence logos from -41 to +1 relative to the transcription start site, revealing conserved motifs that predict dual responsiveness based on positional distances (p-distances) between key elements, such as Dσ38 and Dσ70 scores that correlate with expression strength under varying sigma levels. Representative examples include the pstS gene (phosphate starvation-inducible), aidB (DNA repair), and asr (anaerobic sulfite reduction), where dual recognition ensures robust expression across growth phases without requiring separate promoters.[79] Recent advances have leveraged dual-specificity motifs for synthetic biology applications, such as designing hybrid promoters that respond to multiple sigma factors for dynamic gene circuit control in E. coli. High-throughput mapping of sigma-binding sequences in 2025 has expanded promoter libraries, enabling the engineering of orthogonal systems for metabolic engineering and stress-responsive biosensors, building on earlier motif identifications to achieve precise, multi-input regulation.[79][24]

References

User Avatar
No comments yet.