Recent from talks
Nothing was collected or created yet.

| Part of a series on |
| Genetics |
|---|
Ribonucleic acid (RNA) is a polymeric molecule that is essential for most biological functions, either by performing the function itself (non-coding RNA) or by forming a template for the production of proteins (messenger RNA). RNA and deoxyribonucleic acid (DNA) are nucleic acids. The nucleic acids constitute one of the four major macromolecules essential for all known forms of life. RNA is assembled as a chain of nucleotides. Cellular organisms use messenger RNA (mRNA) to convey genetic information (using the nitrogenous bases of guanine, uracil, adenine, and cytosine, denoted by the letters G, U, A, and C) that directs synthesis of specific proteins. Many viruses encode their genetic information using an RNA genome.
Some RNA molecules play an active role within cells by catalyzing biological reactions, controlling gene expression, or sensing and communicating responses to cellular signals. One of these active processes is protein synthesis, a universal function in which RNA molecules direct the synthesis of proteins on ribosomes. This process uses transfer RNA (tRNA) molecules to deliver amino acids to the ribosome, where ribosomal RNA (rRNA) then links amino acids together to form coded proteins.
It has become widely accepted in science[1] that early in the history of life on Earth, prior to the evolution of DNA and possibly of protein-based enzymes as well, an "RNA world" existed in which RNA served as both living organisms' storage method for genetic information—a role fulfilled today by DNA, except in the case of RNA viruses—and potentially performed catalytic functions in cells—a function performed today by protein enzymes, with the notable and important exception of the ribosome, which is a ribozyme.
Chemical structure of RNA
[edit]Basic chemical composition
[edit]
Each nucleotide in RNA contains a ribose sugar, with carbons numbered 1' through 5'. A base is attached to the 1' position, in general, adenine (A), cytosine (C), guanine (G), or uracil (U). Adenine and guanine are purines, and cytosine and uracil are pyrimidines. A phosphate group is attached to the 3' position of one ribose and the 5' position of the next. The phosphate groups have a negative charge each, making RNA a charged molecule (polyanion). The bases form standard hydrogen bonds between cytosine and guanine and between adenine and uracil, while guanine and uracil can pair through a non-canonical G–U wobble base pair.[2] However, other interactions are possible, such as a group of adenine bases binding to each other in a bulge,[3] or the GNRA tetraloop that has a guanine–adenine base-pair.[2]
Differences between DNA and RNA
[edit]
Like DNA, most biologically active RNAs, including mRNA, tRNA, rRNA, snRNAs, and other non-coding RNAs, contain self-complementary sequences that allow parts of the RNA to fold[8] and pair with itself to form double helices. Analysis of these RNAs has revealed that they are highly structured. Unlike DNA, their structures do not consist of long double helices, but rather collections of short helices packed together into structures akin to proteins.
In this fashion, RNAs can achieve chemical catalysis (like enzymes).[9] For instance, determination of the structure of the ribosome—an RNA-protein complex that catalyzes the assembly of proteins—revealed that its active site is composed entirely of RNA.[10]
An important structural component of RNA that distinguishes it from DNA is the presence of a hydroxyl group at the 2' position of the ribose sugar. The presence of this functional group causes the helix to mostly take the A-form geometry,[11] although in single strand dinucleotide contexts, RNA can rarely also adopt the B-form most commonly observed in DNA.[12] The A-form geometry results in a very deep and narrow major groove and a shallow and wide minor groove.[13] A second consequence of the presence of the 2'-hydroxyl group is that in conformationally flexible regions of an RNA molecule (that is, not involved in formation of a double helix), it can chemically attack the adjacent phosphodiester bond to cleave the backbone.[14]
Secondary and tertiary structures
[edit]The functional form of single-stranded RNA molecules, just like proteins, frequently requires a specific spatial tertiary structure. The scaffold for this structure is provided by secondary structural elements that are hydrogen bonds within the molecule. This leads to several recognizable "domains" of secondary structure like hairpin loops, bulges, and internal loops.[15] In order to create, i.e., design, RNA for any given secondary structure, two or three bases would not be enough, but four bases are enough.[16] This is likely why nature has "chosen" a four base alphabet: fewer than four would not allow the creation of all structures, while more than four bases are not necessary to do so. Since RNA is charged, metal ions such as Mg2+ are needed to stabilise many secondary and tertiary structures.[17]
The naturally occurring enantiomer of RNA is D-RNA composed of D-ribonucleotides. All chirality centers are located in the D-ribose. By the use of L-ribose or rather L-ribonucleotides, L-RNA can be synthesized. L-RNA is much more stable against degradation by RNase.[18]
Like other structured biopolymers such as proteins, one can define topology of a folded RNA molecule. This is often done based on arrangement of intra-chain contacts within a folded RNA, termed as circuit topology.
Chemical modifications
[edit]RNA is transcribed with only four bases (adenine, cytosine, guanine and uracil),[19] but these bases and attached sugars can be modified in numerous ways as the RNAs mature. Pseudouridine (Ψ), in which the linkage between uracil and ribose is changed from a C–N bond to a C–C bond, and ribothymidine (T) are found in various places (the most notable ones being in the TΨC loop of tRNA).[20] Another notable modified base is hypoxanthine, a deaminated adenine base whose nucleoside is called inosine (I). Inosine plays a key role in the wobble hypothesis of the genetic code.[21]
There are more than 100 other naturally occurring modified nucleosides.[22] The greatest structural diversity of modifications can be found in tRNA,[23] while pseudouridine and nucleosides with 2'-O-methylribose often present in rRNA are the most common.[24] The specific roles of many of these modifications in RNA are not fully understood. However, it is notable that, in ribosomal RNA, many of the post-transcriptional modifications occur in highly functional regions, such as the peptidyl transferase center[25] and the subunit interface, implying that they are important for normal function.[26]
Types of RNA
[edit]
Messenger RNA (mRNA) is the type of RNA that carries information from DNA to the ribosome, the sites of protein synthesis (translation) in the cell cytoplasm. The coding sequence of the mRNA determines the amino acid sequence in the protein that is produced.[27] However, many RNAs do not code for protein (about 97% of the transcriptional output is non-protein-coding in eukaryotes[28][29][30][31]).
These so-called non-coding RNAs ("ncRNA") can be encoded by their own genes (RNA genes), but can also derive from mRNA introns.[32] The most prominent examples of non-coding RNAs are transfer RNA (tRNA) and ribosomal RNA (rRNA), both of which are involved in the process of translation.[7] There are also non-coding RNAs involved in gene regulation, RNA processing and other roles. Certain RNAs are able to catalyse chemical reactions such as cutting and ligating other RNA molecules,[33] and the catalysis of peptide bond formation in the ribosome;[10] these are known as ribozymes.
According to the length of RNA chain, RNA includes small RNA and long RNA.[34] Usually, small RNAs are shorter than 200 nt in length, and long RNAs are greater than 200 nt long.[35] Long RNAs, also called large RNAs, mainly include long non-coding RNA (lncRNA) and mRNA. Small RNAs mainly include 5.8S ribosomal RNA (rRNA), 5S rRNA, transfer RNA (tRNA), microRNA (miRNA), small interfering RNA (siRNA), small nucleolar RNA (snoRNAs), Piwi-interacting RNA (piRNA), tRNA-derived small RNA (tsRNA)[36] and small rDNA-derived RNA (srRNA).[37] There are certain exceptions as in the case of the 5S rRNA of the members of the genus Halococcus (Archaea), which have an insertion, thus increasing its size.[38][39][40]
RNAs involved in protein synthesis
[edit]Messenger RNA (mRNA) carries information about a protein sequence to the ribosomes, the protein synthesis factories in the cell. It is coded so that every three nucleotides (a codon) corresponds to one amino acid. In eukaryotic cells, once precursor mRNA (pre-mRNA) has been transcribed from DNA, it is processed to mature mRNA. This removes its introns—non-coding sections of the pre-mRNA. The mRNA is then exported from the nucleus to the cytoplasm, where it is bound to ribosomes and translated into its corresponding protein form with the help of tRNA. In prokaryotic cells, which do not have nucleus and cytoplasm compartments, mRNA can bind to ribosomes while it is being transcribed from DNA. After a certain amount of time, the message degrades into its component nucleotides with the assistance of ribonucleases.[27]
Transfer RNA (tRNA) is a small RNA chain of about 80 nucleotides that transfers a specific amino acid to a growing polypeptide chain at the ribosomal site of protein synthesis during translation. It has sites for amino acid attachment and an anticodon region for codon recognition that binds to a specific sequence on the messenger RNA chain through hydrogen bonding.[32]

Ribosomal RNA (rRNA) is the catalytic component of the ribosomes. The rRNA is the component of the ribosome that hosts translation. Eukaryotic ribosomes contain four different rRNA molecules: 18S, 5.8S, 28S and 5S rRNA. Three of the rRNA molecules are synthesized in the nucleolus, and one is synthesized elsewhere. In the cytoplasm, ribosomal RNA and protein combine to form a nucleoprotein called a ribosome. The ribosome binds mRNA and carries out protein synthesis. Several ribosomes may be attached to a single mRNA at any time.[27] Nearly all the RNA found in a typical eukaryotic cell is rRNA.
Transfer-messenger RNA (tmRNA) is found in many bacteria and plastids. It tags proteins encoded by mRNAs that lack stop codons for degradation and prevents the ribosome from stalling.[41]
Regulatory RNA
[edit]The earliest known regulators of gene expression were proteins known as repressors and activators – regulators with specific short binding sites within enhancer regions near the genes to be regulated.[42] Later studies have shown that RNAs also regulate genes. There are several kinds of RNA-dependent processes in eukaryotes regulating the expression of genes at various points, such as RNA interference repressing genes post-transcriptionally, long non-coding RNAs shutting down blocks of chromatin epigenetically, and enhancer RNAs inducing increased gene expression.[43] Bacteria and archaea have also been shown to use regulatory RNA systems such as bacterial small RNAs and CRISPR.[44] Fire and Mello were awarded the 2006 Nobel Prize in Physiology or Medicine for discovering microRNAs (miRNAs), specific short RNA molecules that can base-pair with mRNAs.[45]
MicroRNA (miRNA) and small interfering RNA (siRNA)
[edit]Post-transcriptional expression levels of many genes can be controlled by RNA interference, in which miRNAs, specific short RNA molecules, pair with mRNA regions and target them for repression.[46] RNA interference is carried out by the RNA-induced silencing complex, which uses a miRNA guide to regulate complementary mRNAs through base pairing interactions and results in either degradaing the mRNA or blocking its translation.[47]
Long non-coding RNAs
[edit]Next to be linked to regulation were Xist and other long noncoding RNAs associated with X chromosome inactivation. Their roles, at first mysterious, were shown by Jeannie T. Lee and others to be the silencing of blocks of chromatin via recruitment of Polycomb complex so that messenger RNA could not be transcribed from them.[48] Additional lncRNAs, currently defined as RNAs of more than 200 base pairs that do not appear to have coding potential,[49] have been found associated with regulation of stem cell pluripotency and cell division.[49]
Enhancer RNAs
[edit]The third major group of regulatory RNAs is called enhancer RNAs.[49] It is not clear at present whether they are a unique category of RNAs of various lengths or constitute a distinct subset of lncRNAs. In any case, they are transcribed from enhancers, which are known regulatory sites in the DNA near genes they regulate.[49][50] They up-regulate the transcription of the gene(s) under control of the enhancer from which they are transcribed.[49][51]
Small RNA in prokaryotes
[edit]Small RNA
[edit]At first, regulatory RNA was thought to be a eukaryotic phenomenon, a part of the explanation for why so much more transcription in higher organisms was seen than had been predicted. But as soon as researchers began to look for possible RNA regulators in bacteria, they turned up there as well, termed as small RNA (sRNA).[52][44] Currently, the ubiquitous nature of systems of RNA regulation of genes has been discussed as support for the RNA World theory.[43][53] There are indications that the enterobacterial sRNAs are involved in various cellular processes and seem to have significant role in stress responses such as membrane stress, starvation stress, phosphosugar stress and DNA damage. Also, it has been suggested that sRNAs have been evolved to have important role in stress responses because of their kinetic properties that allow for rapid response and stabilisation of the physiological state.[4] Bacterial small RNAs generally act via antisense pairing with mRNA to down-regulate its translation, either by affecting stability or affecting cis-binding ability.[43] Riboswitches have also been discovered. They are cis-acting regulatory RNA sequences acting allosterically. They change shape when they bind metabolites so that they gain or lose the ability to bind chromatin to regulate expression of genes.[54][55]
CRISPR RNA
[edit]Archaea also have systems of regulatory RNA.[56] The CRISPR system, recently being used to edit DNA in situ, acts via regulatory RNAs in archaea and bacteria to provide protection against virus invaders.[43][57]
RNA synthesis and processing
[edit]Synthesis
[edit]Synthesis of RNA typically occurs in the cell nucleus and is usually catalyzed by an enzyme—RNA polymerase—using DNA as a template, a process known as transcription. Initiation of transcription begins with the binding of the enzyme to a promoter sequence in the DNA (usually found "upstream" of a gene). The DNA double helix is unwound by the helicase activity of the enzyme. The enzyme then progresses along the template strand in the 3' to 5' direction, synthesizing a complementary RNA molecule with elongation occurring in the 5' to 3' direction. The DNA sequence also dictates where termination of RNA synthesis will occur.[58]
Primary transcript RNAs are often modified by enzymes after transcription. For example, a poly(A) tail and a 5' cap are added to eukaryotic pre-mRNA and introns are removed by the spliceosome.
There are also a number of RNA-dependent RNA polymerases that use RNA as their template for synthesis of a new strand of RNA. For instance, a number of RNA viruses (such as poliovirus) use this type of enzyme to replicate their genetic material.[59] Also, RNA-dependent RNA polymerase is part of the RNA interference pathway in many organisms.[60]
RNA processing
[edit]
Many RNAs are involved in modifying other RNAs. Introns are spliced out of pre-mRNA by spliceosomes, which contain several small nuclear RNAs (snRNA),[7] or the introns can be ribozymes that are spliced by themselves.[61] RNA can also be altered by having its nucleotides modified to nucleotides other than A, C, G and U. In eukaryotes, modifications of RNA nucleotides are in general directed by small nucleolar RNAs (snoRNA; 60–300 nt),[32] found in the nucleolus and cajal bodies. snoRNAs associate with enzymes and guide them to a spot on an RNA by basepairing to that RNA. These enzymes then perform the nucleotide modification. rRNAs and tRNAs are extensively modified, but snRNAs and mRNAs can also be the target of base modification.[62][63] RNA can also be methylated.[64][65]
RNA in genetics
[edit]RNA genomes
[edit]Like DNA, RNA can carry genetic information. RNA viruses have genomes composed of RNA that encodes a number of proteins. The viral genome is replicated by some of those proteins, while other proteins protect the genome as the virus particle moves to a new host cell. Viroids are another group of pathogens, but they consist only of RNA, do not encode any protein and are replicated by a host plant cell's polymerase.[66]
Reverse transcription
[edit]Reverse transcribing viruses replicate their genomes by reverse transcribing DNA copies from their RNA; these DNA copies are then transcribed to new RNA. Retrotransposons also spread by copying DNA and RNA from one another,[67] and telomerase contains an RNA that is used as template for building the ends of eukaryotic chromosomes.[68]
Double-stranded RNA
[edit]
Double-stranded RNA (dsRNA) is RNA with two complementary strands, similar to the DNA found in all cells, but with the replacement of thymine by uracil and the adding of one oxygen atom. dsRNA forms the genetic material of some viruses (double-stranded RNA viruses). Double-stranded RNA, such as viral RNA or siRNA, can trigger RNA interference in eukaryotes, as well as interferon response in vertebrates.[69][70][71][72] In eukaryotes, double-stranded RNA (dsRNA) plays a role in the activation of the innate immune system against viral infections.[73]
Circular RNA
[edit]In the late 1970s, it was shown that there is a single stranded covalently closed, i.e. circular form of RNA expressed throughout the animal and plant kingdom (see circRNA).[74] circRNAs are thought to arise via a "back-splice" reaction where the spliceosome joins a upstream 3' acceptor to a downstream 5' donor splice site. So far the function of circRNAs is largely unknown, although for few examples a microRNA sponging activity has been demonstrated.
Key discoveries in RNA biology
[edit]
Research on RNA has led to many important biological discoveries and numerous Nobel Prizes. Nucleic acids were discovered in 1868 by Friedrich Miescher, who called the material 'nuclein' since it was found in the nucleus.[75] It was later discovered that prokaryotic cells, which do not have a nucleus, also contain nucleic acids. The role of RNA in protein synthesis was suspected already in 1939.[76] Severo Ochoa won the 1959 Nobel Prize in Medicine (shared with Arthur Kornberg) after he discovered an enzyme that can synthesize RNA in the laboratory.[77] However, the enzyme discovered by Ochoa (polynucleotide phosphorylase) was later shown to be responsible for RNA degradation, not RNA synthesis. In 1956 Alex Rich and David Davies hybridized two separate strands of RNA to form the first crystal of RNA whose structure could be determined by X-ray crystallography.[78]
The sequence of the 77 nucleotides of a yeast tRNA was found by Robert W. Holley in 1965,[79] winning Holley the 1968 Nobel Prize in Medicine (shared with Har Gobind Khorana and Marshall Nirenberg).
In the early 1970s, retroviruses and reverse transcriptase were discovered, showing for the first time that enzymes could copy RNA into DNA (the opposite of the usual route for transmission of genetic information). For this work, David Baltimore, Renato Dulbecco and Howard Temin were awarded a Nobel Prize in 1975. In 1976, Walter Fiers and his team determined the first complete nucleotide sequence of an RNA virus genome, that of bacteriophage MS2.[80]
In 1977, introns and RNA splicing were discovered in both mammalian viruses and in cellular genes, resulting in a 1993 Nobel to Philip Sharp and Richard Roberts. Catalytic RNA molecules (ribozymes) were discovered in the early 1980s, leading to a 1989 Nobel award to Thomas Cech and Sidney Altman. In 1990, it was found in Petunia that introduced genes can silence similar genes of the plant's own, now known to be a result of RNA interference.[81][82]
At about the same time, 22 nt long RNAs, now called microRNAs, were found to have a role in the development of C. elegans.[83] Studies on RNA interference earned a Nobel Prize for Andrew Fire and Craig Mello in 2006, and another Nobel for studies on the transcription of RNA to Roger Kornberg in the same year. The discovery of gene regulatory RNAs has led to attempts to develop drugs made of RNA, such as siRNA, to silence genes.[84] Adding to the Nobel prizes for research on RNA, in 2009 it was awarded for the elucidation of the atomic structure of the ribosome to Venki Ramakrishnan, Thomas A. Steitz, and Ada Yonath. In 2023 the Nobel Prize in Physiology or Medicine was awarded to Katalin Karikó and Drew Weissman for their discoveries concerning modified nucleosides that enabled the development of effective mRNA vaccines against COVID-19.[85][86][87]
Relevance for prebiotic chemistry and abiogenesis
[edit]In 1968, Carl Woese hypothesized that RNA might be catalytic and suggested that the earliest forms of life (self-replicating molecules) could have relied on RNA both to carry genetic information and to catalyze biochemical reactions—an RNA world.[88][89] In May 2022, scientists discovered that RNA can form spontaneously on prebiotic basalt lava glass, presumed to have been abundant on the early Earth.[90][91]
In March 2015, DNA and RNA nucleobases, including uracil, cytosine and thymine, were reportedly formed in the laboratory under outer space conditions, using starter chemicals such as pyrimidine, an organic compound commonly found in meteorites. Pyrimidine, like polycyclic aromatic hydrocarbons (PAHs), is one of the most carbon-rich compounds found in the universe and may have been formed in red giants or in interstellar dust and gas clouds.[92] In July 2022, astronomers reported massive amounts of prebiotic molecules, including possible RNA precursors, in the galactic center of the Milky Way Galaxy.[93][94]
Medical applications
[edit]RNA, initially deemed unsuitable for therapeutics due to its short half-life, has been made useful through advances in stabilization. Therapeutic applications arise as RNA folds into complex conformations and binds proteins, nucleic acids, and small molecules to form catalytic centers.[95] RNA-based vaccines are thought to be easier to produce than traditional vaccines derived from killed or altered pathogens, because it can take months or years to grow and study a pathogen and determine which molecular parts to extract, inactivate, and use in a vaccine. Small molecules with conventional therapeutic properties can target RNA and DNA structures, thereby treating novel diseases. However, research is scarce on small molecules targeting RNA and approved drugs for human illness. Ribavirin, branaplam, and ataluren are currently available medications that stabilize double-stranded RNA structures and control splicing in a variety of disorders.[96][97]
Protein-coding mRNAs have emerged as new therapeutic candidates, with RNA replacement being particularly beneficial for brief but torrential protein expression.[98] In vitro transcribed mRNAs (IVT-mRNA) have been used to deliver proteins for bone regeneration, pluripotency, and heart function in animal models.[99][100][101][102][103] SiRNAs, short RNA molecules, play a crucial role in innate defense against viruses and chromatin structure. They can be artificially introduced to silence specific genes, making them valuable for gene function studies, therapeutic target validation, and drug development.[98]
mRNA vaccines have emerged as an important new class of vaccines, using mRNA to manufacture proteins which provoke an immune response. Their first successful large-scale application came in the form of COVID-19 vaccines during the COVID-19 pandemic.
See also
[edit]References
[edit]- ^ Copley SD, Smith E, Markowitz HJ (December 2007). "The origin of the RNA world: co-evolution of genes and metabolism". Bioorganic Chemistry. 35 (6): 430–443. doi:10.1016/j.bioorg.2007.08.001. PMID 17897696.
The proposal that life on Earth arose from an RNA World is widely accepted.
- ^ a b Lee JC, Gutell RR (December 2004). "Diversity of base-pair conformations and their occurrence in rRNA structure and RNA structural motifs". Journal of Molecular Biology. 344 (5): 1225–49. doi:10.1016/j.jmb.2004.09.072. PMID 15561141.
- ^ Barciszewski J, Frederic B, Clark C (1999). RNA biochemistry and biotechnology. Springer. pp. 73–87. ISBN 978-0-7923-5862-6. OCLC 52403776.
- ^ a b "RNA: The Versatile Molecule". University of Utah. 2015.
- ^ "Nucleotides and Nucleic Acids" (PDF). University of California, Los Angeles. Archived from the original (PDF) on 2015-09-23. Retrieved 2015-08-26.
- ^ Shukla RN (2014). Analysis of Chromosomes. Agrotech Press. ISBN 978-93-84568-17-7.[permanent dead link]
- ^ a b c Berg JM, Tymoczko JL, Stryer L (2002). Biochemistry (5th ed.). WH Freeman and Company. pp. 118–19, 781–808. ISBN 978-0-7167-4684-3. OCLC 179705944.
- ^ Tinoco I, Bustamante C (October 1999). "How RNA folds". Journal of Molecular Biology. 293 (2): 271–81. doi:10.1006/jmbi.1999.3001. PMID 10550208.
- ^ Higgs PG (August 2000). "RNA secondary structure: physical and computational aspects". Quarterly Reviews of Biophysics. 33 (3): 199–253. doi:10.1017/S0033583500003620. PMID 11191843. S2CID 37230785.
- ^ a b Nissen P, Hansen J, Ban N, Moore PB, Steitz TA (August 2000). "The structural basis of ribosome activity in peptide bond synthesis". Science. 289 (5481): 920–30. Bibcode:2000Sci...289..920N. doi:10.1126/science.289.5481.920. PMID 10937990.
- ^ Salazar M, Fedoroff OY, Miller JM, Ribeiro NS, Reid BR (April 1993). "The DNA strand in DNA.RNA hybrid duplexes is neither B-form nor A-form in solution". Biochemistry. 32 (16): 4207–15. doi:10.1021/bi00067a007. PMID 7682844.
- ^ Sedova A, Banavali NK (February 2016). "RNA approaches the B-form in stacked single strand dinucleotide contexts". Biopolymers. 105 (2): 65–82. doi:10.1002/bip.22750. PMID 26443416. S2CID 35949700.
- ^ Hermann T, Patel DJ (March 2000). "RNA bulges as architectural and recognition motifs". Structure. 8 (3): R47–54. doi:10.1016/S0969-2126(00)00110-6. PMID 10745015.
- ^ Mikkola S, Stenman E, Nurmi K, Yousefi-Salakdeh E, Strömberg R, Lönnberg H (1999). "The mechanism of the metal ion promoted cleavage of RNA phosphodiester bonds involves a general acid catalysis by the metal aquo ion on the departure of the leaving group". Journal of the Chemical Society, Perkin Transactions 2 (8): 1619–26. doi:10.1039/a903691a.
- ^ Mathews DH, Disney MD, Childs JL, Schroeder SJ, Zuker M, Turner DH (May 2004). "Incorporating chemical modification constraints into a dynamic programming algorithm for prediction of RNA secondary structure". Proceedings of the National Academy of Sciences of the United States of America. 101 (19): 7287–92. Bibcode:2004PNAS..101.7287M. doi:10.1073/pnas.0401799101. PMC 409911. PMID 15123812.
- ^ Burghardt B, Hartmann AK (February 2007). "RNA secondary structure design". Physical Review E. 75 (2) 021920. arXiv:physics/0609135. Bibcode:2007PhRvE..75b1920B. doi:10.1103/PhysRevE.75.021920. PMID 17358380. S2CID 17574854.
- ^ Tan ZJ, Chen SJ (July 2008). "Salt dependence of nucleic acid hairpin stability". Biophysical Journal. 95 (2): 738–52. Bibcode:2008BpJ....95..738T. doi:10.1529/biophysj.108.131524. PMC 2440479. PMID 18424500.
- ^ Vater A, Klussmann S (January 2015). "Turning mirror-image oligonucleotides into drugs: the evolution of Spiegelmer(®) therapeutics". Drug Discovery Today. 20 (1): 147–55. doi:10.1016/j.drudis.2014.09.004. PMID 25236655.
- ^ Jankowski JA, Polak JM (1996). Clinical gene analysis and manipulation: Tools, techniques and troubleshooting. Cambridge University Press. p. 14. ISBN 978-0-521-47896-0. OCLC 33838261.
- ^ Yu Q, Morrow CD (May 2001). "Identification of critical elements in the tRNA acceptor stem and T(Psi)C loop necessary for human immunodeficiency virus type 1 infectivity". Journal of Virology. 75 (10): 4902–6. doi:10.1128/JVI.75.10.4902-4906.2001. PMC 114245. PMID 11312362.
- ^ Elliott MS, Trewyn RW (February 1984). "Inosine biosynthesis in transfer RNA by an enzymatic insertion of hypoxanthine". The Journal of Biological Chemistry. 259 (4): 2407–10. doi:10.1016/S0021-9258(17)43367-9. PMID 6365911.
- ^ Cantara WA, Crain PF, Rozenski J, McCloskey JA, Harris KA, Zhang X, Vendeix FA, Fabris D, Agris PF (January 2011). "The RNA Modification Database, RNAMDB: 2011 update". Nucleic Acids Research. 39 (Database issue): D195-201. doi:10.1093/nar/gkq1028. PMC 3013656. PMID 21071406.
- ^ Söll D, RajBhandary U (1995). TRNA: Structure, biosynthesis, and function. ASM Press. p. 165. ISBN 978-1-55581-073-3. OCLC 183036381.
- ^ Kiss T (July 2001). "Small nucleolar RNA-guided post-transcriptional modification of cellular RNAs". The EMBO Journal. 20 (14): 3617–22. doi:10.1093/emboj/20.14.3617. PMC 125535. PMID 11447102.
- ^ Tirumalai MR, Rivas M, Tran Q, Fox GE (November 2021). "The Peptidyl Transferase Center: a Window to the Past". Microbiol Mol Biol Rev. 85 (4) e00104-21: e0010421. Bibcode:2021MMBR...85...21T. doi:10.1128/MMBR.00104-21. PMC 8579967. PMID 34756086.
- ^ King TH, Liu B, McCully RR, Fournier MJ (February 2003). "Ribosome structure and activity are altered in cells lacking snoRNPs that form pseudouridines in the peptidyl transferase center". Molecular Cell. 11 (2): 425–35. doi:10.1016/S1097-2765(03)00040-6. PMID 12620230.
- ^ a b c Cooper GC, Hausman RE (2004). The Cell: A Molecular Approach (3rd ed.). Sinauer. pp. 261–76, 297, 339–44. ISBN 978-0-87893-214-6. OCLC 174924833.
- ^ Mattick JS, Gagen MJ (September 2001). "The evolution of controlled multitasked gene networks: the role of introns and other noncoding RNAs in the development of complex organisms". Molecular Biology and Evolution. 18 (9): 1611–30. doi:10.1093/oxfordjournals.molbev.a003951. PMID 11504843.
- ^ Mattick JS (November 2001). "Non-coding RNAs: the architects of eukaryotic complexity". EMBO Reports. 2 (11): 986–91. doi:10.1093/embo-reports/kve230. PMC 1084129. PMID 11713189.
- ^ Mattick JS (October 2003). "Challenging the dogma: the hidden layer of non-protein-coding RNAs in complex organisms" (PDF). BioEssays. 25 (10): 930–39. CiteSeerX 10.1.1.476.7561. doi:10.1002/bies.10332. PMID 14505360. Archived from the original (PDF) on 2009-03-06.
- ^ Mattick JS (October 2004). "The hidden genetic program of complex organisms". Scientific American. 291 (4): 60–67. Bibcode:2004SciAm.291d..60M. doi:10.1038/scientificamerican1004-60. PMID 15487671.[dead link]
- ^ a b c Wirta W (2006). Mining the transcriptome – methods and applications. Stockholm: School of Biotechnology, Royal Institute of Technology. ISBN 978-91-7178-436-0. OCLC 185406288.
- ^ Rossi JJ (July 2004). "Ribozyme diagnostics comes of age". Chemistry & Biology. 11 (7): 894–95. doi:10.1016/j.chembiol.2004.07.002. PMID 15271347.
- ^ Storz G (May 2002). "An expanding universe of noncoding RNAs". Science. 296 (5571): 1260–63. Bibcode:2002Sci...296.1260S. doi:10.1126/science.1072249. PMID 12016301. S2CID 35295924.
- ^ Fatica A, Bozzoni I (January 2014). "Long non-coding RNAs: new players in cell differentiation and development". Nature Reviews Genetics. 15 (1): 7–21. doi:10.1038/nrg3606. PMID 24296535. S2CID 12295847.[permanent dead link]
- ^ Chen Q, Yan M, Cao Z, Li X, Zhang Y, Shi J, et al. (January 2016). "Sperm tsRNAs contribute to intergenerational inheritance of an acquired metabolic disorder" (PDF). Science. 351 (6271): 397–400. Bibcode:2016Sci...351..397C. doi:10.1126/science.aad7977. PMID 26721680. S2CID 21738301.
- ^ Wei H, Zhou B, Zhang F, Tu Y, Hu Y, Zhang B, Zhai Q (2013). "Profiling and identification of small rDNA-derived RNAs and their potential biological functions". PLOS ONE. 8 (2) e56842. Bibcode:2013PLoSO...856842W. doi:10.1371/journal.pone.0056842. PMC 3572043. PMID 23418607.
- ^ Luehrsen KR, Nicholson DE, Eubanks DC, Fox GE (1981). "An archaebacterial 5S rRNA contains a long insertion sequence". Nature. 293 (5835): 755–756. Bibcode:1981Natur.293..755L. doi:10.1038/293755a0. PMID 6169998. S2CID 4341755.
- ^ Stan-Lotter H, McGenity TJ, Legat A, Denner EB, Glaser K, Stetter KO, Wanner G (1999). "Very similar strains of Halococcus salifodinae are found in geographically separated permo-triassic salt deposits". Microbiology. 145 (Pt 12): 3565–3574. doi:10.1099/00221287-145-12-3565. PMID 10627054.
- ^ Tirumalai MR, Kaelber JT, Park DR, Tran Q, Fox GE (August 2020). "Cryo-Electron Microscopy Visualization of a Large Insertion in the 5S ribosomal RNA of the Extremely Halophilic Archaeon Halococcus morrhuae". FEBS Open Bio. 10 (10): 1938–1946. Bibcode:2020FEBOB..10.1938T. doi:10.1002/2211-5463.12962. PMC 7530397. PMID 32865340.
- ^ Gueneau de Novoa P, Williams KP (January 2004). "The tmRNA website: reductive evolution of tmRNA in plastids and other endosymbionts". Nucleic Acids Research. 32 (Database issue): D104–08. doi:10.1093/nar/gkh102. PMC 308836. PMID 14681369.
- ^ Jacob F, Monod J (1961). "Genetic Regulatory Mechanisms in the Synthesis of Proteins". Journal of Molecular Biology. 3 (3): 318–56. doi:10.1016/s0022-2836(61)80072-7. PMID 13718526. S2CID 19804795.
- ^ a b c d Morris K, Mattick J (2014). "The rise of regulatory RNA". Nature Reviews Genetics. 15 (6): 423–37. doi:10.1038/nrg3722. PMC 4314111. PMID 24776770.
- ^ a b Gottesman S (2005). "Micros for microbes: non-coding regulatory RNAs in bacteria". Trends in Genetics. 21 (7): 399–404. doi:10.1016/j.tig.2005.05.008. PMID 15913835.
- ^ "The Nobel Prize in Physiology or Medicine 2006". Nobelprize.org. Nobel Media AB 2014. Web. 6 Aug 2018. http://www.nobelprize.org/nobel_prizes/medicine/laureates/2006
- ^ Fire, et al. (1998). "Potent and Specific Genetic Interference by double stranded RNA in Ceanorhabditis elegans". Nature. 391 (6669): 806–11. Bibcode:1998Natur.391..806F. doi:10.1038/35888. PMID 9486653. S2CID 4355692.
- ^ Pratt, Ashley J.; MacRae, Ian J. (2009-07-03). "The RNA-induced Silencing Complex: A Versatile Gene-silencing Machine *". Journal of Biological Chemistry. 284 (27): 17897–17901. doi:10.1074/jbc.R900012200. ISSN 0021-9258. PMC 2709356. PMID 19342379.
- ^ Zhao J, Sun BK, Erwin JA, Song JJ, Lee JT (2008). "Polycomb proteins targeted by a short repeat RNA to the mouse X chromosome". Science. 322 (5902): 750–56. Bibcode:2008Sci...322..750Z. doi:10.1126/science.1163045. PMC 2748911. PMID 18974356.
- ^ a b c d e Rinn JL, Chang HY (2012). "Genome regulation by long noncoding RNAs". Annu. Rev. Biochem. 81: 1–25. doi:10.1146/annurev-biochem-051410-092902. PMC 3858397. PMID 22663078.
- ^ Taft RJ, Kaplan CD, Simons C, Mattick JS (2009). "Evolution, biogenesis and function of promoter- associated RNAs". Cell Cycle. 8 (15): 2332–38. doi:10.4161/cc.8.15.9154. PMID 19597344.
- ^ Orom UA, Derrien T, Beringer M, Gumireddy K, Gardini A, et al. (2010). "'Long noncoding RNAs with enhancer-like function in human cells". Cell. 143 (1): 46–58. doi:10.1016/j.cell.2010.09.001. PMC 4108080. PMID 20887892.
- ^ EGH Wagner, P Romby. (2015). "Small RNAs in bacteria and archaea: who they are, what they do, and how they do it". Advances in genetics (Vol. 90, pp. 133–208).
- ^ J.W. Nelson, R.R. Breaker (2017) "The lost language of the RNA World."Sci. Signal.10, eaam8812 1–11.
- ^ Winklef WC (2005). "Riboswitches and the role of noncoding RNAs in bacterial metabolic control". Curr. Opin. Chem. Biol. 9 (6): 594–602. doi:10.1016/j.cbpa.2005.09.016. PMID 16226486.
- ^ Tucker BJ, Breaker RR (2005). "Riboswitches as versatile gene control elements". Curr. Opin. Struct. Biol. 15 (3): 342–48. doi:10.1016/j.sbi.2005.05.003. PMID 15919195.
- ^ Mojica FJ, Diez-Villasenor C, Soria E, Juez G (2000). "" "Biological significance of a family of regularly spaced repeats in the genomes of archaea, bacteria and mitochondria". Mol. Microbiol. 36 (1): 244–46. doi:10.1046/j.1365-2958.2000.01838.x. PMID 10760181. S2CID 22216574.
- ^ Brouns S, Jore MM, Lundgren M, Westra E, Slijkhuis R, Snijders A, Dickman M, Makarova K, Koonin E, Der Oost JV (2008). "Small CRISPR RNAs guide antiviral defense in prokaryotes". Science. 321 (5891): 960–64. Bibcode:2008Sci...321..960B. doi:10.1126/science.1159689. PMC 5898235. PMID 18703739.
- ^ Nudler E, Gottesman ME (August 2002). "Transcription termination and anti-termination in E. coli". Genes to Cells. 7 (8): 755–68. doi:10.1046/j.1365-2443.2002.00563.x. PMID 12167155. S2CID 23191624.
- ^ Hansen JL, Long AM, Schultz SC (August 1997). "Structure of the RNA-dependent RNA polymerase of poliovirus". Structure. 5 (8): 1109–22. doi:10.1016/S0969-2126(97)00261-X. PMID 9309225.
- ^ Ahlquist P (May 2002). "RNA-dependent RNA polymerases, viruses, and RNA silencing". Science. 296 (5571): 1270–73. Bibcode:2002Sci...296.1270A. doi:10.1126/science.1069132. PMID 12016304. S2CID 42526536.
- ^ Steitz TA, Steitz JA (July 1993). "A general two-metal-ion mechanism for catalytic RNA". Proceedings of the National Academy of Sciences of the United States of America. 90 (14): 6498–502. Bibcode:1993PNAS...90.6498S. doi:10.1073/pnas.90.14.6498. PMC 46959. PMID 8341661.
- ^ Xie J, Zhang M, Zhou T, Hua X, Tang L, Wu W (January 2007). "Sno/scaRNAbase: a curated database for small nucleolar RNAs and cajal body-specific RNAs". Nucleic Acids Research. 35 (Database issue): D183–87. doi:10.1093/nar/gkl873. PMC 1669756. PMID 17099227.
- ^ Omer AD, Ziesche S, Decatur WA, Fournier MJ, Dennis PP (May 2003). "RNA-modifying machines in archaea". Molecular Microbiology. 48 (3): 617–29. doi:10.1046/j.1365-2958.2003.03483.x. PMID 12694609. S2CID 20326977.
- ^ Cavaillé J, Nicoloso M, Bachellerie JP (October 1996). "Targeted ribose methylation of RNA in vivo directed by tailored antisense RNA guides". Nature. 383 (6602): 732–35. Bibcode:1996Natur.383..732C. doi:10.1038/383732a0. PMID 8878486. S2CID 4334683.
- ^ Kiss-László Z, Henry Y, Bachellerie JP, Caizergues-Ferrer M, Kiss T (June 1996). "Site-specific ribose methylation of preribosomal RNA: a novel function for small nucleolar RNAs". Cell. 85 (7): 1077–88. doi:10.1016/S0092-8674(00)81308-2. PMID 8674114. S2CID 10418885.
- ^ Daròs JA, Elena SF, Flores R (June 2006). "Viroids: an Ariadne's thread into the RNA labyrinth". EMBO Reports. 7 (6): 593–98. doi:10.1038/sj.embor.7400706. PMC 1479586. PMID 16741503.
- ^ Kalendar R, Vicient CM, Peleg O, Anamthawat-Jonsson K, Bolshoy A, Schulman AH (March 2004). "Large retrotransposon derivatives: abundant, conserved but nonautonomous retroelements of barley and related genomes". Genetics. 166 (3): 1437–50. doi:10.1534/genetics.166.3.1437. PMC 1470764. PMID 15082561.
- ^ Podlevsky JD, Bley CJ, Omana RV, Qi X, Chen JJ (January 2008). "The telomerase database". Nucleic Acids Research. 36 (Database issue): D339–43. doi:10.1093/nar/gkm700. PMC 2238860. PMID 18073191.
- ^ Blevins T, Rajeswaran R, Shivaprasad PV, Beknazariants D, Si-Ammour A, Park HS, Vazquez F, Robertson D, Meins F, Hohn T, Pooggin MM (2006). "Four plant Dicers mediate viral small RNA biogenesis and DNA virus induced silencing". Nucleic Acids Research. 34 (21): 6233–46. doi:10.1093/nar/gkl886. PMC 1669714. PMID 17090584.
- ^ Jana S, Chakraborty C, Nandi S, Deb JK (November 2004). "RNA interference: potential therapeutic targets". Applied Microbiology and Biotechnology. 65 (6): 649–57. doi:10.1007/s00253-004-1732-1. PMID 15372214. S2CID 20963666.
- ^ Virol, J (May 2006). "Double-Stranded RNA Is Produced by Positive-Strand RNA Viruses and DNA Viruses but Not in Detectable Amounts by Negative-Strand RNA Viruses". Journal of Virology. 80 (10): 5059–5064. doi:10.1128/JVI.80.10.5059-5064.2006. PMC 1472073. PMID 16641297.
- ^ Schultz U, Kaspers B, Staeheli P (May 2004). "The interferon system of non-mammalian vertebrates". Developmental and Comparative Immunology. 28 (5): 499–508. doi:10.1016/j.dci.2003.09.009. PMID 15062646.
- ^ Whitehead KA, Dahlman JE, Langer RS, Anderson DG (2011). "Silencing or stimulation? siRNA delivery and the immune system". Annual Review of Chemical and Biomolecular Engineering. 2: 77–96. doi:10.1146/annurev-chembioeng-061010-114133. PMID 22432611.
- ^ Hsu MT, Coca-Prados M (July 1979). "Electron microscopic evidence for the circular form of RNA in the cytoplasm of eukaryotic cells". Nature. 280 (5720): 339–40. Bibcode:1979Natur.280..339H. doi:10.1038/280339a0. PMID 460409. S2CID 19968869.
- ^ Dahm R (February 2005). "Friedrich Miescher and the discovery of DNA". Developmental Biology. 278 (2): 274–88. doi:10.1016/j.ydbio.2004.11.028. PMID 15680349.
- ^ Caspersson T, Schultz J (1939). "Pentose nucleotides in the cytoplasm of growing tissues". Nature. 143 (3623): 602–03. Bibcode:1939Natur.143..602C. doi:10.1038/143602c0. S2CID 4140563.
- ^ Ochoa S (1959). "Enzymatic synthesis of ribonucleic acid" (PDF). Nobel Lecture.
- ^ Rich A, Davies D (1956). "A New Two-Stranded Helical Structure: Polyadenylic Acid and Polyuridylic Acid". Journal of the American Chemical Society. 78 (14): 3548–49. Bibcode:1956JAChS..78R3548R. doi:10.1021/ja01595a086.
- ^ Holley RW, et al. (March 1965). "Structure of a ribonucleic acid". Science. 147 (3664): 1462–65. Bibcode:1965Sci...147.1462H. doi:10.1126/science.147.3664.1462. PMID 14263761. S2CID 40989800.
- ^ Fiers W, et al. (April 1976). "Complete nucleotide sequence of bacteriophage MS2 RNA: primary and secondary structure of the replicase gene". Nature. 260 (5551): 500–07. Bibcode:1976Natur.260..500F. doi:10.1038/260500a0. PMID 1264203. S2CID 4289674.
- ^ Napoli C, Lemieux C, Jorgensen R (April 1990). "Introduction of a Chimeric Chalcone Synthase Gene into Petunia Results in Reversible Co-Suppression of Homologous Genes in trans". The Plant Cell. 2 (4): 279–89. doi:10.1105/tpc.2.4.279. PMC 159885. PMID 12354959.
- ^ Dafny-Yelin M, Chung SM, Frankman EL, Tzfira T (December 2007). "pSAT RNA interference vectors: a modular series for multiple gene down-regulation in plants". Plant Physiology. 145 (4): 1272–81. doi:10.1104/pp.107.106062. PMC 2151715. PMID 17766396.
- ^ Ruvkun G (October 2001). "Molecular biology. Glimpses of a tiny RNA world". Science. 294 (5543): 797–99. doi:10.1126/science.1066315. PMID 11679654. S2CID 83506718.
- ^ Fichou Y, Férec C (December 2006). "The potential of oligonucleotides for therapeutic applications". Trends in Biotechnology. 24 (12): 563–70. doi:10.1016/j.tibtech.2006.10.003. PMID 17045686.
- ^ "The Nobel Prize in Physiology or Medicine 2023". NobelPrize.org. Retrieved 2023-10-03.
- ^ "Hungarian and US scientists win Nobel for COVID-19 vaccine discoveries". Reuters. 2023-10-02. Retrieved 2023-10-03.
- ^ "The Nobel Prize in Physiology or Medicine 2023". NobelPrize.org. Retrieved 2023-10-03.
- ^ Siebert S (2006). "Common sequence structure properties and stable regions in RNA secondary structures" (PDF). Dissertation, Albert-Ludwigs-Universität, Freiburg im Breisgau. p. 1. Archived from the original (PDF) on March 9, 2012.
- ^ Szathmáry E (June 1999). "The origin of the genetic code: amino acids as cofactors in an RNA world". Trends in Genetics. 15 (6): 223–29. doi:10.1016/S0168-9525(99)01730-8. PMID 10354582.
- ^ Jerome, Craig A.; et al. (19 May 2022). "Catalytic Synthesis of Polyribonucleic Acid on Prebiotic Rock Glasses". Astrobiology. 22 (6): 629–636. Bibcode:2022AsBio..22..629J. doi:10.1089/ast.2022.0027. PMC 9233534. PMID 35588195. S2CID 248917871.
- ^ Foundation for Applied Molecular Evolution (3 June 2022). "Scientists announce a breakthrough in determining life's origin on Earth—and maybe Mars". Phys.org. Retrieved 3 June 2022.
- ^ Marlaire R (3 March 2015). "NASA Ames Reproduces the Building Blocks of Life in Laboratory". NASA. Archived from the original on 5 March 2015. Retrieved 5 March 2015.
- ^ Starr, Michelle (8 July 2022). "Loads of Precursors For RNA Have Been Detected in The Center of Our Galaxy". ScienceAlert. Retrieved 9 July 2022.
- ^ Rivilla, Victor M.; et al. (8 July 2022). "Molecular Precursors of the RNA-World in Space: New Nitriles in the G+0.693–0.027 Molecular Cloud". Frontiers in Astronomy and Space Sciences. 9 876870. arXiv:2206.01053. Bibcode:2022FrASS...9.6870R. doi:10.3389/fspas.2022.876870.
- ^ Cech, Thomas R.; Steitz, Joan A. (March 2014). "The Noncoding RNA Revolution—Trashing Old Rules to Forge New Ones". Cell. 157 (1): 77–94. doi:10.1016/j.cell.2014.03.008. ISSN 0092-8674. PMID 24679528. S2CID 14852160.
- ^ Palacino, James; Swalley, Susanne E; Song, Cheng; Cheung, Atwood K; Shu, Lei; Zhang, Xiaolu; Van Hoosear, Mailin; Shin, Youngah; Chin, Donovan N; Keller, Caroline Gubser; Beibel, Martin; Renaud, Nicole A; Smith, Thomas M; Salcius, Michael; Shi, Xiaoying (2015-06-01). "SMN2 splice modulators enhance U1–pre-mRNA association and rescue SMA mice". Nature Chemical Biology. 11 (7): 511–517. doi:10.1038/nchembio.1837. ISSN 1552-4450. PMID 26030728.
- ^ Roy, Bijoyita; Friesen, Westley J.; Tomizawa, Yuki; Leszyk, John D.; Zhuo, Jin; Johnson, Briana; Dakka, Jumana; Trotta, Christopher R.; Xue, Xiaojiao; Mutyam, Venkateshwar; Keeling, Kim M.; Mobley, James A.; Rowe, Steven M.; Bedwell, David M.; Welch, Ellen M. (2016-10-04). "Ataluren stimulates ribosomal selection of near-cognate tRNAs to promote nonsense suppression". Proceedings of the National Academy of Sciences. 113 (44): 12508–12513. Bibcode:2016PNAS..11312508R. doi:10.1073/pnas.1605336113. ISSN 0027-8424. PMC 5098639. PMID 27702906.
- ^ a b Qadir, Muhammad Imran; Bukhat, Sherien; Rasul, Sumaira; Manzoor, Hamid; Manzoor, Majid (2019-09-03). "RNA therapeutics: Identification of novel targets leading to drug discovery". Journal of Cellular Biochemistry. 121 (2): 898–929. doi:10.1002/jcb.29364. ISSN 0730-2312. PMID 31478252. S2CID 201806158.
- ^ Balmayor, Elizabeth R.; Geiger, Johannes P.; Aneja, Manish K.; Berezhanskyy, Taras; Utzinger, Maximilian; Mykhaylyk, Olga; Rudolph, Carsten; Plank, Christian (May 2016). "Chemically modified RNA induces osteogenesis of stem cells and human tissue explants as well as accelerates bone healing in rats". Biomaterials. 87: 131–146. doi:10.1016/j.biomaterials.2016.02.018. ISSN 0142-9612. PMID 26923361.
- ^ Plews, Jordan R.; Li, JianLiang; Jones, Mark; Moore, Harry D.; Mason, Chris; Andrews, Peter W.; Na, Jie (2010-12-30). "Activation of Pluripotency Genes in Human Fibroblast Cells by a Novel mRNA Based Approach". PLOS ONE. 5 (12) e14397. Bibcode:2010PLoSO...514397P. doi:10.1371/journal.pone.0014397. ISSN 1932-6203. PMC 3012685. PMID 21209933.
- ^ Preskey, David; Allison, Thomas F.; Jones, Mark; Mamchaoui, Kamel; Unger, Christian (May 2016). "Synthetically modified mRNA for efficient and fast human iPS cell generation and direct transdifferentiation to myoblasts". Biochemical and Biophysical Research Communications. 473 (3): 743–751. Bibcode:2016BBRC..473..743P. doi:10.1016/j.bbrc.2015.09.102. ISSN 0006-291X. PMID 26449459.
- ^ Warren, Luigi; Manos, Philip D.; Ahfeldt, Tim; Loh, Yuin-Han; Li, Hu; Lau, Frank; Ebina, Wataru; Mandal, Pankaj K.; Smith, Zachary D.; Meissner, Alexander; Daley, George Q.; Brack, Andrew S.; Collins, James J.; Cowan, Chad; Schlaeger, Thorsten M. (November 2010). "Highly Efficient Reprogramming to Pluripotency and Directed Differentiation of Human Cells with Synthetic Modified mRNA". Cell Stem Cell. 7 (5): 618–630. doi:10.1016/j.stem.2010.08.012. ISSN 1934-5909. PMC 3656821. PMID 20888316.
- ^ Elangovan, Satheesh; Khorsand, Behnoush; Do, Anh-Vu; Hong, Liu; Dewerth, Alexander; Kormann, Michael; Ross, Ryan D.; Rick Sumner, D.; Allamargot, Chantal; Salem, Aliasger K. (November 2015). "Chemically modified RNA activated matrices enhance bone regeneration". Journal of Controlled Release. 218: 22–28. doi:10.1016/j.jconrel.2015.09.050. ISSN 0168-3659. PMC 4631704. PMID 26415855.
External links
[edit]- RNA World website Link collection (structures, sequences, tools, journals)
- Nucleic Acid Database Images of DNA, RNA, and complexes.
- Anna Marie Pyle's Seminar: RNA Structure, Function, and Recognition Archived 2018-06-21 at the Wayback Machine
Structure and Composition
Nucleotide Components
RNA is a linear polymer composed of repeating units known as ribonucleotides, which are linked together to form the nucleic acid chain. Each ribonucleotide consists of three primary components: a pentose sugar called ribose, a phosphate group, and one of four nitrogenous bases—adenine (A), guanine (G), cytosine (C), or uracil (U). The nitrogenous bases are heterocyclic aromatic compounds; adenine and guanine are purines with a fused double-ring structure, while cytosine and uracil are pyrimidines featuring a single six-membered ring. These bases attach to the ribose sugar via an N-glycosidic bond at the C1' position of the sugar and the N9 (for purines) or N1 (for pyrimidines) of the base.[4][5] The ribose sugar in RNA is specifically β-D-ribofuranose, existing in a five-membered furanose ring conformation with hydroxyl groups at the 2', 3', and 5' positions. This configuration arises from the furanose form of D-ribose, where the ring oxygen is between C1' and C4', and the β-anomer has the base attached above the plane of the ring. The distinctive 2'-hydroxyl (2'-OH) group on the ribose enhances RNA's chemical reactivity compared to deoxyribose in DNA, as it participates in nucleophilic attacks that facilitate processes like self-cleavage or enzymatic modifications. For instance, the chemical formula of adenosine monophosphate (AMP), the ribonucleotide containing adenine, is CHNOP, illustrating the integration of these components. Similar formulas apply to the other ribonucleotides: guanosine monophosphate (GMP, CHNOP), cytidine monophosphate (CMP, CHNOP), and uridine monophosphate (UMP, CHNOP).[6][7] The phosphodiester backbone of RNA forms through covalent bonds between the phosphate group of one ribonucleotide and the sugar of the adjacent one, creating a directional chain. Specifically, the 3'-OH of the ribose in one nucleotide reacts with the 5'-phosphate attached to the adjacent nucleotide, forming a phosphodiester linkage and resulting in a polymer with 5' to 3' polarity. This asymmetry imparts directionality to the RNA strand, with the 5' end typically bearing a phosphate or triphosphate group and the 3' end terminating in a hydroxyl. The 2'-OH group contributes to RNA's relative instability, rendering it susceptible to hydrolysis under basic conditions; the hydroxyl acts as a nucleophile to attack the adjacent phosphodiester bond, forming a 2',3'-cyclic phosphate intermediate that leads to chain cleavage. This reactivity contrasts with the stability of DNA and underscores RNA's transient role in cellular processes.[8][9]Differences from DNA
One key chemical difference between RNA and DNA lies in the substitution of uracil (U) for thymine (T) as one of the nitrogenous bases in RNA.[2] Uracil pairs with adenine (A) through two hydrogen bonds, similar to the A-T pairing in DNA, whereas guanine (G) pairs with cytosine (C) via three hydrogen bonds in both molecules, contributing to the overall stability of base pairing.[10] DNA uses thymine instead of uracil to allow detection and repair of cytosine deamination, which produces uracil; if DNA used uracil, such damage would be indistinguishable from normal bases. In RNA, uracil is used as it is energetically cheaper to synthesize (lacking the methyl group) and RNA's short lifespan reduces the impact of mutations.[11] Another fundamental distinction is the sugar component: RNA incorporates ribose, which has a hydroxyl group (-OH) at the 2' position of the sugar ring, whereas DNA uses deoxyribose lacking this group.[12] The 2'-OH group in RNA enhances the conformational flexibility of the single-stranded molecule, allowing it to adopt diverse shapes more readily than the more rigid DNA backbone.[13] However, this same group renders RNA more vulnerable to enzymatic degradation and chemical hydrolysis, as it can participate in nucleophilic attacks on the phosphodiester backbone, briefly tying back to the nucleotide's inherent reactivity.[14] RNA is typically single-stranded, in contrast to DNA's double-helical structure, which impacts their respective stability and compactness.[2] The absence of a complementary strand in RNA reduces base-pairing protection, making it less stable and more prone to unfolding or degradation, while DNA's double helix provides greater compactness and resistance to environmental damage for long-term genetic storage.[15] This single-stranded nature also facilitates RNA's role as a transient intermediary rather than a permanent archive. RNA molecules are generally much shorter in length than DNA, with most ranging from hundreds to thousands of nucleotides, compared to DNA's genome-spanning millions.[2] Consequently, RNA exhibits a higher turnover rate, with half-lives often ranging from minutes to hours depending on the type and cellular conditions, enabling rapid regulation of gene expression, whereas DNA persists stably across cell generations for archival purposes.[16] Physicochemically, RNA demonstrates higher solubility in water than DNA, attributable to the polar 2'-OH group increasing hydrophilicity.[17] This feature, combined with the single-stranded form, promotes RNA's tendency to form complex intramolecular folds through base pairing and stacking interactions, unlike the more uniform double helix of DNA.[17] Additionally, RNA's susceptibility to degradation by ubiquitous ribonucleases (RNases) far exceeds DNA's vulnerability to deoxyribonucleases (DNases), underscoring its ephemeral nature in cellular environments.[18]Folding and Higher-Order Structures
The primary structure of RNA, defined by its linear sequence of ribonucleotides, encodes the intrinsic potential for folding into functional three-dimensional conformations through specific base interactions. This sequence dictates the locations and types of complementary bases available for pairing, influencing the stability and architecture of higher-order structures. Seminal studies have established that variations in primary sequence can profoundly alter folding pathways and final structures, underscoring the sequence as the foundational determinant of RNA's conformational landscape.[19] RNA secondary structures emerge from intramolecular base pairing along the primary chain, forming double-stranded helices interspersed with single-stranded regions. Canonical Watson-Crick base pairs (A-U and G-C) provide the core stability via two or three hydrogen bonds, respectively, while non-canonical interactions, such as the G-U wobble pair, introduce flexibility and are nearly as thermodynamically stable as Watson-Crick pairs due to their isomorphic geometry and comparable hydrogen bonding. These pairings create characteristic motifs, including stem-loops (where a helical stem terminates in a loop), hairpins (short stems closed by loops of 3-7 nucleotides), bulges (unpaired bases on one side of a helix), and internal loops (unpaired regions on both sides). Such elements allow RNA to adopt compact, hierarchical architectures that serve as scaffolds for tertiary folding.00112-X)[20] Tertiary structures result from the spatial organization of multiple secondary elements through long-range contacts, yielding compact globular forms essential for RNA function. Key motifs include pseudoknots, in which a single-stranded loop from one stem pairs with a distant sequence to interlock helices; coaxial helices, where adjacent helical segments stack continuously without interruption; and kissing loops, involving reciprocal base pairing between the loops of two hairpins. These interactions enable complex topologies, as exemplified by the L-shaped tertiary fold in tRNA, which positions its acceptor and anticodon arms orthogonally, or the precisely organized active sites in ribozymes that catalyze phosphodiester bond formation.[21][22] The driving forces behind RNA folding encompass a balance of enthalpic and entropic contributions from non-covalent interactions. Hydrogen bonding between base edges stabilizes paired regions, while base stacking—arising from van der Waals and hydrophobic forces between adjacent aromatic bases—provides the dominant stabilization in helical segments, contributing up to 50% of the free energy in double-stranded RNA. Electrostatic repulsion from the negatively charged phosphate backbone is mitigated by divalent cations like Mg²⁺ ions, which bind specifically to facilitate tertiary contacts and neutralize charges, often increasing folding efficiency by orders of magnitude in physiological conditions.00112-X)[23] Predicting RNA folding computationally relies on thermodynamic models that approximate free energy minimization. Algorithms like mfold, introduced in seminal work using dynamic programming to enumerate suboptimal structures, and the ViennaRNA package, which implements partition function calculations for ensemble predictions, model secondary structures by scoring base pairs based on nearest-neighbor parameters derived from melting experiments. These tools excel for sequences up to several hundred nucleotides but face limitations from kinetic traps—local energy minima that trap RNA in metastable states during folding, as opposed to the global minimum—leading to discrepancies between predicted and native structures in vivo. Advanced extensions incorporate barrier-crossing heuristics to address these kinetic effects.[24][25]Chemical Modifications
RNA molecules undergo a diverse array of post-transcriptional chemical modifications that alter their structure and function, building upon the core nucleotide components of adenine, guanine, cytosine, uracil, and ribose. Over 170 distinct types of these modifications have been identified as of 2025, with the majority occurring in eukaryotic organisms and prominently in ribosomal RNA (rRNA), transfer RNA (tRNA), and messenger RNA (mRNA).30638-4) These modifications include base methylations, such as N6-methyladenosine (m6A) on adenine, pseudouridylation where uridine is isomerized to pseudouridine (Ψ), and ribose 2'-O-methylation on the sugar backbone.[26] Among these, m6A stands out as the most abundant internal modification in eukaryotic mRNA, often found in the consensus sequence DRACH (where D = A/G/U, R = A/G, H = A/C/U).[27] The installation, removal, and interpretation of these modifications are mediated by enzymatic complexes known as writers, erasers, and readers, respectively. For m6A, the primary writer is the METTL3-METTL14-WTAP methyltransferase complex, which catalyzes the addition of a methyl group to the N6 position of adenosine.[28] Erasers, such as the demethylase FTO, reverse this modification by oxidative demethylation, thereby dynamically regulating m6A levels.[28] Readers, including YTH-domain-containing proteins like YTHDF2, recognize and bind to modified sites to influence downstream RNA processes, such as directing m6A-marked transcripts to decay pathways.[28] Similar machinery exists for other modifications; for instance, pseudouridine synthases (e.g., PUS enzymes) act as writers for Ψ without requiring erasers, while fibrillarin catalyzes site-specific 2'-O-methylations in rRNA.[26] These chemical alterations profoundly impact RNA biology by enhancing stability against nuclease degradation, modulating base-pairing interactions, and fine-tuning processes like splicing and translation efficiency. For example, m6A promotes mRNA decay through YTHDF2-mediated recruitment to P-bodies, thereby reducing protein output, while 2'-O-methylation stabilizes RNA structures and improves translational fidelity in rRNA.[29] Pseudouridylation enhances RNA flexibility and stability, facilitating proper tRNA anticodon recognition during translation and influencing splice site selection in pre-mRNA.[30] Such modifications collectively regulate gene expression at multiple levels, with disruptions linked to diseases including cancer and neurological disorders.[26] RNA modifications exhibit evolutionary conservation, particularly in essential RNAs like tRNA and rRNA, where core sites such as m6A in stem cell transcripts are preserved from yeast to humans, underscoring their fundamental roles in cellular homeostasis.00451-2) Detection of these modifications has advanced through epitranscriptomics, employing techniques like mass spectrometry for quantitative profiling of abundant RNAs and sequencing-based methods, such as m6A-seq or Pseudo-seq, which use antibody pulldowns or chemical labeling to map modification sites genome-wide with single-nucleotide resolution.[31] These approaches have revealed dynamic, context-dependent modification patterns that respond to cellular stresses and developmental cues.00147-7)Synthesis and Processing
Transcription Mechanism
Transcription is the enzymatic process by which RNA is synthesized from a DNA template, involving the polymerization of ribonucleotides in the 5' to 3' direction to produce a complementary RNA strand.[32] This DNA-directed synthesis uses the DNA as a template, where the enzyme reads one strand (the template strand) and assembles RNA using nucleoside triphosphates (NTPs) that match the complementary bases.[33] In prokaryotes, a single RNA polymerase enzyme, composed of core subunits and a sigma factor for promoter recognition, catalyzes the transcription of all RNA types.[33] In eukaryotes, three distinct nuclear RNA polymerases perform specialized roles: RNA polymerase I (Pol I) transcribes most ribosomal RNAs (rRNAs), RNA polymerase II (Pol II) synthesizes messenger RNAs (mRNAs) and small nuclear RNAs (snRNAs), and RNA polymerase III (Pol III) produces transfer RNAs (tRNAs) and 5S rRNA.[32] The transcription process occurs in three main stages: initiation, elongation, and termination. Initiation begins with the binding of RNA polymerase to promoter elements on the DNA. In prokaryotes, the core promoter includes the -35 box (TTGACA consensus) and -10 box (TATAAT consensus), recognized by the sigma factor to unwind DNA and form the open complex.[33] In eukaryotes, Pol II initiation involves the TATA box (TATAAA consensus, located 25-35 bases upstream of the start site), bound by the TATA-binding protein (TBP) as part of the transcription factor IID (TFIID) complex, which recruits additional factors and the polymerase.[32] Eukaryotic promoters may also include enhancers, distal regulatory sequences that boost transcription rates.[33] During elongation, the RNA polymerase moves along the DNA template, incorporating NTPs (ATP, GTP, CTP, UTP) complementary to the template bases, extending the RNA chain in the 5' to 3' direction at rates of about 20-50 nucleotides per second in prokaryotes and 22-25 nucleotides per second for Pol II in eukaryotes.[32] The energy for this polymerization comes from the hydrolysis of the high-energy phosphoanhydride bonds in NTPs, releasing pyrophosphate (PPi) and driving the irreversible addition of each nucleotide.[33] Fidelity is maintained through base-pairing selectivity and proofreading mechanisms; the initial misincorporation error rate is approximately 1 in 10^4 nucleotides, improved by intrinsic cleavage activity in some polymerases, such as Pol III, which removes mismatched 3' termini via hydrolytic proofreading, enhancing accuracy by up to 10^3-fold. Overall transcription error rates reach about 10^{-5} per nucleotide in bacteria like E. coli.[34] Termination signals the end of RNA synthesis and release of the transcript. In prokaryotes, intrinsic termination involves the formation of a GC-rich hairpin loop in the RNA followed by a run of uracils, causing polymerase pausing and dissociation, while rho-dependent termination uses the Rho helicase protein to unwind the RNA-DNA hybrid.[33] Eukaryotic termination for Pol II occurs downstream of the polyadenylation signal, involving cleavage and polymerase release, though mechanisms vary by polymerase type.[32]Post-Transcriptional Processing
Post-transcriptional processing encompasses a series of modifications that transform the primary RNA transcript, known as pre-mRNA in eukaryotes, into mature, functional RNA molecules. These steps occur in the nucleus and are crucial for RNA stability, export to the cytoplasm, and proper translation. In prokaryotes, processing is minimal due to coupled transcription and translation, whereas eukaryotic processing is more elaborate to accommodate larger genomes and regulatory complexity. One of the initial modifications is 5' capping, which involves the addition of a 7-methylguanosine cap to the 5' end of the nascent pre-mRNA shortly after transcription initiation. This cap is covalently linked via a 5'-5' triphosphate bridge by the enzyme guanylyltransferase, followed by methylation. The cap protects the RNA from 5' exonucleases, facilitates nuclear export through interactions with export factors like NXF1, and enhances translation initiation by recruiting the eukaryotic initiation factor eIF4E. At the 3' end, polyadenylation occurs after cleavage of the pre-mRNA at a specific site defined by the AAUAAA signal, followed by the addition of a poly-A tail consisting of 200-250 adenine residues in eukaryotes. This process is catalyzed by a multiprotein complex including cleavage and polyadenylation specificity factor (CPSF) and poly-A polymerase (PAP). The poly-A tail increases mRNA stability by preventing degradation from 3' exonucleases and promotes export and translation efficiency via binding to poly-A binding proteins (PABPs). Splicing removes non-coding introns and joins coding exons to form mature mRNA, a process mediated by the spliceosome in eukaryotes, which assembles from small nuclear ribonucleoproteins (snRNPs) U1 through U6. The spliceosome recognizes conserved splice sites (GU at the 5' end and AG at the 3' end of introns) and catalyzes two transesterification reactions to excise introns. Some introns, such as group I and II, can self-splice without proteins, relying on RNA catalysis. Alternative splicing, where different exon combinations are selected, generates multiple protein isoforms from a single gene, expanding proteomic diversity. RNA editing introduces base changes post-transcriptionally, with adenosine-to-inosine (A-to-I) editing being prevalent in eukaryotes, performed by ADAR enzymes that deaminate adenosine to inosine, which is read as guanosine during translation. This can alter codons, potentially changing amino acids or creating stop codons, thus modulating protein function and diversity. For instance, editing in glutamate receptor transcripts affects calcium permeability in neurons. Quality control mechanisms ensure only properly processed RNAs proceed, with nonsense-mediated decay (NMD) targeting transcripts containing premature termination codons for degradation. NMD involves recognition by factors like UPF1, UPF2, and UPF3 during the pioneer round of translation, preventing accumulation of truncated proteins. This pathway degrades about 5-30% of human transcripts, highlighting its role in regulating gene expression. Secondary structures in the RNA can influence transcription termination signals, thereby affecting the substrate for these processing events.Major Types and Functions
Protein-Coding RNAs
Protein-coding RNAs, primarily messenger RNAs (mRNAs), serve as the intermediary molecules that convey genetic information from DNA to ribosomes for protein synthesis, embodying a core aspect of the central dogma of molecular biology. In eukaryotic cells, mRNAs are typically monocistronic, encoding a single protein from one open reading frame, whereas prokaryotic mRNAs are often polycistronic, allowing multiple proteins to be translated from a single transcript organized into operons. The structure of mRNA includes a 5' untranslated region (UTR) that regulates translation initiation, a central coding sequence composed of nucleotide triplets known as codons that specify amino acid sequences, and a 3' UTR that influences mRNA stability, localization, and translation efficiency, often ending with a poly-A tail in eukaryotes.[35] The translation of mRNA into proteins occurs in three main stages: initiation, elongation, and termination. During initiation in eukaryotes, the small ribosomal subunit binds to the 5' cap of the mRNA with assistance from eukaryotic initiation factors (eIFs), scanning to the start codon (AUG) recognized via the Kozak consensus sequence for efficient assembly of the full ribosome.[36] Elongation follows as transfer RNAs (tRNAs) match their anticodons to successive mRNA codons in the ribosome's A site, facilitating peptide bond formation and translocation along the mRNA. Termination is triggered by stop codons (UAA, UAG, UGA) in the A site, prompting release factors to disassemble the ribosome and liberate the nascent polypeptide. Following transcription in the nucleus, mature eukaryotic mRNAs are exported to the cytoplasm through nuclear pore complexes, where they localize to specific cellular compartments for targeted translation, such as dendrites in neurons.[37] mRNA stability is tightly regulated, with degradation initiated by deadenylation (shortening of the poly-A tail) followed by decapping and exonucleolytic digestion, ensuring rapid turnover of transcripts in response to cellular needs.[38] In terms of abundance, mRNAs constitute approximately 1-5% of total cellular RNA in eukaryotes, with half-lives ranging from minutes for short-lived transcripts like the proto-oncogene c-fos to several hours for more stable ones, reflecting their role in dynamic gene expression control.[2][39] Evolutionarily, protein-coding RNAs are thought to trace back to an ancient RNA world, where self-replicating RNA molecules encoded rudimentary peptides via a primitive genetic code, laying the foundation for modern translation systems. Prior to translation, mRNA precursors undergo processing steps like capping, splicing, and polyadenylation to generate functional transcripts.Regulatory Non-Coding RNAs
Regulatory non-coding RNAs (ncRNAs) are a diverse class of RNA molecules that do not encode proteins but play crucial roles in modulating gene expression at transcriptional, post-transcriptional, and epigenetic levels. These RNAs, ranging from short 20-30 nucleotide species to long transcripts exceeding 200 nucleotides, interact with DNA, RNA, or proteins to fine-tune cellular processes such as development, differentiation, and response to stress. Unlike protein-coding RNAs, their primary function lies in regulation rather than translation, enabling precise control over genome activity without altering the genetic code directly.[40] MicroRNAs (miRNAs) are small endogenous ncRNAs approximately 21-25 nucleotides in length that primarily repress gene expression post-transcriptionally. They are initially transcribed as primary miRNAs (pri-miRNAs) with stem-loop structures, which are processed in the nucleus by the microprocessor complex containing Drosha and DGCR8 to form precursor miRNAs (pre-miRNAs). These precursors are then exported to the cytoplasm and cleaved by Dicer into mature miRNAs, which are loaded into the RNA-induced silencing complex (RISC) containing Argonaute proteins.[40] Within RISC, miRNAs typically bind to the 3' untranslated regions (UTRs) of target mRNAs through partial base-pairing, leading to translational repression or mRNA destabilization and decay. This mechanism allows a single miRNA to regulate hundreds of targets, influencing processes like cell proliferation and apoptosis; for instance, the founding miRNA lin-4 was discovered in C. elegans where it negatively regulates LIN-14 protein levels during development.[41] Small interfering RNAs (siRNAs) are structurally similar to miRNAs, also 20-25 nucleotides long and processed by Dicer, but they arise primarily from exogenous double-stranded RNA (dsRNA) precursors and mediate sequence-specific RNA interference (RNAi) for gene silencing. Unlike miRNAs, siRNAs often exhibit perfect complementarity to their targets, triggering direct cleavage by Argonaute-2 in the RISC complex rather than translational repression. This pathway was first demonstrated in C. elegans, where injection of dsRNA corresponding to specific genes led to potent and heritable silencing, far more effective than single-stranded RNA. siRNAs play key roles in antiviral defense and transposon suppression, with applications in experimental gene knockdown across eukaryotes.[42] Long non-coding RNAs (lncRNAs), defined as transcripts longer than 200 nucleotides, exhibit diverse regulatory functions including chromatin modification, transcriptional interference, and post-transcriptional modulation. Many lncRNAs act as scaffolds for protein complexes, recruiting histone modifiers like Polycomb repressive complex 2 (PRC2) to specific genomic loci. A prominent example is Xist, a 17-19 kb lncRNA essential for X-chromosome inactivation in female mammals, where it coats the inactive X chromosome in cis, leading to epigenetic silencing through recruitment of silencing factors and chromatin compaction.[43] Another well-studied lncRNA, HOTAIR (HOX transcript antisense RNA), is a 2.2 kb transcript from the HOXC locus that represses HOXD genes in trans by interacting with PRC2 and LSD1 to promote H3K27 methylation and H3K4 demethylation, respectively, thereby establishing repressive chromatin domains. lncRNAs like these are implicated in developmental patterning and cancer progression when dysregulated.[44] Enhancer RNAs (eRNAs) are short, often bidirectional ncRNAs transcribed from enhancer regions, typically 50-2000 nucleotides long, that facilitate enhancer-promoter interactions to activate transcription. eRNAs promote gene expression by stabilizing chromatin loops, recruiting Mediator and cohesin complexes, or interacting with transcription factors like YY1 to enhance RNA polymerase II activity at target promoters. Their discovery stemmed from genome-wide mapping of nascent transcripts, revealing pervasive enhancer transcription in active cell states. For example, eRNAs from the β-globin locus enhancers loop to interact with the promoter, boosting hemoglobin expression during erythropoiesis. eRNA levels correlate with enhancer activity, providing a dynamic readout of regulatory potential.[45] Piwi-interacting RNAs (piRNAs), 24-31 nucleotides long, form complexes with Piwi proteins to silence transposons primarily in germline cells, protecting genome integrity from mutagenic insertions. Unlike miRNAs and siRNAs, piRNAs are generated from long single-stranded precursors via a Dicer-independent pathway involving Zucchini endonuclease, and they exhibit a bias for uridine at the 5' end. In animals, piRNAs guide Piwi to transposon loci, inducing heterochromatin formation through H3K9 methylation or transcriptional repression. The ping-pong amplification cycle, where primary piRNAs direct cleavage of sense transcripts to produce secondary piRNAs, amplifies the response. Seminal studies in mice identified piRNAs bound to MIWI and MILI, clustered in germline-specific loci, underscoring their role in fertility and transposon control.[46]Structural and Catalytic RNAs
Structural and catalytic RNAs encompass a diverse class of non-coding RNAs that provide essential scaffold and enzymatic functions within the cell, most prominently in the translation machinery and RNA processing pathways. These molecules, including ribosomal RNAs (rRNAs) and transfer RNAs (tRNAs), form the core structural components of the ribosome, facilitating protein synthesis, while ribozymes demonstrate RNA's capacity for catalysis independent of proteins. Unlike protein-coding or regulatory RNAs, structural and catalytic RNAs primarily enable constitutive cellular processes through their architectural and reactive properties.[47] Ribosomal RNA (rRNA) constitutes the majority of the ribosome's mass and serves as its structural and functional backbone. In prokaryotes, the small ribosomal subunit contains 16S rRNA, while the large subunit includes 23S and 5S rRNAs; in eukaryotes, these correspond to 18S rRNA in the small subunit and 28S, 5.8S, and 5S rRNAs in the large subunit.[48] These rRNAs fold into complex three-dimensional structures that position ribosomal proteins and create functional sites for translation. A highly conserved core region within the 23S/28S rRNA forms the peptidyl transferase center (PTC), the site responsible for catalyzing peptide bond formation during protein synthesis.[49] Transfer RNA (tRNA) molecules act as adapters in translation, linking amino acids to their corresponding codons on messenger RNA through specific structural features. The canonical tRNA secondary structure adopts a cloverleaf conformation, characterized by an acceptor stem, D-arm, anticodon arm, and T-arm, which folds into an L-shaped tertiary structure.[50] The anticodon loop, located at one end of the L-shape, contains a three-nucleotide anticodon sequence that base-pairs with mRNA codons to ensure accurate amino acid selection. At the opposite end, the 3' CCA terminus serves as the attachment site for the cognate amino acid, a process catalyzed by aminoacyl-tRNA synthetases that recognize specific tRNA identity elements to achieve high-fidelity charging.[51] Ribozymes represent RNA molecules with intrinsic catalytic activity, exemplified by self-splicing introns and RNase P. Group I self-splicing introns excise themselves from precursor RNAs using a guanosine nucleotide or its derivatives as a cofactor, initiating transesterification reactions that join the flanking exons without protein assistance.[52] In contrast, group II introns undergo self-splicing via two transesterification steps, resulting in a lariat intermediate where the intron's 5' end branches to a bulged adenosine, mirroring the mechanism of spliceosomal introns.[53] RNase P, a ribonucleoprotein complex, processes the 5' leader sequence of precursor tRNAs to generate mature tRNAs; its RNA subunit alone exhibits catalytic activity in vitro, cleaving pre-tRNA substrates in the presence of monovalent and divalent cations.[54] The ribosome itself functions as a ribozyme, with its peptidyl transferase activity residing entirely within the rRNA component of the large subunit. Biochemical and structural studies have shown that the PTC, composed of rRNA nucleotides without direct involvement of ribosomal proteins, catalyzes the nucleophilic attack of the aminoacyl-tRNA's alpha-amino group on the peptidyl-tRNA's ester linkage to form a peptide bond.[55] This RNA-based catalysis underscores the ancient evolutionary origins of the ribosome, predating protein synthesis machinery. Post-transcriptional modifications enhance the stability and functionality of structural RNAs like tRNAs and rRNAs. In tRNAs, hypermodifications such as wybutosine at position 37 (adjacent to the anticodon) stabilize the codon-anticodon interaction through base-stacking, preventing frameshifting and promoting efficient translation fidelity.[56] Similar modifications in rRNAs, including pseudouridylation and 2'-O-methylation in the PTC and decoding regions, fine-tune ribosomal structure and catalytic efficiency.[49]Genetic and Evolutionary Roles
RNA Genomes and Viruses
RNA serves as the hereditary material in numerous viruses, where it functions as the genome rather than DNA, enabling rapid replication and evolution in host cells. This contrasts with the DNA-based genomes of cellular organisms and most organelles, highlighting RNA's versatility in genetic systems. The Baltimore classification system categorizes viruses into seven groups based on their genome type (DNA or RNA, single- or double-stranded, sense or antisense) and replication strategy, with groups III through VI encompassing those with RNA genomes. Developed in 1971, this framework underscores how RNA viruses exploit host machinery while relying on virus-encoded polymerases for genome replication, often leading to high genetic diversity.[57] Positive-sense single-stranded RNA (+ssRNA) viruses, classified in Baltimore group IV, possess genomes that directly serve as messenger RNA (mRNA) upon entry into host cells, allowing immediate translation of viral proteins including RNA-dependent RNA polymerase (RdRp). For example, poliovirus (a picornavirus) uses its ~7.5 kb +ssRNA genome to produce RdRp, which then synthesizes a complementary negative-sense strand as a template for new +ssRNA genomes, facilitating efficient replication in cytoplasmic membrane-bound compartments. Negative-sense single-stranded RNA (-ssRNA) viruses, in group V, carry their genome in an antisense orientation and package RdRp within the virion to first transcribe positive-sense mRNAs for protein synthesis before full genome replication can occur. Influenza A virus, with its segmented ~13.5 kb -ssRNA genome, exemplifies this: the viral polymerase complex initiates transcription in the nucleus, producing mRNAs capped by host factors, which are translated to support subsequent replication of full-length antigenomic intermediates.[58] Double-stranded RNA (dsRNA) viruses, grouped in Baltimore class III, feature genomes composed of 10–12 linear segments encapsidated within the virion, providing stability against host nucleases. Reoviruses, such as mammalian orthoreovirus, have a ~24 kb dsRNA genome divided into large, medium, and small segments, each encoding specific proteins; replication occurs in cytoplasmic viral factories where endogenous RdRp transcribes and replicates segments without free RNA intermediates, enabling genetic reassortment during co-infection.[59] Retroviruses, in group VI, maintain single-stranded RNA genomes (~9 kb) that are reverse-transcribed into DNA proviruses, which integrate into the host genome as stable hereditary elements. Human immunodeficiency virus (HIV-1), for instance, forms a double-stranded DNA provirus via its RNA template, allowing persistent infection and propagation with host DNA during cell division.[60] A key challenge in RNA virus replication stems from the error-prone nature of RdRp enzymes, which lack proofreading mechanisms found in DNA polymerases, resulting in mutation rates of approximately 10^{-4} to 10^{-5} errors per nucleotide per replication cycle—orders of magnitude higher than DNA-based systems.[61] This quasispecies diversity drives rapid viral evolution, immune evasion, and adaptation to antiviral therapies, but also imposes fitness costs on progeny virions. In organelles like mitochondria and chloroplasts, genetic material is primarily circular DNA (e.g., ~16 kb mitochondrial DNA in humans), yet in animal cells, RNA transcripts from mitochondrial genomes can constitute a significant portion (up to 30%) of the cellular mRNA pool in metabolically active tissues like heart muscle, supporting organelle function and indirectly influencing hereditary traits through post-transcriptional regulation.[62] Evolutionarily, RNA genomes in viruses suggest ancient origins, potentially mirroring primordial RNA-world scenarios, and their high mutability facilitates diversification across host species, contributing to zoonotic emergences.[63]Reverse Transcription and Retroelements
Reverse transcription is the process by which DNA is synthesized from an RNA template, a mechanism first discovered in 1970 by Howard Temin and Satoshi Mizutani, who identified an RNA-dependent DNA polymerase in virions of Rous sarcoma virus, and independently by David Baltimore in RNA tumor viruses, challenging the central dogma of molecular biology. This enzyme, known as reverse transcriptase (RT), was recognized for its role in retroviral replication, earning Temin and Baltimore the 1975 Nobel Prize in Physiology or Medicine.[64] Reverse transcriptase is a multifunctional enzyme encoded by the pol gene in retroviruses, such as HIV-1, where it is produced as part of the Gag-Pol polyprotein and exhibits both DNA polymerase activity for RNA-templated synthesis and RNase H activity to degrade the RNA strand in RNA-DNA hybrids.[65] In HIV-1, RT forms a heterodimer consisting of p66 (catalytic subunit) and p51 (structural subunit), with the polymerase domain in p66 responsible for nucleotide addition and the RNase H domain cleaving RNA in hybrid duplexes.[66] The enzyme's error-prone nature, with a fidelity rate of approximately 1 error per 10,000 nucleotides incorporated, contributes to high mutation rates in retroviral genomes, driving viral diversity and evolution. The mechanism of reverse transcription begins upon retroviral entry into the host cell, where RT uses a tRNA primer annealed to the viral RNA genome's primer binding site to synthesize a complementary DNA strand, forming an RNA-DNA hybrid.[68] RNase H activity then partially degrades the RNA template, allowing the newly synthesized DNA to fold back via repeat sequences to prime second-strand synthesis, ultimately yielding a double-stranded DNA (dsDNA) provirus flanked by long terminal repeats (LTRs) generated through template switching during the process.[68] These LTRs, identical direct repeats at both ends of the proviral DNA, contain promoter and enhancer elements essential for viral gene expression after integration into the host genome.[69] Retroelements are mobile genetic elements that propagate via RNA intermediates and reverse transcription, comprising a significant portion of eukaryotic genomes. Endogenous retroviruses (ERVs), ancient integrations of retroviral proviruses, account for about 8% of the human genome, with many retaining LTRs but lacking functional gag, pol, and env genes due to mutations over evolutionary time.[70] Non-LTR retrotransposons include long interspersed nuclear elements (LINEs), such as LINE-1, which encode their own RT and are autonomously mobile, and short interspersed nuclear elements (SINEs), like Alu elements, which are non-autonomous and rely on LINE-1 machinery for retrotransposition via RNA intermediates.[71] Alu elements, the most abundant SINEs in primates with over one million copies, amplify through transcription into RNA, reverse transcription, and reintegration, influencing genome structure and sometimes contributing to genetic disorders.[72] Telomerase represents a cellular application of reverse transcription, consisting of the telomerase reverse transcriptase (TERT) protein subunit with RT activity and the telomerase RNA component (TERC), which provides the template sequence for adding telomeric repeats to chromosome ends.[73] In humans, TERC's 3'-CAC-5' template directs TERT to extend the 3' overhang of telomeres by adding TTAGGG repeats, counteracting replicative shortening and maintaining genomic stability in stem and cancer cells.[74] This ribonucleoprotein complex exemplifies how reverse transcription mechanisms, originally identified in viruses, are co-opted for essential eukaryotic functions.[75]Double-Stranded and Circular RNAs
Double-stranded RNA (dsRNA) serves as a critical intermediate in the replication of many RNA viruses, where it is generated by viral RNA-dependent RNA polymerases (RdRps) during the synthesis of complementary strands from positive-sense RNA templates.[76] This biogenesis process is essential for viral genome amplification, as RdRps use the dsRNA as a template to produce progeny genomic RNA. In cellular contexts, dsRNA also arises during RNA interference (RNAi) pathways, where double-stranded precursors are processed into small interfering RNAs (siRNAs) to silence gene expression.[77] dsRNA plays a pivotal role in triggering innate immune responses, primarily through activation of the protein kinase R (PKR) pathway, which phosphorylates eukaryotic initiation factor 2α (eIF2α) to inhibit global protein translation and induce stress responses. This activation leads to the production of type I interferons (IFNs) via downstream signaling, enhancing antiviral defenses by upregulating interferon-stimulated genes.[78] Additionally, dsRNA serves as a trigger for RNAi-mediated antiviral immunity, where Dicer enzymes cleave it into siRNAs that guide Argonaute proteins to degrade viral RNA.[77] In viruses with RNA genomes, these dsRNA intermediates are key targets for host recognition, linking them to broader antiviral mechanisms. Evolutionarily, dsRNA structures are conserved in innate immune pathways across eukaryotes, suggesting ancient origins in antiviral defense. Circular RNAs (circRNAs) are a class of non-coding RNAs formed through back-splicing, a process where a downstream splice donor joins an upstream splice acceptor, often facilitated by complementary sequences in flanking introns or exon skipping.[79] Biogenesis can involve lariat intermediates from canonical splicing, where introns are removed and the lariat's branch point enables exon ligation to form the circular structure.[80] This covalent closure renders circRNAs highly resistant to exonuclease degradation, conferring greater stability compared to linear RNAs.[81] circRNAs exert regulatory functions, notably as microRNA (miRNA) sponges that sequester miRNAs and prevent their interaction with target mRNAs; a prominent example is ciRS-7 (also known as CDR1as), which harbors over 70 binding sites for miR-7 and modulates neuronal gene expression. In select cases, circRNAs undergo translation to produce proteins, particularly when containing internal ribosome entry sites (IRES) or undergoing cap-independent initiation, as observed in some viral and cellular circRNAs.[82] Detection of circRNAs typically involves enrichment with RNase R, an exonuclease that digests linear RNAs but spares circular forms, followed by RNA sequencing to identify back-spliced junctions.[83] In certain cell types, such as neurons, circRNAs can comprise 10-20% of the transcriptome, reflecting their high abundance and stability.[84] Recent studies since 2020 have highlighted circRNAs' involvement in neurodegeneration, where dysregulated circRNAs in Alzheimer's and Parkinson's diseases influence synaptic function and amyloid-beta accumulation through miRNA sponging and protein interactions.[85] For dsRNA, post-2020 research has elucidated its role in enhancing antiviral transcriptional responses independent of sequence-specific recognition, bolstering innate immunity against emerging pathogens like SARS-CoV-2.[86] Evolutionarily, circular RNAs, emerging from alternative splicing, represent an evolutionary innovation in higher organisms, with over 100,000 identified in humans, potentially enhancing regulatory diversity without genomic expansion.[87]Historical and Fundamental Discoveries
Early Identification and Characterization
In 1869, Swiss biochemist Friedrich Miescher isolated a phosphorus-rich substance he termed "nuclein" from the nuclei of white blood cells obtained from discarded surgical bandages, marking the first identification of nucleic acids, which include both DNA and RNA.[88] Miescher's extraction involved treating the cells with pepsin to remove proteins, followed by alkali to precipitate the nuclein, revealing its acidic nature and high phosphorus content, distinct from known proteins or lipids.[89] Although Miescher did not distinguish between DNA and RNA at the time, his work laid the groundwork for recognizing nucleic acids as fundamental cellular components.[90] The identification of RNA as a distinct nucleic acid emerged in the late 1930s through studies on the tobacco mosaic virus (TMV). In 1936, Roy Markham and Northrop demonstrated the presence of a nucleic acid in purified TMV preparations, and by 1937, Bawden and Pirie confirmed it contained phosphorus, indicating a nucleoprotein composition.[91] Further analysis in 1939 by Bawden and Pirie established that the nucleic acid was ribonucleic acid (RNA), not deoxyribonucleic acid, based on its susceptibility to alkali hydrolysis and base composition.[92] Wendell Stanley, who had crystallized TMV in 1935, collaborated with Bawden and Pirie, and by 1944, their collective work proposed RNA's potential role in viral heredity, challenging the protein-centric views of inheritance prevalent at the time.[91] In the 1950s, the emerging field of molecular biology began elucidating RNA's functional roles in protein synthesis. Francis Crick proposed the "adaptor hypothesis" in 1955, suggesting that small RNA molecules act as intermediaries to translate the nucleotide sequences of a genetic template into amino acid chains, addressing the mismatch between the four-letter nucleic acid code and the twenty amino acids.[93] This idea, detailed in Crick's 1958 paper "On Protein Synthesis," posited that these adaptors—later identified as transfer RNAs (tRNAs)—recognize specific codons via base-pairing while carrying attached amino acids, thus serving as the bridge in the central dogma of molecular biology. The hypothesis provided a conceptual framework for RNA's intermediary function, influencing subsequent experiments on genetic coding.[94] The concept of messenger RNA (mRNA) was experimentally validated in 1961 through studies by Sydney Brenner, François Jacob, and Matthew Meselson using T4 bacteriophage infection in Escherichia coli. Their pulse-labeling experiments with radioactive uracil demonstrated the existence of a short-lived, unstable RNA species that rapidly incorporates genetic information from newly synthesized DNA and directs protein synthesis at ribosomes. Published in Nature as "An Unstable Intermediate Carrying Information from Genes to Ribosomes for Protein Synthesis," the work showed that this RNA turns over quickly, with a half-life of about 2-3 minutes, confirming its role as a transient messenger between DNA and ribosomes.[95] This discovery resolved debates about how genetic information flows in bacteria and established mRNA as the key intermediary in gene expression.[96] The 1970s brought structural insights into RNA components and the revelation of gene organization complexities. In 1974, Alexander Rich and Sung-Hou Kim, along with colleagues, determined the three-dimensional crystal structure of yeast phenylalanine tRNA at 3.0 Å resolution, revealing its L-shaped tertiary fold with stacked helices and a cloverleaf secondary structure stabilized by modified bases and magnesium ions. This structure, resolved using X-ray diffraction on crystals grown from purified tRNA, confirmed Crick's adaptor hypothesis by showing the anticodon loop positioned to interact with mRNA and the acceptor stem for amino acid attachment.[97] Concurrently, in 1977, Phillip Sharp and Richard Roberts independently discovered introns—non-coding sequences interrupting eukaryotic genes—through electron microscopy of adenovirus RNA hybrids, revealing looped-out regions where introns are transcribed but spliced out during mRNA maturation. Sharp's team at MIT and Roberts' at Cold Spring Harbor used heteroduplex mapping to show that the beta-globin and histone genes contain intervening sequences, fundamentally altering views of gene continuity. These findings, awarded the 1993 Nobel Prize in Physiology or Medicine, highlighted RNA's role in post-transcriptional processing.[98]Milestones in RNA Function and Regulation
In 1982, Thomas Cech's laboratory discovered the self-splicing capability of the ribosomal RNA intron from Tetrahymena thermophila, demonstrating that RNA could catalyze its own excision without protein assistance, thus identifying the first ribozyme. This breakthrough challenged the prevailing view that only proteins function as enzymes and paved the way for understanding RNA's catalytic potential. Independently, in 1983, Sidney Altman's group showed that the RNA component of RNase P from Escherichia coli performs the catalytic cleavage of tRNA precursors, confirming RNA's enzymatic role in vivo. Their discoveries, recognized with the 1989 Nobel Prize in Chemistry, established ribozymes as key players in RNA processing and regulation, influencing fields from splicing mechanisms to synthetic biology.[99] The late 1990s brought further revelations in RNA-mediated gene silencing with the 1998 identification of RNA interference (RNAi) by Andrew Fire and Craig Mello, who demonstrated that double-stranded RNA triggers potent, sequence-specific degradation of homologous mRNAs in Caenorhabditis elegans. This work elucidated the core RNAi pathway involving small interfering RNAs (siRNAs) and microRNAs (miRNAs), which guide Argonaute proteins to target transcripts for cleavage or translational repression, thereby regulating gene expression at the post-transcriptional level. Awarded the 2006 Nobel Prize in Physiology or Medicine, RNAi revolutionized functional genomics, enabling targeted gene knockdown and inspiring therapeutic applications like siRNA drugs for viral infections and genetic disorders.[100] The 2000s and 2010s marked an explosion in recognizing regulatory non-coding RNAs, particularly long non-coding RNAs (lncRNAs), with early examples like Xist—first sequenced in 1991 but functionally characterized in the mid-2000s for its role in X-chromosome inactivation through chromatin coating and silencing. By 2012, the ENCODE project's GENCODE consortium cataloged over 9,000 human lncRNA loci, revealing their widespread expression and diverse regulatory functions, such as epigenetic modulation and transcriptional interference, far beyond initial annotations. This systematic annotation, building on computational pipelines from the late 2000s, highlighted lncRNAs' prevalence—comprising up to 80% of the non-coding transcriptome—and spurred genome-wide studies into their roles in development and disease. Advancements in epitranscriptomics emerged in the 2010s, with Kate D. Meyer's 2012 development of MeRIP-seq enabling transcriptome-wide mapping of N⁶-methyladenosine (m⁶A), the most abundant internal mRNA modification, enriched near stop codons and in 3' UTRs to influence splicing, stability, and translation.[101] Concurrently, Julia Salzman's 2012 analysis uncovered circular RNAs (circRNAs) as predominant isoforms from thousands of human genes, formed via back-splicing and functioning as miRNA sponges or regulators of parental gene expression, challenging linear RNA paradigms. These discoveries expanded RNA regulation to include chemical modifications and alternative splicing topologies, with m⁶A "writers" like METTL3 and circRNA abundance in neural tissues underscoring their tissue-specific impacts. From 2020 onward, research has illuminated RNA's role in biomolecular condensates, particularly phase separation within stress granules—cytoplasmic assemblies that sequester mRNAs during cellular stress to halt translation and promote survival. A pivotal 2020 study revealed G3BP1 as a core driver, where RNA binding induces its conformational switch to trigger liquid-liquid phase separation, dynamically partitioning RNAs for selective protection or degradation. Single-cell RNA sequencing (scRNA-seq) has further resolved regulatory networks, with tools like IReNA (2022) integrating scRNA-seq and scATAC-seq to infer cell-type-specific interactions, uncovering dynamic transcription factor modules in heterogeneous tissues like tumors. The COVID-19 pandemic accelerated RNA research through mRNA vaccines, which by 2021 demonstrated scalable production and immune efficacy, spurring over 200 clinical trials for non-viral applications like cancer immunotherapies and boosting lipid nanoparticle delivery innovations. These developments, from 2020 to 2025, have integrated phase-separated RNA dynamics with high-resolution profiling, transforming regulatory insights and therapeutic pipelines.RNA in Abiogenesis and Prebiotic Chemistry
The RNA world hypothesis posits that RNA served as both the genetic material and catalyst in the earliest stages of life on Earth, preceding the emergence of DNA and proteins. Proposed by Walter Gilbert in 1986, this model suggests that self-replicating RNA molecules capable of catalyzing their own replication and basic metabolic reactions formed the foundation of prebiotic evolution. In this scenario, RNA's dual functionality—storing information like DNA and performing enzymatic roles like proteins—allowed it to bootstrap the complexity of life without requiring more sophisticated biopolymers initially. Prebiotic synthesis pathways for RNA components remain a central focus, with research exploring plausible geochemical environments. Nucleotides, the building blocks of RNA, could have formed in settings such as formamide-rich pools or hydrothermal vents, where simple precursors like hydrogen cyanide and formaldehyde react under mild conditions to yield ribose sugars and nucleobases. A landmark achievement came in 2009, when Matthew Powner and colleagues demonstrated the synthesis of pyrimidine ribonucleotides—such as cytidine and uridine derivatives—from simple prebiotic molecules like cyanamide, glycolaldehyde, and phosphate, bypassing the unstable free ribose and requiring only wet-dry cycles for activation.[102] This pathway, conducted under conditions mimicking early Earth, produced activated nucleotides with 3'-5' phosphodiester linkages, addressing a key hurdle in RNA polymerization. Hydrothermal vents provide another proposed site, where mineral surfaces catalyze nucleotide assembly from CO2 and H2 under high-temperature gradients. Despite these advances, significant challenges persist in reconstructing a fully RNA-based prebiotic system. One major issue is the preferential formation of non-standard 2'-5' phosphodiester linkages during non-enzymatic polymerization, which destabilize RNA duplexes and hinder template-directed replication compared to the biologically relevant 3'-5' bonds. These aberrant linkages arise because prebiotic reactions often activate the 2'-hydroxyl group on ribose, leading to branched polymers that are less stable and prone to hydrolysis. In vitro evolution experiments, such as Sol Spiegelman's 1967 work with Qβ phage RNA replicase, illustrate the dynamics of RNA simplification under selective pressure; serial transfer in test tubes produced "Spiegelman's monster," a truncated 218-nucleotide RNA that replicated rapidly but lost non-essential genetic information, highlighting the ease of evolutionary regression without stabilizing mechanisms. Supporting evidence for the RNA world includes ribozymes that mimic primitive metabolic functions, demonstrating RNA's catalytic versatility. For instance, in vitro-selected ribozymes have been engineered to perform reactions akin to glycolysis intermediates, such as carbon-carbon bond formation, suggesting that early RNA networks could sustain basic metabolism without proteins.[103] Extraterrestrial delivery of RNA precursors further bolsters the hypothesis; the Murchison meteorite, which fell in 1969, contains a suite of nucleobases including adenine, guanine, cytosine, uracil, and thymine, with isotopic signatures indicating abiotic synthesis in space.[104] These compounds, detected at concentrations up to 70 parts per billion, could have seeded Earth's prebiotic soups with ready-made building blocks.[104] Recent computational and experimental studies from 2023 to 2025 have refined models of prebiotic RNA pathways using artificial intelligence and co-evolution simulations. In 2024, studies on vesicle-RNA co-evolution demonstrated that fatty acid vesicles encapsulate short RNA oligomers, enhancing their stability and enabling template-directed ligation in dilute prebiotic conditions, with encapsulated RNAs showing up to 10-fold faster replication rates compared to free molecules.[105] These protocell models suggest that lipid membranes and RNA co-emerged, facilitating the transition from abiotic chemistry to Darwinian evolution. Ribozyme catalysis, as seen in modern structural RNAs, provides a brief analog for such ancient functions, where self-splicing introns hint at primordial RNA processing capabilities.Applications in Medicine and Biotechnology
Therapeutic RNA Molecules
Therapeutic RNA molecules represent a rapidly advancing class of pharmaceuticals that leverage RNA's natural roles in gene expression and regulation to treat diseases. These include messenger RNA (mRNA) vaccines, antisense oligonucleotides (ASOs), small interfering RNAs (siRNAs), and aptamers, each designed to modulate specific biological processes such as protein production, gene silencing, or protein binding. Unlike traditional small-molecule drugs, RNA therapeutics offer high specificity and the potential for rapid development, particularly in response to emerging threats like infectious diseases or genetic disorders. Their clinical success has been enabled by innovations in chemical modifications and delivery systems to overcome inherent RNA vulnerabilities. mRNA vaccines, a breakthrough in prophylactic and therapeutic applications, instruct host cells to produce antigenic proteins that trigger immune responses. The Pfizer-BioNTech vaccine (BNT162b2), approved by the U.S. Food and Drug Administration (FDA) in December 2020 under emergency use authorization, and the Moderna vaccine (mRNA-1273), authorized shortly thereafter, both encode the SARS-CoV-2 spike protein within nucleoside-modified mRNA encapsulated in lipid nanoparticles (LNPs) for efficient cellular uptake and protection from degradation. Upon delivery, the mRNA is translated by ribosomes into the spike protein, eliciting neutralizing antibodies and T-cell immunity without using live virus. These vaccines demonstrated over 90% efficacy in preventing symptomatic COVID-19 in phase 3 trials, marking the first widespread deployment of mRNA technology in humans.[106][107][108] Antisense oligonucleotides (ASOs) function by hybridizing to target RNA sequences to alter splicing, block translation, or induce degradation, providing precise control over gene expression. Nusinersen (Spinraza), an ASO approved by the FDA in December 2016 for spinal muscular atrophy (SMA), binds to an intronic splicing silencer site in SMN2 pre-mRNA, promoting inclusion of exon 7 to increase full-length survival motor neuron (SMN) protein production. Administered intrathecally, it has shown significant improvements in motor function for infants and children with SMA in clinical trials, with sustained benefits observed over multiple years. Similarly, siRNA therapeutics exploit RNA interference to silence disease-causing genes. Patisiran (Onpattro), approved by the FDA in August 2018 for hereditary transthyretin-mediated (hATTR) amyloidosis, is a lipid nanoparticle-formulated siRNA conjugated to N-acetylgalactosamine (GalNAc) for hepatocyte-specific targeting via the asialoglycoprotein receptor. It reduces hepatic transthyretin (TTR) production by over 80% in patients, alleviating polyneuropathy symptoms as evidenced in the APOLLO phase 3 trial.[109][110][111][112] Aptamers, single-stranded RNA or DNA ligands selected for high-affinity binding to target proteins, offer a non-immunogenic alternative for inhibiting protein function. Pegaptanib (Macugen), the first FDA-approved aptamer in December 2004 for neovascular (wet) age-related macular degeneration (AMD), is a 27-nucleotide RNA molecule pegylated for stability that specifically binds the vascular endothelial growth factor (VEGF165) isoform, preventing its interaction with receptors and reducing pathological angiogenesis in the retina. Intravitreal injections slowed vision loss in about 70% of treated patients in pivotal trials, establishing aptamers as viable therapeutics despite later competition from protein-based anti-VEGFs. Key challenges in RNA therapeutics include rapid enzymatic degradation and innate immune activation, which can limit efficacy and cause adverse reactions. To enhance stability, nucleoside modifications such as pseudouridine (Ψ) are incorporated, as in mRNA vaccines, where Ψ substitution reduces Toll-like receptor recognition, lowers immunogenicity, and boosts translation efficiency by up to 10-fold compared to unmodified RNA. LNPs and GalNAc conjugates further address delivery barriers by facilitating endosomal escape and tissue-specific uptake, though off-target effects and manufacturing scalability remain hurdles. As of 2025, mRNA platforms are expanding into oncology, with BioNTech's individualized neoantigen-specific mRNA vaccines (e.g., autogene cevumeran) showing promising immune activation and tumor reduction in phase 2 trials for pancreatic and melanoma cancers, with phase 3 studies planned or initiated in 2025. Recent approvals include donidalorsen in August 2025 for hereditary angioedema, further expanding the portfolio of RNA-based treatments.[113] These developments underscore RNA's potential for personalized medicine, with over a dozen RNA drugs now FDA-approved.[114][115][116][117]Diagnostic and Research Tools
Reverse transcription polymerase chain reaction (RT-PCR) and quantitative PCR (qPCR) are foundational techniques for RNA detection in diagnostics, involving the conversion of RNA to complementary DNA followed by amplification to quantify viral or gene expression levels. These methods gained prominence during the COVID-19 pandemic for SARS-CoV-2 detection, where RT-qPCR served as the gold standard due to its high sensitivity (detecting as few as 10-100 viral RNA copies) and specificity exceeding 99%, enabling rapid identification of infected individuals from nasopharyngeal swabs.[118] Limitations include potential false negatives from low viral loads or sample degradation, but optimizations like one-step RT-qPCR have improved throughput for large-scale testing.[119] RNA sequencing (RNA-seq) represents a high-throughput approach for comprehensive transcriptome analysis, capturing the full spectrum of RNA molecules to profile gene expression, alternative splicing, and novel transcripts in research and disease diagnostics. By sequencing cDNA libraries from RNA samples, RNA-seq provides quantitative data on thousands of genes simultaneously, outperforming earlier methods in dynamic range and resolution, with applications in identifying biomarkers for cancers like leukemia through differential expression patterns.[120] Single-cell RNA-seq (scRNA-seq) extends this by resolving cellular heterogeneity, isolating transcripts from individual cells to map rare subpopulations, such as tumor-infiltrating immune cells, which bulk methods obscure; for instance, scRNA-seq has revealed subtype-specific gene signatures in breast cancer heterogeneity with over 10,000 cells profiled per sample.[121] Microarrays and Northern blots offer targeted tools for RNA expression profiling, though they have been largely supplanted by sequencing in modern workflows. DNA microarrays hybridize labeled RNA or cDNA to immobilized probes on a chip, enabling parallel assessment of up to 50,000 genes to detect expression changes, as validated in studies of inflammatory responses where fold-changes correlated with clinical outcomes.[122] Northern blots, a classical gel-based method, separate RNA by size via electrophoresis, transfer it to a membrane, and detect specific transcripts using radiolabeled or chemiluminescent probes, providing size confirmation and quantification for validation, such as confirming microRNA levels in developmental tissues with sensitivity down to 1-5 pg of target RNA.[123] In situ hybridization (ISH) enables spatial visualization of RNA localization within intact tissues, using labeled probes to bind target sequences and reveal expression patterns at cellular resolution. RNA-ISH, often fluorescent (FISH), has been pivotal in neuroscience for mapping mRNA distribution in brain sections, identifying localized transcripts like those for neurotransmitters with single-molecule sensitivity, and in pathology for diagnosing viral infections or oncogenic fusions in tumor biopsies.[124] Advances like branched DNA amplification in ISH platforms enhance signal detection in formalin-fixed tissues, achieving multiplexed analysis of up to 48 RNA targets simultaneously.[125] Spatial transcriptomics, exemplified by the Visium platform, integrates RNA-seq with tissue imaging to map gene expression in situ at near-single-cell resolution, addressing limitations of dissociated samples by preserving spatial context. Introduced in 2019 and refined by 2024 with high-definition versions capturing 2-micron pixels across 1 cm² sections, Visium has elucidated tumor microenvironments in prostate cancer, identifying spatially segregated immune niches with over 18,000 genes profiled per spot.[126] These methods support research into tissue architecture in diseases like fibrosis, where zonal gene gradients inform pathogenesis.[127] Post-2023 integrations of artificial intelligence (AI) with RNA diagnostics have enhanced analysis of complex datasets, such as using machine learning on RNA-seq outputs for predictive modeling in oncology. AI algorithms, like deep neural networks trained on scRNA-seq data, classify cancer subtypes with 95% accuracy by detecting subtle expression patterns missed by traditional statistics, as demonstrated in pancreatic ductal adenocarcinoma diagnostics.[128] In viral diagnostics, AI-optimized RT-qPCR interpretation reduces false positives by 20% through pattern recognition in amplification curves, accelerating outbreak responses.[129]Emerging Synthetic and Editing Technologies
Emerging advancements in RNA synthetic and editing technologies have revolutionized genome engineering and synthetic biology by enabling precise manipulation of genetic material at the RNA level. The CRISPR-Cas systems, originally derived from bacterial immune defenses, utilize single-guide RNAs (sgRNAs) to direct Cas9 nucleases for targeted DNA cleavage and editing, allowing for insertions, deletions, or base substitutions in eukaryotic genomes. In parallel, Cas13 variants target RNA directly, facilitating transient knockdown or cleavage without altering the underlying DNA sequence, which is particularly useful for studying gene function or degrading viral RNAs. RNA editing technologies have advanced through the integration of CRISPR components with endogenous enzymes like ADAR, which naturally catalyze adenosine-to-inosine (A-to-I) deamination in RNA transcripts. For instance, CRISPRoff employs a catalytically dead Cas9 fused to a KRAB repressor domain to achieve epigenetic silencing of DNA targets via methylation, offering reversible gene repression without permanent mutations.31502-6) Building on ADAR's mechanism, the REPAIR system, developed in 2017, uses a catalytically inactive Cas13 fused to an evolved ADAR deaminase to enable programmable A-to-I edits in target RNAs, demonstrating up to 30% editing efficiency in cellular transcripts with minimal off-target effects. In synthetic biology, RNA molecules serve as programmable building blocks for regulatory circuits and nanostructures. Toehold switches, short RNA sequences that form metastable hairpins, act as riboregulators by activating translation upon binding complementary trigger RNAs, enabling logic-gated gene expression in bacteria and mammalian cells with sensitivities rivaling protein-based sensors.00864-0) RNA nanostructures, such as those created via RNA origami techniques, fold into complex two- and three-dimensional shapes through computational design of base-pairing motifs, achieving nanoscale assemblies stable under physiological conditions for potential use in molecular machines. Aptamer evolution through the Systematic Evolution of Ligands by EXponential enrichment (SELEX) process generates high-affinity RNA ligands that bind specific targets, such as proteins or small molecules, with dissociation constants in the nanomolar range. For vaccine development, mRNA engineering incorporates nucleoside modifications like pseudouridine to reduce immunogenicity and enhance stability, as exemplified in the rapid deployment of SARS-CoV-2 mRNA vaccines that elicited robust immune responses in clinical trials. As of 2025, frontiers in RNA nanotechnology emphasize self-assembling RNA particles for targeted drug delivery, where lipid-RNA nanoparticles encapsulate therapeutics to improve bioavailability and reduce systemic toxicity in cancer therapies. Recent Cas13-based antiviral trials in 2024 demonstrated prophylactic efficacy against influenza in animal models by degrading viral RNA in vivo, paving the way for RNA-targeted antivirals. Additionally, quantum dot-RNA hybrid sensors have emerged for real-time detection of RNA biomarkers, leveraging fluorescence resonance energy transfer to achieve single-molecule sensitivity in diagnostic applications.References
- https://www.thermofisher.com/us/en/home/life-science/[cloning](/page/Cloning)/cloning-learning-center/invitrogen-school-of-molecular-biology/rt-education/reverse-transcriptase-attributes.html