Hubbry Logo
Glossary of cellular and molecular biology (M–Z)Glossary of cellular and molecular biology (M–Z)Main
Open search
Glossary of cellular and molecular biology (M–Z)
Community hub
Glossary of cellular and molecular biology (M–Z)
logo
7 pages, 0 posts
0 subscribers
Be the first to start a discussion here.
Be the first to start a discussion here.
Glossary of cellular and molecular biology (M–Z)
Glossary of cellular and molecular biology (M–Z)
from Wikipedia

This glossary of cellular and molecular biology is a list of definitions of terms and concepts commonly used in the study of cell biology, molecular biology, and related disciplines, including molecular genetics, biochemistry, and microbiology.[1] It is split across two articles:

  • Glossary of cellular and molecular biology (0–L) lists terms beginning with numbers and those beginning with the letters A through L.
  • Glossary of cellular and molecular biology (M–Z) (this page) lists terms beginning with the letters M through Z.

This glossary is intended as introductory material for novices (for more specific and technical detail, see the article corresponding to each term). It has been designed as a companion to Glossary of genetics and evolutionary biology, which contains many overlapping and related terms; other related glossaries include Glossary of virology and Glossary of chemistry.


M

[edit]
M phase
See mitosis.
macromolecule
Any very large molecule composed of dozens, hundreds, or thousands of covalently bonded atoms, especially one with biological significance. Many important biomolecules, such as nucleic acids and proteins, are polymers consisting of a repeated series of smaller monomers; others such as lipids and carbohydrates may not be polymeric but are nevertheless large and complex molecules.
macronucleus

Also meganucleus.

The larger of the two types of nuclei which occur in pairs in the cells of some ciliated protozoa. Macronuclei are highly polyploid and responsible for directing vegetative reproduction, in contrast to the diploid micronuclei, which have important functions during conjugation.[2]
macrophage
Any of a class of relatively long-lived phagocytic cells of the mammalian immune system which are activated in response to the presence of foreign materials in certain tissues and subsequently play important roles in antigen presentation, stimulating other types of immune cells, and killing or engulfing parasitic microorganisms, diseased cells, or tumor cells.[3]
major groove
map-based cloning
See positional cloning.
massively parallel sequencing

Also next-generation sequencing (NGS) and second-generation sequencing.

medical genetics
The branch of medicine and medical science that involves the study, diagnosis, and management of hereditary disorders, and more broadly the application of knowledge about human genetics to medical care.
megabase (Mb)
A unit of nucleic acid length equal to one million (1×106) bases in single-stranded molecules or one million base pairs in duplex molecules such as double-stranded DNA.
meiosis
A specialized type of cell division that occurs exclusively in sexually reproducing eukaryotes, during which DNA replication is followed by two consecutive rounds of division to ultimately produce four genetically unique haploid daughter cells, each with half the number of chromosomes as the original diploid parent cell. Meiosis only occurs in cells of the sex organs, and serves the purpose of generating haploid gametes such as sperm, eggs, or spores, which are later fused during fertilization. The two meiotic divisions, known as Meiosis I and Meiosis II, may also include various genetic recombination events between homologous chromosomes.
meiotic spindle
See spindle apparatus.
melting
The denaturation of a double-stranded nucleic acid into two single strands, especially in the context of the polymerase chain reaction.
membrane
A supramolecular aggregate of amphipathic lipid molecules which when suspended in a polar solvent tend to arrange themselves into structures which minimize the exposure of their hydrophobic tails by sheltering them within a ball created by their own hydrophilic heads (i.e. a micelle). Certain types of lipids, specifically phospholipids and other membrane lipids, commonly occur as double-layered sheets of molecules when immersed in an aqueous environment, which can themselves assume approximately spherical shapes, acting as semipermeable barriers surrounding a water-filled interior space. This is the basic structure of the biological membranes enclosing all cells, vesicles, and membrane-bound organelles.
membrane lipid
membrane potential
membrane protein
Any protein that is closely associated either transiently or permanently with the lipid bilayer membrane surrounding a cell, organelle, or vesicle.[4]
membrane-bound organelle
An organelle or cellular compartment enclosed by its own dedicated lipid membrane, separating its interior from the rest of the cytoplasm.
messenger RNA (mRNA)
Any of a class of single-stranded RNA molecules which function as molecular messengers, carrying sequence information encoded in the DNA genome to the ribosomes where protein synthesis occurs. The primary products of transcription, mRNAs are synthesized by RNA polymerase, which builds a chain of ribonucleotides that complement the deoxyribonucleotides of a DNA template; in this way, the DNA sequence of a protein-coding gene is effectively preserved in the raw transcript, which is subsequently processed into a mature mRNA by a series of post-transcriptional modifications.
The structure of a typical mature protein-coding messenger RNA or mRNA, drawn approximately to scale. The coding sequence (green) is bounded by untranslated regions at both the 5'-end (yellow) and the 3'-end (pink). Prior to export from the nucleus, a 5' cap (red) and a 3' poly(A) tail (black) are added to help stabilize the mRNA and prevent its degradation by ribonucleases.
metabolic pathway
A stepwise series of biochemical reactions occurring within a cell, often but not necessarily catalyzed by specific enzymes, that fulfills some activity or process related to metabolism. The reactions are linked by the sharing of reactants, products, or intermediate compounds in consecutive steps, such that the product of one reaction is used as a reactant in a subsequent reaction. Byproducts are often removed from the cell as metabolic waste. The overall pathway may be anabolic, catabolic, or amphibolic in nature.[5] In any actively metabolizing cell, an elaborate network of interconnected metabolic pathways is required to maintain homeostasis, with degradative catabolic processes providing the energy necessary to conduct anabolic biosynthesis; for example, glycolysis, the electron transport chain, and oxidative phosphorylation provide the ATP used in fatty acid synthesis. The flux of metabolites through each pathway is regulated by the needs of the cell and the availability of substrates.
metabolic waste
metabolism
The complete set of chemical reactions which sustain and account for the basic processes of life in all living cells,[2] especially those involving: 1) the conversion of energy from food into energy available for cellular activities; 2) the breakdown of food into simpler compounds which can then be used as substrates to build complex biomolecules such as proteins, lipids, and nucleic acids; and 3) the degradation and excretion of toxins, byproducts, and other unusable compounds known as metabolic wastes. In a broader sense the term may include all chemical reactions occurring in living organisms, even those which are not strictly necessary for life but instead serve accessory functions. Many specific cellular activities are accomplished by metabolic pathways in which one chemical is ultimately transformed through a stepwise series of reactions into another chemical, with each reaction catalyzed by a specific enzyme. Most metabolic reactions can be subclassified as catabolic or anabolic.
metabolite
An intermediate or end product of metabolism, especially degradative metabolism (catabolism);[2] or any substance produced by or taking part in a metabolic reaction. Metabolites include a huge variety of small molecules generated by cells from various pathways and having various functions, including as inputs to other pathways and reactions, as signaling molecules, and as stimulators, inhibitors, and cofactors of enzymes. Metabolites may result from the degradation and elimination of naturally occurring compounds as well as of synthetic compounds such as pharmaceuticals.
metabolome
The complete set of small-molecule chemical compounds within a cell, organelle, or any other biological sample, including both endogenous molecules (e.g. individual amino acids and nucleotides, fatty acids, organic acids, amines, simple sugars, vitamins, antibiotics, etc.) and exogenous molecules (e.g. drugs, toxins, environmental contaminants, and other xenobiotics).
metacentric
(of a linear chromosome or chromosome fragment) Having a centromere positioned in the middle of the chromosome, resulting in chromatid arms of approximately equal length.[6]
metaphase
The stage of mitosis and meiosis that occurs after prometaphase and before anaphase, during which the centromeres of the replicated chromosomes align along the equator of the cell, with each kinetochore attached to the mitotic spindle.
methylation
The covalent attachment of a methyl group (–CH
3
) to a chemical compound, protein, or other biomolecule, either spontaneously or by enzymatic catalysis. Methylation is one of the most widespread natural mechanisms by which nucleic acids and proteins are labelled. The methylation of nucleobases in a DNA molecule inhibits recognition of the methylated sequence by DNA-binding proteins, which can effectively silence the expression of genes. Specific residues within histones are also commonly methylated, which can change nucleosome positioning and similarly activate or repress nearby loci. The opposite reaction is demethylation.
methyltransferase
Any of a class of transferase enzymes which catalyze the covalent bonding of a methyl group (–CH
3
) to another compound, protein, or biomolecule, a process known as methylation.
MicroArray and Gene Expression (MAGE)
A group that "aims to provide a standard for the representation of DNA microarray gene expression data that would facilitate the exchange of microarray information between different data systems".[7]
microbody
Any of a diverse class of small membrane-bound organelles or vesicles found in the cells of many eukaryotes, especially plants and animals, usually having some specific metabolic function and occurring in great numbers in certain specialized cell types. Peroxisomes, glyoxysomes, glycosomes, and hydrogenosomes are often considered microbodies.
microchromosome
A type of very small chromosome, generally less than 20,000 base pairs in size, present in the karyotypes of some organisms.
microdeletion
A chromosomal deletion that is too short to cause any apparent change in morphology under a light microscope, though it may still be detectable with other methods such as sequencing.
microfilament
A long, thin, flexible, rod-like structure composed of polymeric strands of proteins, usually actins, that occurs in abundance in the cytoplasm of eukaryotic cells, forming part of the cytoskeleton. Microfilaments comprise the cell's structural framework. They are modified by and interact with numerous other cytoplasmic proteins, playing important roles in cell stability, motility, contractility, and facilitating changes in cell shape, as well as in cytokinesis.
micronucleus
The smaller of the two types of nuclei that occur in pairs in the cells of some ciliated protozoa. Whereas the larger macronucleus is polyploid, the micronucleus is diploid and generally transcriptionally inactive except for the purpose of sexual reproduction, where it has important functions during conjugation.[2]
microRNA (miRNA)
A type of small, single-stranded, non-coding RNA molecule that functions in post-transcriptional regulation of gene expression, particularly RNA silencing, by base-pairing with complementary sequences in mRNA transcripts, which typically results in the cleavage or destabilization of the transcript or inhibits its translation by ribosomes.
microsatellite

Also short tandem repeat (STR) or simple sequence repeat (SSR).

A type of satellite DNA consisting of a relatively short sequence of tandem repeats, in which certain motifs (ranging in length from one to six or more bases) are repeated, typically 5–50 times. Microsatellites are widespread throughout most organisms' genomes and tend to have higher mutation rates than other regions. They are classified as variable number tandem repeat (VNTR) DNA, along with longer minisatellites.
microsome
A small intracellular vesicle derived from fragments of endoplasmic reticulum observed in cells which have been homogenized.[4]
microspike
See filopodium.
microtome
An instrument used to cut extremely thin slices of material, known as microsections or simply sections, preparatory to observation under a microscope.[5] Sections of tissues and cells are usually 50 nanometres (nm) to 100 micrometres (μm) in width. The process of cutting them is known as microtomy.
microtrabecula

(pl.) microtrabeculae

A fine protein filament of the cytoskeleton. Multiple filaments form the microtrabecular network.[2]
microtubule
Any of the long, generally straight, hollow tubes, about 24 nanometers in diameter and composed of interwoven polymeric filaments of the protein tubulin, found in the cytoplasm of many eukaryotic cells, where they are involved in maintaining the cell's shape and structural integrity as well as in force generation for cellular or organellar locomotion (as with cilia and flagella). They also comprise the spindle apparatus critical to mitosis and meiosis. Microtubules are rigid but transient all-purpose structural members which can be rapidly assembled and disassembled at the cell's needs. Many different microtubule-associated proteins interact with them.[5] See also microfilament.
microtubule-organizing center (MTOC)
A region near the center of a eukaryotic cell typically consisting of two centrioles oriented at right angles to each other and surrounded by a complex of associated proteins, which functions as the site of initiation for the assembly of microtubules.[5]
microvesicle

Also ectosome and microparticle.

A type of extracellular vesicle released when an evagination of the cell membrane "buds off" into the extracellular space. Microvesicles vary in size from 30–1,000 nanometres in diameter and are thought to play roles in many physiological processes, including intercellular communication by shuttling molecules such as RNA and proteins between cells.[8]
microvillus
A small, slender, tubular cytoplasmic projection, generally 0.2–4 micrometres long and 0.1 micrometres in diameter,[9] protruding from the surface of some animal cells and supported by a central core of microfilaments. When present in large numbers, such as on epithelial cells lining the respiratory and alimentary tracts, they form a dense brush border which presumably serves to increase each cell's absorptive surface area.[2][3]
mid body
The centrally constricted region that forms across the central axis of a cell during cytokinesis, constricted by the closing of the contractile ring until the daughter cells are finally separated,[2] but occasionally persisting as a tether between the two cells for as long as a complete cell cycle.[9]
middle lamella
In plant cells, the outermost layer of the cell wall; a continuous, unified layer of extracellular pectins which is the first layer deposited by the cell during cytokinesis and which serves to cement together the primary cell walls of adjacent cells.[4]
Minimal information about a high-throughput sequencing experiment (MINSEQE)
A commercial standard developed by FGED for the storage and sharing of high-throughput sequencing data.[10]
Minimum information about a microarray experiment (MIAME)
A commercial standard developed by FGED and based on MAGE in order to facilitate the storage and sharing of gene expression data.[11][12]
minisatellite
A region of repetitive, non-coding genomic DNA in which certain DNA motifs (typically 10–60 bases in length) are tandemly repeated (typically 5–50 times). In the human genome, minisatellites occur at more than 1,000 loci, especially in centromeres and telomeres, and exhibit high mutation rates and high variability between individuals. Like the shorter microsatellites, they are classified as variable number tandem repeats (VNTRs) and are a type of satellite DNA.
minor groove
minus-strand
See template strand.
miRNA
See microRNA.
mismatch

Also mispairing.

An incorrect pairing of nucleobases on complementary strands of DNA or RNA; i.e. the presence in one strand of a duplex molecule of a base that is not complementary (by Watson–Crick pairing rules) to the base occupying the corresponding position in the other strand, which prevents normal hydrogen bonding between the bases. For example, a guanine paired with a thymine would be a mismatch, as guanine normally pairs with cytosine.[13]
mismatch repair (MMR)
missense mutation
A type of point mutation which results in a codon that codes for a different amino acid than in the unmutated sequence. Compare nonsense mutation.
mistranslation
The insertion of an incorrect amino acid in a growing peptide chain during translation, i.e. the inclusion of any amino acid that is not the one specified by a particular codon in an mRNA transcript. Mistranslation may originate from a mischarged transfer RNA or from a malfunctioning ribosome.[13]
mitochondrial DNA (mtDNA)
The set of DNA molecules contained within mitochondria, usually one or more circular plasmids representing a semi-autonomous genome which is physically separate from and functionally independent of the chromosomal DNA in the cell's nucleus. The mitochondrial genome encodes many unique enzymes found only in mitochondria.
mitochondrial fusion
mitochondrion

(pl.) mitochondria; also formerly chondriosome.

A highly pleiomorphic membrane-bound organelle found in the cytoplasm of nearly all eukaryotic cells, usually in large numbers in the form of sausage-shaped structures 5–10 micrometres in length,[9] enclosed by a double membrane, with the inner membrane infolded in an elaborate series of cristae so as to maximize surface area. Mitochondria are the primary sites of ATP synthesis, where ATP is regenerated from ADP via oxidative phosphorylation, as well as many supporting pathways, including the citric acid cycle and the electron transport chain.[3] Like other plastids, mitochondria contain their own genome encoded in circular DNA molecules which replicate independently of the nuclear genome, as well as their own unique set of transcription factors, polymerases, ribosomes, transfer RNAs, and aminoacyl-tRNA synthetases with which to direct transcription and translation of their genes. The majority of the structural proteins found in mitochondria are encoded by nuclear genes, however, such that mitochondria are only partially autonomous.[2] These observations suggest mitochondria evolved from symbiotic prokaryotes living inside eukaryotic cells.
Diagram of a mitochondrion found in an animal cell
mitogen
Any substance or stimulus that promotes or induces mitosis, or more generally which causes cells to re-enter the cell cycle.[3]
mitophagy
The selective degradation of mitochondria by means of autophagy; i.e. the mitochondrion initiates its own degradation. Mitophagy is a regular process in healthy populations of cells by which defective or damaged mitochondria are recycled, preventing their accumulation. It may also occur in response to the changing metabolic needs of the cell, e.g. during certain developmental stages.
mitosis

Also M phase.

In eukaryotic cells, the part of the cell cycle during which the division of the nucleus takes place and replicated chromosomes are separated into two distinct nuclei. Mitosis is generally preceded by the S phase of interphase, when the cell's DNA is replicated, and either occurs simultaneously with or is followed by cytokinesis, when the cytoplasm and plasma membrane are divided into two new daughter cells. Colloquially, the term "mitosis" is often used to refer to the entire process of cell division, not just the division of the nucleus.
mitotic index (MI)
The proportion of cells within a sample which are undergoing mitosis at the time of observation, typically expressed as a percentage or as a value between 0 and 1. The number of cells dividing by mitosis at any given time can vary widely depending on organism, tissue, developmental stage, and culture media, among other factors.[2]
mitotic recombination

Also somatic crossing over.

The abnormal exchange of genetic material between homologous chromosomes during mitosis (as opposed to meiosis, where it occurs normally). Homologous recombination during mitosis is relatively uncommon; in the laboratory, it can be induced by exposing dividing cells to high-energy electromagnetic radiation such as X rays. As in meiosis, it can separate heterozygous alleles and thereby propagate potentially significant changes in zygosity to daughter cells, though unless it occurs very early in development this often has little or no phenotypic effect, since any phenotypic variance shown by mutant lineages arising in terminally differentiated cells is generally masked or compensated for by neighboring wild-type cells.[2]
mitotic rounding
The process by which most animal cells undergo an overall change in shape during or preceding mitosis, abandoning the various complex or elongated shapes characteristic of interphase and rapidly contracting into a rounded or spherical morphology that is more conducive to cell division. This phenomenon has been observed both in vivo and in vitro.
mitotic segregation
mitotic spindle
See spindle apparatus.
mixoploidy
The presence of more than one different ploidy level, i.e. more than one number of sets of chromosomes, in different cells of the same cellular population.[13]
mobile genetic element (MGE)
Any genetic material that can move between different parts of a genome or be transferred from one species or replicon to another within a single generation. The many types of MGEs include transposable elements, bacterial plasmids, bacteriophage elements which integrate into host genomes by viral transduction, and self-splicing introns.
mobilome
The complete set of mobile genetic elements within a particular genome, cell, species, or other taxon, including all transposons, plasmids, prophages, and other self-splicing nucleic acid molecules.
molecular biology
The branch of biology that studies biological activity at the molecular level, in particular the various mechanisms underlying the biological processes that occur in and between cells, including the structures, properties, synthesis, and modification of biomolecules such as proteins and nucleic acids, their interactions with the chemical environment and with other biomolecules, and how these interactions explain the observations of classical biology (which in contrast studies biological systems at much larger scales).[14] Molecular biology relies largely on laboratory techniques of physics and chemistry to manipulate and measure microscopic phenomena. It is closely related to and overlaps with the fields of cell biology, biochemistry, and molecular genetics.
molecular cloning
Any of various molecular biology methods designed to replicate a particular molecule, usually a DNA sequence or a protein, many times inside the cells of a natural host. Commonly, a recombinant DNA fragment containing a gene of interest is ligated into a plasmid vector, which competent bacterial cells are then induced to uptake in a process known as transformation. The bacteria, carrying the recombinant plasmid, are then allowed to proliferate naturally in cell culture, so that each time the bacterial cells divide, the plasmids are replicated along with the rest of the bacterial genome. Any functioning gene of interest within the plasmid will be expressed by the bacterial cells, and thereby its gene products will also be cloned. The plasmids or gene products, which now exist in many copies, may then be extracted from the bacteria and purified. Molecular cloning is a fundamental tool of genetic engineering employed for a wide variety of purposes, often to study gene expression, to amplify a specific gene product, or to generate a selectable phenotype.
An outline of how molecular cloning works
molecular genetics
A branch of genetics that employs methods and techniques of molecular biology to study the structure and function of genes and gene products at the molecular level. Contrast classical genetics.
monad
A haploid set of chromosomes as it exists inside the nucleus of an immature gametic cell such as an ootid or spermatid, i.e. a cell which is a product of meiosis but is not yet a mature gamete.[13]
monocentric
(of a linear chromosome or chromosome fragment) Having only one centromere. Contrast dicentric and holocentric.
monoclonal
Describing cells, proteins, or molecules descended or derived from a single clone (i.e. from the same genome or genetic lineage) or made in response to a single unique compound. Monoclonal antibodies are raised against only one antigen or can only recognize one unique epitope on the same antigen. Similarly, the cells of some tissues and neoplasms may be described as monoclonal if they are all the asexual progeny of one original parent cell.[2] Contrast polyclonal.
monocyte
A type of large leukocyte of the mononuclear phagocyte system in mammals, characterized by pale-staining cytoplasm and a kidney-shaped or horseshoe-shaped nucleus. Monocytes are derived from pluripotent stem cells in bone marrow and become macrophages in other tissues.[5]
monokaryotic
(of a cell) Having a single nucleus, as opposed to no nucleus or multiple nuclei.
monomer
A molecule or compound which can exist individually or serve as a building block or subunit of a larger macromolecular aggregate known as a polymer.[4] Polymers form when multiple monomers of the same or similar molecular species are connected to each other by chemical bonds, either in a linear chain or a non-linear conglomeration. Examples include the individual nucleotides which form nucleic acid polymers, the individual amino acids which form polypeptides, and the individual proteins which form protein complexes.
monoploid
monosaccharide
Any of a class of organic compounds which are the simplest forms of carbohydrates and the most basic structural subunits or monomers from which larger carbohydrate polymers such as disaccharides, oligosaccharides, and polysaccharides are composed. With few exceptions, all monosaccharides are variations on the empirical formula (CH
2
O)
n
, where n typically ranges from 3 (trioses) to 7 (heptoses).[3] Common examples include glucose, ribose, and deoxyribose.
monosomy
The abnormal and frequently pathological presence of only one chromosome of a normal diploid pair. It is a type of aneuploidy.
Morpholino

Also phosphorodiamidate Morpholino oligomer.

A synthetic nucleic acid analogue connecting a short sequence of nucleobases into an artificial antisense oligomer, used in genetic engineering to knockdown gene expression by pairing with complementary sequences in naturally occurring RNA or DNA molecules, especially mRNA transcripts, thereby inhibiting interactions with other biomolecules such as proteins and ribosomes. Morpholino oligomers are not themselves translated, and neither they nor their hybrid duplexes with RNA are attacked by nucleases; also, unlike the negatively charged phosphates of normal nucleic acids, the synthetic backbones of Morpholinos are electrically neutral, making them less likely to interact non-selectively with a host cell's charged proteins. These properties make them useful and reliable tools for artificially generating mutant phenotypes in living cells.[13]
mosaicism
The presence of two or more populations of cells with different genotypes in an individual organism which has developed from a single fertilized egg. A mosaic organism can result from many kinds of genetic phenomena, including nondisjunction of chromosomes, endoreduplication, or mutations in individual stem cell lineages during the early development of the embryo. Mosaicism is similar to but distinct from chimerism.
motif
Any distinctive or recurring sequence of nucleotides in a nucleic acid or of amino acids in a peptide that is or is conjectured to be biologically significant, especially one that is reliably recognized by other biomolecules or which has a three-dimensional structure that permits unique or characteristic chemical interactions such as DNA binding.[13] In nucleic acids, motifs are often short (three to ten nucleotides in length), highly conserved sequences which act as recognition sites for DNA-binding proteins or RNAs involved in the regulation of gene expression.
motor protein
Any protein which converts chemical energy derived from the hydrolysis of nucleoside triphosphates such as ATP and GTP into mechanical work in order to effect its own locomotion, by propelling itself along a filament or through the cytoplasm.[4]
mRNA
See messenger RNA.
mtDNA
See mitochondrial DNA.
multicellular
Composed of more than one cell. The term is used especially to describe organisms or tissues consisting of many cells descendant from the same original parent cell which work together in an organized way, but may also describe groups of nominally single-celled organisms such as protists and bacteria which live symbiotically with each other in large colonies. Contrast unicellular.
multimer
An aggregate of two or more molecular entities, identical or non-identical, held together by non-covalent forces;[5] e.g. a protein complex.
multinucleate
(of a cell) Having more than one nucleus within a single cell; i.e. having multiple nuclei occupying the same cytoplasm.
multiomics
The integration of data from multiple "omics" technologies (e.g. data from the genome, epigenome, transcriptome, proteome, metabolome, etc.) in order to study complex biological relationships, discover novel associations between biological entities, pinpoint relevant biomarkers, or build elaborate models of physiology and disease.
multiple cloning site (MCS)

Also polylinker.

A locus or sequence within a plasmid vector which contains multiple unique restriction sites recognized by various restriction endonucleases, which makes it possible for scientists to target the insertion of a DNA fragment (often a gene cassette) specifically to that locus and in the desired orientation, by digesting the insert and the vector with the same endonuclease(s) and then ligating them together via compatible restriction ends, a technique known as restriction cloning.[5] Commercial plasmids designed for cloning commonly incorporate one or more multiple cloning sites.
mutagen
Any physical or chemical agent that changes the genetic material (usually DNA) of an organism and thereby increases the frequency of mutations above natural background levels.
mutagenesis
1. The process by which the genetic information of an organism is changed, resulting in a mutation. Mutagenesis may occur spontaneously or as a result of exposure to a mutagen.
2. In molecular biology, any laboratory technique by which one or more genetic mutations are deliberately engineered in order to produce a mutant gene, regulatory element, gene product, or genetically modified organism so that the functions of a genetic locus, process, or product can be studied in detail.
mutant
An organism, gene product, or phenotypic trait resulting from a mutation, of a type that would not be observed naturally in wild-type specimens.
mutation
Any permanent change in the nucleotide sequence of a strand of DNA or RNA, or in the amino acid sequence of a peptide. Mutations play a role in both normal and abnormal biological processes; their natural occurrence is integral to the process of evolution. They can result from errors in replication, chemical damage, exposure to high-energy radiation, or manipulations by mobile genetic elements. Repair mechanisms have evolved in many organisms to correct them. By understanding the effect that a mutation has on phenotype, it is possible to establish the function of the gene or sequence in which it occurs.
mutator gene
Any mutant gene or sequence that increases the spontaneous mutation rate of one or more other genes or sequences. Mutators are often transposable elements, or may be mutant housekeeping genes such as those that encode helicases or proteins involved in proofreading.[13]
mutein
A mutant protein, i.e. a protein whose amino acid sequence differs from that of the normal because of a mutation.
muton
The smallest unit of a DNA molecule in which a physical or chemical change can result in a mutation (conventionally a single nucleotide).[13]


N

[edit]
n orientation
One of two possible orientations by which a linear DNA fragment can be inserted into a vector, specifically the one in which the gene maps of both fragment and vector have the same orientation.[13] Contrast u orientation.
NAD
See nicotinamide-adenine dinucleotide.
NADP
See nicotinamide-adenine dinucleotide phosphate.
nanoinjection
A laboratory technique involving the use of a microscopic lance or nanopipette (typically about 100 nanometres in diameter) in the presence of an electric field in order to deliver DNA or RNA directly into a cell, often a zygote or early embryo, via an electrophoretic mechanism. While submerged in a pH-buffered solution, a positive electric charge is applied to the lance, attracting negatively charged nucleic acids to its surface; the lance then penetrates the cell membrane and the electric field is reversed, applying a negative charge which repels the accumulated nucleic acids away from the lance and thus into the cell. Compare microinjection.
nascent
In the process of being synthesized; incomplete; not yet fully processed or mature. The term is commonly used to describe strands of DNA or RNA which are actively undergoing synthesis during replication or transcription, or sometimes a complete, fully transcribed RNA molecule before any post-transcriptional modifications have been made (e.g. polyadenylation or RNA editing), or a peptide chain actively undergoing translation by a ribosome.[13]
ncAA
See non-canonical amino acid.
ncDNA
See non-coding DNA.
ncRNA
See non-coding RNA.
negative (-) sense strand
See template strand.
negative control

Also negative regulation.

The inhibition or deactivation of some biological process caused by the presence of a specific molecular entity (e.g. a repressor), in the absence of which the process is not inhibited and thus can proceed normally.[5] In gene regulation, for example, a repressor may bind to an operator upstream from a coding sequence and prevent access by transcription factors and/or RNA polymerase, thereby blocking the gene's transcription. This is contrasted with positive control, in which the presence of an inducer is necessary to switch on transcription.[9]
negative supercoiling
The supercoiling of a double-stranded DNA molecule in the direction opposite to the turn of the double helix itself (e.g. a left-handed coiling of a helix with a right-handed turn).[9] Contrast positive supercoiling.
next-generation sequencing (NGS)
See massively parallel sequencing.
nick
A break or discontinuity in the phosphate backbone of one strand of a double-stranded DNA molecule, i.e. where a phosphodiester bond is hydrolyzed but no nucleotides are removed; such a molecule is said to be nicked. A nick is a single-strand break, where despite the break the DNA molecule is not ultimately broken into multiple fragments, which contrasts with a cut, where both strands are broken. Nicks may be caused by DNA damage or by dedicated nucleases known as nicking enzymes, which nick DNA at random or specific sites. Nicks are frequently placed by the cell as markers identifying target sites for enzyme activity, including in DNA replication, transcription, and mismatch repair, and also to release torsional stress from overwound DNA molecules, making them important in manipulating DNA topology.[9]
nick translation
nickase
Another name for a nicking enzyme, especially one that has been artificially engineered to create single-stranded breaks (i.e. nicks) by altering the cleavage activity of an endonuclease that normally creates double-stranded breaks, e.g. Cas9 nickase (nCas9).[15]
nicking enzyme

Also nicking endonuclease and nickase.

Any of a class of endonuclease enzymes capable of generating a single-stranded break in a double-stranded DNA molecule, i.e. a nick, either at random or at a specific recognition sequence, by breaking a phosphodiester bond linking adjacent nucleotides.
nicotinamide adenine dinucleotide (NAD)
nicotinamide adenine dinucleotide phosphate (NADP+, NADP)
nitrogenous base

Sometimes used interchangeably with nucleobase or simply base.

Any organic compound containing a nitrogen atom that has the chemical properties of a base. Five particular nitrogenous basesadenine (A), guanine (G), cytosine (C), thymine (T), and uracil (U) – are especially relevant to biology because they are components of nucleotides, which are the primary monomers that make up nucleic acids.
non-canonical amino acid (ncAA)

Also non-standard amino acid.

Any amino acid, natural or artificial, that is not one of the 20 or 21 proteinogenic amino acids encoded by the standard genetic code. There are hundreds of such amino acids, many of which have biological functions and are specified by alternative codes or incorporated into proteins accidentally by errors in translation. Many of the best known naturally occurring ncAAs occur as intermediates in the metabolic pathways leading to the standard amino acids, while others have been made synthetically in the laboratory.[16]
non-coding DNA (ncDNA)
Any segment of DNA that does not encode a sequence that may ultimately be transcribed and translated into a protein. In most organisms, only a small fraction of the genome consists of protein-coding DNA, though the proportion varies greatly between species. Some non-coding DNA may still be transcribed into functional non-coding RNA (as with transfer RNAs) or may serve important developmental or regulatory purposes; other regions (as with so-called "junk DNA") appear to have no known biological function.
non-coding RNA (ncRNA)
Any molecule of RNA that is not ultimately translated into a protein. The DNA sequence from which a functional non-coding RNA is transcribed is often referred to as an "RNA gene". Numerous types of non-coding RNAs essential to normal genome function are produced constitutively, including transfer RNA (tRNA), ribosomal RNA (rRNA), microRNA (miRNA), and small interfering RNA (siRNA); other non-coding RNAs (sometimes described as "junk RNA") have no known function and are likely the product of spurious transcription.
non-coding strand
See template strand.
nondisjunction
The failure of homologous chromosomes or sister chromatids to segregate properly during cell division. Nondisjunction results in daughter cells that are aneuploid, containing abnormal numbers of one or more specific chromosomes. It may be caused by a variety of factors.
non-homologous end joining (NHEJ)
nonrepetitive sequence
Broadly, any nucleotide sequence or region of a genome that does not contain repeated sequences, or in which repeats do not comprise a majority; or any segment of DNA exhibiting the reassociation kinetics expected of a unique sequence.[13]
nonsense mutation

Also point-nonsense mutation.

A type of point mutation which results in a premature stop codon in the transcribed mRNA sequence, thereby causing the premature termination of translation, which results in a truncated, incomplete, and often non-functional protein.
nonsense suppressor
A factor which can inhibit the effects of a nonsense mutation (i.e. a premature stop codon) by any mechanism, usually either a mutated transfer RNA which can bind the mutated stop codon or some kind of ribosomal mutation.[17]
nonsynonymous mutation

Also nonsynonymous substitution or replacement mutation.

A type of mutation in which the substitution of one nucleotide base for another results, after transcription and translation, in an amino acid sequence that is different from that produced by the original unmutated gene. Because nonsynonymous mutations always result in a biological change in the organism, they are often subject to strong selection pressure. Contrast synonymous mutation.
non-transcribed spacer (NTS)
See spacer.
northern blotting
A blotting method in molecular biology used to detect RNA in a sample. Compare Southern blotting, western blotting, and eastern blotting.
nRNA
See nuclear RNA.
N-terminus

Also amine terminus and amino terminus.

The end of a linear chain of amino acids (i.e. a peptide) that is terminated by the free amine group (–NH
2
) of the first amino acid added to the chain during translation. This amino acid is said to be N-terminal. By convention, sequences, domains, active sites, or any other structure positioned nearer to the N-terminus of the polypeptide or the folded protein it forms relative to others are described as upstream. Contrast C-terminus.
nuclear DNA
Any DNA molecule contained within the nucleus of a eukaryotic cell, most prominently the DNA in chromosomes. It is sometimes used interchangeably with genomic DNA.
nuclear envelope
A sub-cellular barrier consisting of two concentric lipid bilayer membranes that surrounds the nucleus in eukaryotic cells. The nuclear envelope is sometimes simply called the "nuclear membrane", though the structure is actually composed of two distinct membranes, an inner membrane and an outer membrane.
nuclear equivalence
The principle that the nuclei of essentially all differentiated cells of a mature multicellular organism are genetically identical to each other and to the nucleus of the zygote from which they descended; i.e. they all contain the same genetic information on the same chromosomes, having been replicated from the original zygotic set with extremely high fidelity. Even though all adult somatic cells have the same set of genes, cells can nonetheless differentiate into distinct cell types by expressing different subsets of these genes. Though this principle generally holds true, the reality is slightly more complex, as mutations such as insertions, deletions, duplications, and translocations as well as chimerism, mosaicism, and various types of genetic recombination can all cause different somatic lineages within the same organism to be genetically non-identical.
nuclear export signal (NES)
nuclear lamina
A fibrous network of proteins lining the inner, nucleoplasmic surface of the nuclear envelope, composed of filaments similar to those that make up the cytoskeleton. It may function as a scaffold for the various contents of the nucleus including nuclear proteins and chromosomes.[3]
nuclear localization signal (NLS)

Also nuclear localization sequence.

An amino acid sequence within a protein which serves as a molecular signal marking the protein for transport into the nucleus, typically consisting of one or more short motifs containing positively charged amino acid residues exposed on the mature protein's surface (especially lysines and arginines). Though all proteins are translated in the cytoplasm, many whose primary biological activities occur inside the nucleus (e.g. transcription factors) require nuclear localization signals identifiable by molecular chaperones in order to cross the nuclear envelope. Contrast nuclear export signal.
nuclear matrix

Also nucleoskeleton.

A mesh-like latticework of protein polymers and microfilaments suspended in the nucleoplasm in the nuclei of eukaryotic cells, akin to the cytoskeleton in the cytoplasm. The nuclear matrix functions as a scaffold and an anchor for large DNA molecules such as chromosomes and for the macromolecular complexes that perform essential nuclear activities such as transcription and DNA replication.[3]
nuclear membrane
See nuclear envelope.
nuclear pore
A complex of membrane proteins that creates an opening in the nuclear envelope through which certain molecules and ions are permitted to pass and thereby enter or exit the nucleus (analogous to the channel proteins in the cell membrane). The nuclear envelope typically has thousands of pores to selectively regulate the exchange of specific materials between the nucleoplasm and the cytoplasm, including messenger RNAs, which are transcribed in the nucleus but must be translated in the cytoplasm, as well as nuclear proteins, which are synthesized in the cytoplasm but must return to the nucleus to serve their functions.[4][3]
nuclear protein
Any protein that is naturally found in or localizes to the cell's nucleus (as opposed to the cytoplasm or elsewhere).
nuclear RNA (nRNA)
Any RNA molecule located within a cell's nucleus, whether associated with chromosomes or existing freely in the nucleoplasm, including small nuclear RNA (snRNA), enhancer RNA (eRNA), and all newly transcribed immature RNAs, coding or non-coding, prior to their export to the cytosol (hnRNA).
nuclear transfer
nuclear transport
The mechanisms by which molecules cross the nuclear envelope surrounding a cell's nucleus. Though small molecules and ions can cross the membrane freely, the entry and exit of larger molecules is tightly regulated by nuclear pores, so that most macromolecules such as RNAs and proteins require association with transport factors in order to be chaperoned across.
nuclease
Any of a class of enzymes capable of cleaving phosphodiester bonds connecting adjacent nucleotides in a nucleic acid molecule (the opposite of a ligase). Nucleases may nick one strand or cut both strands of a duplex molecule, and may cleave randomly or at specific recognition sequences. They are ubiquitous and imperative for normal cellular function, and are also widely employed in laboratory techniques.
nucleic acid
A long, polymeric macromolecule made up of smaller monomers called nucleotides which are chemically linked to one another in a chain. Two specific types of nucleic acid, DNA and RNA, are common to all living organisms, serving to encode the genetic information governing the construction, development, and ordinary processes of all biological systems. This information, contained within the order or sequence of the nucleotides, is translated into proteins, which direct all of the chemical reactions necessary for life.
nucleic acid sequence
The precise order of consecutively linked nucleotides in a nucleic acid molecule such as DNA or RNA. Long sequences of nucleotides are the principal means by which biological systems store genetic information, and therefore the accurate replication, transcription, and translation of such sequences is of the utmost importance, lest the information be lost or corrupted. Nucleic acid sequences may be equivalently referred to as sequences of nucleotides, nitrogenous bases, nucleobases, or, in duplex molecules, base pairs, and they correspond directly to sequences of codons and amino acids.
nucleobase

Sometimes used interchangeably with nitrogenous base or simply base.

Any of the five primary or canonical nitrogenous basesadenine (A), guanine (G), cytosine (C), thymine (T), and uracil (U) – that form nucleosides and nucleotides, the latter of which are the fundamental building blocks of nucleic acids. The ability of these bases to form base pairs via hydrogen bonding, as well as their flat, compact three-dimensional profiles, allows them to "stack" one upon another and leads directly to the long-chain structures of DNA and RNA. When writing sequences in shorthand notation, the letter N is often used to represent a nucleotide containing a generic or unidentified nucleobase.
nucleoid

Also prokaryon.

An irregularly shaped region which contains most or all of the genetic material in prokaryotic cells such as bacteria, but is not enclosed by a nuclear membrane as in eukaryotes.
nucleolin
The primary protein of which the eukaryotic nucleolus is composed, thought to play important roles in chromatin decondensation, transcription of ribosomal RNA, and ribosome assembly.
nucleolonema
The central region of the nucleolus, composed of dense, convoluted fibrillar material.[2]
nucleolus
An organelle within the nucleus of eukaryotic cells which is composed of proteins, DNA, and RNA and serves as the site of ribosome synthesis.
nucleoplasm

Also karyoplasm.

All of the material enclosed within the nucleus of a cell by the nuclear envelope, analogous to the cytoplasm enclosed by the main cell membrane. Like the cytoplasm, the nucleoplasm is composed of a gel-like substance (the nucleosol) in which various organelles, nuclear proteins, and other biomolecules are suspended, including nuclear DNA in the form of chromosomes, the nucleolus, nuclear bodies, and free nucleotides.
nucleoprotein
Any protein that is chemically bonded to or conjugated with a nucleic acid. Examples include ribosomes, nucleosomes, and many enzymes.
nucleosidase
Any of a class of enzymes which catalyze the decomposition of nucleosides into their component nitrogenous bases and pentose sugars.[13]
nucleoside
An organic molecule composed of a nitrogenous base bonded to a five-carbon sugar (either ribose or deoxyribose). A nucleotide additionally includes one or more phosphate groups.
nucleosol

Also karyolymph or nuclear hyaloplasm.

The soluble, liquid portion of the nucleoplasm (analogous to the cytosol of the cytoplasm).
nucleosome
The basic structural subunit of chromatin used in packaging nuclear DNA such as chromosomes, consisting of a core particle of eight histone proteins around which double-stranded DNA is wrapped in a manner akin to thread wound around a spool. The technical definition of a nucleosome includes a segment of DNA about 146 base pairs in length which makes 1.67 left-handed turns as it coils around the histone core, as well as a stretch of linker DNA (generally 38–80 bp) connecting it to an adjacent core particle, though the term is often used to refer to the core particle alone. Long series of nucleosomes are further condensed by association with histone H1 into higher-order structures such as 30-nm fibers and ultimately supercoiled chromatids. Because the histone–DNA interaction limits access to the DNA molecule by other proteins and RNAs, the precise positioning of nucleosomes along the DNA sequence plays a fundamental role in controlling whether or not genes are transcribed and expressed, and hence mechanisms for moving and ejecting nucleosomes have evolved as a means of regulating the expression of particular loci.
nucleosome-depleted region (NDR)
A region of a genome or chromosome in which long segments of DNA are bound by few or no nucleosomes, and thus exposed to manipulation by other proteins and molecules, especially implying that the region is transcriptionally active.
nucleotidase
Any of a class of phosphoric monoester hydrolase enzymes which catalyze the hydrolysis of a nucleotide into a nucleoside and orthophosphate. Such enzymes may or may not distinguish ribonucleotides from deoxyribonucleotides, i.e. their function may not be sugar-specific.[5]
nucleotide

Also nucleoside monophosphate (NMP).

An organic molecule that serves as the fundamental monomer or subunit of nucleic acid polymers, including RNA and DNA. Each nucleotide is composed of three connected functional groups: a nitrogenous base, a five-carbon sugar (either ribose or deoxyribose), and a single phosphate group. Though technically distinct, the term "nucleotide" is often used interchangeably with nitrogenous base, nucleobase, and base pair when referring to the sequences that make up nucleic acids. Compare nucleoside.
The nucleobases (blue) are the five specific nitrogenous bases canonically used in DNA and RNA. A nucleobase bonded to a pentose sugar (either ribose or deoxyribose; yellow) is known as a nucleoside (yellow + blue). A nucleoside bonded to a single phosphate group (red) is known as a nucleoside monophosphate (NMP) or a nucleotide (red + yellow + blue). When not incorporated into a nucleic acid chain, free nucleosides can bind multiple phosphate groups: two phosphates yields a nucleoside diphosphate (NDP), and three yields a nucleoside triphosphate (NTP).
nucleotide sequence
See nucleic acid sequence.
nucleus

pl. nuclei

A large spherical or lobular organelle surrounded by a dedicated membrane which functions as the main storage compartment for the genetic material of eukaryotic cells, including the DNA comprising chromosomes, as well as the site of RNA synthesis during transcription. The vast majority of eukaryotic cells have a single nucleus, though some cells may have more than one nucleus, either temporarily or permanently, and in some organisms there exist certain cell types (e.g. mammalian erythrocytes) which lose their nuclei upon reaching maturity, effectively becoming anucleate. The nucleus is one of the defining features of eukaryotes; the cells of prokaryotes such as bacteria lack nuclei entirely.[2]


O

[edit]
occluding junction
See tight junction.
ochre
One of three stop codons used in the standard genetic code; in RNA, it is specified by the nucleotide triplet UAA. The other two stop codons are named amber and opal.
Okazaki fragments
Short sequences of nucleotides which are synthesized discontinuously by DNA polymerase and later linked together by DNA ligase to create the lagging strand during DNA replication. Okazaki fragments are the consequence of the unidirectionality of DNA polymerase, which only works in the 5' to 3' direction.
oligo dT
A short, single-stranded DNA oligonucleotide consisting of a sequence of repeating deoxythymidine (dT) nucleotides. Oligo dTs are commonly synthesized de novo to be used as primers for in vitro reverse transcription reactions during rtPCR techniques, where short chains of 12 to 18 thymine bases readily complement the poly(A) tails of mature messenger RNAs, allowing the selective amplification and preparation of a cDNA library from a pool of coding transcripts.[5]
oligogene
oligomer
Any polymeric molecule consisting of a relatively short series of connected monomers or subunits; e.g. an oligonucleotide is a short series of nucleotides.
oligonucleotide

Also abbreviated oligo.

A relatively short chain of nucleic acid residues. In the laboratory, oligonucleotides are commonly used as primers or hybridization probes to detect the presence of larger mRNA molecules or assembled into two-dimensional microarrays for high-throughput sequencing analysis.
oligosaccharide
A polymeric carbohydrate molecule consisting of a relatively short chain of connected monosaccharides. Oligosaccharides have important functions in processes such as cell signaling and cell adhesion. Longer chains are called polysaccharides.
omics
A suffix used to describe any of the diverse fields of study that conduct rigorous, systematic analyses of any of the "omes", e.g. the genome, transcriptome, proteome, metabolome, etc.,[5] each of which represents the totality of a specific class of biological content that has been or could hypothetically be isolated from an individual cell, population of cells, organism, species, or some other particular context. Thus genomics is the field of study which analyzes the totality of genes in a genome, proteomics studies the complete set of all of the proteins in a proteome, etc. The term may also be used to refer to all of these fields collectively.
oncogene
A gene that has the potential to cause cancer. In tumor cells, such genes are often mutated and/or expressed at abnormally high levels.
one gene–one polypeptide

Also one gene–one protein or one gene–one enzyme.

The hypothesis that there exists a large class of genes in which each particular gene directs the synthesis of one particular polypeptide or protein.[13] Historically it was thought that all genes and proteins might follow this rule by definition, but it is now known that many proteins are composites of different polypeptides and therefore the product of multiple genes, and also that some genes do not encode polypeptides at all but instead produce non-coding RNAs, which are never translated.
opal

Also umber.

One of three stop codons used in the standard genetic code; in RNA, it is specified by the nucleotide triplet UGA. The other two stop codons are named amber and ochre.
open chromatin
See euchromatin.
open reading frame (ORF)
The part of a reading frame that has the ability to be translated from DNA or RNA into protein; any continuous stretch of codons that contains a start codon and a stop codon.
operator
A regulatory sequence within an operon, typically located between the promoter sequence and the structural genes of the operon, to which an uninhibited repressor protein can bind, thereby physically obstructing RNA polymerase from initiating the transcription of adjacent cistrons.[18]
operon
A functional unit of gene expression consisting of a cluster of neighboring structural genes which are collectively under the control of a single promoter, along with one or more adjacent regulatory sequences such as operators. The set of genes is transcribed together, resulting in a single polycistronic messenger RNA molecule encoding multiple distinct polypeptides which may then be translated together or undergo splicing to create multiple mRNAs which are translated independently; the result is that the genes contained in the operon are either expressed together or not at all. Regulatory proteins, including repressors and activators, usually bind specifically to the regulatory sequences of a given operon; by some definitions, the genes that code for these regulatory proteins are also considered part of the operon.
opsonin
Any substance, especially certain blood-serum proteins such as immunoglobulins, that in binding to the surface of foreign cells or particulate matter increases the susceptibility of the foreign material to phagocytosis by phagocytes.[3] Opsonins work by linking foreign particles to specific receptors on the surface of phagocytic cells in a process known as opsonization.[5]
organelle
A spatially distinct compartment or subunit within a cell which has a specialized function. Organelles occur in both prokaryotic and eukaryotic cells. In the latter they are often separated from the cytoplasm by being enclosed with their own membrane bilayer (whence the term membrane-bound organelles), though organelles may also be functionally specific areas or structures without a surrounding membrane; some cellular structures which exist partially or entirely outside of the cell membrane, such as cilia and flagella, are also referred to as organelles. There are numerous types of organelles with a wide variety of functions, including the various compartments of the endomembrane system (e.g. the nuclear envelope, endoplasmic reticulum, and Golgi apparatus), mitochondria, chloroplasts, lysosomes, endosomes, and vacuoles, among others. Many organelles are unique to particular cell types or species.
origin of replication (ORI)

Also replication origin or simply origin.

A particular location within a DNA molecule at which DNA replication is initiated. Origins are usually defined by the presence of a particular replicator sequence or by specific chromatin patterns.
osmosis
osmotic shock

Also osmotic stress.

Physiological dysfunction caused by a sudden change in the concentration of dissolved solutes in the extracellular environment surrounding a cell, which provokes the rapid movement of water across the cell membrane by osmosis, either into or out of the cell. In a severely hypertonic environment, where extracellular solute concentrations are extremely high, osmotic pressure may force large quantities of water to move out of the cell (plasmolysis), leading to its desiccation; this may also have the effect of inhibiting transport of solutes into the cell, thus denying it the substrates necessary to sustain normal cellular activities. In a severely hypotonic environment, where extracellular solute concentrations are much lower than intracellular concentrations, water is forced to move into the cell (turgescence), causing it to swell in size and potentially burst, or triggering apoptosis.
outron
A sequence near the 5'-end of a primary mRNA transcript that is removed by a special form of splicing during post-transcriptional processing. Outrons are located entirely outside of the transcript's coding sequences, unlike introns.
overexpression
An abnormally high level of gene expression which results in an excessive number of copies of one or more gene products. Overexpression produces a pronounced gene-related phenotype.[19][20]
oxidative phosphorylation

Also electron transport-linked phosphorylation or terminal oxidation.

The process by which cells use chemical energy obtained by the oxidation of nutrients to power the production of adenosine triphosphate (ATP). Oxidative phosphorylation couples two related processes: in the electron transport chain, a series of enzyme-catalyzed redox reactions transfers electrons from energetic donors such as NADH and FADH through various intermediates and ultimately to a terminal electron acceptor such as molecular oxygen (O
2
); the energy liberated by these reactions is simultaneously used in chemiosmosis to move protons (H+
) across a membrane and against their concentration gradient, generating an electrochemical potential which powers ATP synthase, an enzyme that catalyzes the phosphorylation of ADP into ATP. In eukaryotes, both of these processes are carried out by proteins embedded in the membranes of mitochondria and chloroplasts; in prokaryotes, they occur in the cell membrane.
oxidative stress
oxygen cascade
The flow of oxygen from environmental sources (e.g. the air in the atmosphere) to the mitochondria of a cell, where oxygen atoms participate in biochemical reactions that result in the oxidation of energy-rich substrates such as carbohydrates in a process known as aerobic respiration.


P

[edit]
p53

Also Tumor protein P53 (TP53), transformation-related protein 53 (TRP53), and cellular tumor antigen p53.

A class of regulatory proteins encoded by the TP53 gene in vertebrates which bind DNA and regulate gene expression in order to protect the genome from mutation and block progression through the cell cycle if DNA damage does occur.[4] It is mutated in more than 50% of human cancers, indicating it plays a crucial role in preventing cancer formation.
pachynema

Also pachytene stage.

In meiosis, the third of five substages of prophase I, following zygonema and preceding diplonema. During pachynema, the synaptonemal complex facilitates crossing over between the synapsed homologous chromosomes, and the centrosomes begin to move apart from each other.[13]
palindromic sequence

Also palindrome.

A nucleic acid sequence of a double-stranded DNA or RNA molecule in which the unidirectional sequence (e.g. 5' to 3') of nucleobases on one strand is identical to the sequence in the same direction (e.g. 5' to 3') on the complementary strand. In other words, a sequence is said to be palindromic if it is equal to its own reverse complement. Palindromic motifs are common recognition sites for restriction enzymes.
paracellular transport
The transfer of substances across an epithelium by passing through the extracellular space between cells, in contrast to transcellular transport, where substances travel through cells by crossing the intracellular cytoplasm.
paracrine
Describing or relating to a class of agonist signaling molecules produced and secreted by regulatory cells into the extracellular environment and then transported by passive diffusion to target cells other than those which produced them. The term may refer to the molecules themselves, sometimes called paramones, to the cells that produce them, or to signaling pathways which rely on them.[5] Compare autocrine, endocrine, and juxtacrine.
paratope

Also antigen-binding site.

An idiotope, i.e. the specific site or region within an antibody that recognizes and binds to a particular antigen or epitope.[21] The uniqueness of a paratope allows it to bind to only one epitope with very high affinity. At the end of each arm of the Y-shaped antibody is an identical paratope, and each paratope comprises a total of six complementarity-determining regions (three from each of the light and heavy chains) which protrude from a series of antiparallel beta sheets in the antibody's higher structure.[22] The term is also sometimes used to refer to the specific site on a ligand molecule which defines the ligand's specificity for other molecules such as cell-surface receptors.[3]
parent cell
The original or ancestral cell from which a given set of descendant cells, known as daughter cells, have divided by mitosis or meiosis.
passenger
A DNA fragment of interest designed to be spliced into a 'vehicle' such as a plasmid vector and then cloned.[13]
passive transport
The movement of a solute across a membrane by traveling down an electrochemical or concentration gradient, using only the energy stored in the gradient and not any energy from external sources.[3] Contrast active transport.
Pasteur effect

Also Pasteur-Meyerhof effect.

A phenomenon observed in facultatively anaerobic cells, including animal tissues and many microorganisms such as yeast, whereby the presence of oxygen in the environment inhibits the cell's use of ethanol fermentation pathways to generate energy, and drives the cell to instead make use of the available oxygen in aerobic respiration;[23] or more generally the observed decrease in the rate of glycolysis or of lactate production in cells exposed to oxygenated air.[5]
PCR
See polymerase chain reaction.
PCR product
See amplicon.
pentose
Any monosaccharide containing five carbon atoms. The compounds ribose and deoxyribose are both pentose sugars, which, in the form of cyclic five-membered rings, serve as the central structural components of the ribonucleotides and deoxyribonucleotides that make up RNA and DNA, respectively.
peptidase
See protease.
peptide
A short chain of amino acid monomers linked by covalent peptide bonds. Peptides are the fundamental building blocks of longer polypeptide chains and hence of proteins.
peptide bond
A covalent chemical bond between the carboxyl group of one amino acid and the amino group of an adjacent amino acid, formed by a dehydration reaction catalyzed by peptidyl transferase, an enzyme within the ribosome, during translation. A linear chain of amino acids linked by peptide bonds may be called a peptide or polypeptide.
peptidoglycan

Also murein.

A glycoconjugate complex of interwoven peptides and polysaccharides that is a primary constituent of the cell wall in all bacteria and archaea, consisting of strands of glycosaminoglycans crosslinked by short oligopeptides (usually 4–10 residues[3]) to form a rigid lattice of indefinite size.[5] The proportion of the cell wall that is peptidoglycan varies widely by strain and is often used to aid strain identification: the higher peptidoglycan content of the cell walls of Gram-positive bacteria causes them to stain a darker color than Gram-negative bacteria.
pericentriolar material (PCM)
perinuclear space
The space between the inner and outer membranes of the nuclear envelope.
peripheral membrane protein

Also extrinsic membrane protein.

Any of a class of membrane proteins which attach only temporarily to the cell membrane, either by penetrating the lipid bilayer or by attaching to other proteins which are permanently embedded within the membrane.[24] The ability to reversibly interact with membranes makes peripheral membrane proteins important in many different roles, commonly as regulatory subunits of channel proteins and cell surface receptors. Their domains often undergo rearrangement, dissociation, or conformational changes when they interact with the membrane, resulting in the activation of their biological activity.[25] In protein purification, peripheral membrane proteins are typically more water-soluble and much easier to isolate from the membrane than integral membrane proteins.
periplasmic space

Also periplasm.

In a bacterial cell, the space between the cell membrane and the cell wall.[5]
peroxisome
A small membrane-bound organelle found in many eukaryotic cells which specializes in carrying out oxidative reactions with various enzyme peroxidases and catalase, generally to mitigate damage from reactive oxygen species but also as a participant in various metabolic pathways such as beta-oxidation of fatty acids.[26]
persistence
1. The tendency of a moving cell to continue moving in the same direction as previously; that is, even in isotropic environments, there inevitably still exists an inherent bias by which, from instant to instant, cells are more likely not to change direction than to change direction. Averaged over long periods of time, however, this bias is less obvious and cell movements are better described as a random walk.[3]
2. The ability of some viruses to remain present and viable in cells, organisms, or populations for very long periods of time by any of a variety of strategies, including retroviral integration and immune suppression, often in a latent form which replicates very slowly or not at all.[3]
Petri dish
A shallow, transparent plastic or glass dish, usually circular and covered with a lid, which is widely used in biology laboratories to hold solid or liquid growth media for the purpose of culturing cells. They are particularly useful for adherent cultures, where they provide a flat, sterile surface conducive to colony formation from which scientists can easily isolate and identify individual colonies.
phagemid
A bacteriophage with a genome encoding a mobile plasmid that can be excised by co-infection of the host cell with a helper phage. Phagemids are useful as vectors for library production.[3]
phagocyte
Any cell capable of phagocytosis, especially any of various cell types of the immune system which engulf and ingest harmful foreign molecules, bacteria, and dead or dying cells, including neutrophils and macrophages.[3]
phagocytosis
The process by which foreign cells, molecules, and small particulate matter are engulfed and ingested via endocytosis by specialized cells known as phagocytes (a class which includes macrophages and neutrophils).[4]
phagosome
A large, intracellular, membrane-bound vesicle formed as a result of phagocytosis and containing whatever previously extracellular material was engulfed during that process.[4]
pharmacogenomics
The study of the role played by the genome in the body's response to pharmaceutical drugs, combining the fields of pharmacology and genomics.
phenome
The complete set of phenotypes that are or can be expressed by a genome, cell, tissue, organism, or species; the sum of all of its manifest chemical, morphological, and behavioral characteristics or traits.
phenomic lag
A delay in the phenotypic expression of a genetic mutation owing to the time required for the manifestation of changes in the affected biochemical pathways.[9]
phenotype
The composite of the observable morphological, physiological, and behavioral traits of an organism that result from the expression of the organism's genotype as well as the influence of environmental factors and the interactions between the two.
phenotypic switching
A type of phenotypic plasticity in which a cell rapidly undergoes major changes to its morphology and/or function, usually via epigenetic modifications, allowing it to quickly switch back and forth between disparate phenotypes in response to changes in the local microenvironment.
phosphatase
Any of a class of enzymes that catalyze the hydrolytic cleavage of a phosphoric acid monoester into a phosphate ion and an alcohol, e.g. the removal of a phosphate group from a nucleotide via the breaking of the ester bond connecting the phosphate to a ribose or deoxyribose sugar or to another phosphate, a process termed dephosphorylation. The opposite process is performed by kinases.
phosphate
Any chemical species or functional group derived from phosphoric acid (H
3
PO
4
) by the removal of one or more protons (H+
); the completely ionized form, [PO
4
]3−
, consists of a single, central phosphorus atom covalently bonded to four oxygen atoms via three single bonds and one double bond. Phosphates are abundant and ubiquitous in biological systems, where they occur either as free anions in solution, known as inorganic phosphates and symbolized Pi, or bonded to organic molecules via ester bonds. The huge diversity of organophosphate compounds includes all nucleotides, whose phosphate groups are linked by phosphodiester bonds to form the structural backbones of long nucleotide chains such as DNA and RNA, and the high-energy diphosphate and triphosphate substituents of individual nucleotides such as ADP and ATP serve as essential energy carriers in all cells. Phospholipids are major components of most membranes. Enzymes known as kinases and phosphatases catalyze the addition and removal of phosphate groups to and from these and other biomolecules.
phosphate backbone

Also phosphodiester backbone, sugar–phosphate backbone, and phosphate–sugar backbone.

The linear chain of alternating phosphate and sugar compounds that results from the linking of consecutive nucleotides in the same strand of a nucleic acid molecule, and which serves as the structural framework of the nucleic acid. Each individual strand is held together by a repeating series of phosphodiester bonds connecting each phosphate group to the ribose or deoxyribose sugars of two adjacent nucleotides. These bonds are created by ligases and broken by nucleases.
A defining element of nucleic acid structure is the linear chain of alternating sugars (orange) and phosphates (yellow) known as the phosphate backbone, which acts as a scaffold to which nucleobases are attached. The phosphorus atom of each phosphate group forms two ester bonds to specific carbon atoms within the pentose sugars—ribose in RNA and deoxyribose in DNA—of two adjacent nucleosides.
phosphodiester bond
A pair of ester bonds linking a phosphate molecule with the two pentose rings of consecutive nucleosides on the same strand of a nucleic acid. Each phosphate forms a covalent bond with the 3' carbon of one pentose and the 5' carbon of the adjacent pentose; the repeated series of such bonds that holds together the long chain of nucleotides comprising DNA and RNA molecules is known as the phosphate or phosphodiester backbone.
phospholipid
Any of a subclass of lipids consisting of a central alcohol (usually glycerol) covalently bonded to three functional groups: a negatively charged phosphate group, and two long fatty acid chains. This arrangement results in a highly amphipathic molecule which in aqueous solutions tends to aggregate with similar molecules in a lamellar or micellar conformation with the hydrophilic phosphate "heads" oriented outward, exposing them to the solution, and the hydrophobic fatty acid "tails" oriented inward, minimizing their interactions with water and other polar compounds. Phospholipids are the major structural membrane lipid in almost all biological membranes except the membranes of some plant cells and chloroplasts, where glycolipids dominate instead.[3]
phospholipid bilayer
See lipid bilayer.
phosphorylation
The attachment of a phosphate ion, PO3−
4
, to another molecule or ion or to a protein by covalent bonding. Phosphorylation and the inverse reaction, dephosphorylation, are essential steps in numerous biochemical pathways, including in the production of adenosine triphosphate (ATP) (as in oxidative phosphorylation); in the metabolism of glucose and the synthesis of glycogen; and in the post-translational modification of amino acid residues in many proteins. Enzymes which catalyze phosphorylation reactions are known as kinases; those that catalyze dephosphorylation are known as phosphatases.
pinocytosis
A form of endocytosis in which liquid and suspended solids from the extracellular environment are captured in inward invaginations of the cell membrane which then "bud off" into enclosed vesicles in the cytoplasm. The contents of these vesicles are then passed to organelles such as endosomes by fusion of the vesicular and organellar membranes. Pinocytosis is the predominant form of endocytosis occurring in most cells, such that the term is often used interchangeably with endocytosis as a whole.[5]
piRNA
See Piwi-interacting RNA.
pitch
The number of base pairs contained within a single complete turn of the DNA double helix,[13] used as a measure of the "tightness" or density of the helix's spiral.
Piwi-interacting RNA (piRNA)
plasma membrane
See cell membrane.
plasmid
Any small DNA molecule that is physically separated from the larger body of chromosomal DNA and can replicate independently. Plasmids are typically small (less than 100 kbp), circular, double-stranded DNA molecules in prokaryotes such as bacteria, though they are also sometimes present in archaea and eukaryotes.
plasmid partitioning
The process by which plasmids which have been replicated inside a parent cell are distributed equally between daughter cells during cell division.[3]
plasmid-mediated resistance
The development of resistance to toxins or antibiotics which is enabled by the horizontal transfer of resistance genes encoded within small, independently replicating DNA molecules known as plasmids. This process occurs naturally via mechanisms such as bacterial conjugation, but is also a common aspect of genetic engineering methods such as molecular cloning.
plasmolysis
The temporary shrinkage of the protoplasm of a plant or bacterial cell away from the cell wall, caused by loss of water from the cell.[5]
plastid
Any of a class of membrane-bound organelles found in the cells of some eukaryotes such as plants and algae which are hypothesized to have evolved from endosymbiotic cyanobacteria; examples include chloroplasts, chromoplasts, and leucoplasts. Plastids retain their own circular chromosomes which replicate independently of the host cell's genome. Many contain photosynthetic pigments which allow them to perform photosynthesis, while others have been retained for their ability to synthesize unique chemical compounds.
pleomorphism
1. Variability in the size, shape, or staining of cells and/or their nuclei, particularly as observed in histology and cytopathology, where morphological variation is frequently an indicator of a cellular abnormality such as disease or tumor formation.
2. In microbiology, the ability of some microorganisms such as certain bacteria and viruses to alter their morphology, metabolism, or mode of reproduction in response to changes in their environment.
plithotaxis
The tendency of cells within a monolayer to migrate in the direction of the local highest tension or maximal principal stress, exerting minimal shear stress on neighboring cells and thereby propagating the tension across many intercellular junctions and causing the cells to exhibit a sort of collective migration.[27]
ploidy
The number of complete sets of chromosomes in a cell, and hence the number of possible alleles present within the cell at any given autosomal locus.
A cell's ploidy level is defined by the number of copies it has of each specific chromosome: if the cell has two copies of each of three distinct chromosomes, it is said to be diploid (2N).
pluripotency
plus-strand
See coding strand.
point mutation
A mutation by which a single nucleotide base is changed, inserted, or deleted from a sequence of DNA or RNA.
poly(A) tail
A post-transcriptional modification consisting of a chain of repeated adenosine residues, 40–250 nucleotides in length, attached to the 3' end of nearly all mature eukaryotic messenger RNA transcripts (those of histones being a notable exception).[5]
polyadenylation
The addition of a series of multiple adenosine ribonucleotides, known as a poly(A) tail, to the 3'-end of a primary RNA transcript, typically a messenger RNA. Primary transcripts are first cleaved 10–30 nucleotides downstream of a highly conserved AAUAAA sequence, then the poly(A) tail is generated from the chaining of multiple ATP molecules through the action of polynucleotide adenylyltransferase.[5] A class of post-transcriptional modification, polyadenylation serves different purposes in different cell types and organisms. In eukaryotes, the addition of a poly(A) tail is an important step in the processing of a raw transcript into a mature mRNA, ready for export to the cytoplasm where translation occurs; in many bacteria, polyadenylation has the opposite function, instead promoting the RNA's degradation.
polyclonal
Describing cells, proteins, or molecules descended or derived from more than one clone (i.e. from more than one genome or genetic lineage) or made in response to more than one unique stimulus. Antibodies are often described as polyclonal if they have been produced or raised against multiple distinct antigens or multiple variants of the same antigen, such that they can recognize more than one unique epitope.[2] Contrast monoclonal.
polylinker
See multiple cloning site.
polymer
A macromolecule composed of multiple repeating subunits or monomers; a chain or aggregation of many individual molecules of the same compound or class of compound.[2] The formation of polymers is known as polymerization and generally only occurs when nucleation sites are present and the concentration of monomers is sufficiently high.[3] Many of the major classes of biomolecules are polymers, including nucleic acids and polypeptides.
polymerase
Any of a class of enzymes which catalyze the synthesis of polymeric molecules, especially nucleic acid polymers, typically by encouraging the base pairing of free nucleotides with those of an existing complementary template strand. DNA polymerases and RNA polymerases are essential for DNA replication and transcription, respectively.
polymerase chain reaction (PCR)
Any of a wide variety of molecular biology methods involving the rapid production of millions or billions of copies of a specific DNA sequence, allowing scientists to selectively amplify fragments of a very small sample to a quantity large enough to study in detail. In its simplest form, PCR generally involves the incubation of a target DNA sample of known or unknown sequence with a reaction mixture consisting of oligonucleotide primers, a heat-stable DNA polymerase, and free deoxyribonucleotide triphosphates (dNTPs), all of which are supplied in excess. This mixture is then alternately heated and cooled to pre-determined temperatures for pre-determined lengths of time according to a specified pattern which is repeated for many cycles, typically in a thermal cycler which automatically controls the required temperature variations. In each cycle, the most basic of which includes a denaturation phase, annealing phase, and elongation phase, the copies synthesized in the previous cycle are used as templates for synthesis in the next cycle, causing a chain reaction that results in the exponential growth of the total number of copies in the reaction mixture. Amplification by PCR has become a standard technique in virtually all molecular biology laboratories.
A diagram of the exponential amplification of a specific DNA sequence via the polymerase chain reaction (PCR)
polymerization
The formation of a polymer from its constituent monomers; the chemical reaction or series of reactions by which monomeric subunits are covalently linked together into a polymeric chain or branching aggregate; e.g. the polymerization of a nucleic acid chain by linking consecutive nucleotides, a reaction catalyzed by a polymerase enzyme.
polymorphism
1. In genetics, the regular and simultaneous existence of two or more discontinuous alleles or genotypes in the same population where the frequency of each allele is greater than can be explained by recurrent mutation alone, typically occurring in more than 1 percent of the population's individuals.[3] An example is the different human blood types (A, B, AB, and O).[5]
2. In chemistry, the existence of the same substance in two or more different crystalline forms.[5]
polypeptide
A long, continuous, and unbranched polymeric chain of amino acid monomers linked by covalent peptide bonds, typically longer than a peptide. Proteins generally consist of one or more polypeptides folded or arranged in a biologically functional way.
polyploid
(of a cell or organism) Having more than two homologous copies of each chromosome; i.e. any ploidy level that is greater than diploid. Polyploidy may occur as a normal condition of chromosomes in certain cells or even entire organisms, or it may result from errors in cell division or mutations causing the duplication of the entire chromosome set.
polyribosome
See polysome.
polysaccharide
A linear or branched polymeric chain of carbohydrate monomers (monosaccharides). Examples include glycogen and cellulose.[4]
polysome

Also polyribosome or ergosome.

A functional unit of protein synthesis consisting of multiple ribosomes attached along the length of the same messenger RNA transcript.[3]
polysomy
The condition of a cell or organism having at least one more copy of a particular chromosome than is normal for its ploidy level, e.g. a diploid organism with three copies of a given chromosome is said to show trisomy. Every polysomy is a type of aneuploidy.
polytene chromosome
position effect
Any effect on the expression or functionality of a gene or sequence that is a consequence of its location or position within a chromosome or other DNA molecule. A sequence's precise location relative to other sequences and structures tends to strongly influence its activity and other properties, because different loci on the same molecule can have substantially different genetic backgrounds and physical/chemical environments, which may also change over time. For example, the transcription of a gene located very close to a nucleosome, centromere, or telomere is often repressed or entirely prevented because the proteins that make up these structures block access to the DNA by transcription factors, while the same gene is transcribed at a much higher rate when located in euchromatin. Proximity to promoters, enhancers, and other regulatory elements, as well as to regions of frequent transposition by mobile elements, can also directly affect expression; being located near the end of a chromosomal arm or to common crossover points may affect when replication occurs and the likelihood of recombination. Position effects are a major focus of research in the field of epigenetic inheritance.
positional cloning

Also map-based cloning.

A strategy for identifying and cloning a candidate gene based on knowledge of its locus or position alone and with little or no information about its products or function, in contrast to functional cloning. This method usually begins by comparing the genomes of individuals expressing a phenotype of unknown provenance (often a hereditary disease) and identifying genetic markers shared between them. Regions defined by markers flanking one or more genes of interest are cloned, and the genes located between the markers can then be identified by any of a variety of means, e.g. by sequencing the region and looking for open reading frames, by comparing the sequence and expression patterns of the region in mutant and wild-type individuals, or by testing the ability of the putative gene to rescue a mutant phenotype.[13]
positive (+) sense strand
See coding strand.
positive control

Also positive regulation.

The initiation, activation, or enhancement of some biological process by the presence of a specific molecular entity (e.g. an activator or inducer), in the absence of which the process cannot proceed or is otherwise diminished.[5] In gene regulation, for example, the binding of an activating molecule such as a transcription factor to a promoter may recruit RNA polymerase to a coding sequence, thereby causing it to be transcribed. Contrast negative control.
positive supercoiling
The supercoiling of a double-stranded DNA molecule in the same direction as the turn of the double helix itself (e.g. a right-handed coiling of a helix with a right-handed turn).[9] Contrast negative supercoiling.
post-transcriptional modification
post-translational modification
potency
precursor cell

Also blast cell.

A partially differentiated or intermediate stem cell with the ability to further differentiate into only one cell type; i.e. a unipotent stem cell that is the immediate parent cell from which fully differentiated cell types divide. The term "precursor cell" is sometimes used interchangeably with progenitor cell, though this term may also be considered technically distinct.
Pribnow box
primary structure
primary transcript
The unprocessed, single-stranded RNA molecule produced by the transcription of a DNA sequence as it exists before post-transcriptional modifications such as alternative splicing convert it into a mature RNA product such as an mRNA, tRNA, or rRNA. A precursor mRNA or pre-mRNA, for example, is a primary transcript which, after processing, becomes a mature mRNA ready for translation.
primase
Any of a class of enzymes that catalyze the synthesis of short, ~10-base RNA oligonucleotides, which by complementing the lagging strand during DNA replication are used as primers by DNA polymerase to initiate the synthesis of Okazaki fragments.[3]
primer
A short, single-stranded oligonucleotide, typically 5–100 bases in length, which "primes" or initiates nucleic acid synthesis by hybridizing to a complementary sequence on a template strand and thereby providing an existing 3'-end from which a polymerase can extend the new strand. Natural systems exclusively use RNA primers to initiate DNA replication and some forms of prokaryotic transcription, whereas the in vitro syntheses performed in many laboratory techniques such as PCR often use DNA primers. In modern laboratories, primers are carefully designed, often in "forward" and "reverse" pairs, to complement specific and unique sequences in target DNA molecules, with consideration given to their melting and annealing temperatures, and then purchased from commercial suppliers which create oligonucleotides on demand by de novo synthesis.
primer dimer (PD)
primer walking
priming
The initiation of nucleic acid synthesis by the hybridization or annealing of one or more primers to a complementary sequence within a template strand.
pro-enzyme
See zymogen.
pro-protein

Also pro-peptide.

An inactive precursor of a protein or polypeptide that is converted into the active form by some post-translational modification, such as by cleaving a specific peptide sequence from the precursor or by attaching other molecules to specific amino acid residues. The names of protein precursors are often prefixed with pro-, as in proinsulin. Enzyme precursors may be called pro-enzymes or zymogens.
probe
Any reagent used to make a single measurement in a biochemical assay such as a gene expression experiment. Molecules which have a specific affinity for one or more other molecules may be used to probe for the presence of those other molecules in samples of unknown composition. Probes are often labelled or otherwise used as reporters to indicate whether or not a specific chemical reaction is taking place. See also hybridization probe.
probe-set
A collection of two or more probes designed to measure a single molecular species, such as a collection of oligonucleotides designed to hybridize to various parts of the mRNA transcripts generated from a single gene.
process molecular gene concept
An alternative definition of a gene which emphasizes the contribution of non-DNA factors to the process by which the information encoded in a DNA sequence results in the synthesis of a polypeptide.
prometaphase
The second stage of cell division in mitosis, following prophase and preceding metaphase, during which the nuclear membrane disintegrates, the chromosomes inside form kinetochores around their centromeres, microtubules emerging from the poles of the mitotic spindle reach the nuclear space and attach to the kinetochores, and motor proteins associated with the microtubules begin to push the chromosomes toward the center of the cell.
promoter
A sequence or region of DNA, usually 100–1,000 base pairs long, which regulates the expression of one or more associated genes by containing binding sites for transcription factors which recruit RNA polymerase to the sequence and initiate transcription. Promoters are typically located immediately upstream of the genes they regulate, near to and often including the transcription start site.
promotion
See upregulation.
prophase
The first stage of cell division in both mitosis and meiosis, occurring after interphase and before prometaphase, during which the DNA of the chromosomes is condensed into chromatin, the nucleolus disintegrates, centrosomes move to opposite ends of the cell, and the mitotic spindle forms.
protease

Also peptidase.

Any of a class of enzymes which catalyze proteolysis, i.e. the decomposition of proteins into smaller polypeptides or individual amino acids, by cleaving peptide bonds via hydrolysis. Proteases are ubiquitous components of numerous biological pathways, and therefore it is often necessary to inhibit them in order for laboratory techniques involving protein activity to be effective.
proteasome

Also ingensin, macropain, prosome, multicatalytic proteinase, and multicatalytic endopeptidase complex.

A large complex of protease enzymes that selectively degrades intracellular proteins which have been tagged for degradation by ubiquitination. Proteasomes play important roles in the timing and onset of cellular processes through the signal-mediated proteolysis of certain enzymes and regulatory proteins; they also contribute to the stress response by removing abnormal proteins and to the immune response by generating antigenic peptides.[5]
protein
A polymeric macromolecule composed of one or more long chains of amino acids linked by peptide bonds. Proteins are the three-dimensional structures created when these chains fold into specific higher-order arrangements following translation, and it is this folded structure which determines a protein's chemical activity and hence its biological function. Ubiquitous and fundamental in all living organisms, proteins are the primary means by which the activities of life are performed, participating in the vast majority of the biochemical reactions that occur inside and outside of cells. They are often classified according to the type(s) of reaction(s) they facilitate or catalyze, by the chemical substrate(s) they act upon, or by their functional role in cellular activity; e.g. as structural proteins, motor proteins, enzymes, transcription factors, or links within biochemical pathways.
protein complex
An assembly or aggregate of multiple proteins held together by intermolecular forces, especially one with a particular biological function. Complexes may include many of the same protein or all different proteins. Numerous cellular activities, including DNA replication, transcription, and translation, rely on protein complexes.[4]
protein folding
The physical process by which the linear chains of amino acids (i.e. polypeptides) synthesized during translation are changed from random coils into stable, orderly, three-dimensional shapes (i.e. proteins) by assuming a higher-order structure or conformation which permits the protein to be biologically functional, known as its native state. Folding is the consequence of amino acid residues participating in intermolecular electrostatic interactions with each other and with their surroundings, including other molecules, and so is strongly influenced by the particularities of the local chemical environment. The time it takes to properly fold a protein can vary greatly, but the process often begins while chain synthesis is still ongoing. Some chains may have motifs or domains which lack intrinsic order and remain unfolded across a wide range of chemical conditions. Having the correct three-dimensional structure is essential for proper protein function, and misfolded proteins are generally biologically inactive, though mutant folds can occasionally modify functionality in useful ways.
protein kinase
Any of a class of enzymes which phosphorylate proteins by catalyzing the transfer of a phosphate from ATP to an amino acid residue and often causing a functionally relevant conformational change as a result. The great majority of protein kinases phosphorylate the hydroxyl side chains of either serine, threonine, or tyrosine, though other types also exist.[3] Separate classes of kinases phosphorylate non-protein molecules such as lipids and carbohydrates.
protein purification
protein sorting

Also protein targeting.

The set of biological mechanisms by which proteins are directed and transported to appropriate destinations within or outside of the cell. Proteins must often be routed into the interior of organelles, embedded within a membrane, or secreted into the extracellular environment in order to serve their functions, and information contained in the protein itself instructs this delivery process.[28] In eukaryotic cells, an expansive network of organelles and pathways is specialized to facilitate protein sorting, including the endoplasmic reticulum and the Golgi apparatus.
protein tag
protein targeting
See protein sorting.
protein-coding gene
A gene containing a coding sequence which can be transcribed and translated to produce a protein, as opposed to an RNA gene, which produces non-coding RNA transcripts that are not translated into proteins but instead have functions in and of themselves.
proteinogenic amino acid
Any of the 20 canonical amino acids which are encoded by the standard genetic code and incorporated into peptides and ultimately proteins during translation. The term may also be inclusive of an additional two amino acids encoded by non-standard codes which can be incorporated by special translation mechanisms.
protein–protein interaction (PPI)
proteoglycan
Any heavily glycosylated protein, i.e. a core polypeptide with one or more covalently attached glycosaminoglycan chains. Proteoglycans are therefore considered a subclass of glycoproteins in which the carbohydrate units are long, linear polysaccharide polymers containing amino sugars[29] and generally bearing a net negative charge under physiological conditions due to the presence of sulfates and uronic acid groups. They are a major component of the extracellular matrix between animal cells, where they form large hydrated complexes commonly employed in connective tissues such as cartilage.
proteolysis
The decomposition of proteins into their component polypeptides or individual amino acids by cleaving the peptide bonds linking the amino acids together via hydrolysis. Proteolysis is an important reaction used not only for degrading and inactivating proteins but sometimes also to activate them by removing amino acid residues which inhibit their activity.[3] It is usually catalyzed by enzymes known as proteases.
proteome
The entire set of proteins that is or can be expressed by a particular genome, cell, tissue, or species at a particular time (such as during a single lifespan or during a specific developmental stage) or under particular conditions (such as when compromised by a certain disease).
proteomics
The study of the proteome of a particular genome, cell, or organism, i.e. the sum total of all of the proteins produced from it by translation. Proteomics technologies allow scientists to purify and identify proteins and polypeptides and determine which ones are most and least abundant at a given time or under a given experimental condition.
protomer
Any molecular subunit from which a larger polymeric macromolecule is built, including those subunits which are not strictly monomers and can themselves be divided into subunits. For example, a heterodimer of tubulin proteins is the protomer for microtubule assembly.[3]
proton motive force
See chemiosmotic coupling.
protoplasm
The biological contents enclosed within a membrane-bound space, variously referring to the cytoplasm, or the cytoplasm and nucleoplasm considered collectively, and sometimes exclusive of vacuoles.
protoplast
A plant, fungal, or bacterial cell which has had its cell wall removed by mechanical, chemical, or enzymatic means; or the complete contents (the protoplasm) of an intact cell excluding the cell wall.
pulsatile secretion
The secretion of substances from a cell, organelle, or tissue in a regular, rhythmic, pulse-like pattern. Many intercellular signaling molecules such as hormones and neurotransmitters are released in this manner in order to maintain homeostasis or to sensitize target cells by stimulating their production of surface receptors.
purine

Abbreviated in shorthand with the letter R.

A double-ringed heterocyclic organic compound which, along with pyrimidine, is one of two molecules from which all nitrogenous bases (including the nucleobases used in DNA and RNA) are derived. Adenine (A) and guanine (G) are classified as purines. The letter R is sometimes used to indicate a generic purine; e.g. in a nucleotide sequence read, R may be used to indicate that either purine nucleobase, A or G, can be substituted at the indicated position.
putative gene
A specific nucleotide sequence suspected to be a functional gene based on the identification of its open reading frame. The gene is said to be "putative" in the sense that no function has yet been described for its products.
pyknosis

Also pycnosis or karyopyknosis.

The irreversible condensation of chromatin inside the nucleus as the cell undergoes necrosis or apoptosis, resulting in a compact mass which stains strongly and is conspicuous under a microscope.[13] It is followed by karyorrhexis.
pyrimidine

Abbreviated in shorthand with the letter Y.

A single-ringed heterocyclic organic compound which, along with purine, is one of two molecules from which all nitrogenous bases (including the nucleobases used in DNA and RNA) are derived. Cytosine (C), thymine (T), and uracil (U) are classified as pyrimidines. The letter Y is sometimes used to indicate a generic pyrimidine; e.g. in a nucleotide sequence read, Y may be used to indicate that either pyrimidine nucleobase – C, T, or U – can be substituted at the indicated position.
pyrimidine dimer
A type of molecular lesion caused by photochemical damage to DNA or RNA, whereby exposure to ultraviolet (UV) radiation induces the formation of covalent bonds between pyrimidine bases occupying adjacent positions in the same polynucleotide strand, which in turn may cause local conformational changes in secondary structure and prevent base pairing with the opposite strand. In DNA, the dimerization reaction occurs between neighboring thymine and cytosine residues (TT, CC, or TC); it can also occur between cytosine and uracil residues in double-stranded RNA. Pyrimidine dimers are usually quickly corrected by nucleotide excision repair, but uncorrected lesions can inhibit or arrest polymerase activity during transcription or replication.
pyruvic acid


Q

[edit]
quantitative PCR (qPCR)

Also real-time PCR (rtPCR).

quiescent culture
A cell culture in which there is little or no active cell growth or replication but in which the cells nonetheless continue to survive, as observed with some confluent cultures.[2]


R

[edit]
random walk
A popular description of the path followed by a locomotive cell or particle when there is no bias in movement, i.e. when the direction of movement at any given instant is not influenced by the direction of movement in the preceding instant. The essential randomness of cell movement in a uniform environment is only apparent over long periods of time, however; in the short term, cells can and do exhibit a tendency to continue moving in the same direction.[3]
rDNA
1. An abbreviation of recombinant DNA.
2. An abbreviation of ribosomal DNA.
reading frame
A way of dividing the nucleotide sequence in a DNA or RNA molecule into a series of consecutive, non-overlapping groups of three nucleotides, known as triplets, which is how the sequence is interpreted or "read" by ribosomes during translation. In coding DNA, each triplet is referred to as a codon and corresponds to a particular amino acid to be added to the nascent peptide chain during translation. In general, only one reading frame (the so-called open reading frame) in a given sequence encodes a functional protein, though there are exceptions. A frameshift mutation results in a shift in the normal reading frame which affects all downstream codons and usually results in a completely different and senseless amino acid sequence.
readthrough
The continued transcription or translation of a DNA or mRNA sequence, respectively, beyond encoded termination signals such as a terminator sequence or a stop codon.[3]
real-time PCR (rtPCR)
See quantitative PCR.
reassociation kinetics
The measurement and manipulation of the rate of reannealing of complementary strands of DNA, generally by heating and denaturing a double-stranded molecule into single strands and then observing their rehybridization at a cooler temperature. Because the base pair G+C requires more energy to anneal than the base pair A+T, the rate of reannealing between two strands depends partly on their nucleotide sequence, and it is therefore possible to predict or estimate the sequence of the duplex molecule by the time it takes to fully hybridize. Reassociation kinetics is studied with C0t analysis: fragments reannealing at low C0t values tend to have highly repetitive sequences, while higher C0t values imply more unique sequences.[13]
receptor
A protein which initiates a cellular response to an external stimulus or propagates a molecular signal by binding a specific ligand, often a dedicated signaling molecule. Numerous types of receptors exist which serve an enormous variety of functions. Cell-surface receptors, such as those that bind acetylcholine and insulin, are embedded within the cell membrane with their binding sites exposed to the extracellular space; intracellular receptors, including many hormone receptors, are located in the cytoplasm, where they bind ligands that have diffused across the membrane and into the cell.[4]
receptor-mediated endocytosis
reciprocal translocation
A type of chromosomal translocation by which there is a reciprocal exchange of chromosome segments between two or more non-homologous chromosomes. When the exchange of material is evenly balanced, reciprocal translocations are usually harmless.
A reciprocal translocation between chromosome 4 and chromosome 20
recognition site
A specific motif or sequence, either of nucleotides in a nucleic acid molecule or of amino acids in a protein, that is "recognized" or identified by another protein in order to direct the protein's activity to a specific molecule or location. Recognition motifs may consist of a simple consecutive sequence within a single molecule or may involve multiple non-consecutive motifs, e.g. amino acids in separate parts of the same polypeptide which are brought into juxtaposition by the quaternary structure created during protein folding. Recognition sites often help to distinguish the nucleic acid or protein bearing the motif from other similar molecules and thereby identify it as a valid target for some biochemical activity, or to specify a locus or subregion within the larger macromolecule at which the activity is to occur. In this sense recognition sites are critical for properly localizing proteins to their biochemical targets. A protein's recognition site is often but not necessarily the same as its binding site or target site.[5]
recombinant DNA (rDNA)
Any DNA molecule in which laboratory methods of genetic recombination have brought together genetic material from multiple sources, thereby creating a sequence that would not otherwise be found in a naturally occurring genome. Because DNA molecules from all organisms share the same basic chemical structure and properties, DNA sequences from any species, or even sequences created de novo by artificial gene synthesis, may be incorporated into recombinant DNA molecules. Recombinant DNA technology is widely used in genetic engineering.
recombinase
Any of a class of enzymes united by their ability to catalyze some form of genetic recombination, by any of a variety of mechanisms, e.g. by site-specific recombination.
recombination
See genetic recombination, homologous recombination, and chromosomal crossover.
recombinator
Any nucleotide sequence that increases the likelihood of homologous recombination in nearby regions of the genome, e.g. the Chi sequence in certain species of bacteria.[13]
recon
The smallest unit of a DNA molecule capable of undergoing homologous recombination, i.e. a pair of consecutive nucleotides, adjacent to each other in cis.[13]
regulon
A group of non-contiguous genes which are regulated as a unit, generally by virtue of having their expression controlled by the same regulatory element or set of elements, e.g. the same repressor or activator. The term is most commonly used with prokaryotes, where a regulon may consist of genes from multiple operons.
repeat
Any pattern of nucleobases within a nucleic acid sequence which occurs in multiple copies in the same nucleic acid molecule such as a chromosome or within a genome. Repeated sequences are classified according to their length, structure, location, mode of replication, or evolutionary origin. They may be any length, but are often short motifs of less than 100 bases; they may be direct or inverted, and may occur in tandem arrays with the copies immediately adjacent to each other or interspersed with non-repeated sequences. Significant fractions of most eukaryotic genomes consist of repetitive DNA, much of it retroviral in origin, though repeats may also result from errors in normal cellular processes, as with duplications during DNA replication or cell division. Because so many genetic mechanisms depend on the binding or complementing of locally unique sequences, sequences containing or adjacent to repeats are particularly prone to errors in replication and transcription by strand slippage, or to forming problematic secondary structures, and thus repeats are often unstable in the sense that the number of copies tends to expand or diminish stochastically with each round of replication, causing great variation in copy number even between different cells in the same organism. When repeats occur within genes or regulatory elements, these properties often result in aberrant expression and lead to disease. Repeats are also essential for normal genome function in other contexts, as with telomeres and centromeres, which consist largely of repetitive sequences.
repetitive DNA

Also repetitious DNA.

A region or fragment of DNA consisting largely or entirely of repeated nucleotide sequences.
replacement mutation
See nonsynonymous mutation.
replication
1. The process by which certain biological molecules, notably the nucleic acids DNA and RNA, produce copies of themselves.
2. A technique used to estimate technical and biological variation in experiments for statistical analysis of microarray data. Replicates may be technical replicates, such as dye swaps or repeated array hybridizations, or biological replicates, biological samples from separate experiments which are used to test the effects of the same experimental treatment.
replication eye

Also replication bubble.

The eye-shaped structure that forms when a pair of replication forks, each growing away from the origin, separates the strands of the double helix during DNA replication.
replication fork

Also Y fork.

The point at which the paired strands of a double-stranded DNA molecule are separated by helicase during DNA replication, breaking the hydrogen bonds between the complementary strands and thereby forming a structure with two branching single strands of DNA. Once unpaired, these strands serve as templates from which DNA polymerase synthesizes the leading strand and lagging strand. As replication proceeds, helicase moves along the DNA and continues to separate the strands, causing the replication fork to move as well.[3] A pair of replication forks forms when helicases work in opposite directions from a single origin of replication, creating a replication eye.
replication rate
The speed at which deoxyribonucleotides are incorporated into an elongating chain by DNA polymerases during DNA replication; or more generally the speed at which any chromosome, genome, cell, or organism makes a complete, independently functional copy of itself.
replicator
1. Any fragment or region of DNA that contains a replication origin.[13]
2. Any molecule or structure capable of copying itself; namely, nucleic acids, but also crystals of many minerals, e.g. kaolinite.
replicon
Any molecule or region of DNA or RNA that replicates from a single origin of replication.
replisome
The entire complex of molecular machinery that carries out the process of DNA replication, including all proteins, nucleic acids, and other molecules which participate at an active replication fork.
reporter
In genetic engineering, a gene which when properly expressed encodes a gene product that is easily detected or visualized with biochemical assays (e.g. green fluorescent protein, β-galactosidase, chloramphenicol O-acetyltransferase, etc.), allowing researchers to use its expression in order to study the functions and properties of associated regulatory sequences. Reporters are commonly cloned into plasmid vectors in proximity to putative promoters, enhancers, or response elements, which are then mutated in order to precisely identify the specific recognition motifs within these sequences that are necessary for expression. In the broadest sense, reporters may also include things like molecular tags, fluorescent labels, and hybridization probes which render their conjugated molecules conspicuous or able to be purified; or they may be used similarly to selectable markers, to distinguish cells that express a given product from those that do not, so that researchers can easily identify mutants of interest or verify the success of an experimental treatment or laboratory procedure.[5]
repression
See downregulation.
repressor
A DNA-binding protein that inhibits the expression of one or more genes by binding to the operator and blocking the attachment of RNA polymerase to the promoter, thus preventing transcription. This process is known as negative gene regulation or repression.
rescue
The restoration of a defective cell or tissue to a healthy or normal condition,[13] or the reversion or recovery of a mutant gene to its normal functionality, especially in the context of experimental genetics, where an experiment (e.g. a drug, cross, or gene transfer) resulting in such a restoration is said to rescue the normal phenotype.
residue
An individual monomer or subunit of a larger polymeric macromolecule; e.g. a nucleic acid is composed of nucleotide residues, and a peptide or protein is composed of amino acid residues.[13]
response element
Any short sequence of DNA that serves some function related to the activity of a protein or other biomolecule, especially a sequence within a promoter region that is able to bind specific transcription factors in order to regulate transcription of specific genes.
restitution
The spontaneous rejoining of an experimentally broken chromosome which restores the original configuration.
restitution nucleus
A nucleus containing twice the expected number of chromosomes owing to an error in cell division, especially an unreduced, diploid product of meiosis resulting from the failure of the first or second meiotic division.
restriction cloning
The use of restriction sites and the enzymes specific for them in order to clone recombinant DNA molecules.
restriction enzyme

Also restriction endonuclease, restriction exonuclease, or restrictase.

An endonuclease or exonuclease enzyme that recognizes and cleaves a nucleic acid molecule into fragments at or near specific recognition sequences known as restriction sites by breaking the phosphodiester bonds of the nucleic acid backbone. Restriction enzymes are naturally occurring in many organisms, but are also routinely used for artificial modification of DNA in laboratory techniques such as restriction cloning.
restriction fragment
Any DNA fragment that results from the cutting of a DNA strand by a restriction enzyme at one or more restriction sites.
restriction fragment length polymorphism (RFLP)
Variability within a population of organisms observed in the size of the restriction fragments produced when genomic DNA (or any particular DNA molecule) is digested by one or more restriction endonucleases. This variability results from a corresponding polymorphism in the locations of restriction sites within the molecule(s) due to slight differences in nucleotide sequence between individuals. RFLP is frequently exploited in the laboratory to construct physical maps of the genome, to identify the specific locus occupied by a particular gene, and to detect genetic differences between closely related individuals or determine that different samples originated from the same individual. Analysis of restriction fragments can also reveal the presence of a mutation that may itself cause disease or be closely linked to one that does.[5]
restriction map
A diagram of known restriction sites within a known DNA sequence, such as a plasmid vector, obtained by systematically exposing the sequence to various restriction enzymes and then comparing the lengths of the resulting fragments, a technique known as restriction mapping. See also gene map.
restriction mapping
The use of type II restriction endonucleases to cleave DNA molecules at specific restriction sites in order to produce characteristic patterns of fragments which can be resolved by size using gel electrophoresis. Digesting DNA molecules such as genomic DNA or plasmids with one or multiple restriction enzymes makes it possible to deduce from the sizes of the resulting fragments the order or arrangement of the restriction sites within the molecule and the distances between them, and thus to construct reliable maps with restriction sites effectively serving as genetic markers.[5]
restriction site

Also restriction recognition site.

A short, specific sequence of nucleotides (typically 4 to 8 bases in length) that is reliably recognized by a particular restriction enzyme. Because restriction enzymes usually bind as homodimers, restriction sites are generally palindromic sequences spanning both strands of a double-stranded DNA molecule. Restriction endonucleases cleave the phosphate backbone between two nucleotides within the recognized sequence itself, while other types of restriction enzymes make their cuts at one end of the sequence or at a nearby sequence.
A double-stranded DNA molecule containing the sequence GAATTC and its palindromic complement functions as a restriction site for the bacterial enzyme EcoRI, which recognizes and cuts or "digests" it in the manner shown here, by breaking phosphodiester bonds in the backbones of both strands and leaving behind single-stranded overhangs at the ends of each of the now separate molecules.
retrogene
A gene or other DNA sequence which has arisen in a genome by the stable integration of the genetic material of a retrovirus, e.g. by the reverse transcription of viral RNA and the subsequent insertion of the resulting DNA fragments into the host cell's genomic DNA.[5] These endogenous viral elements can then be replicated along with the host's own genes and thus persist indefinitely in the host genome, and in many cases retain the ability to produce functional viral proteins from this latent state, by which they may continue to copy, excise, and/or transpose themselves.
reverse genetics
An experimental approach in molecular genetics in which a researcher starts with a known gene and attempts to determine its function or its effect on phenotype by any of a variety of laboratory techniques, commonly by deliberately mutating the gene's DNA sequence or by repressing or silencing its expression and then screening the mutant organisms for changes in phenotype. When the gene of interest is the only one in the genome whose expression has been manipulated, any observed phenotypic changes are assumed to be influenced by it. This is the opposite of forward genetics, in which a known phenotype is linked to one or more unknown genes.
reverse transcriptase (RT)
An enzyme capable of synthesizing a complementary DNA molecule from an RNA template, a process termed reverse transcription.
reverse transcription
The synthesis of a DNA molecule from an RNA template, i.e. the opposite of ordinary transcription. This process, mediated by an enzyme known as a reverse transcriptase, is used by many viruses to replicate their genomes, as well as by retrotransposons and in some eukaryotic cell types.
ribonuclease (RNase)
Any of a class of nuclease enzymes which catalyze the hydrolytic cleavage of phosphodiester bonds in RNA molecules, thus severing polymeric strands of ribonucleotides into smaller components. Compare deoxyribonuclease.
ribonucleic acid (RNA)
A polymeric nucleic acid molecule composed of a series of ribonucleotides which incorporate a set of four nucleobases: adenine (A), guanine (G), cytosine (C), and uracil (U). Unlike DNA, RNA is more often found as a single strand folded onto itself, rather than a paired double strand. Various types of RNA molecules serve in a wide variety of essential biological roles, including coding, decoding, regulating, and expressing genes, as well as functioning as signaling molecules and, in certain viral genomes, as the primary genetic material itself.
ribonucleoprotein (RNP)
A nucleoprotein that is a complex of one or more RNA molecules and one or more proteins. Examples include ribosomes and the enzyme ribonuclease P.
ribonucleotide
A nucleotide containing ribose as its pentose sugar component, and the monomeric subunit of ribonucleic acid (RNA) molecules. Ribonucleotides canonically incorporate any of four nitrogenous bases: adenine (A), guanine (G), cytosine (C), and uracil (U). Compare deoxyribonucleotide.
ribonucleotide reductase (RNR)

Also ribonucleoside diphosphate reductase.

An enzyme which catalyzes the formation of deoxyribonucleotides via the reductive dehydroxylation of ribonucleotides, specifically by removing the 2' hydroxyl group from the ribose ring of ribonucleoside diphosphates (rNDPs). RNR plays a critical role in regulating the overall rate of DNA synthesis such that the ratio of DNA to cell mass is kept constant during cell division and DNA repair.
ribose
A monosaccharide sugar which, as D-ribose in its pentose ring form, is one of three primary components of the ribonucleotides from which ribonucleic acid (RNA) molecules are built. Ribose differs from its structural analog deoxyribose only at the 2' carbon, where ribose has an attached hydroxyl group that deoxyribose lacks.
ribosomal DNA (rDNA)
A DNA sequence that codes for ribosomal RNA (rRNA). In many eukaryotic genomes, rDNA occupies large, highly conserved regions of multiple chromosomes and is rich in both genes and repeats.
ribosomal RNA (rRNA)
A type of non-coding RNA which is the primary constituent of ribosomes, binding to ribosomal proteins to form the small and large subunits. It is ribosomal RNA which enables ribosomes to perform protein synthesis by working as a ribozyme that catalyzes the set of reactions comprising translation. Ribosomal RNA is transcribed from the corresponding ribosomal DNA (rDNA) and is the most abundant class of RNA in most cells, bearing responsibility for the translation of all encoded proteins despite never being translated itself.
ribosome
A macromolecular complex made of both RNA and protein which serves as the site of protein synthesis by translation. Ribosomes have two subunits, each of which consists of one or more strands of ribosomal RNA bound to various ribosomal proteins: the small subunit, which reads the messages encoded in messenger RNA molecules, and the large subunit, which links amino acids in sequence to form a polypeptide chain. Ribosomes are essential and ubiquitous in all cell types and are used by all known forms of life.
riboswitch
A regulatory sequence within a messenger RNA transcript that can bind a small effector molecule, preventing or disrupting translation and thereby acting as a switch that regulates the mRNA's expression.
ribozyme
An RNA molecule with enzymatic activity,[4] i.e. one that is capable of catalyzing one or more specific biochemical reactions, similar to protein enzymes. Ribozymes function in numerous capacities, including in ribosomes as part of the large subunit ribosomal RNA.
RNA
See ribonucleic acid.
RNA gene
A gene that codes for any of the various types of non-coding RNA (e.g. rRNA and tRNA).[13]
RNA interference (RNAi)
RNA polymerase

Often abbreviated RNAP or RNApol.

Any of a class of polymerase enzymes that synthesize RNA molecules from a DNA template. RNA polymerases are essential for transcription and are found in all living organisms and many viruses. They build long single-stranded polymers called transcripts by adding ribonucleotides one at a time in the 5'-to-3' direction, relying on the template provided by the complementary strand to transcribe the nucleotide sequence faithfully. Unlike DNA polymerases, RNA polymerases notably do not require oligonucleotide primers to initiate synthesis; i.e. they are capable of synthesizing RNA molecules de novo.
RNA splicing
RNA-induced silencing complex (RISC)
A ribonucleoprotein complex which works to silence endogenous and exogenous genes by participating in various RNA interference pathways at the transcriptional and translational levels. RISC can bind both single-stranded and double-stranded RNA fragments and then cleave them or use them as guides to target complementary mRNAs for degradation.
RNase
See ribonuclease.
Robertsonian translocation (ROB)
A type of chromosomal translocation by which double-strand breaks at or near the centromeres of two acrocentric chromosomes cause a reciprocal exchange of segments that gives rise to one large metacentric chromosome (composed of the long arms) and one extremely small chromosome (composed of the short arms), the latter of which is often subsequently lost from the cell with little effect because it contains very few genes. The resulting karyotype shows one fewer than the expected total number of chromosomes, because two previously distinct chromosomes have essentially fused together. Carriers of Robertsonian translocations are generally not associated with any phenotypic abnormalities, but do have an increased risk of generating meiotically unbalanced gametes.
rolling circle replication (RCR)
rough endoplasmic reticulum (RER)
A type of membrane in the endoplasmic reticulum with numerous ribosomes conspicuously attached to its surface, in contrast to the "smooth" endoplasmic reticulum which lacks ribosomes. The rough ER serves as the site of protein synthesis for the majority of the cell's secreted and transmembrane proteins, as well as the site of synthesis of membrane lipids. It may be continuous with the smooth ER or exist separately.[3]
rRNA
See ribosomal RNA.
rtPCR
1. An abbreviation of real-time polymerase chain reaction, synonymous with quantitative PCR.
2. An abbreviation of reverse transcription polymerase chain reaction.


S

[edit]
S phase

Also synthesis phase or synthetic phase.

The phase of the cell cycle during which nuclear DNA is replicated, occurring after the G1 phase and before the G2 phase.[2]
salvage pathway
Any metabolic pathway that utilizes compounds formed in catabolism for the anabolism or biosynthesis of new compounds, e.g. by recycling building block monomers such as free purine and pyrimidine bases to make new nucleotides.[5]
samesense mutation
See synonymous mutation.
Sanger sequencing
A method of DNA sequencing based on the in vitro replication of a DNA template sequence, during which fluorochrome-labeled, chain-terminating dideoxynucleotides are randomly incorporated in the elongating strand; the resulting fragments are then sorted by size with electrophoresis, and the particular fluorochrome terminating each of the size-sorted fragments is detected by laser chromatography, thus revealing the sequence of the original DNA template through the order of the fluorochrome labels as one reads from small-sized fragments to large-sized fragments. Though Sanger sequencing has been replaced in some contexts by next-generation methods, it remains widely used for its ability to produce relatively long sequence reads (500+ nucleotides) and its very low error rate.
An outline of the Sanger sequencing method
saturation hybridization
An in vitro nucleic acid hybridization reaction in which one polynucleotide component (either DNA or RNA) is supplied in great excess relative to the other, causing all complementary sequences in the other polynucleotide to pair with the excess sequences and form hybrid duplex molecules.[13]
scRNA
See small conditional RNA.
second messenger

Also secondary messenger.

A molecule or compound (often a protein) that is caused to accumulate in an effector cell by the action of a hormone, growth factor, or other agonist and thereby brings about the action of that agonist on the cell. Second messengers are therefore critical mediators of a diverse variety of signal transduction pathways, including the synthesis of cyclic AMP by adenylate cyclase and of cyclic GMP by guanylate cyclase, the opening of ion channels, and the phosphorylation of proteins by serine/threonine-specific or tyrosine-specific protein kinases.[5]
secondary structure
The arrangement or folding of a polypeptide's primary structure into higher-order, locally organized structures, primarily via hydrogen bonding between non-adjacent amino acid residues, in particular α helices and β sheets; or the arrangement of double-stranded nucleic acid chains into the shape of a double helix stabilized by hydrogen bonds between the complementary bases.[5]
second-generation sequencing
See massively parallel sequencing.
selectable marker
A gene or other genetic material whose expression in cultured cells confers a selective advantage in the culture environment, causing cells expressing the gene to have one or more traits suitable for artificial selection. Selectable markers are widely used in the laboratory as a type of reporter, usually to indicate the success of a procedure meant to introduce exogenous DNA into a host cell such as transfection or transformation. A common example is an antibiotic resistance gene which is transformed into competent bacterial cells cultured on a medium containing the particular antibiotic, such that only those cells which have successfully taken up and expressed the gene are able to survive and grow into colonies.
selfish genetic element

Also selfish DNA or parasitic DNA.

Any genetic material (e.g. a gene or any other DNA sequence) which can enhance its own replication and/or transmission into subsequent generations at the expense of other genes in the genome, even if doing so has no positive effect or even a net negative effect on the fitness of the genome as a whole. Selfish elements usually work by producing self-acting gene products which repeatedly copy and paste their own coding sequences into other parts of the genome, independently of normal DNA replication (as with transposable elements); by facilitating the uneven swapping of chromosome segments during genetic recombination events (as with unequal crossing over); or by disrupting the normally equal redistribution of replicated material during mitosis or meiosis such that the probability that the selfish element is present in a given daughter cell is greater than the normal 50 percent (as with gene drives).
semiconservative replication
The standard mode of DNA replication that occurs in all living cells, in which each of the two parental strands of the original double-stranded DNA molecule are used as template strands, with DNA polymerases replicating each strand separately and simultaneously in antiparallel directions. The result is that each of the two double-stranded daughter molecules is composed of one of the original parental strands and one newly synthesized complementary strand, such that each daughter molecule conserves the precise sequence of information (indeed the very same atoms) from one-half of the original molecule. Contrast conservative replication and dispersive replication.
Three different modes of DNA replication. In semiconservative replication, each of the two daughter molecules is built from one of the original parental strands and one newly synthesized strand. In conservative replication, the original parent molecule remains intact while the replicated molecule is composed of two newly synthesized strands. In dispersive replication, each of the daughter molecules is an uneven mix of old and new, with some segments consisting of the two parental strands and others consisting of two newly synthesized strands. Only semiconservative replication occurs naturally.
sense
A distinction made between the individual strands of a double-stranded DNA molecule in order to easily and specifically identify each strand. The two complementary strands are distinguished as sense and antisense or, equivalently, the coding strand and the template strand. It is the antisense/template strand which is actually used as the template for transcription; the sense/coding strand merely resembles the sequence of codons on the RNA transcript, which makes it possible to determine from the DNA sequence alone the expected amino acid sequence of any protein translated from the RNA transcript. Which strand is which is relative only to a particular RNA transcript and not to the entire DNA molecule; that is, either strand can function as the sense/coding or antisense/template strand.
sense codon
Any codon that specifies an amino acid, as opposed to a stop codon, which does not specify any particular amino acid but instead signals the end of translation.
sequence
See nucleic acid sequence.
In bioinformatics, a graphical representation of the conservation of nucleobases or amino acids at each position within a nucleic acid or protein sequence. Sequence logos are created by aligning many sequences and used to depict consensus sequences as well as the degree of variability within the pool of aligned sequences.
A sequence logo depicts the statistical frequency with which each nucleobase (or amino acid) occurs within a given sequence. Each position in the sequence is represented by a vertical stack of letters; the total height of the stack indicates the degree of consensus at that position between all of the aligned sequences, and the height of each individual letter in the stack indicates the proportion of the aligned sequences having that nucleobase at that position. A single very large letter filling most of the stack indicates that most or all of the aligned sequences have that particular nucleobase at that position.
sequence-tagged site (STS)
Any DNA sequence that occurs exactly once within a particular genome, and whose location and nucleotide sequence are known with confidence.
sequencing
The determination of the order or sequence of nucleotides in a nucleic acid molecule, or of amino acids in a peptide, by any means. Sequences are usually written as a linear string of letters which conveniently summarizes much of the atomic-level structure of the molecule.
sex chromosome
See allosome.
sex linkage
The presence of a particular gene or DNA sequence on a sex chromosome (in mammals either the X chromosome or the Y chromosome) rather than on an autosome; these genes are said to be sex-linked. Expression of sex-linked genes varies by organism depending on the mechanism of sex determination and the types of sex chromosomes present, but the associated phenotypes often exclusively appear in either the homogametic or heterogametic sex.[2]
Shine–Dalgarno sequence
In many prokaryotic messenger RNAs, the consensus sequence AGGAGGU, located 6–8 bases upstream of the translation start codon, which functions as a binding site for the ribosome by complementing a sequence in the ribosomal RNA.[13]
short arm

Denoted in shorthand with the symbol p.

In condensed chromosomes where the positioning of the centromere creates two segments or "arms" of unequal length, the shorter of the two arms of a chromatid. Contrast long arm.
short interspersed nuclear element (SINE)
short tandem repeat (STR)
See microsatellite.
shotgun sequencing
A method of sequencing entire genomes in which genomic DNA is randomly fragmented (e.g. by sonication or restriction digests), cloned into plasmid vectors, and then sequenced using primers that anneal to flanking sequences in the plasmids. Computer software is used to align the sequenced fragments via overlapping contigs,[5] allowing scientists to deduce the relative genomic locations of each fragment and thereby assemble a complete genome.
signal peptidase
signal peptide

Also leader peptide, prepeptide, and presequence.

Any sequence of amino acids, usually 15–30 residues in length, that functions as a molecular signal directing the sorting and transport of the polypeptide bearing it to a specific location within a cell or organelle. Signal peptides are commonly located close to either the N-terminal or C-terminal ends of nascent or recently synthesized polypeptides, especially those destined for secretion from the cell or integration into a membrane, and are typically cleaved off by a dedicated signal peptidase when their polypeptide reaches the endoplasmic reticulum.[5] Similar but distinct varieties of protein targeting are accomplished with nuclear localization signals and post-translational protein tags.
signal transduction
The process by which a chemical, electrical, or mechanical signal is converted into a cellular response, or the transmission or propagation of such a signal through a cell as a series of molecular events known as a signaling pathway. For example, the extracellular interaction of a hormone, growth factor, or some other chemical agonist with a specific cell surface receptor can trigger a cascade of sequential biochemical reactions which propagate through the cell membrane and into the cytoplasm, provoking the synthesis of second messengers and leading to amplification of the signal or activation of other pathways. Other modes of transduction involve agonists which diffuse across the membrane freely, eliciting intracellular changes without amplification, or rapid shifts in cell polarity which transmit electrical impulses, such as those that cause the axons of neural cells to release neurotransmitters at synapses.[5]
silencer
A sequence or region of DNA that can be bound by a repressor, thereby blocking the transcription of a nearby gene.
silencing
The total or near-total loss of expression of a particular gene or DNA sequence by any mechanism, natural or artificial, whether before, during, or after transcription or translation, which completely prevents the normal gene product from being produced and thereby deprives the cell of its ordinary function. Gene silencing may occur via natural regulatory mechanisms such as condensation of the relevant segment of DNA into a transcriptionally inactive, heterochromatic state, in which case the term is more or less equivalent to repression; genes are also commonly silenced artificially for research purposes by using techniques such as knockdown (e.g. by RNA interference) or knockout (by deleting the gene from the genome entirely). See also downregulation.
silent allele
An allele that does not produce a detectable product.[13] Compare null allele.
silent mutation
A type of neutral mutation which does not have an observable effect on the organism's phenotype. Though the term "silent mutation" is often used interchangeably with synonymous mutation, synonymous mutations are not always silent, nor vice versa. Missense mutations which result in a different amino acid but one with similar functionality (e.g. leucine instead of isoleucine) are also often classified as silent, since such mutations usually do not significantly affect protein function.
simple sequence repeat (SSR)
See microsatellite.
single-nucleotide polymorphism (SNP)
Any substitution of a single nucleotide which occurs at a specific position within a genome and with measurable frequency within a population; for example, at a specific base position in a DNA sequence, the majority of the individuals in a population may have a cytosine (C), while in a minority of individuals, the same position may be occupied by an adenine (A). SNPs are usually defined with respect to a "standard" reference genome; an individual human genome differs from the reference human genome at an average of 4 to 5 million positions, most of which consist of SNPs and short indels. See also polymorphism.
single-strand break (SSB)
The loss of continuity of the phosphate-sugar backbone in one strand of a DNA duplex.[9] See also nick; contrast double-strand break.
single-stranded
Composed of a single, unpaired nucleic acid molecule, i.e. one linear strand of nucleotides sharing a single phosphodiester backbone, as opposed to a duplex of two such strands joined by base pairing. See also single-stranded DNA and single-stranded RNA.
single-stranded DNA (ssDNA)
Any DNA molecule that consists of a single nucleotide polymer or strand, as opposed to a pair of complementary strands held together by hydrogen bonds (double-stranded DNA). In most circumstances, DNA is more stable and more common in double-stranded form, but high temperatures, low concentrations of dissolved salts, and very high or low pH can cause double-stranded molecules to decompose into two single-stranded molecules in a denaturation process known as melting; this reaction is exploited by naturally occurring enzymes such as those involved in DNA replication as well as by laboratory techniques such as polymerase chain reaction.
siRNA
See small interfering RNA.
sister chromatids
A pair of identical copies (chromatids) produced as the result of the DNA replication of a chromosome, particularly when both copies are joined together by a common centromere; the pair of sister chromatids is called a dyad. The two sister chromatids are ultimately separated from each other into two different cells during mitosis or meiosis.
site-directed mutagenesis
small conditional RNA (scRNA)
A class of small RNA molecules engineered so as to change conformation conditionally in response to cognate molecular inputs, often with the goal of controlling signal transduction pathways in vitro or in vivo.
small interfering RNA (siRNA)
small nuclear RNA (snRNA)
A class of small non-coding RNA molecules, approximately 100–300 nucleotides in length and rich in uridine residues, found in association with specific proteins as part of ribonucleoprotein complexes known as snRNPs within nuclear speckles and Cajal bodies of the eukaryotic nucleus.[5] SnRNPs assemble into larger complexes known as spliceosomes which play important roles in the splicing of pre-mRNA transcripts (hnRNAs) before they are exported to the cytoplasm.
small nucleolar RNA (snoRNA)
A class of small non-coding RNA molecules whose primary function is to direct the post-transcriptional modification of other RNAs, mainly transfer RNAs (tRNA), small nuclear RNAs (snRNA), and especially ribosomal RNAs (rRNA) as a part of ribosome synthesis in the nucleolus. SnoRNAs contain antisense sequences that complement sequences within these target RNAs and guide ribonucleoprotein complexes to them, which can then catalyze specific nucleoside modifications, typically methylation or pseudouridylation.
small temporal RNA (stRNA)
A subclass of microRNAs, originally described in nematodes, which regulate the timing of developmental events by binding to complementary sequences in the 3' untranslated regions of messenger RNAs and inhibiting their translation. In contrast to siRNAs, which serve similar purposes, stRNAs bind to their target mRNAs after the initiation of translation and without affecting mRNA stability, which makes it possible for the target mRNAs to resume translation at a later time.
small ubiquitin-like modifier (SUMO)
Any of a family of small proteins, each approximately 100 amino acids, which are covalently conjugated to and removed from charged residues of other proteins in a form of post-translational modification known as SUMOylation, thereby functioning as a protein tag in a manner resembling ubiquitin.
smooth endoplasmic reticulum (SER)
A type of membrane in the endoplasmic reticulum that lacks ribosomes on its surface, thus visibly contrasting with the "rough" endoplasmic reticulum. Smooth ER tends to be tubular rather than sheet-like, and may form an extension of the rough ER or exist separately. It is especially abundant in cells concerned with lipid metabolism.[3]
snoRNA
See small nucleolar RNA.
snRNA
See small nuclear RNA.
soluble RNA (sRNA)
See transfer RNA.
somatic cell

Also vegetal cell or soma.

Any biological cell forming the body of an organism, or, in multicellular organisms, any cell other than a gamete, germ cell, or undifferentiated stem cell. Somatic cells are theoretically distinct from cells of the germ line, meaning the mutations they have undergone can never be transmitted to the organism's descendants, though in practice exceptions do exist.
somatic cell nuclear transfer (SCNT)
somatic crossover
See mitotic recombination.
sonication

Also ultrasonication.

The application of sound energy at ultrasonic frequencies in order to agitate particles in a chemical or biological sample. Intense acoustic vibrations produce pressure waves and cavitations that propagate through a liquid medium, thereby converting the sound energy to mechanical energy which can disperse solutes, disrupt intermolecular interactions, and break covalent bonds. At various amplitudes sonication can be used to increase the permeability of cell or nuclear membranes, a technique known as sonoporation, or to completely destroy them and release their contents for isolation and extraction. It has a wide variety of applications in industry and research, including creating nanoparticles such as liposomes, shearing DNA and proteins into smaller fragments, degassing liquids, and ultrasonic cleaning.
Southern blotting
A molecular biology method used to detect a specific sequence in DNA samples. The method combines separation of DNA fragments by gel electrophoresis, transfer of the DNA to a synthetic membrane, and subsequent identification of target fragments with radio-labeled or fluorescent hybridization probes. Compare northern blotting, western blotting, and eastern blotting.
spacer

Also intergenic spacer (IGS) or non-transcribed spacer (NTS).

Any sequence or region of non-coding DNA separating neighboring genes, whether transcribed or not. The term is used in particular to refer to the non-coding regions between the many repeated copies of the ribosomal RNA genes.[9] See also intergenic region.
spatially-restricted gene expression
The expression of one or more genes only within a specific anatomical region or tissue, often in response to a paracrine signal. The boundary between the jurisdictions of two spatially restricted genes may generate a sharp phenotypic gradient there, as with striping patterns.
spindle apparatus
The cytoskeletal structure that forms during cell division in eukaryotic cells, consisting of a network of long, flexible microtubules extending from each pole of the parent cell and attaching to kinetochores at the centromeres of homologous chromosomes or sister chromatids near the cell equator. As the microtubules shorten, they pull the chromosomes apart, separating them into different daughter cells. Proper formation of this apparatus is critical in both mitosis and meiosis, where it is respectively referred to as the mitotic spindle and meiotic spindle.
spliceosome
A large ribonucleoprotein complex found primarily in the nucleus of eukaryotic cells, composed of multiple small nuclear ribonucleoproteins (snRNPs) which are themselves composed of small nuclear RNAs (snRNAs) and a variety of snRNP-specific proteins. Spliceosomes assemble at the junctions between exons and introns within primary RNA transcripts and catalyze the removal of the introns and the ligation of the flanking exons in a form of post-transcriptional processing known as RNA splicing.[30]
splicing
Any natural or artificial process involving the excision of oligonucleotide sequences from nucleic acid molecules (either DNA or RNA) or of peptide sequences from proteins and the subsequent re-ligation of the flanking fragments into a single continuous molecule lacking the excised sequence. RNA splicing in particular is an important form of post-transcriptional processing whereby introns are removed from primary mRNA transcripts and the exons rejoined (and sometimes rearranged) to create mature mRNAs; a similar process also occurs with the removal of inteins and the rejoining of exteins in the post-translational modification of certain proteins. The term may also refer more generally to artificial techniques for creating recombinant sequences in genetic engineering.[5]
sRNA
See transfer RNA.
ssDNA
See single-stranded DNA.
ssRNA
See single-stranded RNA.
standard genetic code
The genetic code used by the vast majority of living organisms for translating nucleic acid sequences into proteins. In this system, of the 64 possible permutations of three-letter codons that can be made from the four nucleotides, 61 code for one of the 20 amino acids, and the remaining three code for stop signals. For example, the codon CAG codes for the amino acid glutamine and the codon UAA is a stop codon. The standard genetic code is described as degenerate or redundant because some amino acids can be coded for by more than one different codon.
The standard genetic code specifies a set of 20 different amino acids from triplet arrangements of the four different RNA nucleobases (A, G, C, and U). To read this chart, choose one of the four letters in the innermost ring and then move outward, adding two more letters to complete a codon triplet: a total of 64 unique codons can be made this way, 61 of which signal the addition of one of the 20 amino acids (identified by single-letter abbreviation as well as by full name and chemical structure) to a nascent peptide chain, while the remaining three codons are stop codons signalling the termination of translation. Also indicated are some of the chemical properties of the amino acids and the various ways in which they can be modified.
start codon
The first codon translated by a ribosome from a mature messenger RNA transcript, used as a signal to initiate peptide synthesis. In the standard genetic code, the start codon always codes for the same amino acid, methionine, in eukaryotes and for a modified methionine in prokaryotes. The most common start codon is the triplet AUG. Contrast stop codon.
statistical genetics
A branch of genetics concerned with the development of statistical methods for drawing inferences from genetic data. The theories and methodologies of statistical genetics often support research in quantitative genetics, genetic epidemiology, and bioinformatics.
stem cell
Any biological cell which has not yet differentiated into a specialized cell type and which can divide through mitosis to produce more undifferentiated stem cells.
stem-loop
See hairpin.
sticky end
A term used to describe the end of a double-stranded DNA molecule where one strand is longer than the other by one or more nucleobases, creating a single-stranded "overhang" of unpaired bases, in contrast to a so-called blunt end, where no such overhang exists because the terminal nucleobases on each strand are base-paired with each other. Blunt ends and sticky ends are relevant when ligating linear DNA molecules, e.g. in restriction cloning, because many restriction enzymes cleave the phosphate backbone in a way that leaves behind terminal overhangs in the digested fragments. These sticky-ended molecules ligate much more readily with other sticky-ended molecules having complementary overhangs, allowing scientists to ensure that specific DNA fragments are ligated together in specific places.
stop codon

Also termination codon.

A codon that signals the termination of protein synthesis during translation of a messenger RNA transcript. In the standard genetic code, three different stop codons are used to dissociate ribosomes from the growing amino acid chain, thereby ending translation: UAG (nicknamed "amber"), UAA ("ochre"), and UGA ("opal"). Contrast start codon.
strand
An individual chain of nucleotides comprising a nucleic acid polymer, existing either independently (in which case the nucleic acid molecule is said to be single-stranded) or base-paired in a duplex with another, separate strand (in which case it is said to be double-stranded).
stringency
The effect of conditions such as temperature and pH upon the degree of complementarity that is required for a hybridization reaction to occur between two single-stranded nucleic acid molecules. In the most stringent conditions, only exact complements can successfully hybridize; as stringency decreases, an increasing number of mismatches can be tolerated between the two hybridizing strands.[18]
stRNA
See small temporal RNA.
structural gene
A gene that codes for any protein or RNA product other than a regulatory factor. Structural gene products include enzymes, structural proteins, and certain non-coding RNAs.
structural protein
Any protein which contributes to the mechanical shape and structure of cells, organelles, or tissues (e.g. collagen and actin), as distinguished from proteins which serve some other purpose, such as enzymes. This distinction is not well-defined, however, as many proteins have both structural and non-structural roles.[2]
subcellular localization
1. The subdivision of the interior of a cell into functionally distinct spaces or compartments (e.g. membrane-bound organelles) and the delegation of particular cellular functions and activities to these particular spaces.
2. The determination by any of various laboratory methods (e.g. fluorescent labelling) of the precise location(s) within a cell where a specific molecule has occupancy, or at which a specific activity occurs.
submetacentric
(of a linear chromosome or chromosome fragment) Having a centromere positioned close to but not exactly in the middle of the chromosome, resulting in chromatid arms of slightly different lengths.[6] Compare metacentric.
substitution
A type of point mutation in which a single nucleotide and its attached nucleobase is replaced by a different nucleotide.
substrate
1. A chemical compound or molecule upon which a particular enzyme directly acts, often but not necessarily binding the molecule by forming one or more chemical bonds.[2] See also ligand.
2. The substance, biotic or abiotic, upon which an organism grows or lives, or by which it is supported; e.g. a particular growth medium used in cell culture. See also substratum.
substratum
A solid surface to which a cell or organism adheres or by which it is supported, or over which it moves.[3] See also substrate.
subunit
A single unit of a multi-unit compound or molecular aggregate; e.g. a monomer from which a larger polymer is composed (as with nucleotides in nucleic acids), or an individual polypeptide chain in a multi-chain protein, or an entire protein which participates alongside other proteins as part of a protein complex.[2][4]
SUMOylation

Also sumoylation.

A type of post-translational modification in which a SUMO protein is conjugated to a polar residue of another protein (usually a lysine) via a covalent isopeptide bond. This effectively tags the second protein, making it distinguishable to other biomolecules and in many cases allowing it to participate in specific reactions or to interact with specific protein complexes. SUMOlyation is closely related to ubiquitination, relying on the same E1/E2/E3 enzymes to transfer SUMO to specific recognition motifs in the target protein, though detaching SUMO depends on SUMO-specific proteases. It plays important roles in numerous cellular processes, including protein localization, transcriptional regulation, stress-response pathways, and cell cycle checkpoints, among others. SUMOlyation is also used in the laboratory as a molecular label and to help solubilize proteins which are difficult to purify.
supercoiling
supersecondary structure
suppression
See downregulation.
suspension culture
A type of cell culture in which individual cells or aggregates of cells are suspended in a liquid growth medium, and usually prevented from settling by continuous gentle agitation. Many prokaryotic and eukaryotic cell types readily proliferate in suspension cultures, but they are particularly useful for culturing non-adherent cell lines such as hematopoietic cells, plant cells, and insect cells. Compare adherent culture.
symporter
Any of a class of transmembrane transporter proteins which facilitate the transport of two or more different molecules across the membrane at the same time and in the same direction; e.g. glucose and sodium ions. Contrast antiporter and uniporter.
synapsis
synaptonemal complex
A complex of scaffolding proteins that mediates synapsis and homologous recombination between the chromatids of homologous chromosomes during prophase I of meiosis.
syncytium

Also symplasm; pl. syncytia.

A multinucleate cell, i.e. a cell containing more than one nucleus or, in the broadest sense, more than one nuclear genome (a meaning which is equated with polyploidy). Syncytia may form as a result of cell fusion between uninucleate cells, migration of a nucleus from one cell to another, or multiple nuclear divisions without accompanying cytokinesis (forming a coenocyte).[9] The term may also refer to cells which are interconnected by specialized membranes with gap junctions as in some neuromuscular cell types.
syndesis
The synapsis of chromosomes during meiosis.[13]
synezis
The aggregation of chromosomes into a dense knot that adheres to one side of the nucleus, commonly observed during leptonema in certain organisms.[13]
synonymous mutation

Also synonymous substitution or samesense mutation.

A type of mutation in which the substitution of one nucleotide base for another results, after transcription and translation, in an amino acid sequence which is identical to the original unmutated sequence. This is possible because of the degeneracy of the genetic code, which allows different codons to code for the same amino acid. Though synonymous mutations are often considered silent, this is not always the case; a synonymous mutation may affect the efficiency or accuracy of transcription, splicing, translation, or any other process by which genes are expressed, and thus become effectively non-silent. Contrast nonsynonymous mutation.
synthesis phase
See S phase.


T

[edit]
tandem repeat
A pattern within a nucleic acid sequence in which one or more nucleobases are repeated and the repetitions are directly adjacent (i.e. tandem) to each other. An example is ATGACATGACATGAC, in which the sequence ATGAC is repeated three times.
target site
The site or locus upon another molecule at which a protein performs a particular biochemical activity; e.g. the nucleotide motif at which a restriction endonuclease cleaves a DNA molecule, often but not necessarily the same as the enzyme's recognition site (i.e. a restriction enzyme may recognize one motif, known as a restriction site, and cleave at another).[5]
TATA box

Also Goldberg-Hogness box.

A highly conserved non-coding DNA sequence containing a consensus of repeating T and A base pairs that is commonly found in promoter regions of genes in archaea and eukaryotes. The TATA box often serves as the site of initiation of transcription or as a binding site for transcription factors.
taxis
A directional response by a cell or a population of cells to a specific stimulus; a movement or other activity occurring in a non-random direction and dependent on the direction from which the stimulus originated.[3] This contrasts with kinesis, a response without directional bias.
TCA
See citric acid cycle.
telestability
The transmission of three‐dimensional structural stability from a stable part of a macromolecule to a distal part of the same molecule, especially an inherently less stable part.[5] Instability may also be transmitted in this way, e.g. destabilization of the DNA double helix may occur at a locus that is relatively distant from the binding site of a DNA-binding protein.[13]
telocentric
(of a linear chromosome or chromosome fragment) Having a centromere positioned at the terminal end of the chromosome (near or within the telomere), resulting in only a single arm.[6] Compare acrocentric.
telomere
A region of repetitive nucleotide sequences at each end of a linear chromosome which protects the end of the chromosome from deterioration and from fusion with other chromosomes. Since each round of replication results in the shortening of the chromosome, telomeres act as disposable buffers which are sacrificed to perpetual truncation instead of nearby genes; telomeres can also be lengthened by the enzyme telomerase.
telomeric silencing
The repression of transcription of genes in regions adjacent to telomeres. Telomeres also appear to reduce the accessibility of subtelomeric chromatin to modification by DNA methyltransferases.[13]
telophase
The final stage of cell division in both mitosis and meiosis, occurring after anaphase and before or simultaneously with cytokinesis, during which a nuclear membrane is synthesized around each set of chromatids, nucleoli are reassembled, and the mitotic spindle is disassembled. Following cytokinesis, the new daughter cells resume interphase.
template strand

Also antisense strand, negative (-) sense strand, and noncoding strand.

The strand of a double-stranded DNA molecule which is used as a template for RNA synthesis during transcription. The sequence of the template strand is complementary to the resulting RNA transcript. Contrast coding strand; see also sense.
terminalization
In cytology, the progressive shift of chiasmata from their original to more distal positions as meiosis proceeds through diplonema and diakinesis.[13]
termination codon
See stop codon.
terminator
A DNA sequence or its RNA complement which signals the termination of transcription by triggering processes that ultimately arrest the activity of RNA polymerase or otherwise cause the release of the RNA transcript from the transcriptional complex. Terminator sequences are usually found near the 3'-ends of the coding sequences of genes or operons. They generally function after being themselves transcribed into the nascent RNA strand, whereupon the part of the strand containing the sequence either directly interacts with the transcriptional complex or forms a secondary structure such as a hairpin loop which signals the recruitment of enzymes that promote its disassembly.[13]
tetramer
A molecular aggregate consisting of four subunits.[2] The term is often used to refer to protein complexes composed of four proteins, e.g. haemoglobin, or to individual proteins composed of four polypeptides. Compare monomer, dimer, and trimer.
three-prime end
See 3'-end.
three-prime untranslated region
See 3' untranslated region.
thymidine (T, dT)

Also deoxythymidine.

One of the four standard nucleosides used in DNA molecules, consisting of a thymine base with its N9 nitrogen bonded to the C1 carbon of a deoxyribose sugar. The prefix deoxy- is commonly omitted, since there are no ribonucleoside analogs of thymidine used in RNA, where it is replaced with uridine instead.
thymine (T)

Also 5-methyluracil.

A pyrimidine nucleobase used as one of the four standard nucleobases in DNA molecules. Thymine forms a base pair with adenine. In RNA, thymine is not used at all, and is instead replaced with uracil.
thymine dimer
See pyrimidine dimer.
tight junction

Also occluding junction or zonula occludens.

A type of specialized intercellular junction characterized by very close contact between the plasma membranes of adjacent cells, which are held together by large multiprotein complexes that completely or nearly completely occlude the passage of water and solutes through the intercellular space between cells. Tight junctions occur in many vertebrate tissues, especially between the epithelial and endothelial cells that line the surfaces of most organs and vessels. These cells are completely encircled by tight junctions which create a gasket-like seal that separates each cell's plasma membrane into apical and basolateral domains and prevents the exchange of extracellular materials between them.[5][3]
tissue
In a multicellular organism, a contiguous aggregation of cells held together by a common extracellular matrix and specialized to perform a particular function. Some tissues are composed primarily of a single cell type; others are a heterogeneous mixture of many cell types.[2] Tissues represent a level of multicellular organization between that of individual cells and that of organs, which may be composed of one or more distinct types of tissue.[3]
tissue culture
The growth and maintenance, or "culturing", of multicellular tissues, or of cells harvested from tissues, under carefully controlled conditions in vitro, in the strictest sense by taking a piece of explanted tissue directly from a living plant or animal and maintaining it outside of the body of the source organism. In common usage, the term may also refer to cell culture in general, especially when growing certain cell types which have been harvested from tissues but dispersed from their original tissue-specific organization into a population of more or less independently growing cells.[3]
tissue-specific gene expression
Gene function and expression which is restricted to a particular tissue or cell type. Tissue-specific expression is usually the result of an enhancer which is activated only in the proper cell type.
tonicity
A measure of the effective osmotic pressure gradient of one solution relative to another solution, used especially to describe the water potential that exists between two aqueous solutions separated by a semipermeable membrane (as with a cell, where the intracellular cytosol is separated from the extracellular fluid by the plasma membrane). Tonicity depends on the relative concentrations of solutes on either side of the membrane, which determine the direction and extent to which solvent molecules move across the membrane by osmosis; it is affected only by those solutes which cannot cross the membrane, as those which can cross freely can achieve equilibrium without any net movement of solute. The extracellular environment is commonly described as hypotonic, hypertonic, or isotonic with respect to the intracellular environment.
Tonicity describes the pressure to restore osmotic equilibrium between the inside and outside of cells by moving water across the cell membrane. Red blood cells tend to lose water and shrivel up in a severely hypertonic environment (left) or gain water and swell to bursting in a severely hypotonic environment (right), but the water potential is balanced in an isotonic environment (center).
tonoplast
See vacuole.
topoisomerase
Any of a class of DNA-binding enzymes which catalyze changes in the topological state of a double-stranded DNA molecule by nicking or cutting the sugar-phosphate backbone of one or both strands, relaxing the torsional stress inherent in the double helix and unwinding or untangling the paired strands before re-ligating the nicks. This process is usually necessary prior to replication and transcription. Topoisomerases thereby convert DNA between its relaxed and supercoiled, linked and unlinked, and knotted and unknotted forms without changing the sequence or overall chemical composition, such that the substrate and product molecules are structural isomers, differing only in their shape and their twisting, linking, and/or writhing numbers.
totipotency
A state of cell potency in which a cell or nucleus fully retains the ability to differentiate into all of the cell types represented in the adult organism, or to give rise to all of these cell types upon transplantation into an appropriate cytoplasm (as in nuclear transfer). Such cells or nuclei are said to be totipotent. The zygote that serves as the progenitor cell for sexually reproducing multicellular organisms is the archetypal totipotent cell; almost all of the cells into which it ultimately differentiates are not totipotent, though some cells such as stem cells remain totipotent or pluripotent throughout the organism's life.[2]
tracer
A molecule or a specific atom within a molecule that has been chemically or radioactively labelled so that it can be easily tracked or followed through a biochemical process or located in a cell or tissue.[4]
trailer sequence
See 3' untranslated region.
trans
On the opposite side; across from; acting from a different molecule. Contrast cis.
trans-acting
Affecting a gene or sequence on a different nucleic acid molecule or strand. A locus or sequence within a particular DNA molecule such as a chromosome is said to be trans-acting if it or its products influence or act upon other sequences located relatively far away or on an entirely different molecule or chromosome. For example, a DNA-binding protein acts "in trans" if it binds to or interacts with a sequence located on any strand or molecule different from the one on which it is encoded. Contrast cis-acting.
trans-splicing
A form of RNA splicing in which different RNA transcripts, synthesized in separate transcription events, are spliced together into a single, continuous transcript. This contrasts with the more conventional "cis-splicing", where segments of the same transcript are excised or re-arranged.[3]
transactivation
An experimental approach to artificially control gene expression by introducing a transactivator gene under the control of an inducible promoter into a genome. The transactivator encodes a transcription factor capable of acting in trans upon one or more other genes by recognizing and specifically binding their promoters; thus by inducing the transactivator gene, the expression of many other genes can be experimentally manipulated.[3]
transcribed spacer
A spacer sequence that is transcribed and thus included in the primary ribosomal RNA transcript (as opposed to a non-transcribed spacer) but subsequently excised and discarded during the maturation of functional RNAs of the ribosome.[13]
transcript
A product of transcription; that is, any RNA molecule which has been synthesized by RNA polymerase using a complementary DNA molecule as a template. When transcription is completed, transcripts separate from the DNA and become independent primary transcripts. Particularly in eukaryotes, multiple post-transcriptional modifications are usually necessary for raw transcripts to be converted into stable and persistent molecules, which are then described as mature, though not all transcribed RNAs undergo maturation. Many transcripts are accidental, spurious, incomplete, or defective; others are able to perform their functions immediately and without modification, such as certain non-coding RNAs.
transcript of unknown function (TUF)
Any RNA transcript whose function is unclear. Such transcripts may include functional non-coding RNAs which have not yet been studied in detail as well as spurious transcripts without any definite function. The DNA sequences from which TUFs are transcribed are generally located in intergenic or intronic regions of the genome. See also junk RNA.
transcriptase
See RNA polymerase.
transcription
The first step in the process of gene expression, in which an RNA molecule, known as a transcript, is synthesized by enzymes called RNA polymerases using a gene or other DNA sequence as a template. Transcription is a critical and fundamental process in all living organisms and is necessary in order to make use of the information encoded within a genome. All classes of RNA must be transcribed before they can exert their effects upon a cell, though only messenger RNA (mRNA) proceeds to translation to produce a functional protein, whereas the many types of non-coding RNA fulfill their duties without being translated. Transcription is also not always beneficial for a cell: when it occurs at the wrong time or at a functionless locus, or when mobile elements or infectious pathogens utilize the host's transcription machinery, the resulting transcripts (not to mention the waste of valuable energy and resources) are often harmful to the host cell or genome.
A simplified diagram of transcription. RNA polymerase (RNAP) synthesizes an RNA transcript (blue) in the 5'-to-3' direction, using one of the DNA strands as a template, while a complex of multiple transcription factors binds to a promoter upstream of the gene.
transcription factor (TF)
Any protein that controls the rate of transcription of genetic information from DNA to RNA by binding to a specific DNA sequence and promoting or blocking the recruitment of RNA polymerase to nearby genes. Transcription factors can effectively turn "on" and "off" specific genes in order to make sure they are expressed at the right times and in the right places; for this reason, they are a fundamental and ubiquitous mechanism of gene regulation.
transcription start site (TSS)

Also transcription initiation site.

The specific location within a gene at which RNA polymerase begins transcription, defined by the specific nucleotide or codon corresponding to the first ribonucleotide(s) to be assembled in the nascent transcript (which is not necessarily the same as the first codon to be translated). This site is usually considered the beginning of the coding sequence and is the reference point for numbering the individual nucleotides within a gene. Nucleotides upstream of the start site are assigned negative numbers and those downstream are assigned positive numbers, which are used to indicate the positions of nearby sequences or structures relative to the TSS. For example, the binding site for RNA polymerase might be a short sequence immediately upstream of the TSS, from approximately -80 to -5, whereas an intron within the coding region might be defined as the sequence starting at nucleotide +207 and ending at nucleotide +793.
transcription unit
The segment of DNA between the initiation site and the termination site of transcription, containing the coding sequences for one or more genes. All genes within a transcription unit are transcribed together into a single transcript during a single transcription event; the resulting polycistronic RNA may subsequently be cleaved into separate RNAs, or may be translated as a unit and then cleaved into separate polypeptides.[13]
transcriptional bursting
The intermittent nature of transcription and translation mechanisms. Both processes occur in "bursts" or "pulses", with periods of gene activity separated by irregular intervals.
transcriptome
The entire set of RNA molecules (often referring to all types of RNA but sometimes exclusively to messenger RNA) that is or can be expressed by a particular genome, cell, population of cells, or species at a particular time or under particular conditions. The transcriptome is distinct from the exome and the translatome.
transcriptomics
The study of the transcriptome of a particular genome, cell, or organism, i.e. the sum total of all of the RNA transcripts produced from it by transcription. Transcriptomics technologies allow scientists to isolate and sequence transcriptomes, which can then be mapped to the genome to determine which genes are being expressed or which cellular processes are active and which are dormant at a given time.
transcytosis
The transport of molecules across the interior of a cell, i.e. through the cytoplasm, especially a polarized cell such as an epithelial cell with contrasting apical and basal surfaces, thereby providing a spatially oriented transport system. Molecules undergoing transcytosis are usually contained within vesicles.[5]
transductant
A cell which has undergone transduction and been successfully transduced.
transduction
The transfer of genetic material between cells by a virus or viral vector, either naturally or artificially.
transfectant
A cell which has undergone transfection and been successfully transfected.
transfection
The deliberate experimental introduction of exogenous nucleic acids into a cell or embryo. In the broadest sense the term may refer to any such transfer and is sometimes used interchangeably with transformation, though some applications restrict the usage of transfection to the introduction of naked or purified non-viral DNA or RNA into cultured eukaryotic cells (especially animal cells) resulting in the subsequent incorporation of the foreign DNA into the host genome or the non-hereditary modification of gene expression by the foreign RNA. As a contrast to both standard non-viral transformation and transduction, transfection has also occasionally been used to refer to the uptake of purified viral nucleic acids by bacteria or plant cells without the aid of a viral vector.[13]
transfer RNA (tRNA)

Formerly referred to as soluble RNA (sRNA).

A special class of RNA molecule, typically 76 to 90 nucleotides in length, that serves as a physical adapter allowing mRNA transcripts to be translated into sequences of amino acids during protein synthesis. Each tRNA contains a specific anticodon triplet corresponding to an amino acid that is covalently attached to the tRNA's opposite end; as translation proceeds, tRNAs are recruited to the ribosome, where each mRNA codon is paired with a tRNA containing the complementary anticodon. Depending on the organism, cells may employ as many as 41 distinct tRNAs with unique anticodons; because of codon degeneracy within the genetic code, several tRNAs containing different anticodons carry the same amino acid.
transferase
Any of a class of enzymes which catalyze the chemical transfer of a functional group or substituent from one molecule to another.[2] For example, acetyltransferases catalyze the movement of an acetyl group in a process known as acetylation; methyltransferases catalyze the movement of one or more methyl groups in a process known as methylation.
transfer-messenger RNA (tmRNA)
A type of RNA molecule in some bacteria which has dual tRNA-like and mRNA-like properties, allowing it to simultaneously perform a number of different functions during translation.
transformant
A cell or organism which has taken up extracellular DNA by transformation and which can express genes encoded by it.
transformation
transgene
Any gene or other segment of genetic material that has been isolated from one organism and then transferred either naturally or by any of a variety of genetic engineering techniques into another organism, especially one of a different species. Transgenes are usually introduced into the second organism's germ line. They are commonly used to study gene function or to confer an advantage not otherwise available in the unaltered organism.
transition
A point mutation in which a purine nucleotide is substituted for another purine (AG) or a pyrimidine nucleotide is substituted for another pyrimidine (CT). Contrast transversion.
translation
The second step in the process of gene expression, in which the messenger RNA transcript produced during transcription is read by a ribosome to produce a functional protein.
translatome
The entire set of messenger RNA molecules that are translated by a particular genome, cell, tissue, or species at a particular time or under particular conditions. Like the transcriptome, it is often used as a proxy for quantifying levels of gene expression, though the transcriptome also includes many RNA molecules that are never translated.
translocation
A type of chromosomal abnormality caused by the structural rearrangement of large sections of one or more chromosomes. There are two main types: reciprocal and Robertsonian.
transmembrane protein
See integral polytopic protein.
transmission genetics
The branch of genetics that studies the mechanisms involved in the transfer of genes from parents to offspring.[13]
transport protein

Also transporter.

Any transmembrane protein which functions by permitting the movement of particular molecules, proteins, or other substances across a membrane, either actively or passively and in either or both directions (by which they may be further subclassified into uniporters, antiporters, and symporters).[3] Channel proteins and nuclear pores are examples of transport proteins.
transporter
See transport protein.
transposable element (TE)

Also transposon.

Any of a diverse variety of selfish mobile genetic elements consisting of self-acting DNA sequences capable of replicating themselves semi-autonomously and inserting into random or specific sites within a host genome, a process known as transposition. Transposons contain one or more genes which encode enzymes known as transposases capable of recognizing sequences within a flanking pair of inverted repeats, such that the enzymes effectively catalyze their own replication, excision, and/or re-insertion into other DNA molecules by any of various mechanisms.[3]
transposase
Any of a class of self-acting enzymes capable of binding to the flanking sequences of the transposable element which encodes them and catalyzing its movement to another part of the genome, typically by an excision/insertion mechanism or a replicative mechanism, in a process known as transposition.
transposition
The process by which a nucleic acid sequence known as a transposable element changes its position within a genome, either by excising and re-inserting itself at a different locus (cut-and-paste) or by duplicating itself and inserting into another locus without moving the original element from its original locus (copy-paste). These reactions are catalyzed by an enzyme known as a transposase which is encoded by a gene within the transposable element itself; thus the element's products are self-acting and can autonomously direct their own replication. Transposed sequences may re-insert at random loci or at sequence-specific targets, either on the same DNA molecule or on different molecules.
transvection
transversion
A point mutation in which a purine nucleotide is substituted for a pyrimidine nucleotide, or vice versa (e.g. AC or AT). Contrast transition.
tricarboxylic acid cycle (TCA)
See citric acid cycle.
triglyceride

Also triacylglycerol and triacylglyceride.

Any of a class of chemical compounds which are ester derivatives of glycerol, consisting of a glycerol backbone connected to any three fatty acid substituents via ester bonds. Triglycerides are one of three major classes of esters formed by fatty acids in biological systems, along with phospholipids and cholesteryl esters. They are the primary constituent of adipose tissue in vertebrates.
trimer
A molecular aggregate consisting of three subunits.[31] The term is often used to refer to protein complexes composed of three proteins, e.g. many membrane porins, or to individual proteins composed of three polypeptides. Compare monomer, dimer, and tetramer.
trinucleotide repeat
Any sequence in which an individual nucleotide triplet is repeated many times in tandem, whether in a gene or non-coding sequence. At most loci some degree of repetition is normal and harmless, but mutations which cause specific triplets (especially those of the form CnG) to increase in copy number above the normal range are highly unstable and responsible for a variety of genetic disorders.
triplet
A unit of three successive nucleotides in a DNA or RNA molecule.[13] A triplet within a coding sequence that codes for a specific amino acid is known as a codon.
trisomy
A type of polysomy in which a diploid cell or organism has three copies of a particular chromosome instead of the normal two.
tRNA
See transfer RNA.
tRNA-ligase
See aminoacyl-tRNA synthetase.
tropism

Also tropic movement.

The directional growth or movement of a cell or organism in response to a stimulus, e.g. light, heat, the pull of gravity, or the presence of a particular chemical, such that the response is dependent on the direction of the stimulus (as opposed to a non-directional nastic response). Positive tropism is growth or movement toward the stimulus; negative tropism is away from the stimulus.[2] See also taxis and kinesis.
tumorigenesis
turgor pressure

Also turgidity.

The force within a cell which pushes the plasma membrane against the cell wall,[32] a type of hydrostatic pressure influenced by the osmotic flow of water into and out of the cell. Turgidity is observed in plants, fungi, bacteria, and some protists with cell walls, but generally not in animal cells.
turnover number
In enzymology, a measure of the rate at which a particular enzyme catalyzes a particular biochemical reaction, usually expressed as the average number of substrate molecules it is capable of converting into reaction products per unit time at a given concentration of enzyme.[33]
twisting number


U

[edit]
ubiquitin (Ub)
A small protein of 76 amino acids found in great quantities (ubiquitously) in all eukaryotic cells, employed chiefly as a post-translational protein tag, by which its C-terminal glycine residue is covalently bonded to electrically charged residues within other proteins or polypeptides, a process known as ubiquitination. Ubiquitin tags have functions in the heat-shock response, protein sorting, proteolysis, membrane trafficking, cell signaling, regulation of the cell cycle, X chromosome inactivation, and histone modification, among others.[3]
ubiquitination

Also ubiquitylation.

The labelling of a biomolecule (often another protein) by covalently attaching a ubiquitin protein to it—generally via the formation of an amide bond between the ubiquitin's C-terminal glycine and positively charged side chains (often lysine or arginine residues) of the labelled molecule, an ATP-dependent reaction catalyzed by ubiquitin-conjugating enzymes[3]—thus making it identifiable to molecules capable of recognizing ubiquitin epitopes. Ubiquitination is a widely used post-translational modification by which proteins are tagged; the attachment of a single ubiquitin molecule (monoubiquitination) can variously activate or inhibit a protein's activity, while the attachment of a chain of multiple consecutively linked ubiquitin molecules (polyubiquitination) commonly targets the protein for degradation by proteasomes.
umber
See opal.
uncharged tRNA
A transfer RNA without an attached amino acid. Contrast charged tRNA.
underwinding
See negative supercoiling.
unequal crossing over
uniporter
A type of transport protein which catalyzes the movement of a single, specific solute or chemical species across a lipid membrane in either direction.[4] Contrast antiporter and symporter.
Three classes of membrane transport proteins can be distinguished by function: uniporters, symporters, and antiporters.
unique DNA

Also non-repetitive DNA.

A class of DNA sequences determined by C0t analysis to be present only once in the analyzed genome, as opposed to repetitive sequences. Most structural genes and their introns are unique.[13]
unstable mutation
A mutation with a high frequency of reversion.[13]
untranslated region (UTR)
Any non-coding sequence which is transcribed along with a protein-coding sequence, and thus included within a messenger RNA, but which is not ultimately translated during protein synthesis. A typical mRNA transcript includes two such regions: one immediately upstream of the coding sequence, known as the 5' untranslated region (5'-UTR), and one downstream of the coding sequence, known as the 3' untranslated region (3'-UTR). These regions are not removed during post-transcriptional processing (unlike introns) and are usually considered distinct from the 5' cap and the 3' polyadenylated tail (both of which are later additions to a primary transcript and not themselves products of transcription). UTRs are a consequence of the fact that transcription usually begins considerably upstream of the start codon of the coding sequence and terminates long after the stop codon has been transcribed, whereas translation is more precise. They often include motifs with regulatory functions.
upregulation

Also promotion.

Any process, natural or artificial, which increases the level of gene expression of a certain gene. A gene which is observed to be expressed at relatively high levels (such as by detecting higher levels of its mRNA transcripts) in one sample compared to another sample is said to be upregulated. Contrast downregulation.
upstream
Towards or closer to the 5'-end of a chain of nucleotides, or the N-terminus of a peptide chain. Contrast downstream.
upstream activating sequence (UAS)

Also upstream activator sequence and upstream activation sequence.

A type of cis-acting regulatory element found in the DNA of yeast such as Saccharomyces cerevisiae, usually a few hundred base pairs upstream of the transcription initiation site within the promoter of a protein-coding gene, which helps to increase the gene's expression by serving as a binding site for transcriptional transactivators, analogous to the function of an enhancer in multicellular eukaryotes.[34][5]
uracil (U)
A pyrimidine nucleobase used as one of the four standard nucleobases in RNA molecules. Uracil forms a base pair with adenine. In DNA, uracil is not used at all, and is instead replaced with thymine.
uridine (U, Urd)
One of the four standard nucleosides used in RNA molecules, consisting of a uracil base with its N9 nitrogen bonded to the C1 carbon of a ribose sugar. In DNA, uridine is replaced with thymidine.


V

[edit]
vacuole
Any of a class of enclosed, fluid-filled compartments present in many eukaryotic cells as well as bacteria, often large and conspicuous under the microscope and serving any of a huge variety of functions, including acting as a resizable reservoir for the storage of water, metabolic waste, toxins, or foreign material; maintaining cellular homeostasis and hydrostatic pressure; supporting immune functions; housing symbiotic bacteria; and assisting in the degradation and recycling of old cellular components.[2]
variable number tandem repeat (VNTR)
Any of a class of tandem repeats for which the copy number of the repeated sequence at a particular locus tends to vary between individuals of the same species. VNTRs may occur throughout the genome, both within and outside of coding DNA, and if the copy number is stably inherited may be used in DNA fingerprinting to uniquely identify individuals or to determine their genealogical relatedness to other individuals.
variegation
Variation or irregularity in a particular phenotype, especially a conspicuous visible trait such as color or pigmentation, occurring simultaneously in different parts of the same individual organism due to any of a variety of causes, such as X-inactivation, mitotic recombination, transposable element activity, position effects, or infection by pathogens.
vector
Any DNA molecule used as a vehicle to artificially transport foreign genetic material into another cell, where it can be replicated and/or expressed. Vectors are typically engineered recombinant DNA sequences consisting of an insert (often a transgene) and a longer "backbone" sequence containing an origin of replication, a multiple cloning site, and a selectable marker. Vectors are widely used in molecular biology laboratories to isolate, clone, or express the insert in the target cell.
vectorization
vegetal cell
See somatic cell.
vesicle
Any membrane-bound space completely enclosed by its own membrane, which is separate though usually derived from other membranes (often the cell membrane) either by budding or by mechanical disruption such as sonication.[3] The term is applied to many different structures but especially to the small, roughly spherical compartments created during endocytosis and exocytosis, as well as to lysosomes and various other small intracellular or extracellular organelles.[2]


W

[edit]
Warburg effect
western blotting
A blotting method used for detecting and identifying specific proteins in heterogeneous biological samples. The technique involves separating proteins by size with gel electrophoresis and then immobilizing them upon a nitrocellulose, nylon, or other synthetic membrane, after which they may be visualized by autoradiography or by labelling with chemiluminescent, radioactive, or enzyme-linked antibodies, lectins, or other specific binding agents.[5] Compare Southern blotting, northern blotting, and eastern blotting.
An outline of the steps involved in a western blot
whole genome sequencing (WGS)
The process of determining the entirety or near-entirety of the DNA sequences comprising an organism's genome with a single procedure or experiment, generally inclusive of all chromosomal and extrachromosomal (e.g. mitochondrial) DNA.
wild type (WT)

Denoted in shorthand with a + superscript.

The phenotype of the typical form of a species as it occurs in nature; a product of the standard "normal" allele at a given locus, as opposed to that produced by a non-standard mutant allele.
wobble base pairing
writhing number (W)

Also writhe.

An index of the superhelical coiling of a DNA molecule. The writhing number does not have a precise quantitative definition but instead represents the degree of supercoiling. Together, the writhing number and the twisting number determine the linking number.[5]


X

[edit]
X chromosome
One of two sex chromosomes present in organisms which use the XY sex-determination system, and the only sex chromosome in the X0 system. The X chromosome is found in both males and females and typically contains much more gene content than its counterpart, the Y chromosome.
X-inactivation
The process by which one of the two copies of the X chromosome is silenced by being irreversibly condensed into transcriptionally inactive heterochromatin in the cells of female therian mammals. A form of dosage compensation, X-inactivation prevents females from producing twice as many gene products from genes on the X chromosome as males, who only have one copy of the X chromosome. Which X chromosome is inactivated is randomly determined in the early embryo, making it possible for cell lineages with different inactive Xs to exist in the same organism.
X-linked trait
A phenotypic trait whose expression is governed or influenced by one or more genes located on the X chromosome (making it a sex-linked trait).


Y

[edit]
Y chromosome
One of two sex chromosomes present in organisms which use the XY sex-determination system. The Y chromosome is found only in males and is typically much smaller than its counterpart, the X chromosome.
Y fork
See replication fork.
yeast artificial chromosome (YAC)


Z

[edit]
Z-DNA
zinc finger (ZF)
A supersecondary polypeptide structural motif and DNA-binding domain occurring in many DNA-binding proteins, characterized by a series of non-adjacent amino acid residues which fold into a three-dimensional arrangement capable of coordinating one or more zinc ions (Zn2+
) between them, thus stabilizing the fold into a definite structure that can interact specifically with other biomolecules such as nucleic acids or other polypeptides. There are many distinct classes of zinc fingers using different ligands and spatial arrangements to achieve coordination; in perhaps the most common variant, a short alpha helix is oriented antiparallel to a β sheet, with two histidine residues in the former and two cysteines in the latter forming the coordination complex. Zinc fingers bind DNA as the primary functional domain of many transcription factors.
zonula adherens
See adherens junction.
zonula occludens
See tight junction.
zygonema

Also zygotene stage.

In meiosis, the second of five substages of prophase I, following leptonema and preceding pachynema. During zygonema, synapsis occurs, physically binding homologous chromosomes to each other, and the cell's centrosome divides into two daughter centrosomes, each containing a single centriole.[13]
zygosity
The degree to which multiple copies of a gene, chromosome, or genome have the same genetic sequence; e.g. in a diploid organism with two complete copies of its genome (one maternal and one paternal), the degree of similarity of the alleles present in each copy. Individuals carrying two different alleles for a particular gene are said to be heterozygous for that gene; individuals carrying two identical alleles are said to be homozygous for that gene. Zygosity may also be considered collectively for a group of genes, or for the entire set of genes and genetic loci comprising the genome.
zygote
A type of eukaryotic cell formed as the direct result of a fertilization event between two gametes. In multicellular organisms, the zygote is the earliest developmental stage.


See also

[edit]

References

[edit]

Further reading

[edit]
[edit]
Revisions and contributorsEdit on WikipediaRead on Wikipedia
from Grokipedia
The Glossary of cellular and molecular biology (M–Z) is a reference compendium offering concise definitions of specialized terminology used in the study of cellular biology—which examines the structure, function, and regulation of cells as the fundamental units of life—and molecular biology—which investigates the structure, interactions, and functions of biological macromolecules such as DNA, RNA, and proteins in living processes. This alphabetical segment spans entries from macrophage (a key immune cell involved in phagocytosis) to zymogen (an inactive enzyme precursor), encompassing over several thousand terms that bridge classical cytology with modern genomics and proteomics. Essential to researchers, educators, and students, the glossary highlights pivotal concepts in areas like membrane dynamics (, the site of ), genetic mechanisms (, alterations in DNA ), signaling pathways (, a regulatory modification), and macromolecular synthesis (, the protein assembly machinery), up to developmental endpoints such as formation. These definitions draw from rigorous , emphasizing precise to facilitate interdisciplinary communication in fields ranging from to medicine. Notable for its utility in clarifying evolving terminology amid advances in techniques like (, a in bacterial immune systems) and , the M–Z section supports comprehension of complex phenomena, including ( versus ) and epigenetic regulation ( of histones). By providing cross-references and etymological notes where relevant, it serves as an indispensable tool for decoding the molecular underpinnings of health, disease, and evolution.

M

Membrane and Organelle Structures

In cellular and molecular biology, and structures form the foundational compartments that enable eukaryotic cells to organize biochemical processes, maintain , and respond to environmental cues. These structures, primarily composed of macromolecules and bilayers, compartmentalize the into specialized regions, facilitating selective , production, and degradation pathways. Key components include lipid-based barriers embedded with proteins, as well as membrane-enclosed organelles that house enzymatic reactions essential for cellular function. Macromolecules are large, complex molecules typically formed by the of smaller monomeric units, such as into proteins, nucleotides into nucleic acids, and monosaccharides into , playing critical roles in cellular architecture, , and information storage. These polymers, often exceeding thousands of daltons in molecular weight, provide structural integrity to and while enabling dynamic interactions like folding and assembly into functional complexes. In the context of and structures, macromolecules such as proteins and integrate to form stable yet fluid barriers that support compartmentalization. Membrane refers to the or intracellular membranes that enclose cells and s, consisting of a bilayer that acts as a selective permeability barrier, regulating the influx and efflux of ions, nutrients, and waste products. This structure separates the cell's interior from the extracellular environment while allowing communication through embedded proteins, thereby maintaining cellular integrity and enabling processes like and . Membranes are dynamic, with fluidity influenced by temperature and lipid composition, which is essential for organelle biogenesis and vesicle trafficking. Membrane lipid denotes the amphipathic molecules, primarily phospholipids and sterols like , that self-assemble into the hydrophobic core of the , providing a semi-permeable matrix that excludes water-soluble substances while allowing lateral of embedded components. These , with hydrophilic heads and hydrophobic tails, spontaneously form bilayers in aqueous environments, contributing approximately 50% of the 's mass and modulating membrane thickness and curvature for shaping. , in particular, intercalates between phospholipids to reduce fluidity at physiological temperatures, enhancing stability in structures like the plasma and mitochondrial inner . Membrane protein encompasses integral proteins, which span the via transmembrane domains, and peripheral proteins, which associate loosely with the membrane surface, collectively facilitating , enzymatic activity, signaling, and within cellular compartments. Integral membrane proteins, such as channels and receptors, often traverse the bilayer multiple times with alpha-helical segments, enabling selective ion permeation and across organelles like the and mitochondria. Peripheral proteins, bound via electrostatic interactions, aid in cytoskeletal anchoring and vesicle docking, contributing to the asymmetry and functionality of membrane-bound structures. Membrane-bound organelle describes intracellular compartments enclosed by one or more bilayers, such as the nucleus, , and Golgi apparatus, which segregate specialized enzymatic reactions and prevent cross-talk between incompatible biochemical pathways. These organelles create distinct microenvironments by acting as diffusion barriers, with their membranes enriched in specific and proteins that support functions like protein synthesis and modification. In eukaryotic cells, this compartmentalization enhances efficiency, as seen in the 's role in folding secretory proteins away from cytosolic processes. Microbody is a single-membrane-bound , typically 0.5–1.5 μm in diameter, including and , that houses oxidative enzymes for detoxifying and metabolizing or photorespiratory intermediates. These organelles contain a dense proteinaceous matrix with catalases and oxidases, enabling the breakdown of generated during beta-oxidation, thus protecting cellular components from oxidative damage. In cells, within facilitate the conversion of stored to carbohydrates during , highlighting their role in metabolic compartmentalization. Micronucleus, in protozoans like (e.g., ), is a smaller, diploid nucleus distinct from the larger macronucleus, serving as the for and transmission during conjugation and . Unlike the transcriptionally active macronucleus, the remains dormant during vegetative growth but undergoes precise and segregation during , ensuring genetic stability. This dimorphic nuclear organization allows to separate somatic and reproductive functions within a single cell, a unique adaptation in membrane-bound nuclear structures. Midbody is a transient, microtubule-based structure formed at the intercellular bridge during the final stages of , consisting of bundled central spindle fibers and associated proteins that orchestrate the of daughter cells. Composed of compacted antiparallel enveloped by a plasma membrane, the midbody recruits endosomal sorting complex required for transport () machinery to constrict and sever the bridge, ensuring complete cell separation. Post-, midbody remnants can persist as signaling organelles, influencing polarity and proliferation in the daughter cells. Mitochondrion is a double-membrane-bound , often termed the powerhouse of the cell, that generates ATP through via the embedded in its inner membrane cristae. The outer membrane permits free diffusion of small molecules, while the inner membrane, highly folded to increase surface area, maintains a proton gradient essential for activity, coupling it to the tricarboxylic acid cycle in the matrix. Mitochondria also contain their own circular DNA and ribosomes, supporting semi-autonomous replication and protein synthesis for respiratory complexes. Mitophagy is the selective autophagic process that targets dysfunctional or excess mitochondria for lysosomal degradation, involving ubiquitination and engulfment by autophagosomes to maintain mitochondrial quality and prevent accumulation. Triggered by stressors like hypoxia or damaged respiratory chains, mitophagy employs receptors such as BNIP3 or Parkin to tag mitochondria for clearance, thereby preserving cellular and averting . This pathway is crucial for turnover, with defects linked to neurodegenerative diseases due to impaired mitochondrial dynamics.

Metabolism and Biochemical Processes

Metabolism
refers to the entirety of chemical reactions occurring within cells to maintain life, providing and enabling the synthesis of cellular components. It is divided into two main categories: , which consumes to construct complex molecules from simpler precursors, such as the assembly of proteins from ; and , which breaks down macromolecules to release , often in the form of ATP, as seen in the degradation of glucose. These interconnected processes ensure cellular and respond to environmental cues, with catabolic pathways typically predominating during energy demand to fuel anabolic reactions.
Metabolic pathway
A consists of a sequence of enzyme-catalyzed reactions that transform substrates into products, facilitating the flow of metabolites through interconnected steps. For example, is an anaerobic pathway that converts glucose into pyruvate, generating ATP and NADH without oxygen, while the (also known as the ) oxidizes in mitochondria to produce electron carriers for . These pathways are regulated by enzymes and allosteric effectors to balance energy production and biosynthetic needs, with disruptions often leading to metabolic disorders.
Metabolic waste
Metabolic waste encompasses the byproducts of cellular metabolism that accumulate during energy conversion and must be eliminated to avoid toxicity, such as carbon dioxide from oxidative respiration and urea from amino acid catabolism. In vertebrates, urea is synthesized in the liver through the urea cycle, a series of reactions that detoxifies ammonia—a harmful byproduct of protein breakdown—before excretion by the kidneys. These wastes reflect the efficiency of catabolic processes, with their removal essential for maintaining pH balance and preventing cellular stress.
Metabolite
A is a small, low-molecular-weight organic or inorganic that serves as an intermediate, reactant, or product in metabolic pathways, not directly encoded by the but generated through enzymatic activity. Examples include glucose-6-phosphate in or in the Krebs cycle, which link catabolic and anabolic reactions to support energy transfer and . Metabolites dynamically reflect cellular state, influencing signaling and regulation, as their concentrations modulate in pathways.
Metabolome
The metabolome represents the complete set of all small-molecule metabolites present in a cell, tissue, or at a specific time, providing a snapshot of metabolic activity and environmental interactions. Unlike the static , the metabolome is highly dynamic, encompassing thousands of compounds like , , and sugars that vary with physiological conditions. It serves as the downstream readout of genomic and proteomic functions, enabling the study of how perturbations in pathways affect overall cellular health.
Methylation
involves the covalent addition of a (CH₃) to biomolecules such as , proteins, or metabolites, serving as a key regulatory modification in cellular processes. In typically occurs at the 5-position of residues in CpG dinucleotides, recruiting repressive proteins to silence and maintain epigenetic memory. For metabolites, alters and activity, as in the conversion of norepinephrine to epinephrine, while protein on histones or non-histone targets influences structure and signaling. This process is reversible and tightly controlled, impacting development, disease, and metabolic adaptation.
Methyltransferase
Methyltransferases are a class of transferase enzymes that catalyze the transfer of a methyl group from S-adenosylmethionine (SAM) to acceptor substrates, including nucleic acids, proteins, and small molecules. DNA methyltransferases (DNMTs), such as DNMT1 for maintenance and DNMT3 for de novo methylation, establish epigenetic patterns by modifying cytosines, essential for gene regulation. In metabolic contexts, enzymes like catechol O-methyltransferase (COMT) detoxify catecholamines, highlighting their role in both regulatory and catabolic functions. These enzymes ensure precise control over methylation, with dysregulation linked to cancers and neurological disorders.
Mistranslation
Mistranslation refers to errors in protein synthesis where the incorporates incorrect into the polypeptide chain, deviating from the mRNA codon sequence. Such errors arise from tRNA misacylation, near-cognate tRNA selection, or ribosomal frameshifting, occurring at rates of about 10⁻⁴ per codon in eukaryotes but increasing under stress. While generally deleterious, leading to misfolded proteins and cellular , mistranslation can confer adaptive benefits, such as phenotypic variability during environmental challenges. Quality control mechanisms, including editing by synthetases and degradation by the , mitigate these errors to preserve integrity.
Monosaccharide
A is the simplest form of , consisting of a single unit with the general Cₙ(H₂O)ₙ, serving as a fundamental building block for more complex carbohydrates and a primary energy source. Common examples include glucose (an aldohexose), (a ketohexose), and (a ), which exist in linear or cyclic forms and participate in pathways like for ATP production. Monosaccharides also contribute to structural roles, such as in synthesis, and their facilitates metabolic regulation and transport across membranes.
Motor protein
Motor proteins are ATP-dependent enzymes that generate mechanical force for intracellular transport and motility by undergoing conformational changes along cytoskeletal filaments. Key families include kinesins and dyneins, which move cargos along toward the plus or minus ends, respectively, and myosins, which interact with filaments for processes like vesicle trafficking and . These proteins hydrolyze ATP to produce directed movement at speeds up to 800 nm/s, essential for organizing organelles and maintaining cellular architecture. Disruptions in motor protein function impair and contribute to neurodegenerative diseases.

Mitosis and Cell Division Mechanisms

Mitosis is the process of nuclear division in eukaryotic cells that produces two genetically identical daughter cells, each containing the same number of chromosomes as the parent cell. This mechanism ensures the faithful distribution of replicated DNA during , primarily occurring in somatic cells to support growth, tissue maintenance, and repair. The process is tightly regulated by cyclin-dependent kinases and involves a series of morphological changes coordinated by the mitotic spindle, a structure composed of that attach to chromosomes and pull them apart. The M phase, or mitotic phase, encompasses the entire period of from through , during which the cell's duplicated is segregated and the divides. In , chromosomes condense and the breaks down, allowing spindle formation; involves microtubule attachment to kinetochores on chromosomes; aligns chromosomes at the cell's equator; separates toward opposite poles; and reforms nuclei followed by . This phase is preceded by (G1, S, G2) and is triggered by the activation of (MPF), a complex of and CDK1, which phosphorylates targets to drive these transitions. Errors in M phase progression can lead to , a hallmark of cancer. Metaphase is the stage of where condensed chromosomes align at the metaphase plate, an imaginary equatorial plane in the cell, ensuring balanced segregation. from opposite spindle poles capture kinetochores on , generating tension that stabilizes attachments and satisfies the spindle assembly checkpoint, preventing premature onset. This alignment is crucial for equal chromosome distribution, and disruptions, such as microtubule poisons, can arrest cells here, as visualized in cytogenetic studies. Mitogens are extracellular signaling molecules, such as growth factors (e.g., , PDGF), that bind to cell surface receptors and initiate intracellular cascades to stimulate progression from G1 into and ultimately entry into . These signals activate pathways like MAPK/ERK, which upregulate cyclins and inhibit cell cycle inhibitors like p27, promoting proliferation in response to environmental cues. In the absence of mitogens, cells typically arrest in G0/G1, highlighting their role in coordinating division with nutrient availability and tissue demands. The quantifies the proportion of cells actively undergoing in a , calculated as the number of cells in M phase divided by the total number of cells, often expressed as a or per in tissue sections. It serves as a proxy for proliferative activity, with higher indices indicating rapid cell turnover, as seen in tumors where values exceed 10% in aggressive cancers versus <1% in quiescent tissues. This metric is widely used in for grading malignancies and assessing treatment responses. Mitotic recombination refers to the rare exchange of genetic material between homologous chromosomes during in somatic cells, typically suppressed to maintain genomic stability but occurring at low frequencies (e.g., 10^{-5} to 10^{-6} per ). Unlike meiotic recombination, it arises from double-strand breaks repaired via pathways involving proteins like RAD51, potentially generating that contributes to tumorigenesis. This process is more pronounced in cells with DNA damage or replication stress. Mitotic describes the reversible transformation of adherent cells into a spherical during early , driven by actomyosin contractility and cortical stiffening to detach from the substrate and create space for spindle assembly. This change, mediated by RhoA signaling and myosin II, ensures proper bipolar spindle formation and chromosome alignment, with failure leading to missegregation; for instance, inhibiting increases rates by up to 20-fold in epithelial cells. Mitotic segregation involves the precise partitioning of to daughter cells during , orchestrated by the shortening of and the spindle elongation to move to opposite poles. The spindle assembly checkpoint monitors attachments to prevent errors, but failures result in unequal distribution, causing ; dysfunction, for example, leads to random segregation and loss in up to 50% of divisions. Meiosis is a specialized reductional in germ cells that produces four haploid gametes from a diploid precursor, involving two sequential divisions (meiosis I and II) after one , with ensuring diversity. In meiosis I, homologous chromosomes pair and segregate via chiasmata formed by crossovers, reducing ; meiosis II resembles , separating . This process, regulated by proteins like Spo11 for double-strand breaks and MLH1 for crossovers, is essential for and averts diploid gametes that could cause . Mixoploidy denotes the presence of cells with varying levels within the same or tissue, often arising from errors in early embryonic mitotic divisions, such as chromosome missegregation or failure. In human preimplantation embryos, mixoploidy affects up to 30% of blastomeres due to asymmetric unification, where parental pronuclei segregate unevenly, leading to aneuploid, diploid, or polyploid lineages that can propagate or be selected against during development. This mosaicism underlies some congenital disorders and implantation failures.

Molecular Genetics and Mutations

Molecular genetics is the field that examines the molecular basis of , focusing on the structure, function, and regulation of genes, while represent permanent alterations in the DNA sequence that can drive evolutionary change, , and pathological conditions. These changes are heritable when occurring in cells and can propagate through cell divisions, influencing phenotypes from subtle variations to severe disorders. Key concepts in this domain include techniques for identifying and manipulating genetic elements, as well as specific types of sequence alterations starting with the letter M. A is defined as a heritable change in the nucleotide sequence of an organism's DNA, serving as the primary source of genetic variation. Such alterations can arise spontaneously during DNA replication or be induced by external agents, potentially leading to beneficial adaptations, neutral polymorphisms, or deleterious effects like disease predisposition. Map-based cloning, also known as positional cloning, is a strategy to isolate genes responsible for specific traits or diseases by using genetic linkage maps to narrow down chromosomal regions of interest. This method involves constructing high-resolution genetic maps with molecular markers, followed by chromosome walking or jumping to identify candidate genes, and has been instrumental in cloning genes for monogenic disorders such as cystic fibrosis. Massively parallel sequencing, often referred to as next-generation sequencing, enables high-throughput analysis of DNA sequences to detect s across entire genomes or targeted regions simultaneously. By generating millions of short reads in parallel, it facilitates the identification of rare variants and structural changes at unprecedented scale and speed, revolutionizing mutation detection in clinical diagnostics and research. Medical genetics applies molecular tools to study and diagnose hereditary disorders, integrating genomic data to understand patterns and mechanisms. Techniques like sequencing and variant analysis help classify according to guidelines from organizations such as the American College of Medical Genetics, aiding in personalized treatment for conditions like inherited cancers. A megabase (Mb) is a unit of measurement for DNA length, equivalent to one million base pairs, commonly used to describe genome sizes and chromosomal segments. For instance, the human genome spans approximately 3,000 Mb, providing a scale for mapping large-scale structural variations. A microdeletion refers to the loss of a small segment of chromosomal DNA, typically less than 5 Mb, which can cause genetic syndromes due to haploinsufficiency of key genes. Examples include 22q11.2 deletion syndrome, where such losses lead to developmental abnormalities like congenital heart defects and immune deficiencies. MicroRNA (miRNA) consists of small non-coding RNAs, about 22 long, that post-transcriptionally repress by binding to target mRNAs, leading to their degradation or translational inhibition. These molecules regulate diverse processes like development and stress responses, with dysregulation implicated in cancers and other diseases. Microsatellites are tandem repeats of short DNA motifs (1–6 base pairs) that exhibit high polymorphism due to variable repeat numbers, making them valuable for and tracing mutations in linkage studies. Their instability contributes to mutation hotspots, and they are widely used in forensic and to assess . Minisatellites are longer tandem repeats of 10–100 units, highly variable in length, historically employed in DNA fingerprinting for individual identification. Unlike microsatellites, their greater size and mutation rates via unequal crossing-over make them suitable for paternity testing and evolutionary analyses, though next-generation methods have largely supplanted them. A occurs when a single change in a codon results in the substitution of one for another in the encoded protein, potentially altering its function. For example, the sickle cell mutation in replaces with , causing abnormalities. Molecular cloning involves inserting a DNA fragment of interest into a vector, such as a , to propagate and amplify it within a host organism like . This technique enables the production of for studying gene function, protein expression, and generating libraries for sequencing. Mosaicism describes the presence of two or more genetically distinct cell populations in an individual, arising from post-zygotic that create variation across cell lineages. Somatic mosaicism can contribute to diseases like neurocutaneous syndromes, while explains recurrent mutations in offspring without parental carrier status. Mutagenesis is the process of inducing mutations in DNA using chemical agents, radiation, or biological means to generate genetic variants for research or breeding. Chemical mutagens like alkylating agents cause base modifications, while ionizing radiation induces breaks, both increasing mutation rates to study gene function or create crop improvements. A mutator gene is a genetic variant that elevates the overall mutation rate in an organism by impairing DNA repair or replication fidelity. Such genes, like mutS in bacteria, promote adaptive evolution in stressful environments but can accelerate tumorigenesis in multicellular organisms.

N

Nucleotide and Nucleic Acid Components

Nucleotides serve as the fundamental monomeric units of nucleic acids, consisting of a nitrogenous base, a pentose sugar (ribose in RNA or deoxyribose in DNA), and one or more phosphate groups linked via phosphoester bonds. These molecules polymerize through phosphodiester linkages between the 3' hydroxyl of one nucleotide's sugar and the 5' phosphate of another, forming the backbone of DNA and RNA strands essential for genetic information storage and transfer. In DNA, the sugar is 2'-deoxyribose, enabling the stable double-helix structure, while RNA incorporates ribose, which supports its role in transient processes like messenger and transfer RNA functions. The nitrogenous bases in are heterocyclic aromatic compounds classified into purines and pyrimidines, which dictate base-pairing specificity in nucleic acids. Purines, (A) and (G), feature a fused and ring system, with containing an amino group at position 6 and a carbonyl at 6 and amino at 2. Pyrimidines, (C), (T) in DNA, and uracil (U) in RNA, possess a single six-membered ring; has an amino group at 4 and carbonyls at 2, carbonyls at 2 and 4 with a methyl at 5, and uracil similar but without the methyl. These bases pair via bonds—A with T (two bonds) or U, and G with C (three bonds)—stabilizing the DNA double helix and enabling accurate replication and transcription. Nicotinamide adenine dinucleotide (NAD) functions as a vital coenzyme in cellular reactions, existing primarily as the oxidized form NAD⁺ and reduced form NADH. NAD⁺ accepts a ion (H⁻) from substrates in catabolic pathways like and the tricarboxylic acid cycle, converting to NADH, which then donates electrons to the for ATP production. Structurally, NAD comprises two linked by a bond: one with (derived from ) as the base and the other with , both attached to and phosphates. This dinucleotide's regulates energy , with the NAD⁺/NADH ratio influencing enzymatic activities in . Nicotinamide adenine dinucleotide phosphate (NADP), a phosphorylated of NAD at the 2' position of the , primarily supports anabolic processes as NADPH. Unlike NAD, which predominates in , NADPH provides reducing power for biosynthetic reactions, including and synthesis, as well as defense via reduction. The additional group alters substrate specificity, directing NADP to enzymes in the , a major NADPH source. This distinction ensures compartmentalized control, with NADPH maintaining cellular reducing environments distinct from NADH's oxidative roles. Nascent refers to newly synthesized biomolecules that are incomplete or in early stages of formation, such as nascent transcripts attached to during transcription or nascent polypeptide chains emerging from ribosomes. In contexts, nascent captures the primary transcript before processing like splicing or capping, revealing real-time transcription dynamics. For proteins, the nascent chain is the growing polypeptide tethered to at the ribosomal peptidyl transferase center, vulnerable to folding chaperones or targeting factors until release. This term highlights transient intermediates critical for understanding synthesis regulation and quality control in cellular . Nick translation is a technique for labeling DNA probes by incorporating modified nucleotides during repair of single-strand nicks introduced by DNase I, utilizing the 5'→3' exonuclease and polymerase activities of E. coli DNA polymerase I. The enzyme excises nucleotides ahead of the nick while synthesizing new DNA with labeled dNTPs, resulting in uniformly labeled fragments of 200–500 base pairs suitable for hybridization assays. Developed as an in vitro method, it enables high-specific-activity labeling for detecting specific DNA sequences in Southern blots or in situ hybridizations, offering advantages over end-labeling by producing probes with internal labels resistant to exonuclease degradation. Non-canonical amino acids (ncAAs) are non-standard residues beyond the 20 encoded by the universal , incorporated into proteins via engineered orthogonal tRNA/ pairs that reassign stop codons like (UAG). This genetic code expansion allows site-specific insertion of ncAAs with unique chemical properties, such as photocrosslinkers or fluorescent tags, for studying protein function, interactions, or therapeutic modifications. Methods like suppression introduce ncAAs during , enabling over 460 variants (as of 2023) to probe cellular processes without altering the core code.

Nuclear and Cellular Processes

In cellular and molecular biology, nuclear and cellular processes encompass critical mechanisms that ensure the integrity and proper segregation of genetic material during the . These processes include the management of DNA strand breaks, which are essential for repair pathways, and structural changes to the nucleus that facilitate organization and division. Disruptions in these processes can lead to genomic instability, contributing to diseases such as cancer and developmental disorders. Key elements involve single-strand interruptions, error-prone repair of double-strand breaks, and the disassembly of nuclear barriers during , all of which maintain genomic fidelity under physiological stress. A nick refers to a specific type of DNA damage characterized by the breakage of a single between adjacent in one strand of the DNA double helix, leaving the complementary strand intact. This lesion arises from enzymatic activity, , or replication errors and serves as an intermediate in various repair pathways. Unlike double-strand breaks, nicks do not immediately threaten stability but can convert to more severe damage if unrepaired, highlighting their role in initiating targeted repair to preserve sequence fidelity. Nicking enzymes, also known as nicking endonucleases, are specialized restriction enzymes that catalyze the formation of these single-strand breaks at precise DNA sequences without cleaving the opposite strand. These enzymes play a pivotal role in DNA repair initiation by creating controlled nicks that recruit repair factors, such as those involved in or . For instance, nicking endonucleases such as the I-AniI nickase variant have been engineered to stimulate site-specific gene correction in human cells by facilitating at efficiencies up to one-quarter that of double-strand cleaving counterparts. Seminal work has demonstrated that such nicks trigger recombination pathways distinct from those at double-strand breaks, enabling precise repair without extensive end resection. Non-homologous end joining (NHEJ) is a primary pathway for repairing double-strand breaks (DSBs) in DNA, involving the direct ligation of broken ends without requiring a homologous template, which renders it inherently error-prone due to potential nucleotide loss or addition at junctions. The classical NHEJ (C-NHEJ) mechanism relies on the Ku70/Ku80 heterodimer to bind DSB ends, recruiting DNA-PKcs and the XRCC4/LIG4 complex for end processing and sealing, often resulting in small insertions or deletions (indels) that can alter gene function. This pathway predominates in non-dividing cells like neurons and lymphocytes, where its robustness outweighs fidelity concerns, but its error-prone nature contributes to mutagenesis in contexts like V(D)J recombination. Recent analyses confirm that while C-NHEJ can be precise for compatible ends, incompatible ends necessitate nucleolytic processing, with end-joining efficiency decreasing up to 4.5-fold and exhibiting increased error-proneness, such as higher frequencies of imprecise ligations and larger deletions, in aging cells. Nuclear envelope breakdown (NEBD) occurs at the onset of in open , dismantling the double-membrane barrier that separates the nucleoplasm from the to allow spindle access to condensed . This process is orchestrated by of and pore complexes by cyclin-dependent kinases (CDKs) and other regulators, leading to fragmentation of the and diffusion of nuclear contents into the . NEBD ensures efficient chromosome alignment and segregation by enabling kinetochore-microtubule attachments, a feature conserved in metazoans but absent in closed of yeasts. Defects in NEBD timing can delay onset, underscoring its coordination with the spindle assembly checkpoint for faithful division. Non-disjunction describes the failure of homologous chromosomes or sister chromatids to separate properly during meiosis I/II or mitosis, respectively, resulting in daughter cells with abnormal chromosome numbers known as aneuploidy. This segregation error often stems from weakened cohesin complexes, merotelic kinetochore attachments, or spindle checkpoint deficiencies, leading to one cell gaining an extra chromosome (trisomy) and another losing one (monosomy). In humans, meiotic non-disjunction is a leading cause of aneuploidies like Down syndrome (trisomy 21), with maternal age exacerbating risks due to declining oocyte quality. Mitotic non-disjunction, prevalent in cancer cells, drives chromosomal instability and tumor evolution by generating aneuploid progeny that evade apoptosis.

Non-coding Elements and Regulation

Non-coding DNA (ncDNA) comprises the majority of the eukaryotic and includes sequences that do not proteins but are essential for regulating and maintaining genomic structure. These elements encompass introns, which are intervening sequences transcribed into pre-mRNA but removed during splicing to produce mature mRNA, thereby influencing mRNA processing efficiency and patterns. Promoters serve as proximal regulatory regions located near the transcription start site, where they recruit and transcription factors to initiate gene transcription. Enhancers, in contrast, are distal cis-acting DNA sequences that enhance transcription rates by looping to interact with promoters, often functioning regardless of their orientation or distance from the target gene. Non-coding RNAs (ncRNAs) represent a diverse class of transcripts that lack protein-coding potential yet perform critical cellular functions, including structural roles in protein synthesis and . Transfer RNAs (tRNAs) act as adaptors during , recognizing mRNA codons via their anticodon loops and delivering corresponding to the for polypeptide chain elongation. Ribosomal RNAs (rRNAs) form the core components of , providing both structural scaffolding and catalytic activity to facilitate formation between . Regulatory ncRNAs, such as small interfering RNAs (siRNAs), mediate by incorporating into the (RISC), where they guide the sequence-specific cleavage and degradation of complementary target mRNAs, thereby silencing . Negative control mechanisms in gene regulation rely on repressor proteins that bind to operator sequences in DNA, thereby blocking RNA polymerase access and inhibiting transcription under non-permissive conditions. In prokaryotic systems, such as the trp operon, the repressor binds to the operator in the presence of tryptophan, halting transcription of biosynthetic genes to prevent unnecessary synthesis. This repression can be relieved by corepressor release or inducer binding, allowing dynamic control of gene expression in response to environmental cues.

O

Organelle Functions and Dynamics

Organelles represent specialized, often membrane-enclosed compartments within eukaryotic cells that execute discrete physiological roles, enabling compartmentalization of cellular processes for efficiency and regulation. These structures, such as the nucleus, mitochondria, and lysosomes, exhibit dynamic behaviors including biogenesis, trafficking, fusion, and fission, which adapt to cellular needs like demands or stress responses. In the context of , mitochondria harness oxidative processes to generate ATP, while lysosomes handle waste degradation through hydrolytic breakdown. Nuclear dynamics, involving , further support transcriptional control essential for . An is defined as a subcellular structure, typically membrane-bound in eukaryotes, that performs one or more dedicated functions analogous to organs in a multicellular body, such as protein synthesis in ribosomes or in peroxisomes. Lysosomes exemplify waste management organelles, acting as the cell's primary digestive units by fusing with endocytic vesicles or autophagosomes to degrade macromolecules, pathogens, and damaged organelles via over 50 acid hydrolases maintained in a low-pH lumen (around 4.5–5.0). This catabolic activity recycles , , and , preventing toxic accumulation and supporting nutrient ; disruptions in lysosomal function underlie diseases like lysosomal storage disorders. dynamics, such as microtubule-mediated transport of lysosomes to peripheral sites during , ensure targeted degradation and cellular resilience. Oxidative phosphorylation constitutes the terminal stage of aerobic respiration, coupling in the mitochondrial inner membrane to ATP production, which accounts for up to 90% of a cell's needs under oxygen-replete conditions. The process unfolds across the (complexes I–IV), where NADH and FADH₂ donate electrons to reduce oxygen to water, simultaneously pumping protons into the to establish an (Δψ ≈ 150–180 mV). This proton motive force powers (complex V), which rotates to synthesize ATP from ADP and inorganic phosphate as protons re-enter the matrix, yielding approximately 2.5–3 ATP per NADH oxidized. Seminal work by Peter Mitchell established this chemiosmotic mechanism in 1961, revolutionizing by linking directly to without high-energy intermediates. Mitochondrial dynamics, including fusion and fission, modulate OXPHOS efficiency by distributing respiratory complexes and responding to metabolic shifts. Open , synonymous with , denotes transcriptionally permissive genomic regions where DNA is loosely associated with , allowing access by regulatory proteins and enhancing in active loci. This state arises from histone modifications like and H3/H4 , which neutralize positive charges on histone tails, destabilizing nucleosome-DNA interactions and promoting nucleosome eviction or sliding for increased . occupies peripheral nuclear zones in , contrasting with compact , and its dynamics—facilitated by ATP-dependent remodelers like —enable rapid transcriptional responses to signals, such as during development or stress. In the nucleus, an orchestrating genetic information flow, open chromatin ensures selective activation of housekeeping and inducible genes, with assays like quantifying genome-wide.

Operator and Operon Systems

In prokaryotic gene regulation, an is defined as a functional unit of DNA containing a cluster of genes under the control of a single promoter, allowing coordinated transcription of those genes into a polycistronic mRNA. This organization enables to efficiently respond to environmental changes by expressing multiple related proteins simultaneously from one transcriptional event, as proposed in the foundational model by Jacob and Monod. Operons typically include structural genes encoding enzymes or proteins for a specific , flanked by regulatory elements such as the promoter and terminator sequences. The operator is a short DNA sequence located adjacent to the promoter, serving as the binding site for regulatory proteins that modulate transcription initiation. In the classic operon model, the operator acts as a switch: when a repressor protein binds to it, RNA polymerase is sterically hindered from proceeding to transcribe the downstream genes, thereby repressing gene expression. This binding is reversible and often modulated by small molecules, such as metabolites, which alter the repressor's affinity for the operator DNA. Operators are typically 15–20 base pairs long and exhibit specific sequence motifs that match the DNA-binding domains of repressors. A paradigmatic example is the in , which regulates the metabolism of . This consists of three structural genes—lacZ (encoding ), lacY (encoding lactose permease), and lacA (encoding thiogalactoside transacetylase)—controlled by a single promoter and operator. In the absence of , the LacI protein binds to the operator, preventing transcription; however, when is present, its isomer binds to LacI, causing a conformational change that releases the from the operator and allows to initiate transcription. This inducible system exemplifies negative regulation, where the default state is repression relieved by an inducer. Another key example is the trp operon, which controls tryptophan biosynthesis in E. coli and demonstrates repressible regulation. The operon includes five structural genes (trpE through trpA) transcribed as a single mRNA, with the operator serving as the binding site for the TrpR repressor. When tryptophan levels are high, it binds to TrpR, activating the repressor's affinity for the operator and halting transcription to conserve resources; low tryptophan levels dissociate the complex, permitting expression. These operon systems highlight the elegance of prokaryotic transcriptional control, where operator-repressor interactions fine-tune gene expression in response to nutrient availability.

Origin and Replication Initiation

The is a specific DNA sequence where the duplication of the begins, serving as the starting point for bidirectional in both prokaryotes and eukaryotes. In prokaryotes, such as , the primary origin, known as oriC, consists of repeated DnaA-binding boxes flanked by an AT-rich DNA unwinding element (DUE) that facilitates initial strand separation. This AT-rich region, typically 13 base pairs long with three 13-mer repeats, lowers the melting temperature of the DNA double , allowing initiator proteins to access single-stranded DNA for loading. In eukaryotes, origins are more diverse and less sequence-specific, often associated with AT-rich motifs recognized by the (ORC), which recruits additional factors like Cdc6 and Cdt1 to load the MCM . Replication initiation at the origin involves the assembly of a (pre-RC) that prepares the DNA for unwinding and polymerase engagement. In , protein, in its ATP-bound form, oligomerizes on oriC to form a helical filament that bends and distorts the DNA, exposing the AT-rich DUE for melting by . This process, first biochemically reconstituted in 1981 using purified enzymes, ensures precise once-per-cell-cycle firing to maintain genome stability. Eukaryotic initiation similarly begins with binding to AT-rich or nucleosome-depleted regions, followed by MCM2-7 double hexamer loading during , which is activated in by kinases like CDK and DDK to form the CMG for fork progression. These mechanisms prevent re-replication, with regulatory elements like geminin in metazoans inhibiting premature MCM loading. Okazaki fragments are short, discontinuous DNA segments synthesized on the lagging strand during replication, arising because can only extend in the 5' to 3' direction while the template strand runs antiparallel to the replication fork. Discovered in 1968 through pulse-labeling experiments in E. coli, these fragments, typically 1000–2000 in and 100–200 in eukaryotes, are primed by RNA polymerases (primases) and later joined by after flap endonuclease processing. Their formation ensures efficient lagging strand synthesis starting from the origin, where initial primers are laid near the unwound AT-rich site. This discontinuous mode complements the continuous leading strand synthesis, enabling the replication fork—detailed under replication mechanisms—to advance bidirectionally from the origin.

P

Protein Synthesis and Modification

Protein synthesis begins with the formation of bonds, which are covalent linkages between the carboxyl group of one and the amino group of another, created during the process on ribosomes. This eliminates a molecule and links sequentially, forming the backbone of proteins. The resulting chain, known as a polypeptide, consists of a linear of connected by these bonds, serving as the initial product of before further maturation. The primary structure of a protein refers to this exact linear sequence of in the polypeptide chain, determined by the and critical for dictating higher-order structures and function. Once synthesized, the polypeptide undergoes , a process where the chain adopts its functional three-dimensional conformation, often assisted by molecular chaperones that prevent aggregation and guide the nascent chain to its native state. Chaperones, such as and , bind to hydrophobic regions of the unfolding polypeptide, stabilizing intermediates and facilitating ATP-dependent release upon correct folding. Post-translational modifications (PTMs) occur after , involving covalent alterations to the polypeptide, such as , which adds groups to serine, , or residues, or , which attaches moieties to or other sites, thereby regulating protein activity, localization, stability, and interactions. These modifications diversify protein function without altering the primary sequence and are essential for cellular signaling and quality control. Many enzymes are produced as inactive precursors called pro-enzymes or zymogens, which contain an additional pro-peptide segment that inhibits activity until proteolytic cleavage removes it, preventing premature or uncontrolled catalysis. For instance, digestive enzymes like trypsinogen are secreted as zymogens and activated in the gut by specific proteases. Proteases, also known as peptidases, are enzymes that hydrolyze peptide bonds within proteins, either internally (endopeptidases) or at termini (exopeptidases), playing key roles in digestion, signaling, and protein turnover. Quality control during protein synthesis includes degradation of misfolded or damaged polypeptides via the , a large multi-subunit complex that recognizes ubiquitin-tagged proteins and degrades them into short peptides using ATP-dependent . Ubiquitination marks proteins for delivery to the 26S , where the core particle performs the cleavage, ensuring cellular and preventing toxic aggregates.

Phosphorylation and Signaling Pathways

Phosphorylation serves as a fundamental in cellular and molecular biology, enabling rapid and reversible regulation of protein function within signaling pathways. This process involves the covalent attachment of a group, typically to , , or residues on proteins, which can alter their activity, localization, or interactions. In , phosphorylation acts as a , propagating extracellular signals into intracellular responses that influence processes such as , differentiation, and . Kinases and phosphatases play antagonistic roles in this dynamic equilibrium, with kinases catalyzing addition and phosphatases facilitating its removal to ensure precise temporal control of signaling cascades. A is an that catalyzes the transfer of the γ-phosphate group from (ATP) to specific residues on target proteins, thereby modulating their enzymatic activity or conformational state. This event often activates or inactivates downstream effectors, amplifying signals in pathways like the (MAPK) cascade. For instance, in response to growth factors, receptor kinases phosphorylate adaptor proteins, initiating a kinase relay that culminates in nuclear activation. The encodes over 500 protein kinases, underscoring their prevalence in eukaryotic signaling networks. Phosphorylation itself refers to the kinase-mediated addition of a group, which can lead to by promoting conformational changes that expose active sites or enhance substrate binding, or inactivation by inducing inhibitory interactions. In metabolic regulation, for example, of by inactivates the , halting synthesis during states. Conversely, in progression, activates transcription factors essential for . This duality allows to fine-tune signaling fidelity, preventing aberrant pathway that could contribute to diseases like cancer. Counterbalancing kinases, a phosphatase is an that hydrolyzes phosphate esters, removing phosphate groups from phosphorylated proteins and thereby reversing kinase actions to terminate or attenuate signals. Protein phosphatases, such as protein phosphatase 1 (PP1) and PP2A, dephosphorylate key substrates in insulin signaling, restoring glucose uptake homeostasis after stimulation. Dysregulation of phosphatase activity, as seen in knockout models, disrupts kinase-phosphatase integration, leading to prolonged signaling and cellular dysfunction. These enzymes ensure signaling specificity by acting on discrete phosphosites, with over 100 phosphatase genes identified in mammals. In the context of signaling reception, phospholipids form the structural backbone of cell membranes and serve as critical platforms for signal initiation, where they recruit and activate upon binding to receptors. Phosphatidylinositol lipids, for instance, undergo to generate second messengers like (PIP2), which anchors signaling complexes at the plasma membrane. In G-protein coupled receptor pathways, phospholipid hydrolysis by liberates diacylglycerol, activating and propagating calcium-mediated signals. This lipid environment not only facilitates kinase docking but also compartmentalizes transduction events to maintain spatial precision. The links in signaling molecules such as (), a key second messenger derived from ATP via activation in response to hormones like . This bond's by phosphodiesterases (PDEs) terminates cAMP signaling, converting it to inactive 5'-AMP and preventing unchecked activation of , which phosphorylates targets in metabolic and transcriptional pathways. PDEs compartmentalize cAMP gradients within cells, ensuring localized control of downstream events. Downstream of phosphorylation cascades, signaling often converges on transcriptional regulation, where positive control mechanisms enhance gene expression through activator proteins that bind DNA and recruit RNA polymerase. Phosphorylation activates these transcription factors, such as CREB in cAMP-responsive pathways, by increasing their affinity for co-activators and promoter regions, thereby boosting transcription rates. This contrasts with negative control by repressors and allows cells to amplify mitogenic signals into proliferative responses. The promoter is a proximal DNA sequence upstream of genes that serves as the for and associated factors to initiate transcription, with its activity frequently modulated by of regulatory proteins. For example, of STAT proteins by Janus kinases enables their dimerization and binding to promoter elements, driving interferon-responsive . Promoter architecture, including TATA boxes and enhancers, integrates these signals to dictate tissue-specific transcription levels.

Plasmids and Genetic Vectors

Plasmids are small, extrachromosomal, circular DNA molecules that replicate independently of the chromosomal DNA in bacteria and some other microorganisms, often carrying genes that confer advantages such as antibiotic resistance or metabolic capabilities. These molecules serve as fundamental tools in molecular biology for genetic engineering, enabling the cloning and expression of foreign DNA in host cells. Genetic vectors, including plasmids, facilitate the introduction and propagation of recombinant DNA constructs, with their design incorporating elements like origins of replication, selectable markers, and multiple cloning sites to support stable maintenance and manipulation. In cloning procedures, the refers to the inserted DNA fragment of interest that is ligated into a vector, distinguishing it from the vector's backbone sequences, which provide the necessary replication and selection functions. This passenger DNA is typically amplified and expressed within the host, allowing researchers to study function or produce proteins. A specialized type of vector, the phagemid, combines features of plasmids and bacteriophages, featuring a plasmid replication origin for propagation in and phage packaging signals that enable single-stranded DNA production for applications like libraries. Phagemids are particularly useful in for generating high-diversity or libraries, as they allow efficient rescue and propagation via helper phages. To ensure stable inheritance, plasmid partitioning mechanisms actively segregate plasmids to daughter cells during bacterial division, involving centromere-like DNA sites (parS) and proteins such as ParA (an ) and ParB (a ) that drive directed movement and equal distribution. These systems, often encoded by par operons on low-copy plasmids, prevent loss that could occur due to random segregation, maintaining plasmid copy numbers as low as 1-2 per cell. Disruptions in partitioning can lead to plasmid instability, impacting long-term experiments. A key consideration in vector design is the position effect, where the relocation of a within a vector or alters its expression levels due to influences from surrounding regulatory elements, structure, or proximity, potentially causing silencing or variegated expression. In genetic vectors, this effect is mitigated by incorporating insulators or using site-specific integration to preserve consistent activity, as random insertion sites can variably impact promoter strength and accessibility.

Ploidy and Chromosomal Variations

Ploidy refers to the number of complete sets of in a cell's nucleus, a fundamental aspect of cellular that influences organismal development, , and . In eukaryotic cells, the baseline ploidy level is diploid (2n), where cells contain two homologous sets of , one inherited from each parent. Variations in ploidy arise from errors in or evolutionary adaptations, leading to conditions such as (abnormal numbers) or (multiple complete sets). These variations can have profound effects, from causing developmental disorders in animals to conferring advantages like increased vigor in plants. Chromosomal variations also encompass structural features of chromosomes that relate to ploidy stability, particularly in how centromeres organize during and . The is the region where the assembles, ensuring proper chromosome segregation; its position influences chromosome behavior in dividing cells. Disruptions in ploidy or structure often stem from events like non-disjunction during , where chromosomes fail to separate correctly. Metacentric chromosomes are characterized by a centrally located , resulting in two arms of approximately equal length, which promotes balanced segregation during . This configuration is common in many species and helps maintain genomic stability by allowing even distribution of genetic material. For instance, human is metacentric, facilitating its role in anchoring the symmetrically. Monocentric chromosomes possess a single, well-defined , typically functional for attachment in one location, distinguishing them from holocentric chromosomes found in some and . This single centromere ensures precise bipolar attachment during , preventing missegregation that could lead to changes. Monocentric organization is the predominant form in vertebrates and higher , supporting efficient congression to the plate. Monoploid, also known as haploid (n), describes a cell or with a single complete set of chromosomes, representing the basic genomic content without homologs. In , monoploid individuals can be generated from diploid ancestors via techniques like anther culture, providing a tool for breeding pure lines free of heterozygosity. This level is essential in gametes for , where fusion restores diploidy, and it underlies the study of minimal genome requirements in model organisms like . Monosomy occurs when a diploid lacks one from a homologous pair, resulting in a 2n-1 configuration and severe genetic imbalance due to hemizygosity for genes on the missing . In humans, partial monosomy 21 exemplifies this, often leading to developmental delays, though full autosomal monosomies are typically lethal. This variation highlights the dosage sensitivity of the , where the absence of a single disrupts critical networks. Polyploid cells or organisms contain more than two complete sets of chromosomes, such as triploid (3n) or tetraploid (4n), often arising from fertilization errors or somatic doubling. is rare and usually inviable in animals but prevalent in , where it drives and hybrid vigor; for example, (Triticum aestivum) is hexaploid (6n), combining genomes from three ancestral to enhance adaptability. This variation alters , influencing traits like cell size and fertility, and is a key mechanism in . Polysomy refers to the presence of extra copies of one or more specific chromosomes beyond the normal complement, such as (three copies) or tetrasomy (four copies), a form of that contrasts with euploid by affecting only subsets of the . In humans, (Down syndrome) is the most common viable polysomy, resulting from an extra and causing due to overexpressed genes in the "Down syndrome critical region." Polysomy often originates from and can be studied in model systems like to understand effects on cellular function.

Q

Quantitative Genetics and Traits

Quantitative genetics is the study of the inheritance of continuous phenotypic traits in populations, where variation arises from the combined effects of multiple genetic loci and environmental influences, rather than discrete Mendelian factors. This field reconciles Mendelian genetics with biometrical observations of trait distributions, such as height or yield, which follow a due to polygenic control and gene-environment interactions. Ronald Fisher laid the foundational theoretical framework in 1918 by demonstrating how could produce continuous variation through additive effects of many genes, enabling statistical analysis of and breeding values. Multifactorial inheritance, a core concept in , describes traits influenced by multiple genes (polygenes) each with small effects, alongside non-genetic factors like or stress, leading to a for susceptibility or trait extremes. For instance, exemplifies this, where genome-wide association studies (GWAS) have identified hundreds of loci contributing to variance, yet environmental factors account for up to 20-40% of total variation in well-nourished populations. estimates, calculated as the proportion of phenotypic variance attributable to genetic variance, range from 0.4 to 0.8 for such traits, guiding applications in and . A (QTL) refers to a genomic region containing one or more genes that contribute to variation in a quantitative trait, often identified through linkage mapping in experimental crosses or association studies in natural populations. QTL analysis integrates phenotypic measurements with genotypic markers, such as restriction fragment length polymorphisms (RFLPs), to detect chromosomal intervals associated with trait differences, as pioneered by Lander and Botstein in 1989 for backcross or recombinant inbred populations. In crops like , early QTL mapping revealed loci for fruit weight, each explaining 10-30% of variance, facilitating without exhaustive sequencing. Modern extensions, including high-resolution fine-mapping, refine QTL boundaries to candidate genes, enhancing precision breeding for complex traits.

Quorum Sensing and Cell Communication

Quorum sensing (QS) is a cell-to-cell communication process in that enables population-density-dependent through the production and detection of diffusible signaling molecules known as autoinducers. This mechanism allows unicellular to coordinate behaviors such as , production, and formation, effectively behaving as a multicellular entity when cell density reaches a critical threshold. The phenomenon was first observed in the marine bacterium fischeri, where light emission intensifies in dense populations within host light organs, a discovery attributed to the autoinduction of synthesis. In , the most studied QS systems involve acyl-homoserine s (AHLs) as autoinducers, synthesized by LuxI-type enzymes using S-adenosylmethionine and acyl-acyl carrier proteins. These small, diffusible molecules accumulate extracellularly as bacterial density increases; once a threshold is met, AHLs bind to cognate LuxR-type transcriptional regulators, forming complexes that activate or repress target genes. For instance, in V. fischeri, the autoinducer N-3-oxohexanoyl-homoserine binds LuxR to induce the lux operon, driving only at high densities to avoid energy waste in dilute environments. Similarly, in the opportunistic Pseudomonas aeruginosa, multiple AHL-based systems (e.g., Las and Rhl) coordinate virulence factors like and production, enhancing infection in patients. Gram-positive bacteria employ different autoinducers, primarily small peptides (autoinducing peptides, AIPs) that are exported and often modified post-translationally. These peptides bind to membrane-bound histidine kinase receptors, triggering a phosphorelay cascade that activates intracellular response regulators and modulates gene expression. A classic example is the competence-stimulating peptide in Streptococcus pneumoniae, which at high densities induces DNA uptake machinery for genetic transformation. Across both Gram-negative and Gram-positive systems, QS ensures synchronized responses to environmental cues, such as nutrient availability or host proximity, underscoring its role in bacterial adaptation and pathogenesis.

R

Replication and Repair Mechanisms

In DNA replication, the process ensures accurate duplication of genetic material through coordinated enzymatic activities at specific sites. A replicon represents the fundamental unit of DNA replication, defined as a segment of the that is duplicated from a single , allowing cells to regulate replication timing and frequency during the . This model, originally proposed by , Brenner, and Cuzin, posits that each replicon operates independently, with initiation controlled by replicator sequences and associated proteins to prevent over-replication. In eukaryotes, replicons vary in size, typically spanning 30–500 kilobase pairs, enabling multiple origins to fire sequentially for efficient duplication. The is the multiprotein complex that assembles at the replication origin and progresses along the DNA, orchestrating strand separation and synthesis. It includes core components such as the CMG (comprising Cdc45, MCM2-7, and GINS in eukaryotes), DNA polymerases (Pol δ and Pol ε), , and accessory factors like single-stranded DNA-binding proteins that stabilize unwound DNA. The replisome's structure enables coupled leading-strand synthesis (continuous) and lagging-strand synthesis (discontinuous, producing short ), with activity driving fork progression at rates of up to 1000 per second in and approximately 50 per second in eukaryotes. Cryo-electron microscopy studies reveal dynamic interactions within the replisome, such as tethering to the core, which coordinate bidirectional replication while minimizing errors. Central to this machinery is the replication fork, a Y-shaped structure formed as the double helix unwinds, exposing single-stranded templates for new strand synthesis. Helicase enzymes, such as DnaB in bacteria or MCM in eukaryotes, use ATP hydrolysis to separate strands at the fork apex, generating positive supercoils ahead that are relieved by topoisomerases. The fork progresses bidirectionally from origins, with leading-strand polymerase synthesizing continuously in the 5' to 3' direction, while the lagging strand requires RNA primers for discontinuous extension. This asymmetry ensures semi-conservative replication, where each daughter molecule retains one parental strand, and fork stalling can trigger repair pathways to maintain genomic integrity. Reverse transcriptase is a specialized RNA-dependent that synthesizes (cDNA) from an template, reversing the typical transcription flow and enabling retroviral genome integration. Discovered independently by and by Temin and Mizutani in 1970, the enzyme from viruses like catalyzes using a tRNA initiator, producing a DNA- hybrid that is later processed into double-stranded DNA. Its mechanism involves two active sites: polymerase for nucleotide addition and RNase H for hybrid degradation, allowing template switching and long-terminal repeat formation essential for proviral integration. Beyond viruses, reverse transcriptase homologs in eukaryotes, such as telomerase's RNA component, facilitate maintenance. Supplying the deoxyribonucleotide triphosphates (dNTPs) necessary for replication is , the sole catalyzing the reduction of ribonucleoside diphosphates (NDPs) or triphosphates (NTPs) to (dNDPs or dNTPs). operates via a radical-based mechanism, generating a tyrosyl radical in its R1 subunit that propagates to the substrate's 2'-position for hydrogen abstraction, with class I s (common in eukaryotes) using a di-iron cofactor for radical persistence. by dNTPs and ATP ensures balanced pools, preventing from imbalances; for instance, dATP inhibits the to halt replication during stress. This rate-limiting step couples dNTP production to demands, with activity peaking in to support fork progression. In contrast to bidirectional theta replication of chromosomal DNA, rolling circle replication is a unidirectional, theta-independent mechanism used by certain plasmids and viruses to generate linear concatemers from a circular template. Initiation occurs via a nick in one strand by a plasmid-encoded Rep protein, which acts as a primer for 5' to 3' extension, displacing the nicked strand and producing single-stranded DNA that is later replicated to form multimeric circles. This mode, prevalent in small staphylococcal plasmids, allows high-copy number maintenance without host origins and includes termination via second-strand synthesis and resolution by . It contrasts with theta replication by avoiding bidirectional forks, instead elongating a tail that can exceed the for packaging in phage.

Ribosomes and RNA Processing

Ribonuclease (RNase) refers to a class of enzymes that catalyze the degradation of molecules, playing crucial roles in such as maturation, turnover, quality control, and regulatory processes in both prokaryotic and eukaryotic cells. These enzymes include endoribonucleases, which cleave internal phosphodiester bonds, and exoribonucleases, which remove from the ends of RNA strands, thereby maintaining cellular RNA and preventing the accumulation of aberrant transcripts. For instance, pancreatic RNase A, an endonuclease, specifically cleaves single-stranded RNA after residues, contributing to RNA processing and digestion. Ribonucleoprotein (RNP) denotes a complex formed by molecules bound to proteins, which facilitates various cellular functions including RNA transport, stability, and catalysis; a prominent example is the , a large RNP assembly essential for pre-mRNA splicing in eukaryotes. In the spliceosome, small nuclear RNPs (snRNPs) consisting of small nuclear RNAs (snRNAs) and associated proteins recognize splice sites and mediate excision, highlighting the structural and functional integration of RNA and protein components in dynamic macromolecular machines. RNPs like these are also involved in and other RNA-processing pathways, underscoring their versatility in regulation. Ribosomal RNA (rRNA) is a type of that constitutes the core structural and functional element of ribosomes, the cellular organelles responsible for protein synthesis. In prokaryotes, ribosomes contain 16S, 23S, and 5S rRNAs, while eukaryotic ribosomes feature 18S, 28S, 5.8S, and 5S rRNAs, which fold into complex secondary and tertiary structures that provide scaffolds for ribosomal proteins and catalytic sites for . These rRNA molecules not only stabilize the ribosome's architecture but also contribute directly to peptidyl transferase activity, ensuring accurate decoding of mRNA and tRNA positioning during protein elongation. Modifications on rRNA, such as pseudouridylation and 2'-O-methylation, further enhance ribosomal fidelity and efficiency. The ribosome is a ribonucleoprotein complex that serves as the primary site for protein synthesis, catalyzing the formation of peptide bonds between amino acids during translation of mRNA into polypeptide chains. Composed of a small subunit (e.g., 30S in prokaryotes or 40S in eukaryotes) and a large subunit (50S or 60S), the ribosome's rRNA components form the peptidyl transferase center in the large subunit, where the ribosomal RNA itself acts as a ribozyme to accelerate peptide bond formation without requiring protein-based catalysis. This process involves the A-site accommodating incoming aminoacyl-tRNA, the P-site holding peptidyl-tRNA, and translocation to the E-site for exit, enabling rapid synthesis rates exceeding 20 amino acids per second in vivo. Ribosomes are highly conserved across life forms, reflecting their ancient evolutionary origin and indispensable role in cellular function. A is an molecule capable of catalyzing specific biochemical reactions, much like protein enzymes, thereby demonstrating RNA's dual role as both genetic material and catalyst. Discovered in the , ribozymes include self-splicing introns in group I and II introns, which excise themselves from pre-rRNA or pre-mRNA without protein assistance, and RNase P, a ribozyme that processes tRNA precursors by cleaving leader sequences. These catalytic RNAs often rely on magnesium ions for folding into active conformations that stabilize transition states, supporting the hypothesis where RNA preceded proteins in early evolution. Beyond natural examples, engineered ribozymes have applications in biotechnology, such as targeted RNA cleavage for gene silencing. RNA polymerase is the responsible for synthesizing strands from a DNA template during transcription, reading the DNA in a 3' to 5' direction while polymerizing ribonucleotides in a 5' to 3' orientation to produce complementary RNA. In prokaryotes, a single handles all transcription, whereas eukaryotes possess three distinct types: synthesizes rRNA, produces mRNA and some non-coding RNAs, and transcribes tRNA and 5S rRNA. The core , often associated with sigma factors in bacteria or general transcription factors in eukaryotes, initiates at promoter regions, elongates through nucleotide addition at rates up to 50-100 nucleotides per second, and terminates at specific signals, ensuring precise control. RNA splicing is the post-transcriptional process in eukaryotes whereby introns—non-coding sequences—are precisely removed from pre-mRNA, and exons—coding sequences—are joined together by the to generate mature mRNA for . The , composed of U1, U2, U4, U5, and U6 snRNPs along with numerous proteins, assembles stepwise on the pre-mRNA, recognizes conserved 5' and 3' splice sites and sequences, and catalyzes two reactions: first forming a lariat intermediate with the , then ligating the exons. This mechanism allows for , enabling a single to produce multiple protein isoforms, which vastly expands proteomic diversity from a limited . Errors in splicing can lead to diseases, emphasizing its regulatory importance in cellular biology.

Restriction Enzymes and Cloning Techniques

Restriction enzymes, also known as restriction endonucleases, are proteins produced by bacteria that cleave DNA at specific recognition sequences, typically 4- to 8-base-pair palindromic sites, as a defense mechanism against invading bacteriophages. These enzymes recognize double-stranded DNA and hydrolyze phosphodiester bonds, generating either blunt or sticky (cohesive) ends depending on the enzyme type; for example, EcoRI cuts at GAATTC, producing 5' overhangs that facilitate ligation in cloning. Discovered in the 1960s and 1970s by Werner Arber, Hamilton Smith, and Daniel Nathans, who shared the 1978 Nobel Prize in Physiology or Medicine for this work, restriction enzymes revolutionized molecular biology by enabling precise DNA manipulation. Type II restriction enzymes, such as those from Escherichia coli, are most commonly used in cloning due to their specificity and requirement for only Mg²⁺ as a cofactor, without needing ATP. A restriction site refers to the specific nucleotide sequence where a restriction enzyme binds and cleaves DNA, often symmetrical and palindromic to allow recognition by the enzyme's dimeric structure. These sites are naturally occurring but can be engineered into vectors for targeted cutting; for instance, the HindIII site AAGCTT is widely used in plasmid construction. In cloning techniques, multiple restriction sites are incorporated into plasmids—circular DNA vectors that replicate independently in host cells—to allow insertion of foreign DNA fragments. Restriction fragment length polymorphism (RFLP) is a technique that detects variations in DNA sequence by digesting genomic DNA with restriction enzymes and analyzing the resulting fragment sizes via , revealing polymorphisms due to mutations that create or eliminate restriction sites. First described in 1980, RFLP was pivotal in early and linkage analysis, such as mapping the locus, though it has largely been supplanted by PCR-based methods due to its labor-intensive nature. In practice, Southern blotting follows digestion to hybridize probes to specific fragments, enabling identification of alleles with differing lengths; for example, a (SNP) altering a site can shift fragment size from 5 kb to 3 kb. A restriction map is a diagrammatic representation of a DNA molecule's restriction sites, ordered by position and distance, constructed by comparing fragment patterns from single and double digests with various enzymes. This mapping is essential for cloning strategies, as it identifies suitable sites for inserting genes without disrupting essential sequences; for instance, in the pBR322 plasmid, a restriction map guides the replacement of antibiotic resistance genes with recombinant inserts. Generated using software or traditional gel analysis, restriction maps provide a blueprint for subcloning and verifying constructs, with seminal work in the 1970s enabling the first recombinant DNA molecules. Reverse genetics approaches leverage restriction enzymes to study gene function by deliberately disrupting or altering specific DNA sequences, contrasting with forward genetics that starts from phenotypes. Techniques include inserting restriction sites via site-directed mutagenesis to create knockouts or using enzymes to excise genes from vectors for targeted integration; for example, CRISPR-Cas9 has integrated restriction-like precision, but classical reverse genetics relied on enzymes like BamHI to generate loss-of-function mutants in model organisms. This method, formalized in the , accelerated by allowing hypothesis-driven testing, such as disrupting oncogenes to assess tumorigenicity.

S

Signaling and Second Messengers

In cellular and molecular biology, signaling pathways enable cells to respond to external stimuli through a series of molecular interactions that transmit information from the cell surface to intracellular targets. encompasses the receptor-mediated detection of ligands, such as hormones or growth factors, followed by activation of intracellular cascades that culminate in effector responses like changes or metabolic adjustments. These pathways often rely on second messengers, small diffusible molecules that amplify the initial signal by activating downstream enzymes and altering cellular activities. Second messengers play a pivotal role in signal amplification and diversification. Cyclic adenosine monophosphate (cAMP), identified by Earl W. Sutherland in the 1950s, functions as a classic second messenger in response to (GPCR) activation by hormones like epinephrine. Upon ligand binding, GPCRs stimulate to convert ATP into cAMP, which then binds to and activates (PKA), leading to of target proteins and regulation of processes such as breakdown in liver cells. This mechanism allows a single receptor activation to generate thousands of cAMP molecules, thereby propagating the signal efficiently across the cell. Calcium ions (Ca²⁺) represent another fundamental second messenger, orchestrating rapid and versatile responses to diverse stimuli including neurotransmitters and mechanical stress. In unstimulated cells, cytosolic Ca²⁺ levels are maintained low (~100 nM) by pumps and stores in the ; upon receptor activation—often via generating inositol 1,4,5-trisphosphate (IP₃)—Ca²⁺ is released, raising concentrations to micromolar levels. This transient elevation binds to proteins like , activating kinases and phosphatases that control functions from synaptic transmission to cell motility, with spatiotemporal patterns encoding signal specificity. A silencer is a cis-regulatory DNA element that represses transcription by providing binding sites for proteins, thereby fine-tuning spatial and temporal expression patterns. Unlike enhancers, which promote activation, silencers recruit corepressors or histone-modifying complexes to compact and inhibit recruitment, often acting over long distances via looping interactions. For instance, in developmental genes, silencers bound by factors like Polycomb group proteins ensure tissue-specific silencing, preventing . The small ubiquitin-like modifier () contributes to signaling through post-translational conjugation, a reversible process termed SUMOylation that modulates protein interactions and localization without targeting for degradation, unlike ubiquitination. , a ~11 kDa protein, is activated by the (SAE1/SAE2), transferred to the E2 conjugase Ubc9, and attached to residues on targets via ligases, influencing pathways such as activation in immune responses. This modification stabilizes transcription factors or alters their nuclear import, thereby integrating SUMOylation into broader networks for adaptive cellular decisions.

Splicing and RNA Maturation

Splicing and RNA maturation are essential post-transcriptional processes that transform pre-mRNA into mature mRNA, enabling proper in eukaryotic cells. These modifications, including 5' capping, intron splicing, 3' , and , occur co- or post-transcriptionally and are critical for mRNA stability, nuclear export, translation efficiency, and regulation of protein diversity. Disruptions in these processes can lead to diseases such as and certain cancers, highlighting their biological importance. The is a large, dynamic ribonucleoprotein complex that catalyzes the precise excision of s from pre-mRNA through two reactions, joining exons to form mature mRNA. Composed of five small nuclear ribonucleoproteins (snRNPs) and over 100 associated proteins, the assembles stepwise on pre-mRNA, recognizing conserved splice sites at intron boundaries. This machinery ensures accurate splicing, with the major handling U2-type introns (the majority in eukaryotes) and the minor processing rare U12-type introns. Seminal studies identified the 's catalytic role, earning Phillip Sharp and Richard Roberts the 1993 in or for discovering split genes and RNA splicing.80568-1) Small nuclear RNAs (snRNAs), such as U1, U2, U4, U5, and U6, form the core of snRNPs and drive function by base-pairing with pre-mRNA splice sites and facilitating conformational changes. For instance, U1 snRNA binds the 5' splice site early in assembly, while snRNA recognizes the ; U4/U5/U6 tri-snRNP then integrates to activate . These snRNAs, transcribed by and modified with trimethylguanosine caps, are highly conserved and essential for splicing fidelity. Mutations in snRNA genes, like RNU4-2 or RNU2-2, cause neurodevelopmental disorders by impairing spliceosome assembly. Polyadenylation involves the addition of a poly(A) tail, a stretch of 50–250 residues to the 3' end of pre-mRNA, which enhances mRNA stability by protecting against exonucleases and promoting via interactions with poly(A)-binding proteins (PABPs). This process is mediated by the cleavage and specificity factor (CPSF) complex, which cleaves pre-mRNA at a polyadenylation signal (AAUAAA) and recruits poly(A) polymerase to add the tail. Poly(A) tail length dynamically regulates mRNA decay; shortening via deadenylation triggers degradation, while elongation stabilizes transcripts. In mammalian cells, polyadenylation is coupled to transcription termination, ensuring efficient 3' end formation.01137-6) RNA editing, particularly adenosine-to-inosine (A-to-I) , is a that alters mRNA sequence without changing the , often expanding diversity. Catalyzed by enzymes, A-to-I editing recodes codons (e.g., to ) or affects splicing and miRNA binding; it occurs predominantly in Alu repeats and is vital for neuronal function and immune evasion. For example, editing of GluA2 mRNA ensures calcium impermeability in receptors. Dysregulated editing, as in ADAR1 overexpression, promotes cancer by stabilizing oncogenic transcripts. Alternative splicing generates splice variants by varying inclusion, producing multiple mRNA isoforms from one and thus increasing protein diversity—up to 95% of human multiexon undergo this. Regulated by splicing factors like and hnRNPs, it allows tissue-specific expression; for instance, the yields variants with distinct functions. Aberrant alternative splicing contributes to diseases, such as inclusion of 7 in SMN2 causing . Genome-wide analyses reveal splicing variants' role in development and evolution.00817-8) Capping adds a 7-methylguanosine (m7G) structure to the 5' end of nascent pre-mRNA via three enzymatic steps: RNA triphosphatase removes the gamma phosphate, guanylyltransferase adds GMP, and methyltransferase methylates it. Performed co-transcriptionally by the capping enzyme complex associated with , the cap recruits initiation factors for , protects against 5'-3' exonucleases, and promotes splicing and . In eukaryotes, capping efficiency influences mRNA , with uncapped transcripts rapidly degraded.

Structural Elements and Cytoskeleton

The spindle apparatus is a dynamic, bipolar array of microtubules that forms during mitosis and meiosis to ensure the accurate separation of chromosomes to daughter cells. Composed primarily of microtubules—cylindrical polymers assembled from α- and β-tubulin heterodimers—this structure organizes into distinct subsets, including kinetochore microtubules that attach to chromosomes via kinetochores, interpolar microtubules that span between spindle poles, and astral microtubules that extend to the cell cortex for positioning. Key components such as the γ-tubulin ring complex (γ-TuRC) nucleate microtubules at minus ends, while motor proteins like kinesin-5 (e.g., Eg5) and dynein drive sliding and bundling to establish bipolarity. The apparatus achieves chromosome segregation through microtubule dynamics, including plus-end polymerization at kinetochores and poleward flux via depolymerization, generating tension that aligns chromosomes at the metaphase plate before pulling sister chromatids apart in anaphase. Sister chromatids are the two identical copies of a replicated , joined at the , that arise during the of the . These structures consist of duplicated DNA molecules encased in , held together by proteins until , when they separate to ensure each daughter cell receives an exact genetic complement. Functionally, sister chromatids serve as the units of inheritance during cell division, with spindle microtubules attaching to the centromere-bound kinetochores to bi-orient and segregate them, preventing . Their formation and resolution are critical for maintaining stability, as disruptions can lead to chromosomal instability observed in diseases like cancer. The chromosome scaffold refers to a proteinaceous framework that provides structural support and organization to mitotic chromosomes by anchoring and compacting loops. This scaffold is primarily composed of non-histone proteins, including complexes (e.g., hCAP-E), IIα, and family member 4 (KIF4), forming a double-stranded axial structure along axes as visualized by advanced imaging techniques like 3D-SIM and FIB/SEM. It organizes by radially looping DNA fibers, enabling the dramatic condensation of into compact chromosomes while preserving flexibility for segregation. The scaffold's integrity is reversible, responding to ions like Mg²⁺ for decondensation, and its disruption impairs higher-order architecture. The smooth endoplasmic reticulum (SER) is a network of interconnected tubules and vesicles lacking ribosomes, forming a continuous membrane system throughout the cytosol that specializes in lipid metabolism and detoxification. Unlike the rough ER, the SER's tubular morphology—often 30–60 nm in diameter—facilitates the synthesis of membrane lipids, including phospholipids, cholesterol, and steroids, via embedded enzymes that process precursors like fatty acids and glycerol phosphate. This structure is particularly abundant in cells with high lipid demands, such as hepatocytes and steroid-secreting cells, where it also sequesters calcium ions and metabolizes lipophilic compounds through cytochrome P450 enzymes. The SER's dynamic remodeling supports cellular homeostasis by exporting lipids to other organelles and the plasma membrane.

T

Transcription and Gene Expression

Transcription is the process by which RNA is synthesized from a DNA template, serving as the first step in to produce (mRNA) that carries genetic information for protein synthesis. This enzymatic reaction is catalyzed by enzymes and occurs in three main phases: , elongation, and termination. In prokaryotes, a single handles all transcription, while eukaryotes utilize three distinct polymerases— for , II for mRNA, and III for and small RNAs—each with specific regulatory mechanisms. The fidelity of transcription ensures that the RNA sequence mirrors the DNA template strand, with uracil substituting for thymine in RNA. Initiation of transcription begins when binds to the promoter region of the , a DNA sequence typically rich in adenine-thymine base pairs that facilitates DNA unwinding to form an open complex. Transcription factors, which are proteins that specifically bind to promoter or enhancer DNA sequences, play a crucial role in this phase by recruiting and modulating its activity to control the rate and specificity of transcription. General transcription factors, such as TFIID in eukaryotes, assemble at the core promoter to form the pre-initiation complex, while specific transcription factors bind to enhancers—distal regulatory elements that can loop to interact with promoters—to activate or repress in response to cellular signals. For instance, the recognizes the in many eukaryotic promoters, stabilizing the complex for polymerase loading. During elongation, RNA polymerase moves along the DNA template at a rate of approximately 20–50 nucleotides per second in eukaryotes, synthesizing a growing RNA chain by adding ribonucleotides complementary to the DNA strand, while the DNA double helix reforms behind the enzyme. This phase involves overcoming nucleosome barriers in chromatin and pausing at regulatory checkpoints to ensure accurate synthesis. Transcription factors can also influence elongation by promoting pause release or enhancing processivity, thereby fine-tuning gene output. Termination signals the end of transcription when RNA polymerase encounters a terminator sequence, a specific DNA motif that halts enzyme activity and releases the newly synthesized RNA transcript. In prokaryotes, rho-independent terminators form RNA hairpin structures followed by a uracil-rich stretch, causing polymerase to dissociate; rho-dependent terminators involve the Rho protein binding to the RNA to pull the enzyme off the DNA. Eukaryotic termination for RNA polymerase II is coupled to 3' end processing, triggered by the polyadenylation signal (AAUAAA) in the pre-mRNA, which recruits cleavage and polyadenylation factors to cleave the transcript and release polymerase. Transposons, also known as transposable elements, are mobile DNA segments that can insert into new genomic locations, thereby altering gene expression by disrupting promoters, enhancers, or coding sequences, or by introducing novel regulatory motifs. These elements, which constitute a significant portion of eukaryotic genomes (e.g., over 45% in humans), move via cut-and-paste (DNA transposons) or copy-and-paste (retrotransposons via RNA intermediates) mechanisms, potentially activating or silencing nearby genes through epigenetic modifications or provision of transcription factor binding sites. Seminal studies have shown transposons contribute to evolutionary innovation in gene regulation, such as in the co-option of elements for placental development in mammals.

Translation and Protein Assembly

Translation is the biological process by which the nucleotide sequence of (mRNA) is decoded to direct the synthesis of a polypeptide chain, occurring on ribosomes in the or on the rough . This mRNA-directed protein synthesis was first demonstrated in cell-free extracts of , where synthetic polyuridylic acid (poly-U) RNA stimulated the incorporation of into protein, establishing mRNA as the template for . The process involves at a , elongation through sequential addition of , and termination at stop codons, ensuring fidelity in converting genetic information into functional proteins. The triplet code, or triplet genetic code, refers to the organization of the into non-overlapping groups of three consecutive (codons) in mRNA, each specifying one of the 20 standard or signaling termination. This degeneracy allows multiple codons to encode the same , with 61 codons for and 3 as stop signals, providing robustness against mutations. The triplet nature was experimentally confirmed using proflavin-induced mutations in T4, which showed that insertions or deletions of single or double bases disrupted the , while triples restored it, proving the code's triplet specificity without overlap or gaps. Transfer RNA (tRNA) functions as an adaptor molecule in translation, bearing an anticodon loop that base-pairs with the complementary mRNA codon and a 3' CCA end covalently attached to its cognate via aminoacylation by specific synthetases. Discovered as a soluble RNA intermediate in rat liver cell-free systems, tRNA was shown to accept activated (aminoacyl adenylates) before transferring them to growing polypeptide chains on ribosomes. Each tRNA type recognizes a specific codon through Watson-Crick base pairing in the anticodon, with wobble pairing at the third position allowing one tRNA to decode multiple synonymous codons, thus minimizing the required number of tRNA species to about 40 in most cells. Polysomes, also known as polyribosomes, are assemblies of several ribosomes bound to and translating the same mRNA molecule simultaneously, enabling rapid and efficient production of multiple protein copies from a single transcript. First observed in electron micrographs of lysates actively synthesizing , polysomes sediment as discrete particles in sucrose gradients, with the number of ribosomes per polysome correlating to mRNA length and translation rate. In eukaryotic cells, polysomes form on free cytosolic mRNAs or membrane-bound mRNAs at the , and their disruption by agents like halts protein synthesis, underscoring their role in coordinating ribosomal activity for high-output .

Telomeres and Chromosomal Ends

Telomeres are specialized nucleoprotein structures that cap the ends of linear eukaryotic chromosomes, preventing their recognition as DNA double-strand breaks and mitigating the end-replication problem inherent to semiconservative DNA replication. In humans and other vertebrates, telomeric DNA consists of tandem repeats of the hexanucleotide sequence 5'-TTAGGG-3', typically ranging from 5 to 15 kilobases in length, with a 3' single-stranded overhang of 50–300 nucleotides that facilitates the formation of protective t-loops. This repetitive sequence, first identified through genomic analysis of human metaphase chromosomes, is highly conserved across species and non-coding, serving primarily structural rather than informational roles. The associated proteins, including the shelterin complex, bind these repeats to form a higher-order structure that shields chromosomal termini from nucleolytic degradation, illegitimate recombination, and activation of DNA damage checkpoints. Telomerase, a unique ribonucleoprotein , counteracts attrition by elongating the 3' overhang using its integral component as a template for synthesizing additional TTAGGG repeats. Composed of the catalytic subunit TERT () and the RNA subunit TERC, which provides the template sequence complementary to the telomeric repeat, telomerase extends telomeres de novo without requiring a DNA primer beyond the chromosomal end. This was discovered in extracts of the Tetrahymena thermophila, where it was shown to add telomeric sequences , a finding that revealed its RNA-dependent mechanism and addressed the incomplete replication of linear DNA ends by conventional polymerases. In somatic human cells, telomerase activity is repressed, leading to progressive telomere shortening with each division, which imposes a replicative limit and contributes to ; however, it is upregulated in stem cells and most cancers to maintain telomere length and enable indefinite proliferation. Terminal deoxynucleotidyl transferase (TdT), also known as terminal transferase, is a specialized DNA polymerase that catalyzes the template-independent addition of deoxynucleotides to the 3' hydroxyl ends of single- or double-stranded DNA, typically incorporating 10–20 random nucleotides per reaction. Unlike telomerase, TdT lacks an RNA template and relies on the availability of dNTP substrates, making it a key player in generating junctional diversity during V(D)J recombination in developing lymphocytes, where it adds non-templated (N) nucleotides to enhance antibody and T-cell receptor variability. In the context of chromosomal ends, TdT can contribute to end modification during DNA repair or recombination events, though its activity is tightly regulated and primarily active in immature immune cells to avoid genomic instability. The complex provides multilayered protection to telomeric ends by repressing DNA damage signaling pathways, such as /ATR activation and (NHEJ), while also regulating access for . Comprising six subunits—TRF1 and TRF2 (telomeric repeat-binding factors that bind double-stranded TTAGGG repeats), POT1 (protection of telomeres 1, which binds the single-stranded overhang), and bridging proteins TIN2, TPP1, and RAP1— forms a closed ring around the to compact the DNA and inhibit unwanted repair. Disruption of leads to telomere dysfunction-induced foci (TIFs), telomere fusions, and genomic instability, underscoring its essential role in end protection across cell divisions.

U

Ubiquitination and Protein Degradation

Ubiquitination is a fundamental in eukaryotic cells where the 76-amino-acid protein is covalently conjugated to target proteins, primarily to regulate their stability, localization, and interactions. This process plays a central role in maintaining (proteostasis) by targeting aberrant or excess proteins for degradation, thereby preventing cellular dysfunction. The addition of occurs through an ATP-dependent enzymatic cascade involving E1 (), E2 (ubiquitin-conjugating enzyme), and E3 () enzymes, with E3 providing substrate specificity to ensure precise targeting. Discovered in the late and early , this system was elucidated by , , and Irwin Rose, who identified the ATP-requiring conjugation mechanism in extracts, earning them the 2004 . The ubiquitination process begins with E1 activating ubiquitin by forming a high-energy thioester bond using ATP, which is then transferred to the catalytic cysteine of E2. E3 ligases bridge E2 and the substrate, facilitating the transfer of ubiquitin to a lysine residue on the target protein or to another ubiquitin molecule, forming chains. While monoubiquitination (a single ubiquitin) often modulates protein function, such as endocytosis or DNA repair, polyubiquitination— the sequential addition of multiple ubiquitins— is essential for targeting proteins to degradation pathways. Polyubiquitin chains can form through eight distinct linkages based on ubiquitin's internal lysines (K6, K11, K27, K29, K33, K48, K63) or N-terminal methionine (M1), creating a "ubiquitin code" that dictates downstream outcomes. For instance, K48-linked polyubiquitin chains, the most common degradative signal, require a minimum of four ubiquitins for efficient recognition and are assembled by E2 enzymes like UbcH5 and E3 ligases such as the anaphase-promoting complex/cyclosome (APC/C). In protein degradation, K48- and K11-linked polyubiquitin chains direct substrates to the , a large multiprotein complex that unfolds and hydrolyzes ubiquitinated proteins into peptides and recycles via deubiquitinating enzymes (DUBs). This pathway is critical for , as it selectively degrades misfolded, damaged, or unassembled proteins, such as those from stalled ribosomes or stress, preventing toxic aggregates. For example, the E3 ligase CHIP cooperates with chaperones like to ubiquitinate misfolded proteins for proteasomal clearance, ensuring cellular . K63- and M1-linked chains, in contrast, typically promote non-degradative roles, such as activating signaling for inflammation, highlighting ubiquitination's versatility beyond degradation. DUBs, numbering over 100 in humans, counteract ubiquitination by editing chains or removing , fine-tuning the process; for instance, USP7 stabilizes by deubiquitinating it. Upstream regulatory signals often initiate ubiquitination in response to cellular cues, such as by kinases that activate E3 ligases or prime substrates. In signaling pathways, activation domains (e.g., in transcription factors) or sites serve as recognition motifs for E3 recruitment, linking extracellular stimuli like growth factors to targeted degradation. For example, in the Wnt pathway, of β-catenin by GSK3β creates a for the E3 ligase β-TrCP, leading to its polyubiquitination and degradation. This integration ensures ubiquitination responds dynamically to maintain signaling fidelity and protein quality.

Upstream Regulatory Elements

Upstream regulatory elements are cis-acting DNA sequences located 5' (upstream) of the transcription start site (TSS) in eukaryotic genes, serving to modulate the initiation, rate, and tissue-specificity of transcription by . These elements collectively form the promoter region and associated distal sites, binding transcription factors (TFs) and co-regulators to assemble the preinitiation complex (PIC) and influence accessibility. Unlike coding sequences, they do not encode proteins but act as docking platforms for regulatory proteins, ensuring precise patterns during development, differentiation, and environmental responses. The core promoter, spanning approximately -40 to +40 base pairs relative to the TSS, represents the minimal upstream element required for basal transcription. It includes motifs such as the TATA box (consensus TATAAA, located ~25-35 bp upstream of the TSS), which is recognized by the TATA-binding protein (TBP) subunit of TFIID to nucleate PIC assembly. The TATA box was the first core promoter motif identified in eukaryotic genes, discovered through sequencing of histone genes in 1978 and confirmed in adenovirus major late promoter studies in 1979. Other core elements include the initiator (Inr) at the TSS (consensus YYANWYY, where Y=pyrimidine, N=any nucleotide, W=A/T), which specifies the exact transcription start, and the downstream promoter element (DPE) ~30 bp downstream, both functioning cooperatively with TBP to recruit RNA polymerase II and general TFs like TFIIB and TFIIH. Approximately 10-30% of human promoters contain a TATA box, while TATA-less promoters rely more on Inr and CpG islands for initiation. Proximal promoter elements extend from ~100 to 250 base pairs upstream of the TSS, containing specific TF binding sites that fine-tune transcription in response to cellular signals. These regions often include GC-rich sequences or CpG islands in vertebrates, which are unmethylated and associated with genes, facilitating constitutive expression via Sp1 TF binding. For instance, the (consensus GGCCAATCT) and GC box (GGGCGG) motifs, bound by NF-Y and Sp1 TFs respectively, enhance PIC stability and are found in many TATA-less promoters. Mutations in proximal elements, such as a in the GATA-1 of the PKLR gene's promoter, can lead to reduced expression and diseases like , underscoring their regulatory importance. Unlike distal elements, proximal sites exert effects primarily through direct interactions with the core promoter, without requiring extensive DNA looping. Distal upstream enhancers, often located kilobases to megabases away from the TSS, are modular sequences that potently activate transcription in a position- and orientation-independent manner. Discovered in 1981 within the simian virus 40 () genome as a 72-bp repeat that boosted β-globin gene expression up to 200-fold when placed upstream, enhancers were soon identified in cellular genes like immunoglobulin loci. They function by binding combinations of activator TFs (e.g., AP-1, ) that recruit and histone acetyltransferases, promoting chromatin opening via DNA looping mediated by and . A classic example is the immunoglobulin heavy-chain enhancer, which drives B-cell-specific expression through Oct and PU.1 TF binding. Enhancers can also act as super-enhancers in clusters, amplifying expression of key developmental genes like those in the HOX loci. While enhancers are typically activating, analogous upstream silencers bind repressors (e.g., Polycomb proteins) to inhibit transcription, as seen in PREs that maintain silencing.

V

Vectors and Viral Components

In molecular biology, a vector is a DNA molecule, such as a plasmid or viral genome, used as a vehicle to carry a specific DNA segment into a host cell for replication, expression, or manipulation. Vectors enable the introduction of foreign genetic material, facilitating techniques like and by integrating or maintaining the inserted DNA within the host genome or as an extrachromosomal element. Common types include plasmid vectors, which are small, circular, self-replicating DNA molecules derived from bacteria, and viral vectors, such as those based on adenoviruses or lentiviruses, which exploit viral machinery for efficient into eukaryotic cells. For instance, in technology, a gene of interest is ligated into the vector's , allowing propagation in bacterial hosts before transfer to target cells. This process underpins applications in , where vectors must possess features like , selectable markers, and promoters to ensure stable inheritance and expression. Viral oncogenes (v-onc) are genes acquired by viruses from host cells that promote uncontrolled and can lead to cancer when expressed in infected cells. These oncogenes originate from cellular proto-oncogenes but are altered during viral transduction, often losing regulatory elements, resulting in constitutive activity that disrupts normal signaling pathways like receptors or transcription factors. Retroviruses, such as carrying v-src (a ), exemplify this, where integration near proto-oncogenes or direct expression of v-onc drives oncogenesis. In DNA viruses like human papillomavirus, viral proteins such as E6 and E7 function analogously by inactivating tumor suppressors and Rb, though not true transduced oncogenes. Over two dozen distinct viral oncogenes have been identified, highlighting their role in viral across species. A virion is the complete, extracellular infectious form of a , consisting of a (DNA or RNA) enclosed within a protective protein coat called a , sometimes surrounded by a . The primary function of the virion is to deliver its into a susceptible host cell, initiating replication while shielding the genetic material from environmental degradation. Virions exhibit diverse architectures: helical (e.g., ), icosahedral (e.g., adenovirus), or complex (e.g., T4), with sizes ranging from 20 nm to over 300 nm depending on the family. In enveloped es like , the derived from host membranes incorporates viral glycoproteins for attachment and entry, contrasting with non-enveloped virions that rely on direct capsid-mediated penetration. This structure represents the in its dormant phase, distinct from intracellular replication forms.

Vesicles and Transport Mechanisms

Vesicles are small, membrane-bound s that play a central role in intracellular trafficking by encapsulating such as proteins, , and other molecules for directed between cellular compartments. In eukaryotic cells, vesicular ensures the proper sorting and delivery of these components, maintaining identity and cellular . This process involves the of vesicles from donor membranes, their translocation through the , and fusion with target membranes, often mediated by specific protein coats and regulatory factors. Vesicles form through from donor membranes, like the Golgi apparatus or , and mediate processes including , , and intracellular trafficking via specific coat proteins such as or COPI/II. In vesicular , these carriers ensure directional movement, with fusion to target membranes regulated by SNARE proteins and Rab to maintain cellular compartmentalization and . For example, secretory vesicles release hormones or neurotransmitters by , while endocytic vesicles internalize extracellular material for degradation or recycling. A key aspect of vesicular transport occurs between the (ER) and the Golgi apparatus, where COP-coated vesicles predominate. These non-clathrin coats, composed of coat protein complexes (COPs), facilitate selective packaging and membrane deformation for vesicle formation. Seminal work by and colleagues demonstrated that COPI vesicle assembly requires only coatomer—a soluble complex of seven subunits—and the GTP-binding protein ARF, which recruits coatomer to the Golgi membrane upon GTP binding, enabling self-assembly and budding. COPII vesicles, in contrast, mediate anterograde transport from the ER to the Golgi; their coat assembly is initiated by the Sar1, which inserts into the ER membrane and recruits the Sec23/24-Sec13/31 coat complex to capture proteins destined for . COPI vesicles primarily handle retrograde transport, retrieving escaped ER-resident proteins—such as those bearing KDEL or KKXX retrieval signals—back from the cis-Golgi or ER-Golgi intermediate compartment, thus preventing dilution of ER-specific machinery. In addition to inter-organelle shuttling, COP-coated vesicles contribute to intra-Golgi trafficking, where COPI supports bidirectional movement between Golgi cisternae to refine protein and sorting. The fission of these vesicles is driven by coat-induced , followed by uncoating via GTP , which is essential for subsequent SNARE-mediated fusion with acceptor membranes. Disruptions in COP function, as identified in yeast mutants by , underscore their indispensability, leading to accumulation of secretory precursors and impaired cellular . These mechanisms highlight the precision of vesicular logistics in eukaryotic cells, with COP coats ensuring efficient, vectorial flow of biosynthetic and recycling pathways. The represents another critical vesicular structure, particularly in and fungal cells, where it functions as a multifunctional storage bounded by a tonoplast . In , the central vacuole often occupies up to 90% of the mature cell volume, storing to maintain , which drives cell expansion and structural rigidity. It also sequesters ions, organic acids, sugars, , and secondary metabolites such as anthocyanins and alkaloids, protecting the from potential toxicity while serving as a reservoir during development or stress. Plant vacuoles exist in distinct forms, including lytic vacuoles for hydrolytic degradation of cellular waste via and protein storage vacuoles (PSVs) that accumulate globulins or glutenins in seeds and for future . In fungi, vacuoles share evolutionary and functional parallels with plant vacuoles and mammalian lysosomes, acting as acidic compartments (pH ~5-6) for storage and . They primarily store basic , , calcium ions, and other metabolites, buffering cytosolic and supporting rapid responses to fluctuations. Fungal vacuoles also harbor degradative enzymes like proteases and nucleases, enabling the breakdown of macromolecules during starvation or sporulation, as evidenced in models where vacuolar mutants exhibit impaired growth and ion tolerance. Biogenesis of fungal vacuoles involves homotypic fusion of smaller vesicles derived from the Golgi or endosomal pathways, regulated by proteins such as Vps (vacuolar protein sorting) factors, ensuring dynamic remodeling in response to environmental cues.

W

Western Blotting and Detection Methods

Western blotting, commonly referred to as immunoblotting, is a fundamental technique in for detecting and quantifying specific proteins within a complex mixture of cellular extracts. The method relies on the separation of proteins by size using , followed by their transfer to a and subsequent probing with antibodies that recognize the target protein. This approach enables the analysis of protein expression levels, post-translational modifications, and interactions, providing insights into cellular processes such as signaling pathways and mechanisms. Originally developed to improve upon earlier protein detection methods, Western blotting has become indispensable due to its specificity and sensitivity, capable of detecting proteins at picogram levels. The core procedure initiates with sample preparation, where proteins are extracted from cells or tissues using lysis buffers and quantified, often via assays like , to ensure equal loading. In , proteins are denatured with SDS and beta-mercaptoethanol, mixed with loading buffer, and electrophoresed through a polyacrylamide gel under an ; smaller proteins migrate faster, achieving separation by molecular weight with resolutions down to 1-2 . The resolved proteins are then electroblotted onto a or polyvinylidene difluoride (PVDF) membrane using a buffer system like Towbin's (25 mM Tris, 192 mM , 20% , pH 8.3), preserving the banding pattern for subsequent analysis. This transfer step, first detailed in 1979, allows proteins to bind irreversibly to the membrane while maintaining their spatial separation. Following transfer, the membrane is blocked with non-fat milk or to prevent non-specific binding, then incubated with a primary specific to the target protein, typically overnight at 4°C for optimal binding affinity. A secondary , conjugated to an like (HRP) or , or to a , is applied next to amplify the signal through antigen-antibody interactions. The term "Western blotting" was introduced in to denote this radiographic detection using radioiodinated , though modern variants avoid . Detection occurs via , where HRP catalyzes a substrate like to produce light captured on film or digital imagers, offering dynamic ranges up to 10^4-fold; fluorescent methods enable for simultaneous probing of multiple proteins using distinct emission wavelengths; and colorimetric approaches yield visible bands via enzymatic , though with lower sensitivity around nanograms. These methods collectively facilitate quantitative comparisons, such as normalizing target protein signals to housekeeping proteins like .

X

X Chromosome and Sex-linked Genetics

The X chromosome is one of the two sex chromosomes in mammals, occurring in two copies in females (XX karyotype) and a single copy in males (XY karyotype), where it plays a central role in sex determination by providing essential genes in the absence of the Y chromosome's SRY gene. It spans approximately 155 million base pairs, comprising about 5% of the human genome, and encodes approximately 900–1,400 protein-coding genes, many of which influence traits ranging from color vision to blood clotting. These genes are disproportionately involved in intellectual disability, immune function, and reproduction compared to autosomal genes, highlighting the X chromosome's evolutionary conservation despite its partial degeneration relative to the Y. Sex-linked genetics refers to the inheritance patterns of genes located on the X chromosome, which exhibit distinct modes due to hemizygosity in males and the potential for carrier states in females. first demonstrated in 1910 by through experiments with , where a recessive white-eye mutation appeared only in males, revealing X-linkage as the white-eye allele on the single X chromosome was expressed without a dominant counterpart. In humans, X-linked recessive traits, such as hemophilia A (due to F8 gene mutations) or Duchenne muscular dystrophy (DMD gene), predominantly affect males, who inherit the mutant allele from their carrier mothers and express the phenotype, while females require two mutant copies or to manifest symptoms. X-linked dominant conditions, like (FMR1 gene expansion), can affect both sexes but often show variable severity, with males typically more impacted due to single-copy expression. These patterns enable pedigree analysis to trace , with no male-to-male transmission, underscoring the chromosome's role in for affected families. To compensate for the dosage imbalance of X-linked genes—twice as many in females as in males—one X chromosome is transcriptionally silenced in each female somatic cell through a process called X-chromosome inactivation (XCI), ensuring equivalent gene expression levels between sexes. Proposed by Mary F. Lyon in 1961 based on variegated coat color patterns in female mice heterozygous for X-linked genes, random XCI occurs early in embryonic development, creating a mosaic of cells expressing either the maternal or paternal X. The inactivated X chromosome (Xi) condenses into a compact, heterochromatic structure known as the Barr body, first identified by Murray L. Barr and E. G. Bertram in 1949 as a sex-specific nuclear inclusion visible in female interphase nuclei. At the molecular level, XCI is orchestrated by the X-inactivation center (XIC), a genomic locus containing the long noncoding RNA gene XIST, whose transcript coats the future Xi in cis, recruiting Polycomb repressive complexes to mediate histone modifications (e.g., H3K27me3 enrichment) and DNA methylation for stable silencing. The antisense TSIX RNA from the XIC represses XIST on the active X (Xa), ensuring choice specificity, while about 15-25% of human X genes escape inactivation, particularly in the pseudoautosomal regions, contributing to sexual dimorphisms. This epigenetic mechanism is crucial for viability, as disruptions lead to developmental disorders like Turner syndrome (XO) or triple X syndrome (XXX).

Y

Yeast Models and Hybrid Systems

Yeast models, particularly those utilizing the budding yeast , serve as powerful eukaryotic platforms for studying genetic and molecular processes due to their conserved cellular machinery and ease of genetic manipulation. These models enable the cloning and analysis of large DNA segments and the detection of protein-protein interactions, bridging gaps in higher organism studies. Key tools include yeast artificial chromosomes (YACs) for handling extensive genomic inserts and the two-hybrid for identifying interacting proteins through transcriptional activation. Yeast artificial chromosomes (YACs) are linear vectors constructed by ligating large exogenous DNA fragments—typically ranging from 100 to over 1,000 kilobase pairs—to yeast-derived sequences including a centromere for mitotic segregation, telomeres for stability, and an autonomously replicating sequence (ARS) for replication initiation. This design allows YACs to behave as functional chromosomes in S. cerevisiae, enabling stable propagation and faithful copying during cell division, as demonstrated by pulsed-field gel electrophoresis analysis of cloned yeast and human DNA. Developed in 1987, YACs addressed limitations of smaller cloning vectors like plasmids, which were restricted to fragments under 50 kilobase pairs, thus facilitating the construction of comprehensive genomic libraries for complex eukaryotes. Their primary use in molecular biology involves mapping large-scale genome structures, such as during the Human Genome Project, where they supported the assembly of overlapping clones to span entire chromosomes despite challenges like chimeric artifacts from recombination. The two-hybrid system detects protein-protein interactions by exploiting the modular nature of the S. cerevisiae GAL4, which consists of a separable (DBD) and domain (AD). In this , a "bait" protein of interest is fused to the GAL4 DBD, which binds to upstream activating sequence (UAS) elements in the promoter of a (e.g., HIS3 for prototrophy or lacZ for activity), while a "prey" protein is fused to the GAL4 AD; if the bait and prey interact, the DBD and AD are brought into proximity, reconstituting transcriptional activation and producing a detectable signal. Originally described in using the interacting yeast proteins SNF1 and SNF4 to validate the system, this method allows screening of cDNA libraries to identify novel interactors under selectable conditions like galactose induction. Its high-throughput potential has enabled genome-wide interactome mapping, such as identifying over 5,600 interactions in yeast and thousands in human proteins, providing insights into signaling pathways and disease mechanisms while minimizing false positives through controls for auto-. As a genetic tool, it complements vector-based systems like those in viral components by focusing on interaction dynamics rather than DNA propagation.

Z

Zymogens and Enzyme Activation

Zymogens, also known as proenzymes, are inactive precursor forms of , particularly proteolytic enzymes, that are synthesized to prevent premature or uncontrolled enzymatic activity within the cell. This inactivation is achieved through the inclusion of a propeptide or activation segment that sterically hinders the , ensuring that is spatially and temporally regulated. Such a strategy is essential in to avoid self-digestion of cellular components and to coordinate function with physiological needs, as seen in digestive and hemostatic processes. The primary mechanism of zymogen involves limited proteolytic cleavage, where a specific in the propeptide is hydrolyzed, often by another , leading to the removal of the inhibitory segment and a conformational change that exposes the . This process can be triggered by enzymatic cofactors, such as , or non-enzymatic factors like pH shifts that promote . In some cases, the propeptide not only inhibits activity but also assists in proper folding, stability, and intracellular sorting of the during . Seminal studies on this mechanism, including the identification of an activation peptide released during , established the foundational understanding of proteolytic conversion. A representative example is the of , the form of the , produced in pancreatic acinar cells. In the , duodenal specifically cleaves an octapeptide (Ala-Pro-Phe-Asp-Asp-Asp-Asp-Lys) from the of , releasing the inhibitory propeptide and inducing a conformational shift to form active . This initial then enables a proteolytic cascade, where further activates other pancreatic zymogens like and procarboxypeptidase, amplifying protein digestion. The kinetics of this process are highly efficient, with activation rates proportional to concentration, underscoring its role in regulated gastrointestinal . Zymogen activation is also central to the blood coagulation cascade, where circulating zymogens such as prothrombin and factors VII, IX, and X are sequentially converted to active serine proteases through limited proteolysis. For instance, prothrombin is cleaved by activated factor X to yield thrombin, which then promotes fibrin clot formation; this cascade amplifies the response to vascular injury while localizing activity to the site of damage. Such zymogen-based cascades exemplify how cells achieve rapid, controlled amplification of proteolytic signals in response to physiological cues.

Zygotes and Cell Fusion

In cellular and molecular biology, a is the single diploid cell formed immediately after the fusion of two haploid gametes, typically a and an ovum, during fertilization in . This initial product of syngamy marks the beginning of embryonic development, where the inherits a complete set of chromosomes—one set from each parent—restoring the diploid number characteristic of the species. The 's formation is a highly regulated process essential for and species propagation, observed across eukaryotes from animals to . Cell during formation involves the precise merging of plasma from the male and female gametes, mediated by specialized proteins such as fusogens. In mammals, for instance, the -egg fusion is facilitated by proteins like IZUMO1 on the surface and JUNO on the , which form a heteromeric complex to trigger membrane merger and prevent . This fusion event releases the 's genetic material into the , leading to the decondensation of the nucleus and its integration with the female . Post-fusion, the undergoes rapid cellular rearrangements, including the extrusion of polar bodies from prior meiotic divisions, ensuring the correct . Following fertilization, the enters a phase of transcriptional silence, relying on maternal mRNAs and proteins stored in the for initial development. Zygotic genome activation (ZGA) represents the critical transition where the embryonic genome begins active transcription, typically occurring at the 2- to 8-cell stage in mammals. This activation is timed by regulatory mechanisms, including and the removal of repressive marks like , allowing to initiate zygotic . ZGA is species-specific; for example, in Xenopus laevis, it initiates at the mid-blastula transition around the 12th cleavage, driven by the accumulation of cyclin-dependent kinases that phosphorylate transcription factors. Disruptions in ZGA can lead to developmental arrest, underscoring its role in establishing totipotency and lineage specification.

References

Add your contribution
Related Hubs
User Avatar
No comments yet.