Recent from talks
Nothing was collected or created yet.
Restriction site
View on WikipediaIn molecular biology, restriction sites, or restriction recognition sites, are regions of a DNA molecule containing specific (4-8 base pairs in length[1]) sequences of nucleotides; these are recognized by restriction enzymes, which cleave the DNA at or near the site. These are generally palindromic sequences[2] (because restriction enzymes usually bind as homodimers), and a particular restriction enzyme may cut the sequence between two nucleotides within its recognition site, or somewhere nearby.
Function
[edit]For example, the common restriction enzyme EcoRI recognizes the palindromic sequence GAATTC and cuts between the G and the A on both the top and bottom strands. This leaves an overhang (an end-portion of a DNA strand with no attached complement) known as a sticky end[2] on each end of AATT (AATTC, i.e. TTAAC). The overhang can then be used to ligate in (see DNA ligase) a piece of DNA with a complementary overhang (another EcoRI-cut piece, for example).
Some restriction enzymes cut DNA at a restriction site in a manner which leaves no overhang, called a blunt end.[2] Blunt ends are much less likely to be ligated by a DNA ligase because the blunt end doesn't have the overhanging base pair that the enzyme can recognize and match with a complementary pair.[3] Sticky ends of DNA however are more likely to successfully bind with the help of a DNA ligase because of the exposed and unpaired nucleotides. For example, a sticky end trailing with AATTG is more likely to bind with a ligase than a blunt end where both the 5' and 3' DNA strands are paired. In the case of the example the AATTG would have a complementary pair of TTAAC which would reduce the functionality of the DNA ligase enzyme.[4]
Applications
[edit]Restriction sites can be used for multiple applications in molecular biology such as identifying restriction fragment length polymorphisms (RFLPs). Restriction sites are also important consideration to be aware of when designing plasmids.
Databases
[edit]Several databases exist for restriction sites and enzymes, of which the largest noncommercial database is REBASE.[5][6] Recently, it has been shown that statistically significant nullomers (i.e. short absent motifs which are highly expected to exist) in virus genomes are restriction sites indicating that viruses have probably got rid of these motifs to facilitate invasion of bacterial hosts.[7] Nullomers Database contains a comprehensive catalogue of minimal absent motifs many of which might potentially be not-yet-known restriction motifs.
See also
[edit]References
[edit]- ^ Russell, Peter J. (2006). iGenetics: A Mendelian Approach. Benjamin Cummings. ISBN 978-0805346664.
- ^ a b c Lehninger, Albert L.; Nelson, David L.; Cox, Michael M. (2008). Principles of Biochemistry (5th ed.). New York, NY: W.H. Freeman and Company. p. 305–306. ISBN 978-0-7167-7108-1.
- ^ Mousavi-Khattat, Mohammad; Rafati, Adele; Gill, Pooria (5 February 2015). "Fabrication of DNA nanotubes using origami-based nanostructures with sticky ends". Journal of Nanostructure in Chemistry. 5 (2): 177–183. doi:10.1007/s40097-015-0148-z.
- ^ Gao, Song; Zhang, Jiannan; Miao, Tianjin; Ma, Di; Su, Ying; An, Yingfeng; Zhang, Qingrui (28 March 2015). "A Simple and Convenient Sticky/Blunt-End Ligation Method for Fusion Gene Construction". Biochemical Genetics. 53 (1–3): 42–48. doi:10.1007/s10528-015-9669-x. PMID 25820211. S2CID 16709792.
- ^ Roberts, Richard J.; Vincze, Tamas; Posfai, Janos; Macelis, Dana (2009-10-21). "REBASE—a database for DNA restriction and modification: enzymes, genes and genomes". Nucleic Acids Research. 38 (suppl_1): D234 – D236. doi:10.1093/nar/gkp874. ISSN 0305-1048. PMC 2808884. PMID 19846593.
- ^ Roberts, Richard J.; Vincze, Tamas; Posfai, Janos; Macelis, Dana (2014-11-05). "REBASE—a database for DNA restriction and modification: enzymes, genes and genomes". Nucleic Acids Research. 43 (D1): D298 – D299. doi:10.1093/nar/gku1046. ISSN 1362-4962. PMC 4383893. PMID 25378308.
- ^ Koulouras, Grigorios; Frith, Martin C (2021-04-06). "Significant non-existence of sequences in genomes and proteomes". Nucleic Acids Research. 49 (6): 3139–3155. doi:10.1093/nar/gkab139. ISSN 0305-1048. PMC 8034619. PMID 33693858.
Restriction site
View on GrokipediaFundamentals
Definition
A restriction site, also known as a recognition site, is a specific, short DNA sequence—typically 4 to 8 base pairs in length—that is recognized and cleaved by a restriction endonuclease, commonly referred to as a restriction enzyme.[8] These enzymes act as molecular scissors, enabling precise cuts in double-stranded DNA molecules at these predetermined locations.[9] The phenomenon of restriction was first observed in the 1950s through studies on bacteriophage infections of Escherichia coli, with key insights into host-controlled modification emerging in the 1960s from work on bacteriophage lambda. This laid the groundwork for the 1970s isolation and purification of the first Type II restriction enzymes, such as HindII in 1970 and EcoRI in 1971, and the mapping of their corresponding recognition sequences.[10] These discoveries elucidated the restriction-modification systems, where enzymes cleave unmethylated foreign DNA at specific sites while protecting the host genome through methylation.[11] In the context of genetic engineering, the double-helical structure of DNA necessitates such precise cutting tools to manipulate genetic material without unintended damage, facilitating techniques like recombinant DNA construction.[12] For instance, the EcoRI restriction site is defined by the palindromic sequence 5'-GAATTC-3', which the enzyme recognizes across both strands of the DNA double helix.Recognition Sequence
A recognition sequence, also known as the recognition site or target sequence, is a short, specific stretch of double-stranded DNA that restriction enzymes identify and bind to initiate cleavage. These sequences typically range from 4 to 8 nucleotides in length, with 6-base-pair motifs being the most common among type II restriction endonucleases, which are widely used in molecular biology. The composition of these sequences is highly precise, often featuring dyad symmetry—palindromic arrangements where the sequence reads the same on both strands in the 5' to 3' direction—facilitating symmetric binding by the dimeric enzyme structure. For instance, sequences like 5'-GAATTC-3' lead to the generation of sticky (cohesive) ends with 5' overhangs upon cleavage, whereas 5'-CCCGGG-3' produces blunt ends with flush termini. This structural variation in the recognition sequence influences the type of DNA fragments generated, impacting downstream applications such as ligation efficiency in cloning. Restriction enzymes demonstrate stringent sequence specificity, requiring an exact match to the recognition sequence for effective binding and cleavage; single nucleotide mismatches within the core motif generally abolish or severely reduce enzymatic activity, ensuring targeted action on foreign DNA. While the primary specificity is dictated by the recognition sequence itself, certain enzymes exhibit sensitivity to immediately adjacent flanking sequences, which can modulate cleavage rates by up to several fold depending on their nucleotide composition. This flanking influence arises from subtle interactions that affect the enzyme's conformational changes during catalysis, though it remains secondary to the core sequence fidelity. In the context of bacterial restriction-modification (RM) systems, recognition sequences serve as conserved elements for host defense, enabling the discrimination and degradation of invading foreign DNA such as bacteriophages while sparing the methylated host genome. These systems, comprising a restriction endonuclease and a cognate methyltransferase, are phylogenetically widespread across prokaryotic species, with the recognition motifs maintained to provide robust protection against horizontal gene transfer threats. The conservation of RM architectures underscores the evolutionary pressure to preserve these sequences as integral components of innate immunity in microbes. To accommodate natural sequence variations or degenerate sites recognized by certain enzymes, recognition sequences are conventionally notated using the International Union of Pure and Applied Chemistry (IUPAC) ambiguity codes. These include R for purines (A or G), Y for pyrimidines (C or T), S for strong hydrogen-bonding pairs (G or C), W for weak pairs (A or T), M for amino bases (A or C), K for keto bases (G or T), B for all bases except A (C, G, or T), D for all except C (A, G, or T), H for all except G (A, C, or T), and V for all except T (A, C, or G), with N denoting any base (A, C, G, or T). This standardized notation allows precise description of partially degenerate motifs without listing every possible variant, aiding in database annotation and enzyme classification.Function and Mechanism
Enzyme Recognition
Restriction enzymes, particularly type II endonucleases, identify their target sequences through a highly specific binding process that involves the formation of homodimers or, in some cases, homotetramers, which symmetrically interact with the palindromic DNA recognition sites. These enzymes initially bind non-specifically to DNA via electrostatic interactions with the phosphodiester backbone, allowing them to scan the genome efficiently. Upon encountering a potential recognition site, the enzyme undergoes a conformational change, wrapping around the DNA double helix and making direct contacts with both the major and minor grooves. For instance, in enzymes like EcoRI and BamHI, structural elements such as recognition arms insert into the minor groove, while the core domains primarily engage the major groove to probe the base sequence. This binding is stabilized by approximately 15-20 hydrogen bonds between amino acid side chains and the bases of the recognition sequence, complemented by van der Waals interactions that provide additional specificity by sensing the shape and hydrophobicity of the base edges.[13][14] The specificity of recognition is determined by precise contact points for each base in the 4-8 bp sequence, often involving direct readout via hydrogen bonds to exocyclic groups on the bases and indirect readout through deformation of the DNA backbone. Enzymes like EcoRV exemplify this by forming specific hydrogen bonds with outer base pairs (e.g., G-A contacts via residues like Asn185) and using van der Waals forces with methyl groups for inner pairs, ensuring discrimination against non-cognate sites. Magnesium ions (Mg²⁺) play a crucial role in some enzymes by coordinating with catalytic motifs (e.g., PD...D/EXK) during the transition to the cleavage-competent state, although their primary function is in catalysis; in recognition, they may stabilize the enzyme-DNA complex in certain type II systems. This multi-step process—initial non-specific binding, partial recognition, and tight specific complex formation—achieves high fidelity, with enzymes like EcoRV bending DNA by up to 50° upon cognate site verification to lock in the interaction.[13][15][16] In the context of restriction-modification (RM) systems, the endonuclease is paired with a cognate methyltransferase that modifies the host DNA at the same recognition site, typically by adding methyl groups to adenine (N6-methyladenine) or cytosine (5-methylcytosine) residues, thereby preventing self-cleavage. This modification disrupts key hydrogen bonds or steric contacts in the endonuclease active site, as seen in EcoRV where adenine methylation abolishes binding affinity. The kinetic aspect of recognition involves facilitated diffusion, where the enzyme forms a non-specific complex and performs one-dimensional sliding along the DNA, combined with three-dimensional hopping, to locate target sites rapidly without off-target cleavage; studies on BssHII demonstrate linear scanning that halts at the first cognate site encountered, ensuring precise and efficient genome protection.[16][13][17]Cleavage Specificity
Restriction enzymes cleave DNA at precise positions within or adjacent to their recognition sequences, generating either cohesive (sticky) ends with single-stranded overhangs or blunt ends without overhangs. In the most common type II restriction endonucleases, cleavage occurs at fixed positions relative to the recognition site, typically producing 5' or 3' overhangs of 1-4 nucleotides in length or blunt ends.[18] For instance, EcoRI recognizes the sequence 5'-GAATTC-3' and cleaves between G and A on both strands, resulting in 5' overhangs of four bases (AATT):5'-G AATTC-3'
3'-CTTAA G-5'
5'-G AATTC-3'
3'-CTTAA G-5'
5'-CCC GGG-3'
3'-GGG CCC-5'
5'-CCC GGG-3'
3'-GGG CCC-5'
