Homeobox
Homeobox
Main page
2306092

Homeobox

logo
Community Hub0 subscribers
What are your thoughts?
Be the first to start a discussion here.
Be the first to start a discussion here.
Homeobox

A homeobox is a DNA sequence, around 180 base pairs long, that regulates large-scale anatomical features in the early stages of embryonic development. Mutations in a homeobox may change large-scale anatomical features of the full-grown organism.

Homeoboxes are found within genes that are involved in the regulation of patterns of anatomical development (morphogenesis) in animals, fungi, plants, and numerous single cell eukaryotes. Homeobox genes encode homeodomain protein products that are transcription factors sharing a characteristic protein fold structure that binds DNA to regulate expression of target genes. Homeodomain proteins regulate gene expression and cell differentiation during early embryonic development, thus mutations in homeobox genes can cause developmental disorders.

Homeosis is a term coined by William Bateson to describe the outright replacement of a discrete body part with another body part, e.g. antennapedia—replacement of the antenna on the head of a fruit fly with legs. The "homeo-" prefix in the words "homeobox" and "homeodomain" stems from this mutational phenotype, which is observed when some of these genes are mutated in animals. The homeobox domain was first identified in a number of Drosophila homeotic and segmentation proteins, but is now known to be well-conserved in many other animals, including vertebrates.

The existence of homeobox genes was first discovered in Drosophila by isolating the gene responsible for a homeotic transformation where legs grow from the head instead of the expected antennae. Walter Gehring identified a gene called antennapedia that caused this homeotic phenotype. Analysis of antennapedia revealed that this gene contained a 180 base pair sequence that encoded a DNA binding domain, which William McGinnis termed the "homeobox". The existence of additional Drosophila genes containing the antennapedia homeobox sequence was independently reported by Ernst Hafen, Michael Levine, William McGinnis, and Walter Jakob Gehring of the University of Basel in Switzerland and Matthew P. Scott and Amy Weiner of Indiana University in Bloomington in 1984. Isolation of homologous genes by Edward de Robertis and William McGinnis revealed that numerous genes from a variety of species contained the homeobox. Subsequent phylogenetic studies detailing the evolutionary relationship between homeobox-containing genes showed that these genes are present in all bilaterian animals.

The characteristic homeodomain protein fold consists of a 60-amino acid long domain composed of three alpha helices. The following shows the consensus homeodomain (~60 amino acid chain):

Helix 2 and helix 3 form a so-called helix-turn-helix (HTH) structure, where the two alpha helices are connected by a short loop region. The N-terminal two helices of the homeodomain are antiparallel and the longer C-terminal helix is roughly perpendicular to the axes of the first two. It is this third helix that interacts directly with DNA via a number of hydrogen bonds and hydrophobic interactions, as well as indirect interactions via water molecules, which occur between specific side chains and the exposed bases within the major groove of the DNA.

Homeodomain proteins are found in eukaryotes. Through the HTH motif, they share limited sequence similarity and structural similarity to prokaryotic transcription factors, such as lambda phage proteins that alter the expression of genes in prokaryotes. The HTH motif shows some sequence similarity but a similar structure in a wide range of DNA-binding proteins (e.g., cro and repressor proteins, homeodomain proteins, etc.). One of the principal differences between HTH motifs in these different proteins arises from the stereochemical requirement for glycine in the turn which is needed to avoid steric interference of the beta-carbon with the main chain: for cro and repressor proteins the glycine appears to be mandatory, whereas for many of the homeotic and other DNA-binding proteins the requirement is relaxed.

Homeodomains can bind both specifically and nonspecifically to B-DNA with the C-terminal recognition helix aligning in the DNA's major groove and the unstructured peptide "tail" at the N-terminus aligning in the minor groove. The recognition helix and the inter-helix loops are rich in arginine and lysine residues, which form hydrogen bonds to the DNA backbone. Conserved hydrophobic residues in the center of the recognition helix aid in stabilizing the helix packing. Homeodomain proteins show a preference for the DNA sequence 5'-TAAT-3'; sequence-independent binding occurs with significantly lower affinity. The specificity of a single homeodomain protein is usually not enough to recognize specific target gene promoters, making cofactor binding an important mechanism for controlling binding sequence specificity and target gene expression. To achieve higher target specificity, homeodomain proteins form complexes with other transcription factors to recognize the promoter region of a specific target gene.

See all
User Avatar
No comments yet.