MLH1
MLH1
Main page
1577138

MLH1

logo
Community Hub0 subscribers
Read side by side
from Wikipedia
MLH1
Available structures
PDBOrtholog search: PDBe RCSB
Identifiers
AliasesMLH1, mutL homolog 1, COCA2, FCC2, HNPCC, HNPCC2, hMLH1
External IDsOMIM: 120436; MGI: 101938; HomoloGene: 208; GeneCards: MLH1; OMA:MLH1 - orthologs
Orthologs
SpeciesHumanMouse
Entrez
Ensembl
UniProt
RefSeq (mRNA)

NM_026810
NM_001324522

RefSeq (protein)

NP_001311451
NP_081086

Location (UCSC)Chr 3: 36.99 – 37.05 MbChr 9: 111.06 – 111.1 Mb
PubMed search[3][4]
Wikidata
View/Edit HumanView/Edit Mouse

DNA mismatch repair protein Mlh1 or MutL protein homolog 1 is a protein that in humans is encoded by the MLH1 gene located on chromosome 3. The gene is commonly associated with hereditary nonpolyposis colorectal cancer. Orthologs of human MLH1 have also been studied in other organisms including mouse and the budding yeast Saccharomyces cerevisiae.

Function

[edit]

Variants in this gene can cause hereditary nonpolyposis colon cancer (Lynch syndrome). It is a human homolog of the E. coli DNA mismatch repair gene, mutL, which mediates protein-protein interactions during mismatch recognition, strand discrimination, and strand removal. Defects in MLH1 are associated with the microsatellite instability observed in hereditary nonpolyposis colon cancer. Alternatively spliced transcript variants encoding different isoforms have been described, but their full-length natures have not been determined.[5]

Role in DNA mismatch repair

[edit]

MLH1 protein is one component of a system of seven DNA mismatch repair proteins that work coordinately in sequential steps to initiate repair of DNA mismatches in humans.[6] Defects in mismatch repair, found in about 13% of colorectal cancers, are much more frequently due to deficiency of MLH1 than deficiencies of other DNA mismatch repair proteins.[7] The seven DNA mismatch repair proteins in humans are MLH1, MLH3, MSH2, MSH3, MSH6, PMS1 and PMS2.[6] In addition, there are Exo1-dependent and Exo1-independent DNA mismatch repair subpathways.[8]

DNA mismatches occur where one base is improperly paired with another base, or where there is a short addition or deletion in one strand of DNA that is not matched in the other strand. Mismatches commonly occur as a result of DNA replication errors or during genetic recombination. Recognizing those mismatches and repairing them is important for cells because failure to do so results in microsatellite instability] and an elevated spontaneous mutation rate (mutator phenotype). Among 20 cancers evaluated, microsatellite instable colon cancer (mismatch repair deficient) had the second highest frequency of mutations (after melanoma).

A heterodimer between MSH2 and MSH6 first recognizes the mismatch, although a heterodimer between MSH2 and MSH3 also can start the process. The formation of the MSH2-MSH6 heterodimer accommodates a second heterodimer of MLH1 and PMS2, although a heterodimer between MLH1 and either PMS3 or MLH3 can substitute for PMS2. This protein complex formed between the 2 sets of heterodimers enables initiation of repair of the mismatch defect.[6]

Other gene products involved in mismatch repair (subsequent to initiation by DNA mismatch repair genes) include DNA polymerase delta, PCNA, RPA, HMGB1, RFC and DNA ligase I, plus histone and chromatin modifying factors.[9][10]

Deficient expression in cancer

[edit]
Micrograph showing loss of staining for MLH1 in colorectal adenocarcinoma in keeping with DNA mismatch repair (left of image) and benign colorectal mucosa (right of image).
Cancers deficient in MLH1
Cancer type Frequency of deficiency in cancer Frequency of deficiency in adjacent field defect
Stomach 32%[11][12] 24–28%
Stomach (foveolar type tumors) 74%[13] 71%
Stomach in high-incidence Kashmir Valley 73%[14] 20%
Esophageal 73%[15] 27%
Head and neck squamous cell carcinoma (HNSCC) 31–33%[16][17] 20–25%
Non-small cell lung cancer (NSCLC) 69%[18] 72%
Colorectal 10%[7]

Epigenetic repression

[edit]

Only a minority of sporadic cancers with a DNA repair deficiency have a mutation in a DNA repair gene. However, a majority of sporadic cancers with a DNA repair deficiency do have one or more epigenetic alterations that reduce or silence DNA repair gene expression.[19] In the table above, the majority of deficiencies of MLH1 were due to methylation of the promoter region of the MLH1 gene. Another epigenetic mechanism reducing MLH1 expression is over-expression of miR-155.[20] MiR-155 targets MLH1 and MSH2 and an inverse correlation between the expression of miR-155 and the expression of MLH1 or MSH2 proteins was found in human colorectal cancer.[20]

Deficiency in field defects

[edit]

A field defect is an area or "field" of epithelium that has been preconditioned by epigenetic changes and/or mutations so as to predispose it towards development of cancer. As pointed out by Rubin, "The vast majority of studies in cancer research has been done on well-defined tumors in vivo, or on discrete neoplastic foci in vitro.[21] Yet there is evidence that more than 80% of the somatic mutations found in mutator phenotype human colorectal tumors occur before the onset of terminal clonal expansion."[22] Similarly, Vogelstein et al.[23] point out that more than half of somatic mutations identified in tumors occurred in a pre-neoplastic phase (in a field defect), during growth of apparently normal cells.

In the Table above, MLH1 deficiencies were noted in the field defects (histologically normal tissues) surrounding most of the cancers. If MLH1 is epigenetically reduced or silenced, it would not likely confer a selective advantage upon a stem cell. However, reduced or absent expression of MLH1 would cause increased rates of mutation, and one or more of the mutated genes may provide the cell with a selective advantage. The expression-deficient MLH1 gene could then be carried along as a selectively neutral or only slightly deleterious passenger (hitch-hiker) gene when the mutated stem cell generates an expanded clone. The continued presence of a clone with an epigenetically repressed MLH1 would continue to generate further mutations, some of which could produce a tumor.

Repression in coordination with other DNA repair genes

[edit]

In a cancer, multiple DNA repair genes are often found to be simultaneously repressed.[19] In one example, involving MLH1, Jiang et al.[24] conducted a study where they evaluated the mRNA expression of 27 DNA repair genes in 40 astrocytomas compared to normal brain tissues from non-astrocytoma individuals. Among the 27 DNA repair genes evaluated, 13 DNA repair genes, MLH1, MLH3, MGMT, NTHL1, OGG1, SMUG1, ERCC1, ERCC2, ERCC3, ERCC4, RAD50, XRCC4 and XRCC5 were all significantly down-regulated in all three grades (II, III and IV) of astrocytomas. The repression of these 13 genes in lower grade as well as in higher grade astrocytomas suggested that they may be important in early as well as in later stages of astrocytoma. In another example, Kitajima et al.[25] found that immunoreactivity for MLH1 and MGMT expression was closely correlated in 135 specimens of gastric cancer and loss of MLH1 and MGMTappeared to be synchronously accelerated during tumor progression.

Deficient expression of multiple DNA repair genes are often found in cancers,[19] and may contribute to the thousands of mutations usually found in cancers (see Mutation frequencies in cancers).

Meiosis

[edit]

In addition to its role in DNA mismatch repair, MLH1 protein is also involved in meiotic crossing over.[26] MLH1 forms a heterodimer with MLH3 that appears to be necessary for oocytes to progress through metaphase II of meiosis.[27] Female and male MLH1(-/-) mutant mice are infertile, and sterility is associated with a reduced level of chiasmata.[26][28] During spermatogenesis in MLH1(-/-) mutant mice chromosomes often separate prematurely and there is frequent arrest in the first division of meiosis.[26] In humans, a common variant of the MLH1 gene is associated with increased risk of sperm damage and male infertility.[29]

A current model of meiotic recombination, initiated by a double-strand break or gap, followed by pairing with an homologous chromosome and strand invasion to initiate the recombinational repair process. Repair of the gap can lead to crossover (CO) or non-crossover (NCO) of the flanking regions. CO recombination is thought to occur by the Double Holliday Junction (DHJ) model, illustrated on the right, above. NCO recombinants are thought to occur primarily by the Synthesis Dependent Strand Annealing (SDSA) model, illustrated on the left, above. Most recombination events appear to be the SDSA type.

MLH1 protein appears to localize to sites of crossing over in meiotic chromosomes.[26] Recombination during meiosis is often initiated by a DNA double-strand break (DSB) as illustrated in the accompanying diagram. During recombination, sections of DNA at the 5' ends of the break are cut away in a process called resection. In the strand invasion step that follows, an overhanging 3' end of the broken DNA molecule then "invades" the DNA of an homologous chromosome that is not broken forming a displacement loop (D-loop). After strand invasion, the further sequence of events may follow either of two main pathways leading to a crossover (CO) or a non-crossover (NCO) recombinant (see Genetic recombination). The pathway leading to a CO involves a double Holliday junction (DHJ) intermediate. Holliday junctions need to be resolved for CO recombination to be completed.

In the budding yeast Saccharomyces cerevisiae, as in the mouse, MLH1 forms a heterodimer with MLH3. Meiotic CO requires resolution of Holliday junctions through actions of the MLH1-MLH3 heterodimer. The MLH1-MLH3 heterodimer is an endonuclease that makes single-strand breaks in supercoiled double-stranded DNA.[30][31] MLH1-MLH3 binds specifically to Holliday junctions and may act as part of a larger complex to process Holliday junctions during meiosis.[30] MLH1-MLH3 heterodimer (MutL gamma) together with EXO1 and Sgs1 (ortholog of Bloom syndrome helicase) define a joint molecule resolution pathway that produces the majority of crossovers in budding yeast and, by inference, in mammals.[32]

Clinical significance

[edit]

It can also be associated with Turcot syndrome.[33]

Interactions

[edit]

MLH1 has been shown to interact with:

See also

[edit]

References

[edit]

Further reading

[edit]
[edit]
Revisions and contributorsEdit on WikipediaRead on Wikipedia
from Grokipedia
MLH1, also known as mutL homolog 1, is a gene located on the short arm of chromosome 3 at position 3p22.2 that encodes a key protein in the DNA mismatch repair (MMR) pathway.[1] This protein, MLH1, functions as a scaffold in the MMR system by forming heterodimers, primarily MutLα with PMS2, to detect and correct base-base mismatches and small insertion/deletion loops resulting from DNA polymerase errors during replication.[2] Germline mutations in MLH1 disrupt this repair mechanism, leading to genomic instability characterized by microsatellite instability (MSI), and are responsible for approximately 50% of cases of Lynch syndrome, an autosomal dominant hereditary cancer predisposition syndrome.[3][4] The MLH1 protein's N-terminal ATPase domain facilitates interactions with MutS complexes, such as MSH2-MSH6 (MutSα) for base mismatches or MSH2-MSH3 (MutSβ) for insertion/deletion loops, enabling the recruitment of downstream effectors like the exonuclease EXO1 and replication factor PCNA to excise and resynthesize the faulty DNA strand.[2] Beyond its core role in post-replicative repair, MLH1 contributes to meiotic recombination through heterodimerization with MLH3 to form MutLγ, which resolves crossovers during gametogenesis, and participates in DNA damage response pathways by activating cell cycle checkpoints in response to genotoxic stress.[1] A conserved motif in the intrinsically disordered linker region of MLH1 is vital for its structural integrity and functional dimerization, highlighting the precision required for its catalytic activity.[5] Pathogenic variants in MLH1, including missense mutations, frameshifts, and large deletions, underlie Lynch syndrome, conferring lifetime risks of up to 70% for colorectal cancer, 40-60% for endometrial cancer, and elevated risks for ovarian, gastric, and other extracolonic malignancies.[6] These mutations were first identified as homologs to the bacterial mutL gene in families with hereditary nonpolyposis colorectal cancer (HNPCC), now synonymous with Lynch syndrome, accounting for 2-4% of all colorectal cancers.[4] In sporadic cancers, biallelic inactivation of MLH1 via somatic mutations or promoter hypermethylation similarly drives MSI-high tumors, which exhibit distinct clinical behaviors and responses to therapies like immune checkpoint inhibitors.[7]

Gene and Protein Overview

Genomic Location and Structure

The MLH1 gene was cloned and identified in 1994 by two independent research groups through sequence homology searches to the Escherichia coli MutL gene, a key component of bacterial DNA mismatch repair, with mutations in MLH1 linked to hereditary nonpolyposis colorectal cancer families.[8][9] This discovery built on prior mapping efforts that localized the gene to the short arm of chromosome 3.[10] The human MLH1 gene is situated at chromosomal position 3p22.2, spanning approximately 57 kb on the forward strand from genomic coordinates 36,993,466 to 37,050,846 (GRCh38 assembly).[1][11] It comprises 19 exons, with the primary protein-coding transcript (NM_000249) utilizing all 19 exons to produce a 2,494 bp mRNA.[10][12] The exon-intron organization was detailed in early genomic studies, revealing a compact structure where exons 1–7 encode a highly conserved N-terminal region homologous to MutL, encompassing the ATPase domain essential for the protein's function.[13] Later exons, particularly 15–19 in the C-terminal region, contribute to interaction motifs, including those supporting endonuclease activity within the mismatch repair complex.[10] The promoter region of MLH1 features a CpG island approximately 750 bp upstream of the transcription start site, which is prone to aberrant hypermethylation, leading to transcriptional silencing in certain pathologies.[14] This epigenetic feature is conserved and has been implicated in somatic inactivation mechanisms. Overall, the gene's architecture reflects strong evolutionary conservation from prokaryotic MutL homologs, with key functional domains distributed across specific exons to support DNA repair processes.[13]

Protein Structure and Isoforms

The human MLH1 protein is a 756-amino-acid polypeptide with a calculated molecular mass of approximately 85 kDa.[15] It consists of two major structural domains: an N-terminal domain (residues 7–315) harboring the ATPase activity essential for its function, and a C-terminal domain (residues approximately 316–756) that includes motifs for protein dimerization and interactions in DNA repair processes.[16] The N-terminal ATPase domain features a conserved nucleotide-binding site that facilitates ATP hydrolysis, while the C-terminal region contains helical motifs and interaction interfaces, such as the dimerization domain for heterodimer formation with PMS2, and sites that contribute to the overall mismatch repair complex assembly.[16] Additionally, the protein includes a disordered linker region connecting the N- and C-terminal domains, which harbors regulatory motifs influencing structural flexibility and partner binding.[17] MLH1 exhibits alternative splicing, resulting in multiple transcript variants and protein isoforms. According to UniProt, there are at least three validated isoforms, with isoform 1 (756 amino acids) serving as the canonical full-length form; other isoforms arise from exon skipping or alternative splice sites, such as variants lacking portions of exons 6, 9, 10, or 11, potentially altering the ATPase domain or linker region.[18] NCBI data indicate up to five or more protein-coding transcript variants (e.g., variants 14–16 encoding isoform 5, a shorter form), though functional differences, such as impacts on ATPase activity or stability, remain largely unclear and are noted primarily in database annotations without extensive experimental validation.[1] Post-translational modifications, particularly phosphorylation, play a key role in regulating MLH1 activity and localization. Multiple serine residues are phosphorylated, including S87 in the ATPase domain, which inhibits DNA binding when modified, and sites in the linker region such as S446, S456, and S477, which modulate MutLα complex formation and mismatch repair efficiency.[19] These modifications, often mediated by kinases like ATM/ATR in response to DNA damage, enhance protein stability or alter interactions, with phosphorylation at S477 being dominant for MMR pathway activation.[20]

Biological Roles

DNA Mismatch Repair Pathway

MLH1 functions as a core component of the post-replicative DNA mismatch repair (MMR) pathway in human cells, where it corrects base-base mismatches and small insertion-deletion loops that arise during DNA replication to preserve genomic integrity.[21] MLH1 forms the MutLα heterodimer with PMS2, a complex essential for downstream repair events following initial mismatch detection.[22] This heterodimer is recruited to pre-formed mismatch sites bound by the MutSα heterodimer (MSH2-MSH6), which recognizes replication errors with high specificity.[23] The interaction between MutSα and MutLα is ATP-dependent, enabling the assembly of a repair-competent complex at the lesion.[24] The MMR process proceeds through a coordinated series of steps initiated by MutSα binding to the mismatch, which undergoes a conformational change upon ATP binding to form a sliding clamp that translocates along the DNA.[25] MutLα is then recruited to this site, where the ATPase activity of MLH1 hydrolyzes ATP to activate the latent endonuclease domain in PMS2, introducing strand-specific nicks predominantly in the discontinuous daughter strand.[26] This incision is stimulated by the presence of PCNA (proliferating cell nuclear antigen), loaded onto the DNA by RFC (replication factor C), which directs the repair machinery and ensures strand discrimination.[22] Following nicking, EXO1 (exonuclease 1) excises the error-containing oligonucleotide tract from the nick toward the mismatch in a 5' to 3' direction, creating a repair gap. The gap is then filled by DNA polymerase δ in a PCNA- and RFC-dependent manner, with RPA (replication protein A) stabilizing the single-stranded DNA, and the process is completed by DNA ligase I sealing the nick. Defects in MLH1 disrupt this pathway, leading to unrepaired mismatches and a characteristic microsatellite instability (MSI-high) phenotype due to the accumulation of errors in repetitive DNA sequences.[27] Such deficiencies also promote overall hypermutation by failing to correct replication errors, thereby elevating the mutation rate across the genome.[28]

Meiotic Recombination

MLH1 plays a critical role in meiotic recombination by forming the MutLγ heterodimer with MLH3, which functions as an endonuclease to resolve double Holliday junctions (dHJs) into crossovers during the pachytene stage of prophase I.[29] This heterodimer, conserved from yeast to mammals, specifically promotes class III crossovers that exhibit positive interference, ensuring proper chromosome pairing and segregation.[30] The endonuclease activity of MutLγ is activated in a mismatch repair-independent manner, preferentially nicking branched DNA structures like dHJs to generate crossover products, which is essential for chiasma formation.[29] During meiosis, MLH1 localizes to late recombination nodules on the synaptonemal complex in pachytene spermatocytes and oocytes, marking sites of crossover resolution. In human fetal oocytes, MLH1 foci appear at late zygotene and persist through pachytene, with an average of approximately 70 foci per nucleus, corresponding to crossover sites distributed along chromosome arms.[31] Similarly, in mouse spermatocytes, MLH1 expression peaks in pachytene cells, where it colocalizes with MLH3 at these nodules, facilitating Holliday junction resolution within the synaptonemal complex.[32] This localization underscores MLH1's distinct meiotic function, separate from its role in general DNA repair. Studies in MLH1 knockout mice demonstrate the protein's necessity for fertility and genomic stability in germ cells. Homozygous Mlh1-deficient mice exhibit complete infertility in both sexes due to meiotic arrest, with spermatocytes showing prematurely separated chromosomes and failure to progress beyond metaphase I.[33] These mutants display a profound reduction in chiasmata—often 10- to 100-fold lower than wild-type—leading to unstable bivalents and increased aneuploidy, particularly in oocytes where chromosome missegregation results in gamete abnormalities.[34] In females, this manifests as elevated oocyte aneuploidy rates, highlighting MLH1's role in ensuring crossover-dependent chromosome segregation.[35]

DNA Damage Response

Beyond its roles in mismatch repair and meiotic recombination, MLH1 participates in DNA damage response (DDR) pathways, contributing to the activation of cell cycle checkpoints in response to genotoxic stress such as interstrand crosslinks and ionizing radiation.[36] Specifically, MLH1 is involved in signaling the G2/M checkpoint, where its deficiency impairs arrest following DNA damage, leading to genomic instability.[37] MLH1 interacts with proteins like ATM to regulate checkpoint activation, independent of its MMR function, and promotes apoptosis in response to certain alkylating agents via PARP-dependent pathways.[38] This DDR role underscores MLH1's broader function in maintaining cellular responses to exogenous and endogenous DNA lesions.

Regulation and Expression

Tissue-Specific Expression Patterns

MLH1 demonstrates a pattern of expression that is elevated in tissues characterized by high cellular proliferation, including the gastrointestinal tract, reproductive organs, and certain neural structures. Data from the Genotype-Tissue Expression (GTEx) project indicate median mRNA levels of approximately 60 transcripts per million (TPM) in the small intestine (terminal ileum) and testis, 40 TPM in the colon (sigmoid and transverse) and ovary, and 20–40 TPM across various brain regions such as the amygdala and cortex.[39] These levels are notably lower in non-proliferative tissues, such as skeletal muscle (around 10 TPM) and adipose tissue (5–10 TPM), underscoring MLH1's association with active DNA replication and repair demands.[39] Protein expression aligns with these mRNA patterns, showing general nuclear localization in the nucleoplasm across multiple tissues, with enhanced staining intensity in epithelial cells of the colon, small intestine, and testis. The Human Protein Atlas reports low overall tissue specificity (Tau score ≈ 0.1–0.2), but consistent elevated expression in proliferative epithelia and germ cells, consistent with MLH1's role in mismatch repair during rapid cell division.[40] In the brain, moderate protein levels are observed in neuronal and glial cells, potentially supporting genomic stability in post-mitotic environments.[40] Developmentally, MLH1 is detectable in human fetal tissues from 10–20 weeks gestation across organs like the intestine, kidney, and lung, though quantitative comparisons suggest baseline expression that ramps up in adult proliferative contexts to meet heightened repair needs.[1] The canonical transcript isoform 1 (NM_000249) predominates in most tissues, including the colon, while shorter isoforms (e.g., NM_001167617) contribute to expression in the testis, potentially fine-tuning meiotic functions.[1] This isoform distribution supports MLH1's dual roles in somatic repair and germline recombination.

Epigenetic and Transcriptional Regulation

The expression of the MLH1 gene is tightly regulated at both epigenetic and transcriptional levels to ensure proper DNA mismatch repair function. A primary mechanism of epigenetic control involves methylation of the CpG island within the MLH1 promoter region, which spans approximately 1781 base pairs and includes a 1128-base-pair CpG island containing 93 CpG sites. This methylation acts as a potent repressor of MLH1 transcription by recruiting methyl-CpG-binding proteins and histone deacetylases, leading to chromatin condensation and gene silencing. In normal physiological contexts, low-level methylation maintains basal expression, but hypermethylation can significantly attenuate MLH1 output, highlighting its role as a reversible epigenetic switch.[41][42] Transcriptional regulation of MLH1 involves its TATA-less and GC-rich promoter, characteristic of genes involved in constitutive DNA repair processes.[43] Additionally, microRNAs such as miR-155 contribute to fine-tuning MLH1 expression by post-transcriptionally downregulating its mRNA, thereby modulating mismatch repair efficiency.[44] MLH1 expression is coordinated with other mismatch repair (MMR) genes, particularly MSH2, through shared regulatory pathways that ensure stoichiometric balance in repair complexes. Studies have demonstrated a direct correlation between MLH1 and MSH2 transcript levels, suggesting common transcriptional controls, such as hypoxia-induced co-repression via shifts in Myc/Max complexes independent of HIF-1. This coordination is evident in normal tissues, where synchronized expression prevents imbalances that could impair MMR fidelity.[45][46] Constitutional epimutations represent a rare form of heritable MLH1 silencing, characterized by monoallelic promoter hypermethylation in the absence of underlying sequence alterations, leading to reduced gene expression across normal tissues. Unlike fixed genetic mutations, these epimutations arise de novo as focal epigenetic events at the EPM2AIP1-MLH1 locus and exhibit reversibility, often failing to transmit stably across generations due to meiotic erasure or dilution during gametogenesis. This reversibility distinguishes them from somatic epimutations and underscores the dynamic nature of epigenetic inheritance in MMR gene regulation.[47][48][49]

Pathological Implications

Somatic and Germline Mutations in Cancer

Germline mutations in the MLH1 gene are a primary cause of Lynch syndrome, also known as hereditary nonpolyposis colorectal cancer (HNPCC), accounting for approximately 40-50% of all cases. Over 2,000 distinct pathogenic variants have been cataloged in databases such as ClinVar, including frameshift, nonsense, splice-site, and missense mutations that disrupt protein function and impair DNA mismatch repair (MMR).[50] These inherited alterations lead to a high lifetime risk of colorectal cancer, with Lynch syndrome responsible for 3-5% of all such cases worldwide.[51] In affected individuals, colorectal tumors often exhibit microsatellite instability (MSI) due to MMR deficiency, though the syndrome's full penetrance varies by variant type and modifier factors.[52] Somatic mutations in MLH1 predominantly involve epigenetic silencing through promoter hypermethylation, observed in roughly 15% of sporadic colorectal cancers and 15-20% of sporadic endometrial cancers.[53][54] This hypermethylation inactivates MLH1 expression, resulting in MMR-deficient, MSI-high tumors that drive tumorigenesis via accumulated mutations in oncogenes and tumor suppressors. In contrast to germline cases, somatic alterations are acquired and tumor-specific, often arising in the context of serrated neoplasia pathways.[55] The cancer spectrum associated with MLH1 alterations is broad, with colorectal cancer being the most prevalent in Lynch syndrome, comprising the majority of initial diagnoses (lifetime risk 40-80%), followed by endometrial (30-60%), ovarian (4-15%), and gastric (up to 13%) cancers.[56] Mutations in MLH1 and the related MSH2 gene underlie nearly 90% of Lynch syndrome cases overall.[57] Recent studies have also linked MLH1 inactivation to rare MSI-high lung cancers, particularly those tied to heavy smoking exposure and high tumor mutational burden.[58] In sporadic MLH1-silenced tumors, epigenetic repression frequently co-occurs with the BRAF V600E mutation, which helps differentiate these from hereditary cases and is absent in Lynch syndrome.[59] Additionally, overexpression of microRNA-155 (miR-155) contributes to MLH1 downregulation by directly targeting its 3' untranslated region, promoting MSI and genomic instability in colorectal cancers.[60] This miRNA-mediated mechanism highlights a post-transcriptional layer of regulation in sporadic tumorigenesis.

Deficiency in Field Defects and Premalignancy

Field defects refer to regions of histologically normal or premalignant tissue exhibiting molecular alterations in DNA mismatch repair (MMR) genes, such as MLH1, which predispose to neoplastic progression without overt morphological changes. In colorectal adenomas, particularly serrated polyps, MLH1 promoter hypermethylation and protein loss occur in 28-72% of cases, contributing to field cancerization by creating a mutagenic environment that facilitates adenoma-to-carcinoma transition. Similarly, in gastric mucosa affected by atrophic gastritis, often linked to Helicobacter pylori infection, MLH1 expression loss is observed in association with chronic inflammation, though sporadic prevalence varies with disease severity. These defects highlight MLH1's role in early, widespread genomic instability beyond focal lesions. Clonal expansion of MLH1-deficient cells from normal epithelium arises primarily through age-related or inflammation-driven promoter hypermethylation, leading to biallelic silencing and propagation across tissue fields. This methylation spreads from promoter regions into adjacent CpG islands in normal colonic mucosa, detectable as early as in aberrant crypt foci, the putative precursors to adenomas. Detection relies on immunohistochemistry (IHC) showing absent MLH1 nuclear staining in epithelial cells and loss of heterozygosity (LOH) analysis confirming somatic second hits in methylated alleles, allowing identification in non-neoplastic mucosa adjacent to adenomas. Such clonal propagation underscores the transition from epigenetic silencing in scattered normal cells to dominant fields of deficiency. In premalignant contexts, MLH1 loss elevates microsatellite instability (MSI) within histologically normal colonic fields, promoting accumulation of insertion/deletion mutations that drive progression to dysplasia. Mouse models demonstrate that even partial Mlh1 reduction in normal mucosa induces MSI and chromosomal instability, mirroring human field defects where MSI signatures appear prior to adenoma formation. This process coordinates with methylation of other repair genes like MGMT, where concurrent silencing in adjacent normal mucosa and polyps amplifies G:C to A:T transition mutations, enhancing premalignant potential in 7-49% of colorectal polyps depending on histology. Epigenetic regulation, such as histone modifications, facilitates this coordinated silencing but is detailed elsewhere.

Clinical Applications

Diagnostic Testing for Deficiency

Diagnostic testing for MLH1 deficiency is essential for identifying individuals at risk for Lynch syndrome and guiding clinical management in colorectal and endometrial cancers. The primary screening methods include immunohistochemistry (IHC) and microsatellite instability (MSI) testing on tumor tissue, followed by targeted assays for promoter methylation and germline sequencing when indicated. These approaches allow for the differentiation between germline (hereditary) and somatic (sporadic) causes of deficiency, with universal screening recommended for all patients with relevant tumors.[61] Immunohistochemistry (IHC) serves as a cornerstone for initial detection, assessing the expression of MLH1 protein in tumor cell nuclei compared to adjacent normal tissue. Loss of nuclear MLH1 staining, often accompanied by loss of PMS2 (its heterodimer partner), indicates mismatch repair (MMR) deficiency with a sensitivity of approximately 90-92% for identifying MLH1-associated Lynch syndrome cases. This method is cost-effective, requires minimal tissue, and is widely available, making it the preferred first-line screen per international guidelines. However, discordant results can occur due to variants of uncertain significance or retained expression in some pathogenic cases, necessitating confirmatory testing.[62][61] Microsatellite instability (MSI) testing complements IHC by evaluating tumor DNA for instability at mononucleotide markers such as BAT-25, BAT-26, NR-21, NR-24, and MONO-27 using polymerase chain reaction (PCR). Tumors classified as MSI-high exhibit instability in ≥30% of markers, strongly suggesting MLH1 deficiency with a sensitivity of about 93% and high reproducibility. MSI-high status prompts further evaluation to rule out sporadic causes, particularly in colorectal cancers where it occurs in 15% of cases. This PCR-based assay is particularly useful when IHC yields equivocal results or tissue is limited.[63][61] For tumors showing MLH1 loss on IHC, promoter methylation assays distinguish epigenetic silencing from germline mutations. Techniques such as bisulfite sequencing or methylation-specific multiplex ligation-dependent probe amplification (MS-MLPA) detect hypermethylation of the MLH1 promoter region, which accounts for 10-15% of sporadic MSI-high colorectal cancers. Positive methylation typically indicates a somatic event, obviating the need for germline testing, while negative results warrant proceeding to sequencing.[61] Germline sequencing confirms Lynch syndrome in cases without somatic explanations, using next-generation sequencing (NGS) panels that analyze MLH1 for pathogenic variants, including point mutations, insertions/deletions, and large rearrangements (detected via deletion/duplication analysis). The 2025 Canadian consensus guidelines endorse reflex testing algorithms: universal MMR screening via IHC/MSI for colorectal and endometrial cancers, followed by BRAF V600E mutation or MLH1 promoter methylation analysis for MLH1-deficient tumors, and germline NGS panels if these are negative to identify hereditary cases. This cascade approach enhances efficiency and identifies at-risk family members for cascade testing.[61][64]

Therapeutic Targeting and Recent Advances

Therapeutic strategies targeting MLH1 deficiency primarily exploit the resulting microsatellite instability-high (MSI-H) or mismatch repair-deficient (dMMR) phenotype, which generates a high tumor mutational burden and neoantigen load that enhances immunogenicity.[65] PD-1 inhibitors, such as pembrolizumab, have demonstrated significant efficacy in MSI-H/dMMR tumors across various solid cancers, including those associated with Lynch syndrome, by promoting durable responses through T-cell activation against neoantigens.[66] In advanced endometrial cancer, pembrolizumab plus chemotherapy as first-line therapy yielded longer progression-free survival compared to chemotherapy alone in MSI-H/dMMR cases.[67] Gene-specific outcomes in Lynch syndrome patients treated with immune checkpoint blockade reveal variations; for instance, Lynch syndrome patients, including MLH1 germline carriers, exhibit high complete response rates (up to 35% overall) to PD-1 inhibitors, with MLH1 carriers showing higher rates than PMS2 carriers, though overall survival may differ from PMS2 carriers, with non-PMS2 groups (including MLH1) showing prolonged median overall survival.[68] MLH1-deficient tumors often display resistance to 5-fluorouracil (5-FU)-based chemotherapy due to impaired incorporation of the drug's metabolites into DNA, limiting its cytotoxic effects in dMMR cells.[69] Conversely, these tumors show increased sensitivity to irinotecan and oxaliplatin, as preclinical and clinical data indicate that dMMR status enhances platinum-induced DNA damage and topoisomerase inhibition without the protective mismatch repair mechanism.[70] Adjuvant oxaliplatin-based regimens have provided survival benefits in stage III MSI-H colon cancer, unlike 5-FU alone.[71] For cases involving MLH1 promoter hypermethylation, epigenetic therapies like the DNA methyltransferase inhibitor decitabine aim to restore gene expression by demethylating the promoter region, potentially reactivating mismatch repair function.[72] Preclinical studies demonstrate that decitabine induces transient MLH1 re-expression in methylated colorectal cancer cells through promoter demethylation and nucleosome remodeling, though resilencing occurs post-treatment.[72] Clinical exploration in colon cancer trials has shown decitabine's ability to modulate epigenetic marks, supporting its potential in combination regimens for methylated dMMR tumors.[73] Recent advances from 2024-2025 highlight neoadjuvant PD-1 blockade as a transformative approach for early-stage dMMR solid tumors, achieving pathological complete responses that eliminate the need for surgery in up to 100% of select cases across tumor types, regardless of location.[74] This strategy facilitates nonoperative management, often integrating radiation or chemotherapy, with sustained organ preservation observed in rectal and nonrectal dMMR cancers.[75] Precision therapies for constitutional MLH1 epimutations emphasize enhanced genetic testing protocols to identify these rare heritable events early, enabling tailored surveillance and potential preventive interventions akin to Lynch syndrome management.[76]

Protein Interactions

Key Binding Partners in Repair Complexes

MLH1 primarily functions as a scaffold protein in DNA mismatch repair (MMR) by forming obligate heterodimers with other MutL homologs, which are essential for the assembly of repair complexes. In somatic cells, MLH1 heterodimerizes with PMS2 to form the MutLα complex, which is recruited to mismatch sites to coordinate downstream excision and resynthesis.[77] This 1:1 heterodimer relies on the C-terminal domain (CTD) of MLH1 for stable dimerization with the corresponding CTD of PMS2, enabling the ATPase activity primarily associated with the N-terminal domains of both proteins to activate the endonuclease function on PMS2.[78] In meiotic cells, MLH1 instead pairs with MLH3 to form the MutLγ complex, which plays a specialized role in resolving recombination intermediates, again through 1:1 stoichiometry and C-terminal dimerization motifs that facilitate ATPase-dependent activation.[79] These heterodimers represent the core structural units of MLH1 in repair machinery, with the ATPase domains hydrolyzing ATP to propagate conformational changes necessary for complex assembly; recent studies confirm transient higher-order assemblies at repair sites in vivo.[80][79] Beyond dimerization, MLH1 within these complexes interacts with upstream recognition factors and downstream effectors to form transient, multiprotein assemblies during repair. The MutSα heterodimer (MSH2-MSH6) directly binds MLH1 in the MutLα complex, recruiting it to DNA mismatches via interactions involving the N-terminal ATPase domains of MLH1 and facilitating the transition from mismatch recognition to incision.[77] Similarly, proliferating cell nuclear antigen (PCNA), the sliding clamp on DNA, associates with MLH1-PMS2 through a conserved QIGLTDF motif in the C-terminal region of MLH1, stabilizing the complex at replication forks and enhancing processivity in the Exo1-independent MMR pathway.[81] For excision, exonuclease 1 (EXO1) binds MLH1 via an MLH1-interaction peptide (MIP) motif on EXO1 and a complementary patch on MLH1 around residue I403, forming part of the MutLα-EXO1 subcomplex that executes strand-specific degradation.[79] These interactions are transient and ATP-dependent, assembling stoichiometrically as 1:1 or higher-order complexes only at active repair sites before dissociating post-resolution.[82]

Interactions with Non-Repair Proteins

MLH1 engages in interactions beyond its core role in mismatch repair (MMR), influencing processes such as replication fork stability, transcriptional control of proliferation, and cell cycle checkpoint activation. These auxiliary partnerships highlight MLH1's broader contributions to genomic integrity and cellular homeostasis. MLH1 directly interacts with the Bloom syndrome helicase (BLM), a RecQ family member involved in resolving DNA structures during replication and recombination. This association functions independently of conventional MMR, supporting replication fidelity by facilitating the processing of stalled replication forks and preventing their collapse into double-strand breaks, particularly in MMR-deficient contexts where fork breakage is elevated. In cells lacking functional MMR, the MLH1-BLM interaction aids in fork restart and reduces chromosomal instability, underscoring its protective role against replication stress-induced aberrations.[83] MLH1 also binds to the proto-oncoprotein c-MYC, linking DNA repair pathways to the regulation of cell proliferation. This interaction occurs at the C-terminal region of c-MYC and modulates transcriptional activity, potentially integrating mismatch detection with growth control mechanisms to coordinate repair with proliferative signals. By associating with c-MYC's heterodimeric partner MAX through distinct MMR components, MLH1 may fine-tune oncogene-driven transcription, thereby coupling genomic surveillance to cellular expansion and preventing unchecked proliferation in response to DNA damage.[84] Furthermore, MLH1 interacts with the ataxia-telangiectasia mutated (ATM) kinase, contributing to the activation of the G1 cell cycle checkpoint following mismatch-induced damage. This binding enables ATM to phosphorylate downstream targets like CHK2, enforcing G1 arrest to allow repair before S-phase entry, a process distinct from MMR's excision-based correction. In scenarios of replication stress or alkylation damage, the MLH1-ATM partnership enhances signaling through MAPK pathways, promoting apoptosis or senescence if unrepaired mismatches persist.[85][86]

References

User Avatar
No comments yet.