Hubbry Logo
search
logo
1381824

Protein phosphorylation

logo
Community Hub0 Subscribers
Read side by side
from Wikipedia
Model of a phosphorylated serine residue
Serine in an amino acid chain, before and after phosphorylation.

Protein phosphorylation is a reversible post-translational modification of proteins in which an amino acid residue is phosphorylated by a protein kinase by the addition of a covalently bound phosphate group. Phosphorylation alters the structural conformation of a protein, causing it to become activated, deactivated, or otherwise modifying its function.[1] Approximately 13,000 human proteins have sites that are phosphorylated.[2]

The reverse reaction of phosphorylation is called dephosphorylation, and is catalyzed by protein phosphatases. Protein kinases and phosphatases work independently and in a balance to regulate the function of proteins.[3]

The amino acids most commonly phosphorylated are serine, threonine, tyrosine, and histidine.[4][5] These phosphorylations play important and well-characterized roles in signaling pathways and metabolism. However, other amino acids can also be phosphorylated post-translationally, including arginine, lysine, aspartic acid, glutamic acid and cysteine, and these phosphorylated amino acids have been identified to be present in human cell extracts and fixed human cells using a combination of antibody-based analysis (for pHis) and mass spectrometry (for all other amino acids).[5][6][7][8]

Protein phosphorylation was first reported in 1906 by Phoebus Levene at the Rockefeller Institute for Medical Research with the discovery of phosphorylated vitellin.[9] However, it was nearly 50 years until the enzymatic phosphorylation of proteins by protein kinases was discovered.[10]

History

[edit]

In 1906, Phoebus Levene at the Rockefeller Institute for Medical Research identified phosphate in the protein vitellin (phosvitin)[9] and by 1933 had detected phosphoserine in casein, with Fritz Lipmann.[11] However, it took another 20 years before Eugene P. Kennedy described the first "enzymatic phosphorylation of proteins".[10] The first phosphorylase enzyme was discovered by Carl and Gerty Cori in the late 1930s. Carl and Gerty Cori found two forms of glycogen phosphorylase which they named A and B but did not correctly understand the mechanism of the B form to A form conversion. The interconversion of phosphorylase b to phosphorylase a was later described by Edmond Fischer and Edwin Krebs, as well as, Wosilait and Sutherland, involving a phosphorylation/dephosphorylation mechanism.[12] It was found that an enzyme, named phosphorylase kinase and Mg-ATP were required to phosphorylate glycogen phosphorylase by assisting in the transfer of the γ-phosphoryl group of ATP to a serine residue on phosphorylase b. Protein phosphatase 1 is able to catalyze the dephosphorylation of phosphorylated enzymes by removing the phosphate group. Earl Sutherland explained in 1950, that the activity of phosphorylase was increased and thus glycogenolysis stimulated when liver slices were incubated with adrenalin and glucagon. Phosphorylation was considered a specific control mechanism for one metabolic pathway until the 1970s, when Lester Reed discovered that mitochondrial pyruvate dehydrogenase complex was inactivated by phosphorylation. Also in the 1970s, the term multisite phosphorylation was coined in response to the discovery of proteins that are phosphorylated on two or more residues by two or more kinases. In 1975, it was shown that cAMP-dependent proteins kinases phosphorylate serine residues on specific amino acid sequence motifs. Ray Erikson discovered that v-Src was a kinase and Tony Hunter found that v-Src phosphorylated tyrosine residues on proteins in the 1970s.[13] In the early 1980, the amino-acid sequence of the first protein kinase was determined which helped geneticists understand the functions of regulatory genes. In the late 1980s and early 1990s, the first protein tyrosine phosphatase (PTP1B) was purified and the discovery, as well as, cloning of JAK kinases was accomplished which led to many in the scientific community to name the 1990s as the decade of protein kinase cascades.[14][15] Edmond Fischer and Edwin Krebs were awarded the Nobel prize in 1992 "for their discoveries concerning reversible protein phosphorylation as a biological regulatory mechanism".[16]

Abundance

[edit]

Reversible phosphorylation of proteins is abundant in both prokaryotic and even more so in eukaryotic organisms.[17][18][19][20] For instance, in bacteria 5–10% of all proteins are thought to be phosphorylated.[21][22] By contrast, it is estimated that one third of all human proteins is phosphorylated at any point in time, with 230,000, 156,000, and 40,000 unique phosphorylation sites existing in human, mouse, and yeast, respectively.[2] In yeast, about 120 kinases (out of ~6,000 proteins total) cause 8,814 known regulated phosphorylation events, generating about 3,600 phosphoproteins (about 60% of all yeast proteins).[23][24] Hence, phosphorylation is a universal regulatory mechanism that affects a large portion of proteins. Even if a protein is not phosphorylated itself, its interactions with other proteins may be regulated by phosphorylation of these interacting proteins.

Mechanisms and functions of phosphorylation

[edit]

Phosphorylation introduces a charged and hydrophilic group in the side chain of amino acids, possibly changing a protein's structure by altering interactions with nearby amino acids. Some proteins such as p53 contain multiple phosphorylation sites, facilitating complex, multi-level regulation. Because of the ease with which proteins can be phosphorylated and dephosphorylated, this type of modification is a flexible mechanism for cells to respond to external signals and environmental conditions.[25]

Kinases phosphorylate proteins and phosphatases dephosphorylate proteins. Many enzymes and receptors are switched "on" or "off" by phosphorylation and dephosphorylation. Reversible phosphorylation results in a conformational change in the structure in many enzymes and receptors, causing them to become activated or deactivated. Phosphorylation usually occurs on serine, threonine, tyrosine and histidine residues in eukaryotic proteins. Histidine phosphorylation of eukaryotic proteins appears to be much more frequent than tyrosine phosphorylation.[26] In prokaryotic proteins phosphorylation occurs on the serine, threonine, tyrosine, histidine, arginine or lysine residues.[17][18][26][27] The addition of a phosphate (PO43-) molecule to a non-polar R group of an amino acid residue can turn a hydrophobic portion of a protein into a polar and extremely hydrophilic portion of a molecule. In this way protein dynamics can induce a conformational change in the structure of the protein via long-range allostery with other hydrophobic and hydrophilic residues in the protein.

One such example of the regulatory role that phosphorylation plays is the p53 tumor suppressor protein. The p53 protein is heavily regulated[28] and contains more than 18 different phosphorylation sites. Activation of p53 can lead to cell cycle arrest, which can be reversed under some circumstances, or apoptotic cell death.[29] This activity occurs only in situations wherein the cell is damaged or physiology is disturbed in normal healthy individuals.

Upon the deactivating signal, the protein becomes dephosphorylated again and stops working.[30][citation needed] This is the mechanism in many forms of signal transduction, for example the way in which incoming light is processed in the light-sensitive cells of the retina.

Regulatory roles of phosphorylation include:

  • Biological thermodynamics of energy-requiring reactions
  • Mediates enzyme inhibition
    • Phosphorylation of the enzyme GSK-3 by AKT (Protein kinase B) as part of the insulin signaling pathway.[31]
    • Phosphorylation of src (pronounced "sarc") tyrosine kinase by C-terminal Src kinase (Csk) induces a conformational change in the enzyme, resulting in a fold in the structure, which masks its kinase domain, and is thus shut "off".[32]

Membrane transport

[edit]

Protein degradation

[edit]

Enzyme regulation (activation and inhibition)

[edit]
  • The first example of protein regulation by phosphorylation to be discovered was glycogen phosphorylase. Nobel laureates Edmond H. Fischer and Edwin G. Krebs described how phosphorylation of glycogen phosphorylase b converted it to the active glycogen phosphorylase a. It was soon discovered that glycogen synthase, another metabolic enzyme, is inactivated by phosphorylation.[35]
  • Phosphorylation of the enzyme GSK-3 by AKT (Protein kinase B) as part of the insulin signaling pathway.[31]
  • Phosphorylation of Src tyrosine kinase by C-terminal Src kinase inactivates Src by inducing a conformational change which masks its kinase domain.[32]
  • Phosphorylation of the H2AX histones on serine 139, within two million bases (0.03% of the chromatin) surrounding a double-strand break in DNA, is needed for repair of the double-strand break.[36] Phosphorylation of methylpurine DNA glycosylase at serine 172 is required for base excision repair of alkylated base damage.[37]

Protein-protein interactions

[edit]
  • Phosphorylation of the cytosolic components of NADPH oxidase, a large membrane-bound, multi-protein enzyme present in phagocytic cells, plays an important role in the regulation of protein-protein interactions in the enzyme.[38]
  • Important in protein degradation.
    • In the late 1990s, it was recognized that phosphorylation of some proteins causes them to be degraded by the ATP-dependent ubiquitin/proteasome pathway. These target proteins become substrates for particular E3 ubiquitin ligases only when they are phosphorylated.

Signaling networks

[edit]

Elucidating complex signaling pathway phosphorylation events can be difficult. In cellular signaling pathways, protein A phosphorylates protein B, and B phosphorylates C. However, in another signaling pathway, protein D phosphorylates A, or phosphorylates protein C. Global approaches such as phosphoproteomics, the study of phosphorylated proteins, which is a sub-branch of proteomics, combined with mass spectrometry-based proteomics, have been utilised to identify and quantify dynamic changes in phosphorylated proteins over time. These techniques are becoming increasingly important for the systematic analysis of complex phosphorylation networks.[39] They have been successfully used to identify dynamic changes in the phosphorylation status of more than 6,000 sites after stimulation with epidermal growth factor.[40] Another approach for understanding Phosphorylation Network, is by measuring the genetic interactions between multiple phosphorylating proteins and their targets. This reveals interesting recurring patterns of interactions – network motifs.[41] Computational methods have been developed to model phosphorylation networks[42][43] and predict their responses under different perturbations.[44]

Phosphorylation of histones

[edit]

Eukaryotic DNA is organized with histone proteins in specific complexes called chromatin. The chromatin structure functions and facilitates the packaging, organization and distribution of eukaryotic DNA. However, it has a negative impact on several fundamental biological processes such as transcription, replication and DNA repair by restricting the accessibility of certain enzymes and proteins. Post-translational modification of histones such as histone phosphorylation has been shown to modify the chromatin structure by changing protein:DNA or protein:protein interactions.[45] Histone post-translational modifications modify the chromatin structure. The most commonly associated histone phosphorylation occurs during cellular responses to DNA damage, when phosphorylated histone H2A separates large chromatin domains around the site of DNA breakage.[46] Researchers investigated whether modifications of histones directly impact RNA polymerase II directed transcription. Researchers choose proteins that are known to modify histones to test their effects on transcription, and found that the stress-induced kinase, MSK1, inhibits RNA synthesis. Inhibition of transcription by MSK1 was most sensitive when the template was in chromatin, since DNA templates not in chromatin were resistant to the effects of MSK1. It was shown that MSK1 phosphorylated histone H2A on serine 1, and mutation of serine 1 to alanine blocked the inhibition of transcription by MSK1. Thus results suggested that the acetylation of histones can stimulate transcription by suppressing an inhibitory phosphorylation by a kinase as MSK1.[47]

Kinases

[edit]

Within a protein, phosphorylation can occur on several amino acids. Phosphorylation on serine is thought to be the most common, followed by threonine. Tyrosine phosphorylation is relatively rare but lies at the head of many protein phosphorylation signalling pathways (e.g. in tyrosine kinase-linked receptors) in most of the eukaryotes. Phosphorylation on amino acids, such as serine, threonine, and tyrosine results in the formation of a phosphoprotein, when the phosphate group of the phosphoprotein reacts with the -OH group of a Ser, Thr, or Tyr sidechain in an esterification reaction.[48] However, since tyrosine phosphorylated proteins are relatively easy to purify using antibodies, tyrosine phosphorylation sites are relatively well understood. Histidine and aspartate phosphorylation occurs in prokaryotes as part of two-component signaling and in some cases in eukaryotes in some signal transduction pathways. The analysis of phosphorylated histidine using standard biochemical and mass spectrometric approaches is much more challenging than that of Ser, Thr or Tyr.[49][7][5] and[50] In prokaryotes, archaea, and some lower eukaryotes, histidine's nitrogen act as a nucleophile and binds to a phosphate group.[51] Once histidine is phosphorylated the regulatory domain of the response regulator catalyzes the transfer of the phosphate to aspartate.

Receptor tyrosine kinases

[edit]
The AXL receptor tyrosine kinase, showing the symmetry of the dimerized receptors

While tyrosine phosphorylation is found in relatively low abundance, it is well studied due to the ease of purification of phosphotyrosine using antibodies. Receptor tyrosine kinases are an important family of cell surface receptors involved in the transduction of extracellular signals such as hormones, growth factors, and cytokines. Binding of a ligand to a monomeric receptor tyrosine kinase stabilizes interactions between two monomers to form a dimer, after which the two bound receptors phosphorylate tyrosine residues in trans. Phosphorylation and activation of the receptor activates a signaling pathway through enzymatic activity and interactions with adaptor proteins.[52] Signaling through the epidermal growth factor receptor (EGFR), a receptor tyrosine kinase, is critical for the development of multiple organ systems including the skin, lung, heart, and brain. Excessive signaling through the EGFR pathway is found in many human cancers.[53]

Cyclin-dependent kinases

[edit]

Cyclin-dependent kinases (CDKs) are serine-threonine kinases which regulate progression through the eukaryotic cell cycle. CDKs are catalytically active only when bound to a regulatory cyclin. Animal cells contain at least nine distinct CDKs which bind to various cyclins with considerable specificity. CDK inhibitors (CKIs) block kinase activity in the cyclin-CDK complex to halt the cell cycle in G1 or in response to environmental signals or DNA damage. The activity of different CDKs activate cell signaling pathways and transcription factors that regulate key events in mitosis such as the G1/S phase transition. Earlier cyclin-CDK complexes provide the signal to activate subsequent cyclin-CDK complexes.[54]

Sites

[edit]

There are thousands of distinct phosphorylation sites in a given cell since:

  1. There are thousands of proteins in any particular cell.
  2. An estimated 1/10 to 1/2 of proteins are phosphorylated in some cellular state.
  3. 30–65% of proteins in humans and ~50% of proteins in yeast may be phosphorylated.[15][2]
  4. An estimated 230,000, 156,000, and 40,000 phosphorylation sites exist in human, mouse, and yeast, respectively.[2]
  5. Phosphorylation often occurs on multiple distinct sites on a given protein.

Since phosphorylation of any site on a given protein can change the function or localization of that protein, understanding the "state" of a cell requires knowing the phosphorylation state of its proteins. For example, generally, if amino acid Serine-473 in the protein AKT is phosphorylated, AKT is functionally active as a kinase, and if it is not phosphorylated, AKT is an inactive kinase.

Phosphorylation sites are crucial for proteins and their transportation and functions. They are the covalent modification of proteins through reversible phosphorylation. This enables proteins to stay inbound within a cell since the negative phosphorylated site disallows their permeability through the cellular membrane. Protein dephosphorylation allows the cell to replenish phosphates through release of pyrophosphates which saves ATP use in the cell.[55] An example of phosphorylating enzyme is found in E. coli bacteria. It possesses alkaline phosphatase in its periplasmic region of its membrane. The outermost membrane is permeable to phosphorylated molecules however the inner cytoplasmic membrane is impermeable due to large negative charges.[56] In this way, the E. coli bacteria stores proteins and pyrophosphates in its periplasmic membrane until either are needed within the cell.

Recent advancement in phosphoproteomic identification has resulted in the discoveries of countless phosphorylation sites in proteins. This required an integrative medium for accessible data in which known phosphorylation sites of proteins are organized. A curated database of dbPAF was created, containing known phosphorylation sites in H. sapiens, M. musculus, R. norvegicus, D. melanogaster, C. elegans, S. pombe and S. cerevisiae. The database currently holds 294,370 non-redundant phosphorylation sites of 40,432 proteins.[57] Other tools of phosphorylation prediction in proteins include NetPhos[58] for eukaryotes, NetPhosBac[58] for bacteria, and ViralPhos[59] for viruses.

Serine and threonine

[edit]

There are a large variety of serine residues, and the phosphorylation of each residue can lead to different metabolic consequences.

  • Protein kinase N1 is responsible for the phosphorylation of the TNF receptor-associated factor (TRAF1) on serine 139 under specific conditions. Murine TRAF1 is also phosphorylated by the same kinase, which leads to the silencing of IKK/NF-κB activity. The elimination of phosphorylation on serine 139 can be achieved by the replacement of TRAF1 with an Alanine residue, which consequently leads to the improved recruitment of TBK1.[60]
  • At the serine 789 residue, FGFR1 is phosphorylated by RSK2 when the kinase is in its active form. The signaling capabilities of FGFR1 at the serine 777 site can be weakened by phosphorylation. Serine 1047 and serine 1048 have been linked to the decreased binding affinity of ubiquitin ligase c-Cbl to EFGR when they are phosphorylated.[61]
  • When serine 349 is phosphorylated, the binding affinity between protein complex p62 and the protein Keap1 is strengthened, which is linked to stress response.[62]
  • When serine 337 is phosphorylated by protein kinase A in vitro, the DNA binding efficiency of the p50 subunit of NF-κB is greatly increased.[63]

Phosphorylation of serine and threonine residues is known to crosstalk with O-GlcNAc modification of serine and threonine residues.

Tyrosine

[edit]

Tyrosine phosphorylation is a fast, reversible reaction, and one of the major regulatory mechanisms in signal transduction. Cell growth, differentiation, migration, and metabolic homeostasis are cellular processes maintained by tyrosine phosphorylation. The function of protein tyrosine kinases and protein-tyrosine phosphatase counterbalances the level of phosphotyrosine on any protein. The malfunctioning of specific chains of protein tyrosine kinases and protein tyrosine phosphatase has been linked to multiple human diseases such as obesity, insulin resistance, and type 2 diabetes mellitus.[64] Phosphorylation on tyrosine occurs in eukaryotes, select bacterial species, and is present among prokaryotes. Phosphorylation on tyrosine maintains the cellular regulation in bacteria similar to its function in eukaryotes.[65]

Arginine

[edit]

Arginine phosphorylation in many Gram-positive bacteria marks proteins for degradation by a Clp protease.[34]

Non-canonical phosphorylation on His, Asp, Cys, Glu, Arg and Lys in human cells

[edit]

Widespread human protein phosphorylation occurs on multiple non-canonical amino acids, including motifs containing phosphorylated histidine (1 and 3 positions), aspartate, cysteine, glutamate, arginine, and lysine in HeLa cell extracts. Due to the chemical and thermal lability of these phosphorylated residues, special procedures and separation techniques are required for preservation alongside the heat stable 'classical' Ser, Thr and Tyr phosphorylation.[66]

Detection and characterization

[edit]

Antibodies can be used as powerful tool to detect whether a protein is phosphorylated at a particular site. Antibodies bind to and detect phosphorylation-induced conformational changes in the protein. Such antibodies are called phospho-specific antibodies; hundreds of such antibodies are now available. They are becoming critical reagents both for basic research and for clinical diagnosis.

Example of posttranslational modification detected on a 2D gel (spot boundaries delimited by analysis software, identification by mass spectrometry, P46462 is the protein ID in Expasy)

Post-translational modification (PTM) isoforms are easily detected on 2D gels. Indeed, phosphorylation replaces neutral hydroxyl groups on serines, threonines, or tyrosines with negatively charged phosphates with pKs near 1.2 and 6.5. Thus, below pH 5.5, phosphates add a single negative charge; near pH 6.5, they add 1.5 negative charges; above pH 7.5, they add 2 negative charges. The relative amount of each isoform can also easily and rapidly be determined from staining intensity on 2D gels.

In some very specific cases, the detection of the phosphorylation as a shift in the protein's electrophoretic mobility is possible on simple 1-dimensional SDS-PAGE gels, as it is described for instance for a transcriptional coactivator by Kovacs et al.[67] Strong phosphorylation-related conformational changes (that persist in detergent-containing solutions) are thought to underlie this phenomenon. Most of the phosphorylation sites for which such a mobility shift has been described fall in the category of SP and TP sites (i.e. a proline residue follows the phosphorylated serine or threonine residue).

Large-scale mass spectrometry analyses have been used to determine sites of protein phosphorylation. Dozens of studies have been published, each identifying thousands of sites, many of which were previously undescribed.[68][69] Mass spectrometry is ideally suited for such analyses using HCD or ETD fragmentation, as the addition of phosphorylation results in an increase in the mass of the protein and the phosphorylated residue. Advanced, highly accurate mass spectrometers are needed for these studies, limiting the technology to labs with high-end mass spectrometers. However, the analysis of phosphorylated peptides by mass spectrometry is still not as straightforward as for "regular", unmodified peptides. EThcD has been developed combining electron-transfer and higher-energy collision dissociation. Compared to the usual fragmentation methods, EThcD scheme provides more informative MS/MS spectra for unambiguous phosphosite localization.[70]

A detailed characterization of the sites of phosphorylation is very difficult, and the quantitation of protein phosphorylation by mass spectrometry requires isotopic internal standard approaches.[71] A relative quantitation can be obtained with a variety of differential isotope labeling technologies.[72] There are also several quantitative protein phosphorylation methods, including fluorescence immunoassays, microscale thermophoresis, FRET, TRF, fluorescence polarization, fluorescence-quenching, mobility shift, bead-based detection, and cell-based formats.[73][74]

In the case of intrinsically disordered proteins (IDPs), one can use topological approaches to identify conformational state transitions upon phosphorylation. IDPs can adopt disordered states which are extended or topologically compact. In glucocorticoid receptor, occupancy of these states changes upon phosphorylation, which can be detected using topological approaches.[75]

Evolution

[edit]

Protein phosphorylation is common among all clades of life, including all animals, plants, fungi, bacteria, and archaea. The origins of protein phosphorylation mechanisms are ancestral and have diverged greatly between different species. In eukaryotes, it is estimated that between 30 – 65% of all proteins may be phosphorylated, with tens or even hundreds of thousands of distinct phosphorylation sites.[76][2] Some phosphorylation sites appear to have evolved as conditional "off" switches, blocking the active site of an enzyme, such as in the prokaryotic metabolic enzyme isocitrate dehydrogenase. However, in the case of proteins that must be phosphorylated to be active, it is less clear how they could have emerged from non-phosphorylated ancestors. It has been shown that a subset of serine phosphosites are often replaced by acidic residues such as aspartate and glutamate between different species. These anionic residues can interact with cationic residues such as lysine and arginine to form salt bridges, stable non-covalent interactions that alter a protein's structure. These phosphosites often participate in salt bridges, suggesting that some phosphorylation sites evolved as conditional "on" switches for salt bridges, allowing these proteins to adopt an active conformation only in response to a specific signal.[77]

There are around 600 known eukaryotic protein kinases, making them one of the largest eukaryotic gene families. Most phosphorylation is carried out by a single superfamily of protein kinases that share a conserved kinase domain. Protein phosphorylation is highly conserved in pathways central to cell survival, such as cell cycle progression relying on cyclin-dependent kinases (CDKs), but individual phosphorylation sites are often flexible. Targets of CDK phosphorylation often have phosphosites in disordered segments, which are found in non-identical locations even in close species. Conversely, targets of CDK phosphorylation in structurally defined regions are more highly conserved. While CDK activity is critical for cell growth and survival in all eukaryotes, only very few phosphosites show strong conservation of their precise positions. Positioning is likely to be highly important for phosphates that allosterically regulate protein structure, but much more flexible for phosphates that interact with phosphopeptide-binding domains to recruit regulatory proteins.[78]

Comparisons between eukaryotes and prokaryotes

[edit]

Protein phosphorylation is a reversible post-translational modification of proteins. In eukaryotes, protein phosphorylation functions in cell signaling, gene expression, and differentiation. It is also involved in DNA replication during the cell cycle, and the mechanisms that cope with stress-induced replication blocks. Compared to eukaryotes, prokaryotes use Hanks-type kinases and phosphatases for signal transduction. Whether or not the phosphorylation of proteins in bacteria can also regulate processes like DNA repair or replication still remains unclear.[79]

Compared to the protein phosphorylation of prokaryotes, studies of protein phosphorylation in eukaryotes from yeast to human cells have been rather extensive. It is known that eukaryotes rely on the phosphorylation of the hydroxyl group on the side chains of serine, threonine, and tyrosine for cell signaling. These are the main regulatory post-translational modifications in eukaryotic cells but the protein phosphorylation of prokaryotes are less intensely studied. While serine, threonine, and tyrosine are phosphorylated in eukaryotes, histidine and aspartate is phosphorylated in prokaryotes and eukaryotes. In bacteria, histidine phosphorylation occurs in the phosphoenolpyruvate-dependent phosphotransferase systems (PTSs), which are involved in the process of internalization as well as the phosphorylation of sugars.[80]

Protein phosphorylation by protein kinase was first shown in E. coli and Salmonella typhimurium and has since been demonstrated in many other bacterial cells.[81] It was found that bacteria use histidine and aspartate phosphorylation as a model for bacterial signaling transduction. Serine, threonine and tyrosine phosphorylation are also present in bacteria. Bacteria carry kinases and phosphatases similar to that of their eukaryotic equivalent and have also developed unique kinases and phosphatases not found in eukaryotes.[80]

Pathology

[edit]

Abnormal protein phosphorylation has been implicated in a number of diseases, including cancer, Alzheimer's disease, Parkinson's disease, and other degenerative disorders.

Tau protein belongs to a group of microtubule associated proteins (MAPs) which help stabilize microtubules in cells, including neurons.[82] Association and stabilizing activity of tau protein depends on its phosphorylated state. In Alzheimer's disease, due to misfoldings and abnormal conformational changes in tau protein structure, it is rendered ineffective at binding to microtubules and unable to keep the neural cytoskeletal structure organized during neural processes. Abnormal tau inhibits and disrupts microtubule organization and disengages normal tau from microtubules into cytosolic phase.[83] The misfoldings lead to the abnormal aggregation into fibrillary tangles inside the neurons. The tau protein needs to be phosphorylated to function, but hyperphosphorylation of tau protein is one of the major influences on its incapacity to associate.[83] Phosphatases PP1, PP2A, PP2B, and PP2C dephosphorylate tau protein in vitro, and their activities are reduced in areas of the brain in Alzheimer patients.[83][84] Tau phosphoprotein is three to fourfold hyperphosphorylated in an Alzheimer patient compared to an aged non-afflicted individual. Alzheimer disease tau seems to remove MAP1 and MAP2 (two other major associated proteins) from microtubules and this deleterious effect is reversed when dephosphorylation is performed, evidencing hyperphosphorylation as the sole cause of the crippling activity.[83]

Parkinson's disease

[edit]

α-Synuclein is a protein that is associated with Parkinson's disease.[85] In humans, this protein is encoded by the SNCA gene.[86] α-Synuclein is involved in recycling synaptic vesicles that carry neurotransmitters and naturally occurs in an unfolded form. Elevated levels of α-Synuclein are found in patients with Parkinson's disease. There is a correlation between the concentration of unphosphorylated α-Synuclein present in the patient and the severity of Parkinson's disease.[87] Specifically, phosphorylation of Ser129 in α-Synuclein has an impact on severity. Healthy patients have higher levels of unphosphorylated α-Synuclein than patients with Parkinson's disease. The measurement of change in the ratio of concentrations of phosphorylated α-Synuclein to unphosphorylated α-Synuclein within a patient could be a marker of the disease progression. Antibodies that target α-Synuclein at phosphorylated Ser129 are used to study the molecular aspects of synucleinopathies.[88][89]

Phosphorylation of Ser129 is associated with the aggregation of the protein and further damage to the nervous system. The aggregation of phosphorylated α-Synuclein can be enhanced if a presynaptic scaffold protein, Sept4, is present in insufficient quantities. Direct interaction of α-Synuclein with Sept4 inhibits the phosphorylation of Ser129.[90][91][92] However, phosphorylation of Ser129 can be observed without synuclein aggregation in conditions of overexpression.[93]

References

[edit]
Revisions and contributorsEdit on WikipediaRead on Wikipedia
from Grokipedia
Protein phosphorylation is a reversible post-translational modification in which a phosphate group from ATP is covalently attached to specific amino acid residues, primarily serine, threonine, or tyrosine, on a target protein by enzymes known as protein kinases, with the phosphate subsequently removable by protein phosphatases.[1][2] This process alters the protein's charge, conformation, and interactions, thereby regulating its activity, localization, or binding affinity in response to cellular signals.[1][3] Occurring on approximately one-third of human proteins and involving over 200,000 identified phosphosites, phosphorylation is the most prevalent and versatile mechanism for acute and dynamic control of protein function across all domains of life.[2][4] The specificity of protein phosphorylation is achieved through intricate recognition mechanisms, including substrate sequence motifs, docking interactions, and scaffolding complexes that ensure kinases target only a subset of potential sites—typically 1 to a few hundred out of millions of residues—amidst the vast proteome.[3] In mammalian cells, this modification is mediated by roughly 500 protein kinases and around 200 protein phosphatases, enabling rapid responses to extracellular stimuli such as hormones, neurotransmitters, or growth factors via second messengers like cAMP or calcium ions.[1][5] Phosphorylation predominantly targets serine (about 86%), followed by threonine (12%) and tyrosine (2%), with the addition requiring magnesium ions as a cofactor to facilitate phosphate transfer.[2] Beyond its role in fundamental cellular processes—including signal transduction, metabolism, cell cycle progression, apoptosis, and subcellular trafficking—protein phosphorylation is dysregulated in numerous diseases, such as cancer, where aberrant kinase activity drives uncontrolled proliferation, and serves as a target for therapeutic interventions like kinase inhibitors.[3][2] Advances in chemical biology tools, including analog-sensitive kinases and fluorescent biosensors, have illuminated its spatiotemporal dynamics, underscoring its essential contribution to physiological adaptation and neural plasticity.[4]

Fundamentals

Definition and biochemical overview

Protein phosphorylation is a reversible post-translational modification involving the covalent attachment of a phosphate group (PO₄³⁻) from ATP or GTP to the side chains of specific amino acids, primarily serine (Ser), threonine (Thr), and tyrosine (Tyr) residues in proteins.[3] This process is catalyzed by protein kinases, which transfer the γ-phosphate group to the hydroxyl (-OH) moiety of these residues, and it can be reversed by protein phosphatases that hydrolyze the phosphate ester bond.[4] The phosphate group is derived from high-energy molecules like ATP, which is generated through cellular metabolic pathways such as glycolysis and oxidative phosphorylation.[3] The core biochemical reaction for O-phosphorylation on Ser, Thr, or Tyr can be represented as:
R-OH+ATPR-O-PO32+ADP \text{R-OH} + \text{ATP} \rightarrow \text{R-O-PO}_3^{2-} + \text{ADP}
where R-OH denotes the hydroxyl group of the target amino acid side chain.[4] This nucleophilic attack by the alcohol oxygen on the γ-phosphorus of ATP, facilitated by the kinase active site, results in the formation of a phosphomonoester linkage.[3] The addition of the phosphate group introduces a bulky, negatively charged moiety that significantly alters the protein's physicochemical properties, including its net charge, hydrophobicity, and steric interactions.[4] These changes can induce conformational shifts in the protein structure, either by direct electrostatic repulsion or by creating docking sites for regulatory proteins containing phospho-binding domains like SH2 or 14-3-3.[6] Such modifications enable dynamic regulation without altering the primary amino acid sequence. As a fundamental regulatory mechanism, protein phosphorylation modulates nearly every aspect of cellular biology, including signal transduction, enzymatic activity, metabolic flux, cell cycle progression, and apoptosis.00121-0) Its reversibility allows for rapid, fine-tuned responses to environmental cues, making it indispensable for maintaining homeostasis and adapting to stress across all domains of life.[3]

Historical development

The discovery of protein phosphorylation dates back to the late 19th century, when Swedish chemist Olof Hammarsten identified phosphate groups bound to the milk protein casein in 1883, marking the first recognition of a phosphoprotein.[7] This observation laid the groundwork for understanding post-translational modifications, though the functional implications remained unclear for decades. Early 20th-century studies, such as Phoebus Levene's 1906 report on phosphorylated proteins, further documented the phenomenon but did not elucidate enzymatic mechanisms.[8] A pivotal breakthrough occurred in 1955, when Edmond Fischer and Edwin Krebs demonstrated reversible phosphorylation as a regulatory mechanism while studying glycogen phosphorylase activation in rabbit muscle, revealing how kinases and phosphatases control enzyme activity through phosphate addition and removal.[9] This work established phosphorylation as a dynamic process central to metabolic regulation. In 1979, Tony Hunter and colleagues identified tyrosine phosphorylation in polyomavirus middle T antigen and the v-Src protein, expanding the known phosphoamino acids beyond serine and threonine and uncovering a novel signaling pathway implicated in oncogenesis.[10] These mid-20th-century findings shifted focus from static modifications to active cellular control. The 1980s and 1990s saw rapid advancements in molecular biology, including the 1980 cloning and sequencing of the v-src oncogene from Rous sarcoma virus by Czernilofsky et al., which confirmed its tyrosine kinase activity and facilitated studies on kinase structure and function.[11] Fischer and Krebs received the 1992 Nobel Prize in Physiology or Medicine for their foundational discoveries on reversible phosphorylation, highlighting its broad regulatory role.[9] In the modern era, post-2000 technologies enabled high-throughput phosphoproteomics, with Olsen et al.'s 2006 study identifying over 6,000 phosphorylation sites in human cells, revealing the vast scale of the phosphoproteome.00930-0) Concurrently, Manning et al. mapped the human kinome in 2002, cataloging 518 protein kinases and providing a comprehensive framework for understanding phosphorylation networks. These developments underscored the evolution of concepts from viewing phosphorylation as a mere structural variant to recognizing it as a dynamic signaling hub orchestrating cellular responses to stimuli.

Cellular abundance and distribution

Protein phosphorylation is a highly prevalent post-translational modification in eukaryotic cells, with comprehensive phosphoproteomic analyses identifying over 240,000 phosphorylation sites across the human proteome as of 2024.[12] These sites are distributed on approximately 15,000 to 20,000 proteins, representing a significant portion (75% to 100%) of the total ~20,000 proteins in the human proteome, though estimates suggest that 30% to 50% may be actively phosphorylated at any given time under basal conditions.[13][14] The modification exhibits a dynamic nature, characterized by rapid turnover rates where individual sites can undergo phosphorylation and dephosphorylation cycles hundreds to thousands of times per hour, enabling quick responses to cellular cues.[15] In terms of site distribution, the vast majority—approximately 98%—occur on serine (Ser ~86%) and threonine (Thr ~12%) residues, with tyrosine (Tyr) sites comprising about 2%.[13] Non-canonical sites, such as those on histidine or aspartate, constitute less than 1% of the total but play critical roles in specific signaling contexts, particularly in prokaryotes.[16] This distribution reflects the evolutionary adaptation of phosphorylation primarily to Ser/Thr for broad regulatory purposes in eukaryotes. Phosphorylation is ubiquitous across eukaryotes, with budding yeast (Saccharomyces cerevisiae) featuring over 40,000 sites on about 3,000 to 4,000 proteins, covering roughly 59% of its proteome.[17][18] In prokaryotes, phosphorylation is less abundant overall, with fewer sites per proteome due to simpler cellular architectures, yet it remains essential for two-component signaling systems that mediate environmental responses through histidine-to-aspartate phosphorelays.[16][19] Tissue-specific patterns reveal elevated phosphorylation abundance in signaling-intensive organs, such as the brain, where synaptic and neuronal proteins exhibit higher site densities compared to other tissues like liver or muscle.[20] Subcellularly, phosphorylated proteins are distributed across compartments, with significant enrichment in the cytosol (for soluble signaling effectors), nucleus (for transcriptional regulators), and membranes (for receptor-associated events).[21] Phosphoproteomic studies further demonstrate stimulus-induced dynamics, such as epidermal growth factor stimulation, which can double the number of detectable phosphorylated sites within minutes by activating kinase cascades.[22]

Biochemical Mechanisms

The phosphorylation reaction

Protein phosphorylation involves the enzymatic transfer of a phosphate group from the γ-position of adenosine triphosphate (ATP), typically complexed with magnesium ions (Mg²⁺), to the hydroxyl group of specific amino acid residues such as serine, threonine, or tyrosine on target proteins. This reaction is catalyzed by protein kinases, which form a ternary complex consisting of the kinase, MgATP, and the substrate protein. The active site of the kinase positions the substrate's hydroxyl group in proximity to the γ-phosphate of ATP, enabling a nucleophilic attack by the oxygen atom of the hydroxyl on the phosphorus atom. This phosphotransfer proceeds through a dissociative transition state, resembling a metaphosphate (PO₃⁻) intermediate, where the leaving group (ADP) departs with minimal assistance from the nucleophile, as evidenced by a Bronsted coefficient (β_nuc) near 0.[23][24] The catalytic mechanism is facilitated by conserved residues in the kinase active site. An aspartate residue (e.g., Asp166 in cAMP-dependent protein kinase, PKA) acts as a proton acceptor or "proton trap" to stabilize the transferred phosphate, while basic residues like lysine and the Mg²⁺ ions neutralize negative charges on the phosphates, lowering the activation energy. The optimal geometry maintains a 4.5–6 Å distance between the attacking oxygen and the γ-phosphorus during the transition state. In most cases, the rate-limiting step is the release of MgADP from the active site, rather than the phosphotransfer itself.[23][25] Kinetic parameters of the phosphorylation reaction follow Michaelis-Menten kinetics, with the Michaelis constant (K_m) for ATP typically ranging from 10 to 100 μM across eukaryotic protein kinases, reflecting high affinity for the nucleotide substrate under physiological conditions. Substrate specificity is achieved through docking motifs and consensus sequences adjacent to the phosphorylation site; for example, PKA preferentially targets motifs such as R-R-X-S/T or R-X-X-S/T, where X is any amino acid and S/T denotes serine or threonine, recognized by interactions with the kinase's substrate-binding groove. These sequences ensure selective phosphorylation, with deviations reducing efficiency.[23][26] The reaction is energetically favorable due to the large negative standard free energy change (ΔG°') associated with ATP hydrolysis, approximately -7 kcal/mol (-30 kJ/mol) under standard conditions, which drives the irreversible transfer of the phosphate group. In cellular environments, the actual ΔG is even more negative (e.g., -12 to -17 kcal/mol), further promoting the reaction. While ATP is the primary phosphate donor in eukaryotic and most bacterial systems, some bacterial kinases, such as the sporulation sensor histidine kinase BA2291 in Bacillus anthracis, utilize GTP instead, exhibiting specificity for GTP in the forward phosphotransfer and GDP in the reverse reaction.[27][28] Structurally, protein kinases share a conserved bilobal fold: the N-terminal lobe binds ATP via a nucleotide-binding cleft, while the C-terminal lobe accommodates the substrate, with the activation loop (between DFG and APE motifs) positioning key elements for catalysis. In active conformations, phosphorylation of the activation loop (e.g., Thr197 in PKA) stabilizes a charged cluster that orients the lobes correctly, enhancing catalytic efficiency by up to 100-fold.[24][29] Regulation of the phosphorylation reaction occurs through allosteric effectors and autoinhibitory mechanisms to prevent aberrant activity. Allosteric regulation involves conformational changes induced by binding partners, such as asymmetric dimerization in EGFR kinases or cyclin binding in CDKs, which reposition the activation loop to expose the active site. Autoinhibition commonly arises from the activation loop occluding the substrate-binding site or ATP cleft in inactive states (e.g., Src family kinases in a "DFG-out" conformation), which is relieved by phosphorylation or effector binding.[23][24]

Protein kinases

Protein kinases are a large superfamily of enzymes that catalyze the transfer of the γ-phosphate group from ATP to specific amino acid residues on target proteins, primarily serving as the primary effectors of protein phosphorylation in cells. In humans, the kinome comprises approximately 518 protein kinases, which are classified into eukaryotic protein kinases (ePKs) and atypical kinases based on sequence and structural features. The ePKs, which constitute the majority, are further grouped into nine major families: AGC (including protein kinase A, G, and C), CAMK (calcium/calmodulin-dependent kinases), CK1 (casein kinase 1), CMGC (including cyclin-dependent kinases, mitogen-activated protein kinases, and glycogen synthase kinase 3), STE (sterile kinases involved in mating pathways), TK (tyrosine kinases), TKL (tyrosine kinase-like), RGC (RECK-GTPase-activating), and "other" miscellaneous groups.[30] The core catalytic domain of protein kinases is highly conserved, featuring a bilobal architecture with an N-terminal lobe (N-lobe) primarily responsible for ATP binding through a nucleotide-binding cleft formed by β-sheets and an α-helix, and a C-terminal lobe (C-lobe) that facilitates substrate recognition and orientation via α-helices and loops.00689-5/fulltext) This domain, typically spanning 250-300 amino acids, includes key motifs such as the glycine-rich loop for ATP interaction and the catalytic loop (HRD motif) for phosphate transfer.00689-5/fulltext) Activation of most kinases requires phosphorylation within the activation loop (also known as the T-loop), which repositions residues to stabilize the active conformation and align the catalytic residues.00480-0) Protein kinases exhibit catalytic diversity in their substrate specificity, with the majority (~77%) being serine/threonine-specific, a smaller subset (~17%) tyrosine-specific (primarily in the TK group), and a limited number (~6%) of dual-specificity kinases capable of phosphorylating both serine/threonine and tyrosine residues, such as members of the MAP kinase kinase family.[30] Approximately 10% of the human kinome consists of pseudokinases, which lack one or more conserved catalytic residues (e.g., the aspartate in the DFG motif or lysine in the β3 strand) and are catalytically inactive but function as allosteric regulators or scaffolds in signaling complexes.[31] Regulation of protein kinase activity occurs through multiple mechanisms beyond activation loop phosphorylation, including binding of allosteric modulators, regulatory subunits, or inhibitors that alter domain conformation, as well as post-translational modifications like acetylation or ubiquitination, and subcellular localization signals that control access to substrates.00480-0) For instance, scaffold proteins can sequester kinases to specific compartments, enhancing specificity in signaling pathways.00689-5/fulltext) The protein kinase family has an ancient evolutionary origin, with homologs of the ePK fold present across all three domains of life—bacteria, archaea, and eukaryotes—indicating emergence in the last universal common ancestor or shortly thereafter, followed by extensive diversification in eukaryotes.[32] This conservation underscores the fundamental role of phosphorylation in cellular regulation from prokaryotic metabolism to complex eukaryotic signaling networks.[32]

Protein phosphatases

Protein phosphatases are a diverse group of enzymes that catalyze the dephosphorylation of proteins, removing phosphate groups from serine, threonine, or tyrosine residues to reverse the effects of kinases and maintain signaling balance. They are broadly classified by substrate specificity into serine/threonine-specific phosphatases, tyrosine-specific phosphatases, and dual-specificity phosphatases. Serine/threonine phosphatases fall into two main families: the PPP family, which includes protein phosphatase 1 (PP1) and protein phosphatase 2A (PP2A), and the PPM family, both acting on phosphoester bonds at serine or threonine sites. Tyrosine phosphatases primarily comprise the PTP family, such as PTP1B, while dual-specificity phosphatases (DSP family) target both serine/threonine and tyrosine residues, exemplified by the mitogen-activated protein kinase (MAPK) phosphatases.[33][34] The human genome encodes approximately 189 functional protein phosphatase genes, a number far smaller than the roughly 500 protein kinases, yet these phosphatases achieve tight regulation through complex interactions to ensure dynamic control of phosphorylation states essential for signal termination.[33] Mechanistically, dephosphorylation by protein phosphatases involves hydrolytic cleavage of the phosphoester bond, but the details vary by family. In the PPP family, catalysis relies on a binuclear metal center (typically involving Fe²⁺/Zn²⁺ or Mn²⁺ ions) that coordinates the substrate phosphate and activates a water molecule for direct nucleophilic attack on the phosphorus, leading to inline displacement of the protein residue in a single-step SN2-like reaction. PTP family members, in contrast, use a two-step mechanism where a conserved cysteine nucleophile attacks the phosphate to form a transient covalent thiophosphate intermediate, followed by hydrolysis of this intermediate via a water molecule assisted by an aspartate general acid/base. PPM phosphatases also depend on metals (often Mg²⁺) for water activation, similar to PPPs. Specificity is further refined by docking motifs and interactions that position substrates near the active site.[35][36][37] Structurally, PTP phosphatases share a conserved catalytic domain featuring the signature motif HCXAGXGR(S/T), which positions the nucleophilic cysteine and an arginine for phosphate stabilization, often flanked by a WPD loop that closes over the active site during catalysis. PPP and PPM phosphatases exhibit a core fold with metal-binding motifs, such as the binuclear site in PPPs formed by conserved aspartate and histidine residues. Many phosphatases function as multi-subunit complexes; for instance, PP2A forms holoenzymes with catalytic, scaffolding, and regulatory subunits that dictate substrate targeting and localization.[38][35][33] Regulation of protein phosphatases ensures precise temporal control and includes inhibitor binding and post-translational modifications. Okadaic acid, a marine toxin, selectively inhibits PPP family members like PP1 and PP2A at nanomolar concentrations by binding the active site and mimicking the transition state, thereby blocking metal coordination and water activation. Phosphatases can also be directly phosphorylated by kinases on their regulatory or catalytic domains, which may inhibit activity, alter subcellular localization, or modulate interactions with substrates and inhibitors, as seen in tyrosine phosphorylation of the PP2A catalytic subunit.[39]

Phosphorylation Sites

Canonical sites: serine, threonine, and tyrosine

Protein phosphorylation primarily occurs on the canonical amino acid residues serine (Ser), threonine (Thr), and tyrosine (Tyr), which together account for the majority of known phosphorylation events in eukaryotic cells. These residues feature hydroxyl (-OH) groups in their side chains that serve as nucleophilic targets for phosphate transfer from ATP, forming stable phosphoester bonds. Approximately 86% of identified phosphorylation sites are on serine, 12% on threonine, and 2% on tyrosine.[2] Ser and Thr phosphorylation sites share structural similarities due to their aliphatic alcohol side chains, but Thr's additional methyl group introduces steric bulkiness that can influence kinase accessibility and substrate recognition compared to the smaller Ser side chain. This bulkiness often results in lower phosphorylation frequency for Thr relative to Ser and may affect the conformational dynamics of the phosphorylated protein, promoting more restricted dianionic forms in Thr. The side-chain -OH groups of both have high pKa values around 13, making them poorly ionized at physiological pH and reliant on kinase activation for efficient phosphorylation. Kinases targeting Ser/Thr sites, known as basophilic kinases (e.g., protein kinase A), typically recognize motifs enriched in basic residues like arginine or lysine N-terminal to the target, such as the consensus RRxS/T.[40][41][42] In contrast, Tyr phosphorylation involves the phenolic -OH group, which has a lower pKa of approximately 10, allowing partial ionization at neutral pH and potentially enhancing reactivity in signaling contexts. Tyr sites are less common but play a distinct role in transmembrane signaling, often serving as docking platforms for SH2-domain-containing proteins. Tyrosine kinases frequently prefer motifs with acidic residues, such as aspartate or glutamate, surrounding the target Tyr (e.g., Yxxφ for some Src family kinases, where φ is hydrophobic). The aromatic ring in Tyr also imparts steric and electronic effects that favor phosphorylation in exposed loops of receptor proteins.[40][42] Ser and Thr phosphorylation sites are predominantly found in nuclear and cytosolic proteins, where they regulate transcription factors and metabolic enzymes, while Tyr sites are enriched in plasma membrane-associated receptors and adapters involved in growth factor signaling. Across eukaryotes, these canonical phosphorylation sites exhibit high evolutionary conservation, with phosphosites generally more preserved than non-phosphorylated counterparts, reflecting their functional importance. Computational tools like NetPhos leverage sequence motifs and machine learning to predict these sites with high accuracy, aiding in the annotation of uncharacterized proteomes.[43][44][45]

Non-canonical sites: histidine, aspartate, and others

Non-canonical phosphorylation refers to the covalent attachment of phosphate groups to amino acid residues other than the canonical serine, threonine, and tyrosine, primarily involving histidine and aspartate in well-characterized systems. Histidine phosphorylation forms a phosphoamidate bond, while aspartate phosphorylation creates an acid anhydride linkage, both of which are highly labile with half-lives on the order of minutes under physiological conditions due to rapid hydrolysis.00441-5)[46] These modifications play a central role in bacterial two-component signaling systems, where sensor histidine kinases undergo autophosphorylation on a conserved histidine residue in response to environmental stimuli, followed by phosphate transfer to an aspartate residue on the cognate response regulator to modulate gene expression or enzymatic activity.[47]30703-1) Beyond histidine and aspartate, phosphorylation can occur on cysteine, glutamate, lysine, and arginine, forming thiophosphate, mixed anhydride, phosphorimidazolate, and phosphoramidate bonds, respectively; these events are rare in eukaryotes, comprising about 1% of total phosphosites identified in comprehensive proteomic surveys.[48] For instance, cysteine phosphorylation has been implicated in redox sensing mechanisms, where it modulates protein function in response to oxidative stress, as observed in bacterial peroxiredoxins and select eukaryotic counterparts.[49] In human cells, mass spectrometry-based analyses have identified over 2,000 non-canonical phosphosites across these residues, with histidine phosphorylation prominent in metabolic enzymes such as succinyl-CoA synthetase (SUCLG1), where it regulates ATP production in the tricarboxylic acid cycle by facilitating substrate binding and phosphotransfer.[48][50] The inherent instability of these modifications poses significant challenges for detection and study, as they undergo rapid hydrolysis during standard acidic phosphopeptide enrichment protocols, leading to underestimation and biases in phosphoproteomic datasets.[51] Specialized methods, such as strong anion exchange chromatography or alkaline extraction, have mitigated these issues to reveal their prevalence. Evolutionarily, non-canonical phosphorylation is more abundant in prokaryotes, where it underpins diverse signaling cascades, but has been co-opted in eukaryotes for niche regulatory roles, often retaining prokaryotic enzymatic machinery like nucleoside diphosphate kinases.[52][53]

Biological Functions

Enzyme activation and inhibition

Protein phosphorylation serves as a key regulatory mechanism for modulating enzyme activity, primarily through inducing conformational changes that either promote or hinder catalytic function. This post-translational modification can activate enzymes by stabilizing active conformations or relieve autoinhibitory states, while inhibition often occurs via steric hindrance, charge repulsion, or pseudosubstrate mimicry that blocks substrate access to the active site. Such regulation allows for rapid, reversible control of metabolic and signaling processes, with phosphatases counteracting kinases to maintain dynamic equilibrium.[54] A prominent activation mechanism involves phosphorylation of the activation loop in protein kinases, which repositions critical residues to facilitate substrate binding and catalysis. In the catalytic subunit of cAMP-dependent protein kinase A (PKA), phosphorylation at Thr197 within the activation loop is essential for full activity, reducing the Km for peptide substrate kemptide by approximately 15-fold and for ATP by 7-fold, thereby enhancing catalytic efficiency. This phosphorylation stabilizes the activation loop in an open conformation, relieving autoinhibition by allowing proper alignment of the active site residues. Similar relief of autoinhibition is observed in other kinases, where activation loop phosphorylation disrupts intramolecular interactions that otherwise sequester the catalytic domain.[55][56][57] In contrast, phosphorylation can inhibit enzymes by introducing negative charges that cause electrostatic repulsion or by mimicking substrate binding to occlude the active site. For instance, in glycogen synthase kinase 3β (GSK3β), phosphorylation at Ser9 by kinases such as Akt creates a pseudosubstrate that binds to the positively charged priming phosphate-binding pocket, leading to steric blocking and approximately 200-fold reduction in catalytic efficiency toward primed substrates.[58] This N-terminal phosphorylation induces a conformational shift where the inhibitory tail competes with physiological substrates, exemplifying allosteric inhibition. GSK3β itself phosphorylates glycogen synthase at multiple sites (e.g., Ser641, Ser645), inactivating the enzyme and thereby suppressing glycogen synthesis; inhibition of GSK3β thus indirectly activates glycogen synthase.[59][54] Pyruvate dehydrogenase (PDH), a key enzyme in glucose oxidation, provides another example of inhibitory phosphorylation, where PDH kinases (PDKs) target specific serine residues (e.g., Ser293 on the E1α subunit) to induce a conformational change that disrupts the active site and nearly completely inactivates the enzyme, preventing pyruvate decarboxylation. Dephosphorylation by PDH phosphatases reverses this inhibition, restoring full catalytic function. These modifications often result in 10- to 1000-fold changes in enzyme activity, depending on the system; for example, activation loop phosphorylation in mitogen-activated protein kinases (MAPKs) can yield over 1000-fold enhancement.[60] Multisite phosphorylation enables fine-tuning of enzyme activity, allowing graded responses rather than binary on/off switches, as seen in glycogen synthase where sequential phosphorylation at up to nine sites progressively decreases activity, providing sensitivity to signaling intensity. In insulin signaling, this reciprocity is evident: insulin activates Akt, which phosphorylates GSK3β at Ser9 to inhibit it, promoting glycogen synthase dephosphorylation by protein phosphatase 1 (PP1) and thus enzyme activation; conversely, phosphatases like PP2A reverse inhibitory phosphorylations on PDH to sustain metabolic flux. Such kinase-phosphatase cycles ensure precise temporal control, integrating enzyme modulation into broader signaling networks.[61][62][54]

Protein-protein interactions and localization

Protein phosphorylation serves as a key regulatory mechanism for mediating protein-protein interactions by creating specific binding interfaces on target proteins. Upon phosphorylation, particularly on tyrosine residues, short linear motifs become recognized by modular domains such as Src homology 2 (SH2) domains, which bind phosphotyrosine (pTyr) with high specificity. For instance, the adaptor protein Grb2 utilizes its SH2 domain to bind pTyr residues on activated receptor tyrosine kinases, facilitating downstream signaling assembly.01077-8.pdf) Similarly, phosphorylation of serine (pSer) or threonine (pThr) residues generates docking sites for domains like 14-3-3 proteins, which recognize consensus motifs such as RSXpSXP (where pS is phosphoserine and X is any amino acid), thereby stabilizing protein complexes and modulating their stability or activity.81067-3) The specificity of these interactions arises from the structural architecture of modular domains, including SH2, phosphotyrosine-binding (PTB), WW, and forkhead-associated (FHA) domains, each tailored to distinct phosphorylation motifs. SH2 and PTB domains preferentially engage pTyr within sequences like pYXXL or NPXpY, while WW domains target pSer/Pro motifs (e.g., pSP) and FHA domains bind pThr/Ser in acidic contexts (e.g., TpSXXD). These interactions typically exhibit dissociation constants in the range of 1-10 μM, enabling reversible and tunable associations that respond to fluctuating kinase activities.00580-9) Such modular recognition ensures selective recruitment, as demonstrated by the FHA domain of Rad53 kinase binding phosphorylated Rad9 adaptor to coordinate DNA damage responses.00340-8.pdf) Phosphorylation also directs protein localization by altering binding affinities or exposing/unmasking transport signals. For example, pSer residues can function as nuclear export signals (NES) by promoting interaction with export machinery; in the case of transcription factor EB (TFEB), multisite phosphorylation at serines S142 and S211 by mTORC1 enhances CRM1-mediated nuclear export, retaining TFEB in the cytoplasm under nutrient-rich conditions.[63] Conversely, pTyr motifs recruit effectors to the plasma membrane, such as phospholipase Cγ (PLCγ), whose SH2 domains bind pTyr residues on epidermal growth factor receptor (EGFR) upon ligand stimulation, anchoring PLCγ for localized phosphoinositide hydrolysis.00665-3.pdf) These localization events often involve dynamic scaffolds, where sequential phosphorylation gradients assemble multi-kinase complexes, as seen in MAPK cascades where scaffold proteins like KSR1 tether Raf, MEK, and ERK via phospho-dependent interactions to propagate signals spatially. Illustrative examples highlight these principles in cellular regulation. Phosphorylation of β-catenin at serine residues (e.g., S33/S37 by GSK-3β) creates a binding site for the E3 ubiquitin ligase β-TrCP, promoting its incorporation into the destruction complex and subsequent proteasomal degradation to suppress Wnt signaling.00685-2) In cytokine signaling, tyrosine phosphorylation of STAT transcription factors (e.g., Y701 on STAT1) induces SH2-mediated dimerization, enabling nuclear import via importin-α/β and subsequent gene activation, a process essential for interferon responses.[64] These interactions underscore how phosphorylation not only nucleates complexes but also orchestrates subcellular trafficking to fine-tune physiological outcomes.

Signal transduction networks

Protein phosphorylation serves as a fundamental mechanism in signal transduction networks, enabling cells to process and respond to extracellular cues through orchestrated kinase cascades. These networks typically exhibit a modular architecture where an upstream receptor, such as a receptor tyrosine kinase (RTK), initiates signaling upon ligand binding, leading to autophosphorylation and recruitment of adaptor proteins like Grb2-SOS, which activate small GTPases such as Ras. Activated Ras then recruits and stimulates Raf kinases (MAP3K), which phosphorylate and activate MEK (MAP2K), ultimately resulting in the dual phosphorylation and activation of ERK (MAPK) to drive downstream effectors involved in proliferation, differentiation, and survival.[65] This sequential phosphorylation relay ensures precise signal propagation from the plasma membrane to the nucleus.[66] A key feature of these networks is signal amplification, where each phosphorylation step activates multiple substrate molecules, exponentially increasing the response magnitude. In the MAPK pathway, for instance, the multi-tiered cascade can achieve substantial gain, with each kinase level potentially activating dozens to hundreds of downstream targets, allowing a single ligand-receptor interaction to elicit robust cellular outputs like gene expression changes.[65] This amplification is particularly evident in contexts requiring rapid and strong responses, such as mitogenic stimulation.[67] Crosstalk between pathways integrates diverse signals at shared phosphorylation sites, acting as molecular hubs for decision-making. The insulin receptor substrate-1 (IRS-1), for example, undergoes tyrosine phosphorylation by insulin receptor kinase to propagate metabolic signals, but serine/threonine phosphorylation by multiple kinases—including JNK, PKC, and S6K from stress or nutrient pathways—modulates its activity, enabling integration of insulin signaling with inflammatory or growth cues.[68] Such phospho-sites on IRS-1 thus serve as nodes where competing inputs fine-tune insulin sensitivity and prevent aberrant activation.[69] Feedback loops further refine network dynamics, with negative feedback promoting homeostasis and positive feedback enabling bistability or rapid amplification. In the Raf-MEK-ERK cascade, activated ERK phosphorylates Raf-1 at inhibitory sites (e.g., Ser259, Ser289/296/301), attenuating upstream signaling to prevent overactivation and ensure transient responses.[70] Conversely, positive feedback can arise through ERK-mediated phosphorylation of scaffold proteins like kinase suppressor of Ras (KSR), which enhances scaffold assembly and kinase recruitment, thereby boosting pathway efficiency in sustained signaling scenarios.[71] Spatial organization confines kinase modules to specific cellular compartments, ensuring localized and context-specific signaling. RTK-initiated cascades often assemble at the plasma membrane via lipid rafts or adaptors, while Wnt pathway components, including the Axin-GSK3-β-catenin complex, localize to cytoplasmic puncta or endosomes for targeted β-catenin phosphorylation and degradation.[72] In the Notch pathway, phosphorylation events on the intracellular domain by kinases like CDK8 occur in nuclear or endosomal niches, integrating with membrane-bound modules to regulate developmental patterning.[73] This compartmentalization minimizes off-target effects and coordinates multi-pathway outputs.

Histone phosphorylation and chromatin regulation

Protein phosphorylation plays a pivotal role in chromatin regulation through modifications on histone tails, influencing gene expression, DNA repair, and mitotic processes. Histone H3 and H2A variants are key targets, where phosphorylation acts as a dynamic signal to alter chromatin structure and recruit regulatory proteins.[74] A prominent site is serine 10 on histone H3 (H3 Ser10), phosphorylated primarily by Aurora B kinase during mitosis. This modification correlates with chromosome condensation, facilitating proper segregation by promoting chromatin compaction. Aurora B localizes to centromeres and phosphorylates H3 Ser10 to displace heterochromatin protein 1 (HP1), enabling mitotic progression.[75][76] Another critical site is serine 139 on the histone variant H2AX (H2AX Ser139), known as γH2AX, phosphorylated by ataxia-telangiectasia mutated (ATM) and ATR kinases in response to DNA double-strand breaks. This mark rapidly spreads along chromatin flanks of damage sites, serving as a platform for recruiting repair factors like MDC1 and 53BP1 to initiate non-homologous end joining or homologous recombination.[74] These phospho-marks function by recruiting "reader" proteins that interpret the modification. For instance, 14-3-3 proteins bind specifically to phosphorylated H3 Ser10, often in combination with nearby acetylated lysines, to stabilize open chromatin states or facilitate downstream effectors during transcription. In mitosis, H3 Ser10 phosphorylation induces structural changes in nucleosomes, promoting higher-order chromatin folding essential for condensation.[77][78] In mitotic contexts, H3 Ser10 phosphorylation by Aurora B drives chromosome condensation starting in prophase, ensuring faithful distribution of genetic material; inhibition of this kinase disrupts condensation and leads to mitotic errors. For transcription activation, phosphorylation of H3 threonine 11 (Thr11) by mitogen- and stress-activated kinase 1 (MSK1) occurs at promoters of inducible genes, correlating with enhanced RNA polymerase II recruitment and chromatin accessibility.[79][80] Histone phosphorylation integrates with other modifications in combinatorial "codes." A notable phospho-acetyl switch involves H3 Ser10 phosphorylation preceding lysine 14 acetylation, which together promote transcriptional activation by weakening histone-DNA interactions and evicting repressive factors. Similarly, H3 Ser10 phosphorylation antagonizes methylation at adjacent lysine 9, releasing HP1 and opening heterochromatin for gene expression.[81][75] In immediate-early gene induction, such as c-fos and c-jun following growth factor stimulation, H3 Ser10 phosphorylation by MSK1 facilitates rapid chromatin remodeling and transcription burst, often coupled with acetylation for sustained activation. During double-strand break signaling, γH2AX amplification creates a scaffold for checkpoint kinases, halting cell cycle progression until repair, thus maintaining genomic stability.[82][74]

Membrane transport and protein degradation

Protein phosphorylation regulates membrane transport by modulating the activity of adaptor proteins and small GTPases involved in vesicular trafficking. In clathrin-mediated endocytosis, phosphorylation of serine/threonine residues on the AP-2 adaptor complex enhances its affinity for cargo and promotes vesicle formation. Specifically, phosphorylation of threonine 156 in the μ2 subunit of AP-2, mediated by kinases such as casein kinase II, is essential for efficient recognition of tyrosine-based sorting signals and transferrin receptor endocytosis.[83] Rab GTPases, which coordinate vesicle tethering and fusion along trafficking routes, are also subject to phosphorylation by serine/threonine and tyrosine kinases, altering their GTP/GDP cycling and membrane recruitment to fine-tune endocytic and secretory pathways.[84] Representative examples illustrate these mechanisms in specific signaling contexts. Upon epidermal growth factor (EGF) binding, the epidermal growth factor receptor (EGFR) undergoes autophosphorylation on multiple tyrosine residues, recruiting AP-2 and other adaptors to initiate rapid clathrin-dependent internalization and downregulation of signaling.[85] In insulin-responsive glucose uptake, Akt kinase phosphorylates the Rab GTPase-activating protein AS160 at multiple threonine sites, relieving its inhibitory effect on Rab10 and Rab8, thereby promoting translocation of GLUT4-containing vesicles to the plasma membrane in adipocytes and myocytes.[86] Phosphorylation directs protein degradation by generating phospho-dependent degradation motifs (phospho-degrons) that serve as docking sites for E3 ubiquitin ligases, marking substrates for proteasomal breakdown. The F-box protein β-TrCP, a substrate-recognition subunit of SCF ubiquitin ligase complexes, binds phosphorylated serine/threonine residues in consensus motifs (e.g., DpSGXXpS), as seen in the Wnt pathway where GSK3β phosphorylates β-catenin at serines 33 and 37, enabling β-TrCP recruitment, ubiquitination, and cytosolic degradation to suppress canonical signaling.[87] SCF complexes broadly require such priming phosphorylations to achieve specificity and efficiency in substrate ubiquitination.[88] In cell cycle progression, phosphorylation ensures temporal control of cyclin turnover via ubiquitin-mediated degradation. For instance, GSK3β phosphorylates cyclin D1 at threonine 286 in response to mitogenic withdrawal, creating a phospho-degron that recruits the SCF^{FBX4-αB crystallin} E3 ligase for polyubiquitination and proteasomal degradation, thereby preventing G1/S phase dysregulation.00635-6) This mechanism exemplifies how phosphorylation integrates with ubiquitination to synchronize degradation events, such as mitotic cyclin B destruction by the APC/C ligase following CDK1 priming phosphorylations.[89] Crosstalk between membrane transport and degradation is evident in endocytic sorting, where multi-site phosphorylation codes on cargo or adaptors dictate lysosomal targeting. Phosphorylation patterns, often involving MAPK or other kinases, modulate interactions with sorting nexins (e.g., SNX27 phosphorylation at serine 51 inhibits recycling and promotes lysosomal delivery under stress), integrating vesicular trafficking with ubiquitin-dependent degradation to clear receptors like EGFR via multivesicular bodies.[90]

Major Kinase Families

Receptor tyrosine kinases

Receptor tyrosine kinases (RTKs) are a major subclass of transmembrane protein kinases that initiate cellular responses to extracellular signals through tyrosine phosphorylation, playing pivotal roles in development, metabolism, and homeostasis. These enzymes feature a conserved architecture consisting of an extracellular ligand-binding domain, a single transmembrane helix, and an intracellular tyrosine kinase domain flanked by juxtamembrane and C-terminal regulatory regions. In humans, 58 RTKs are distributed across 20 families, classified based on structural similarities in their extracellular domains, such as immunoglobulin-like or fibronectin type III repeats.[91][92] Activation of RTKs typically occurs through ligand-induced dimerization or oligomerization, which brings the intracellular kinase domains into proximity for trans-autophosphorylation on tyrosine residues. For instance, in the epidermal growth factor receptor (EGFR) family, ligands like EGF promote an asymmetric dimer configuration, where one kinase domain allosterically activates the other, leading to sequential phosphorylation of activation loop tyrosines (e.g., Y1092 in EGFR) and subsequent docking sites.[92] The insulin receptor, a pre-formed disulfide-linked dimer, undergoes a conformational shift upon insulin binding at two distinct sites (site 1 and site 2), relieving autoinhibition and enabling autophosphorylation on three tyrosines in the activation loop (Y1158, Y1162, Y1163).[93] Similarly, vascular endothelial growth factor (VEGF) binding to VEGFR2 induces dimerization and autophosphorylation on key tyrosines (e.g., Y1175), driving endothelial cell proliferation and migration essential for angiogenesis. Upon activation, the phosphotyrosine (pTyr) residues serve as docking sites for downstream effectors containing Src homology 2 (SH2) or phosphotyrosine-binding (PTB) domains, thereby propagating phosphorylation cascades. For example, in EGFR signaling, pTyr-1068 recruits the adaptor Grb2, which links to Sos and activates the Ras-MAPK pathway, while pTyr-1101 binds the p85 subunit of PI3K to stimulate Akt-mediated survival signals.[92] In the insulin receptor pathway, phosphorylated IRS proteins dock PLCγ or Shc, amplifying metabolic and mitogenic responses through further tyrosine phosphorylation events.[93] VEGFR2 pTyr sites engage adaptors like PLCγ and Src, phosphorylating substrates that promote vascular permeability and sprout formation in angiogenesis. RTK activity is tightly regulated to prevent aberrant signaling, primarily through receptor internalization via endocytosis and subsequent lysosomal degradation. For EGFR, ligand binding triggers clathrin-mediated endocytosis, followed by ubiquitination by the E3 ligase Cbl at pTyr-1045, targeting the receptor for degradation and attenuating signaling.[92] Therapeutic inhibition of RTKs, such as the EGFR-specific tyrosine kinase inhibitor gefitinib, competitively blocks ATP binding in the kinase domain, preventing autophosphorylation and downstream activation in cancers driven by EGFR mutations.[94]

Cyclin-dependent kinases

Cyclin-dependent kinases (CDKs) constitute a family of serine/threonine kinases that play a pivotal role in regulating the eukaryotic cell cycle through sequential phosphorylation events. In humans, there are 20 identified CDKs, ranging from CDK1 to CDK20, each associating with specific cyclins to achieve activation and substrate specificity. For instance, CDK1 binds to cyclin A or B to drive mitotic progression, while CDK4 and CDK6 pair with cyclin D during the G1 phase.[95] Activation of CDKs involves a multi-step process: binding to cyclins induces conformational changes that expose the activation loop, which is then phosphorylated at a conserved threonine residue, such as Thr160 in CDK2 or Thr161 in CDK1, by cyclin-activating kinases (CAKs) like CDK7. This activating phosphorylation is counterbalanced by inhibitory phosphorylations on Thr14 and Tyr15, mediated by kinases such as Wee1, which prevent premature CDK activity; these inhibitory phosphates are subsequently removed by Cdc25 phosphatases to trigger full activation at appropriate cell cycle stages.[95][96] CDKs orchestrate cell cycle progression by phosphorylating key substrates in a phase-specific manner. In the G1 phase, the CDK4/6-cyclin D complexes phosphorylate the retinoblastoma protein (Rb) at multiple sites, leading to its inactivation and release of E2F transcription factors to promote G1/S transition. During S phase, CDK2 associated with cyclin E or A phosphorylates substrates involved in DNA replication origin firing and histone loading, ensuring faithful genome duplication.[95] Each CDK typically phosphorylates 100-200 substrates, recognizing a consensus motif of S/TPXK/R, where the proline-directed nature allows for ordered multisite phosphorylation that amplifies signaling and establishes cell cycle checkpoints.[95][97] Dysregulation of CDKs is implicated in cancer, with notable examples including amplification of CDK4 in breast and other tumors, which drives uncontrolled Rb phosphorylation and cell proliferation.[95]

Serine/threonine kinases

Serine/threonine kinases form one of the largest classes within the human kinome, comprising approximately 350 members that specifically phosphorylate serine and threonine residues on target proteins to regulate diverse cellular processes. These kinases are classified into major evolutionary groups based on sequence similarity in their catalytic domains, including the AGC, CMGC, and CAMK groups, as well as other families such as the AMPK subfamily. The AGC group, named after protein kinase A (PKA), G (PKG), and C (PKC) families, encompasses about 63 kinases that typically respond to second messengers like cyclic nucleotides or lipids.[98] Prominent examples include PKA, which is activated when cAMP binds to its regulatory subunits, displacing the catalytic subunits to initiate phosphorylation cascades; PKB/Akt, involved in cell survival signaling; and PKC, modulated by diacylglycerol and calcium.[99] The CMGC group, derived from cyclin-dependent kinases (CDKs), mitogen-activated protein kinases (MAPKs), glycogen synthase kinase 3 (GSK3), and CDC-like kinases (CLKs), includes kinases with roles in cell signaling and metabolism, excluding cell cycle-specific CDKs detailed elsewhere. MAPKs, for instance, are activated through dual phosphorylation on a conserved threonine-tyrosine motif (Thr-X-Tyr) by upstream MAP kinase kinases, enabling responses to environmental stresses such as inflammation or osmotic shock.[100] GSK3, meanwhile, regulates glycogen synthesis and Wnt signaling, often inhibited by phosphorylation at its N-terminal serine. The CAMK group, comprising kinases like calcium/calmodulin-dependent protein kinase II (CaMKII), is primarily activated by elevations in intracellular calcium levels, where calcium-bound calmodulin binds to the kinase, relieving autoinhibition and promoting autophosphorylation for sustained activity.[101] Beyond these core groups, other serine/threonine kinases include AMP-activated protein kinase (AMPK), an energy sensor in the AMPK subfamily that is allosterically activated by AMP and further stimulated by phosphorylation at Thr172 within its activation loop, triggering metabolic adaptations like glucose uptake and fatty acid oxidation during energy stress.[102] These kinases demonstrate wide substrate breadth, from ubiquitous housekeeping roles—such as PKA phosphorylating phosphorylase kinase to promote glycogenolysis in response to hormonal signals—to highly specialized functions, like Aurora kinases (in the Aurora subfamily) that ensure proper chromosome segregation and spindle assembly during mitosis.[103][104] Many serine/threonine kinases serve as therapeutic targets due to their dysregulation in diseases; sorafenib, for example, inhibits Raf kinases (part of the MAPK/ERK pathway) to suppress tumor growth in hepatocellular carcinoma and other cancers.[105]

Detection and Analysis

Experimental detection methods

Protein phosphorylation can be detected using classical methods that rely on radioactive labeling or antibody-based approaches. In vitro kinase assays employing 32P-labeled ATP allow for the direct measurement of kinase activity by incorporating radioactive phosphate into substrate proteins or peptides, providing quantitative insights into phosphorylation events under controlled conditions.[106] This technique, widely used since the 1970s, enables the identification of kinase-substrate interactions but requires careful handling due to radioactivity.[107] Western blotting with phospho-specific antibodies offers a non-radioactive alternative for detecting phosphorylation at specific sites, where antibodies raised against phosphorylated residues bind selectively to target proteins in cell lysates, revealing changes in phosphorylation status following stimuli.[108] These antibodies are particularly valuable for validating known sites and monitoring dynamic responses in complex samples.[109] Mass spectrometry (MS), particularly liquid chromatography-tandem MS (LC-MS/MS), has become the gold standard for global phosphoproteome analysis, enabling the identification and localization of thousands of phosphorylation sites in a single experiment. Phosphopeptides are enriched prior to MS analysis to overcome their low abundance, using methods such as immobilized metal affinity chromatography (IMAC) with Fe³⁺ or Ti⁴⁺ ions, which bind phosphate groups selectively, or titanium dioxide (TiO₂) chromatography, which exhibits high specificity for mono- and multi-phosphorylated peptides under acidic conditions with additives like 2,5-dihydroxybenzoic acid.[110] TiO₂-based enrichment, introduced in 2005, achieves high recovery of phosphopeptides from digests, facilitating the detection of over 10,000 sites in mammalian cell lines per LC-MS/MS run with modern instrumentation.[111] IMAC complements TiO₂ by capturing acidic phosphopeptides missed by other methods, though it can suffer from non-specific binding of acidic residues.[112] Additional confirmation techniques include treatment with protein phosphatases, such as lambda phosphatase, which removes phosphate groups from proteins; a shift in band mobility on Western blots or loss of signal with phospho-specific antibodies verifies phosphorylation dependency.[113] Kinase activity profiling using kinobeads—a multiplexed affinity capture method with sepharose beads covalently linked to broad-spectrum kinase inhibitors—enables the pull-down and quantification of active kinases from lysates via LC-MS/MS, revealing drug-target interactions and kinome-wide changes.[114] Quantitative approaches enhance the study of phosphorylation dynamics. Stable isotope labeling by amino acids in cell culture (SILAC) incorporates heavy isotopes into proteins during cell growth, allowing ratio-based quantification of phosphopeptide changes between conditions, such as in response to signaling perturbations.[115] Tandem mass tag (TMT) labeling enables multiplexed analysis of up to 18 samples, combining isobaric tags for relative quantification of phosphorylation stoichiometry across time points or treatments.[116] Single-cell phospho-flow cytometry extends these to heterogeneous populations, using intracellular staining with phospho-specific antibodies and flow cytometry to measure signaling events in individual cells, as demonstrated in leukemia samples where it resolved patient-specific phospho-networks.[117] Recent advances as of 2025 include AI-driven tools for phosphosite prediction and quantification in mass spectrometry-based analyses, improving coverage and accuracy in global phosphoproteomics.[118] Despite advances, experimental detection faces limitations, including bias toward abundant or stable phosphosites due to enrichment inefficiencies and the substoichiometric nature of phosphorylation, which challenges detection of transient or low-abundance events.[119] Low dynamic range in MS further hinders quantification of rare sites amid high-abundance proteins.[120]

Structural and functional characterization

Protein phosphorylation introduces a negatively charged phosphate group to serine, threonine, or tyrosine residues, often inducing conformational changes that alter protein structure and function. Structural characterization methods, such as X-ray crystallography, have been pivotal in elucidating these effects, particularly in revealing how phosphotyrosine (pTyr) motifs interact with recognition domains like SH2. For instance, the crystal structure of the v-Src SH2 domain bound to a pTyr-containing peptide demonstrates that the phosphate group forms hydrogen bonds with arginine residues in the binding pocket, stabilizing the complex and enabling downstream signaling. Nuclear magnetic resonance (NMR) spectroscopy complements X-ray by providing dynamic insights into phosphorylated proteins in solution, capturing transient conformational shifts induced by phosphorylation. For larger assemblies, cryo-electron microscopy (cryo-EM) has resolved phosphorylation-dependent structures, such as in kinase-substrate complexes within signaling scaffolds, where resolutions below 3 Å reveal phosphate-mediated allosteric effects.[121] Computational approaches enhance structural understanding by simulating phosphorylation's biophysical impacts. Molecular dynamics (MD) simulations model the electrostatic repulsion from the phosphate's negative charge, showing how it disrupts local secondary structures or enhances flexibility in loops. In a MD study of the SARS-CoV-2 main protease, phosphorylation at a key serine residue was shown to increase solvent exposure and hinge motion, altering active site accessibility.[122] Phospho-site predictors like DISPHOS integrate sequence and disorder propensity to forecast phosphorylation likelihood in intrinsically disordered regions, aiding in prioritizing structural studies; it achieves over 80% accuracy for serine/threonine sites in eukaryotic proteins by leveraging evolutionary disorder patterns.[123] Functional characterization employs mutagenesis to dissect phosphorylation's roles. Site-directed mutagenesis, substituting serine/threonine/tyrosine with alanine (non-phosphorylatable mimic) or aspartate/glutamate (phosphomimetic), reveals site-specific effects; for example, mutating S15 to D in p53 enhances DNA binding affinity, mimicking phosphorylated activation.[124] CRISPR-Cas9 knock-in enables precise endogenous modifications, allowing assessment of phosphorylation in native contexts; a study knocking in phospho-mimetic mutations in EGFR demonstrated sustained signaling without exogenous expression artifacts.[125] Dynamic assays probe phosphorylation's real-time consequences. Förster resonance energy transfer (FRET) sensors monitor kinase activity and conformational changes, with phosphorylation-induced proximity shifts yielding sub-second resolution; in MAPK pathways, FRET revealed ERK2 autophosphorylation timing critical for substrate recruitment.[126] Phosphoproteomics combined with pathway inhibitors dissects functional networks, where selective kinase blockade (e.g., via staurosporine analogs) identifies downstream substrates; quantitative mass spectrometry post-inhibition quantified over 5,000 phosphorylation events in exercise-stimulated muscle, linking them to metabolic adaptation.[127] Integrative methods map phosphorylation's broader implications. Kinase-substrate mapping via peptide array-based profiling and computational methods has identified thousands of high-confidence interactions across the human kinome.[128] Evolutionary conservation analysis evaluates phospho-site functionality, with conserved sites across metazoans indicating regulatory importance; comparative phosphoproteomics shows low conservation of phosphosites between distant eukaryotes like humans and yeast, with only minimal overlap observed, though higher conservation in closer species correlates with essential signaling roles.[129]

Evolutionary Aspects

Phosphorylation in prokaryotes

Protein phosphorylation in prokaryotes primarily occurs through non-canonical histidine (His) and aspartate (Asp) residues, forming high-energy acyl-phosphate bonds, which contrasts with the more common serine/threonine/tyrosine (Ser/Thr/Tyr) phosphorylation seen in eukaryotes. These modifications are central to two-component signaling systems (TCSs), which enable bacteria and archaea to sense and respond to environmental cues. Over 300,000 TCSs have been identified across prokaryotic genomes, with an average of about 30 per bacterial species, representing a key adaptation for rapid signal transduction. This low site density—reflecting the specialized nature of these systems, where phosphorylation targets are limited to sensor histidine kinases (HKs) and response regulators (RRs)—contrasts with the higher density in eukaryotes.[130] In the canonical TCS mechanism, an environmental stimulus activates the sensor HK, which autophosphorylates on a conserved His residue using ATP, forming a phosphohistidine intermediate with a high-energy phosphoanhydride bond.[131] The phosphoryl group is then rapidly transferred to an Asp residue on the cognate RR receiver domain, inducing conformational changes that activate the RR's output domain, often a DNA-binding domain for transcriptional regulation. This phosphotransfer occurs on timescales of seconds, allowing quick adaptation to stimuli such as nutrient availability or stress.[132] Dephosphorylation, often catalyzed by the HK's phosphatase activity or spontaneous hydrolysis, resets the system to prevent prolonged signaling.[132] TCSs regulate diverse prokaryotic functions, including chemotaxis, virulence, and stress responses. For instance, in Salmonella enterica, the PhoQ/PhoP TCS senses magnesium limitation and antimicrobial peptides, phosphorylating PhoP to activate genes for intramacrophage survival and acid tolerance.[130] Similarly, the NtrB/NtrC system in enteric bacteria like Escherichia coli controls nitrogen assimilation by phosphorylating NtrC in response to glutamine levels, promoting alternative nitrogen source utilization. Osmotic stress responses, such as the EnvZ/OmpR TCS in E. coli, adjust porin expression via Asp phosphorylation to maintain membrane permeability under varying osmolarity. While His/Asp phosphorylation dominates, some prokaryotes, particularly pathogens, feature eukaryotic-like Ser/Thr/Tyr kinases. In Mycobacterium tuberculosis, the Ser/Thr kinase PknB autophosphorylates and targets substrates involved in cell wall biosynthesis and virulence, enabling adaptation to host environments.[133] These systems highlight the diversity of prokaryotic phosphosignaling, with His/Asp TCSs providing a foundational, efficient framework for environmental sensing.[130]

Phosphorylation in eukaryotes

In eukaryotes, protein phosphorylation has evolved into a highly complex regulatory mechanism, reflecting the demands of multicellularity and intricate cellular organization. Unlike the sparse kinase complement in prokaryotes, eukaryotic genomes encode a substantial array of kinases; for instance, the human genome contains approximately 518 protein kinase genes, representing about 2% of the coding genome and enabling diverse signaling pathways. This expansion facilitates precise control over processes such as cell growth, differentiation, and response to environmental cues. Phosphorylation predominantly targets canonical amino acid residues—serine (Ser), threonine (Thr), and tyrosine (Tyr)—with Ser sites accounting for the majority due to their accessibility and the prevalence of Ser/Thr kinases. Additionally, eukaryotes feature phosphorylation of lipid substrates, notably phosphoinositides like phosphatidylinositol 4,5-bisphosphate (PIP2), which is converted to PIP3 by phosphoinositide 3-kinases (PI3Ks), thereby linking membrane signaling to downstream effectors such as Akt. Key kinase families underpin major eukaryotic systems. Receptor tyrosine kinases (RTKs), such as the epidermal growth factor receptor (EGFR), autophosphorylate on Tyr residues upon ligand binding, recruiting adaptor proteins to activate cascades like the MAPK/ERK pathway for cell proliferation and survival. Cyclin-dependent kinases (CDKs), including CDK1, orchestrate the cell cycle by sequentially phosphorylating substrates that drive progression through G1/S and G2/M phases, ensuring genomic fidelity. In the nucleus, phosphorylation regulates transcription; for example, phosphorylation of the RNA polymerase II C-terminal domain on Ser residues by CDK7 and CDK9 facilitates mRNA synthesis and processing. These systems highlight phosphorylation's role in temporal and spatial control, with eukaryotic proteomes exhibiting orders of magnitude more phosphorylation sites—over 200,000 in humans—compared to the hundreds typically found in prokaryotes, allowing for multilayered regulation. Eukaryotic phosphorylation incorporates adaptations for efficiency and specificity. Scaffold proteins, such as A-kinase anchoring proteins (AKAPs), tether protein kinase A (PKA) to specific subcellular locations, enhancing signal fidelity by localizing kinase-substrate interactions near targets like ion channels or cytoskeletal elements. Compartmentalization further refines this; mitochondrial kinases, including PINK1, phosphorylate proteins involved in mitophagy and energy metabolism, responding to oxidative stress. Phosphorylation cascades often display ultrasensitivity, where dual phosphorylation of substrates like MAPK creates switch-like responses to graded inputs, amplifying signals in pathways such as insulin signaling. Representative examples illustrate these features: in budding yeast (Saccharomyces cerevisiae), the CDK homolog Cdc28 phosphorylates multiple substrates to coordinate cell cycle events, from bud emergence to cytokinesis. In plants, SNF1-related protein kinase 2 (SnRK2) family members, activated by abscisic acid (ABA), phosphorylate ion channels and transcription factors to mediate stress responses like stomatal closure. These mechanisms underscore phosphorylation's centrality to eukaryotic adaptability and homeostasis.

Comparative evolution across domains

Protein phosphorylation mechanisms trace their origins to the last universal common ancestor (LUCA), where basic signaling systems, including histidine kinases integral to two-component systems, likely facilitated environmental responses in early cellular life.[134] These histidine kinases, which autophosphorylate on histidine residues before transferring phosphate to aspartate on response regulators, represent the earliest known phosphorylation-based signaling, predating the divergence of bacterial and archaeal lineages. In contrast, serine/threonine/tyrosine (Ser/Thr/Tyr) phosphorylation, mediated by eukaryotic-like kinases (ELKs), emerged later but shares a deep evolutionary root, with evidence of such kinases present across prokaryotic domains, suggesting their incorporation into LUCA's signaling repertoire before the prokaryote-eukaryote split.[134][135] The divergence between prokaryotes and eukaryotes marked a profound expansion of Ser/Thr/Tyr kinase diversity in the eukaryotic lineage, driven by extensive gene duplication events that amplified the kinase domain repertoire by several orders of magnitude compared to prokaryotic systems, from typically fewer than 50 in prokaryotes to over 500 in humans. In eukaryotes, this proliferation—from around 120 kinases in yeast to over 500 in humans—enabled intricate regulatory networks, while prokaryotes retained more streamlined sets, often fewer than 50 ELKs per genome except in complex lineages like myxobacteria. Notably, histidine kinase systems, dominant in prokaryotes for rapid signal relay, were largely lost in metazoan animals during eukaryotic evolution, possibly due to the inadequacy of their short-lived phospho-histidine signals for the extended intracellular cascades required in multicellular organisms; simpler eukaryotes like fungi and plants, however, retain these systems alongside expanded Ser/Thr/Tyr pathways.[136][134][137] Archaea exhibit hybrid phosphorylation systems that bridge prokaryotic and eukaryotic features, with widespread Ser/Thr kinases but minimal Tyr phosphorylation, reflecting an intermediate evolutionary position. For instance, haloarchaea like Haloferax volcanii possess multiple eukaryotic-like Ser/Thr kinases involved in osmoregulation and stress responses, often integrated with prokaryotic-style histidine systems, yet lack the extensive Tyr kinase networks seen in bacteria or eukaryotes. This configuration underscores archaea's role in conserving ancestral traits while adapting to extreme environments, with genomic analyses revealing fewer than 20 such kinases per genome on average.[138][139] Functionally, phosphorylation evolved from primarily environmental sensing in prokaryotes—such as nutrient detection and osmotic stress via histidine kinases—to complex developmental signaling in eukaryotes, where Ser/Thr/Tyr cascades orchestrate cell differentiation and morphogenesis. In prokaryotes, these systems enable quick adaptations to external cues, as seen in bacterial chemotaxis, whereas eukaryotic expansions support intracellular coordination, exemplified by cyclin-dependent kinases regulating cell cycle progression. This shift correlates with the rise of multicellularity, where myxobacterial prokaryotes preview eukaryotic-like developmental roles through ELK-mediated fruiting body formation.[134][140] Despite these divergences, core catalytic residues remain invariant across domains, ensuring universal phosphoryl transfer efficiency; key elements include the aspartate in the DFG motif for magnesium coordination, the glutamate-lysine salt bridge in the active site, and hydrophobic spines stabilizing the bilobal kinase fold. This deep conservation highlights the ancient optimization of the kinase core, present from LUCA onward. Furthermore, kinases co-evolve with their substrates, with specificity determinants—such as motif preferences for arginine or proline adjacent to phospho-sites—emerging early in eukaryotic history around the last eukaryotic common ancestor, and persisting through balanced duplication and selection to maintain network fidelity.[141][142]

Pathological Roles

Dysregulation in cancer

Dysregulation of protein phosphorylation plays a central role in oncogenesis through aberrant activation of kinases and loss of phosphatase function, leading to uncontrolled cell proliferation and survival. Oncogenic mutations in receptor tyrosine kinases, such as the L858R substitution in EGFR, enhance kinase activity and promote ligand-independent phosphorylation of downstream substrates, driving tumorigenesis in non-small cell lung cancer. Similarly, amplification and overexpression of HER2 in breast cancer result in constitutive autophosphorylation and sustained signaling, contributing to aggressive tumor growth in 15-30% of cases.[143][144] Key signaling pathways amplified by these phosphorylation events include the PI3K/Akt pathway, which promotes cell survival by phosphorylating pro-apoptotic regulators like FOXO transcription factors, and the MAPK/ERK pathway, which drives proliferation through phosphorylation of targets such as cyclin D1. A paradigmatic example is the BCR-ABL fusion protein in chronic myeloid leukemia, where the constitutively active tyrosine kinase phosphorylates substrates in both pathways, leading to leukemic transformation. Loss of phosphatases exacerbates this dysregulation; for instance, PTEN deletion, a lipid phosphatase with analogous tumor-suppressive effects on phosphoinositide signaling, occurs in multiple cancers and results in hyperactivation of Akt. Likewise, epigenetic silencing of PTPRB in lung adenocarcinoma reduces dephosphorylation of tyrosine residues, enhancing oncogenic signaling.[145][146][147][148][149] In diagnostics, phospho-protein profiling reveals tumor-specific patterns, such as elevated phospho-ERK levels in the majority of melanomas due to BRAF or NRAS mutations, aiding in pathway activation assessment and patient stratification. Therapeutically, tyrosine kinase inhibitors (TKIs) target these dysregulated kinases; imatinib, approved by the FDA in 2001, inhibits BCR-ABL phosphorylation and induces remission in chronic myeloid leukemia patients. However, resistance often emerges via secondary mutations in the kinase domain, such as T315I in BCR-ABL, which sterically hinder TKI binding and restore oncogenic phosphorylation.[150][151][151]

Implications in neurodegenerative diseases

Protein phosphorylation plays a critical role in the pathogenesis of neurodegenerative diseases, particularly through dysregulation of kinase and phosphatase activities that lead to aberrant accumulation of phosphorylated proteins in neurons. In Parkinson's disease (PD), the G2019S mutation in leucine-rich repeat kinase 2 (LRRK2) hyperactivates its kinase domain, enhancing autophosphorylation and substrate phosphorylation, which contributes to dopaminergic neuron loss.[152][153] This mutation is associated with 1-5% of sporadic PD cases and up to 40% in certain familial cohorts.[154] Similarly, phosphorylation of alpha-synuclein at serine 129 (pSer129) is a hallmark of PD, with nearly all alpha-synuclein in Lewy bodies being modified at this site, promoting fibril formation and aggregation that exacerbates neurodegeneration.[155][156] In Alzheimer's disease (AD), hyperphosphorylation of tau protein at over 20 sites, mediated by kinases such as glycogen synthase kinase 3β (GSK3β) and casein kinase 1 (CK1), disrupts microtubule stability and leads to the formation of neurofibrillary tangles (NFTs), a key pathological feature.[157][158][159] Dysregulation of cyclin-dependent kinase 5 (CDK5), often through upregulation of its activator p25, further drives tau hyperphosphorylation at multiple epitopes, contributing to synaptic dysfunction and cognitive decline.[160][161] These modifications impair axonal transport and neuronal integrity, accelerating disease progression. Mechanistically, kinase overactivation in neurodegeneration is often triggered by oxidative stress, which activates pathways like p38 MAPK and enhances phosphorylation events that amplify cellular damage.[162] Concurrently, inhibition or decline of protein phosphatase 2A (PP2A) activity reduces dephosphorylation of tau and alpha-synuclein, exacerbating hyperphosphorylation in both AD and PD brains.[163][164] These phospho-marks promote the formation of intracellular inclusions, such as Lewy bodies and NFTs, which sequester cellular components and impair proteostasis.[165] Moreover, hyperphosphorylated proteins disrupt autophagy, a critical clearance mechanism, by inhibiting autophagosome-lysosome fusion and reducing flux, thereby fostering further accumulation of toxic aggregates and neuronal death.[166][167] Therapeutically, targeting these dysregulations shows promise; LRRK2 inhibitors, such as BIIB122 and NEU-411, are in phase 2 clinical trials for PD patients with LRRK2 mutations, aiming to reduce kinase hyperactivity and slow symptom progression.[168][169] For AD, lithium, a GSK3β inhibitor, has demonstrated potential in preclinical models by reducing tau hyperphosphorylation and NFT formation, with ongoing evaluations for its neuroprotective effects.[170][171]

Roles in other disorders

Protein phosphorylation plays a critical role in metabolic disorders, particularly type 2 diabetes, where dysregulation of key signaling pathways contributes to insulin resistance. In insulin resistance, serine phosphorylation of insulin receptor substrate-1 (IRS-1) at sites such as Ser307 inhibits its function, impairing insulin signaling. This phosphorylation is mediated by stress-activated kinases like c-Jun N-terminal kinase (JNK), which is activated under conditions of nutrient excess and inflammation. Similarly, Toll-like receptor 4 (TLR4) activation by ligands such as free fatty acids promotes JNK-dependent IRS-1 serine phosphorylation, exacerbating insulin resistance in adipose and vascular tissues. Additionally, AMP-activated protein kinase (AMPK), a central regulator of energy homeostasis, exhibits hypoactivation in diabetes due to reduced phosphorylation at Thr172 on its α-subunit, leading to diminished glucose uptake and mitochondrial biogenesis in skeletal muscle and liver. High glucose environments further suppress this phosphorylation via Akt-dependent mechanisms, perpetuating metabolic dysfunction. In inflammatory disorders, aberrant protein phosphorylation drives excessive immune responses, including cytokine storms observed in conditions like sepsis and autoimmune diseases. The NF-κB pathway is hyperactivated through phosphorylation of IκB kinase β (IKKβ) at Ser181, which promotes IκB degradation and subsequent NF-κB nuclear translocation, amplifying pro-inflammatory cytokine production. Mitogen-activated protein kinase (MAPK) pathways, particularly p38 and ERK, undergo sustained hyperphosphorylation during cytokine storms, enhancing transcription of interleukin-6 (IL-6) and tumor necrosis factor-α (TNF-α), which perpetuate systemic inflammation. Genetic variants, such as the PTPN22 R620W polymorphism, impair the phosphatase activity of lymphoid tyrosine phosphatase (LYP), leading to excessive T-cell receptor signaling through unchecked tyrosine phosphorylation and increased risk of autoimmunity in diseases like rheumatoid arthritis. Cardiovascular diseases involve phosphorylation imbalances that affect vascular tone and cardiac remodeling. In endothelial cells, phosphorylation of endothelial nitric oxide synthase (eNOS) at Ser1177 by Akt enhances nitric oxide production, promoting vasodilation and protecting against hypertension. Conversely, in cardiac hypertrophy, calcineurin (PP2B), a serine/threonine phosphatase, is activated under stress conditions and dephosphorylates nuclear factor of activated T-cells (NFAT), allowing its nuclear translocation and induction of hypertrophic genes, contributing to pathological remodeling in heart failure. Chronic stress disrupts the phospho-balance by altering kinase-phosphatase dynamics, often through oxidative mechanisms that favor hyperphosphorylation of stress-responsive proteins while impairing dephosphorylation. For instance, sustained glucocorticoid exposure elevates JNK and IKKβ activity, shifting the equilibrium toward pro-inflammatory serine phosphorylation in multiple tissues. The PTPN22 variant exemplifies how genetic factors compound this imbalance, reducing negative regulation of T-cell phosphorylation and promoting autoimmune inflammation. Therapeutic strategies targeting phosphorylation have shown promise in these disorders. Metformin, a first-line treatment for type 2 diabetes, activates AMPK through enhanced phosphorylation at Thr172, improving insulin sensitivity and glucose metabolism independently of AMP levels. In inflammatory conditions, Janus kinase (JAK) inhibitors like tofacitinib block JAK phosphorylation following cytokine receptor engagement, suppressing downstream STAT activation and reducing cytokine storms in rheumatoid arthritis and other autoimmune diseases.

References

User Avatar
No comments yet.