Recent from talks
Nothing was collected or created yet.
Metabolomics
View on Wikipedia

Metabolomics is the scientific study of chemical processes involving metabolites, the small molecule substrates, intermediates, and products of cell metabolism. Specifically, metabolomics is the "systematic study of the unique chemical fingerprints that specific cellular processes leave behind", the study of their small-molecule metabolite profiles.[1] The metabolome represents the complete set of metabolites in a biological cell, tissue, organ, or organism, which are the end products of cellular processes.[2] Messenger RNA (mRNA), gene expression data, and proteomic analyses reveal the set of gene products being produced in the cell, data that represents one aspect of cellular function. Conversely, metabolic profiling can give an instantaneous snapshot of the physiology of that cell,[3] and thus, metabolomics provides a direct "functional readout of the physiological state" of an organism.[4] There are indeed quantifiable correlations between the metabolome and the other cellular ensembles (genome, transcriptome, proteome, and lipidome), which can be used to predict metabolite abundances in biological samples from, for example mRNA abundances.[5] One of the ultimate challenges of systems biology is to integrate metabolomics with all other -omics information to provide a better understanding of cellular biology.
History
[edit]The concept that individuals might have a "metabolic profile" that could be reflected in the makeup of their biological fluids was introduced by Roger Williams in the late 1940s,[6] who used paper chromatography to suggest characteristic metabolic patterns in urine and saliva were associated with diseases such as schizophrenia. However, it was only through technological advancements in the 1960s and 1970s that it became feasible to quantitatively (as opposed to qualitatively) measure metabolic profiles.[7] The term "metabolic profile" was introduced by Horning, et al. in 1971 after they demonstrated that gas chromatography-mass spectrometry (GC-MS) could be used to measure compounds present in human urine and tissue extracts.[8][9] The Horning group, along with that of Linus Pauling and Arthur B. Robinson led the development of GC-MS methods to monitor the metabolites present in urine through the 1970s.[10]
Concurrently, NMR spectroscopy, which was discovered in the 1940s, was also undergoing rapid advances. In 1974, Seeley et al. demonstrated the utility of using NMR to detect metabolites in unmodified biological samples.[11] This first study on muscle highlighted the value of NMR in that it was determined that 90% of cellular ATP is complexed with magnesium. As sensitivity has improved with the evolution of higher magnetic field strengths and magic angle spinning, NMR continues to be a leading analytical tool to investigate metabolism.[8][12] Recent efforts to utilize NMR for metabolomics have been largely driven by the laboratory of Jeremy K. Nicholson at Birkbeck College, University of London and later at Imperial College London. In 1984, Nicholson showed 1H NMR spectroscopy could potentially be used to diagnose diabetes mellitus, and later pioneered the application of pattern recognition methods to NMR spectroscopic data.[13][14]
In 1994 and 1996, liquid chromatography mass spectrometry metabolomics experiments[15][16] were performed by Gary Siuzdak while working with Richard Lerner (then president of the Scripps Research Institute) and Benjamin Cravatt, to analyze the cerebral spinal fluid from sleep deprived animals. One molecule of particular interest, oleamide, was observed and later shown to have sleep inducing properties. This work is one of the earliest such experiments combining liquid chromatography and mass spectrometry in metabolomics.
In 2005, the first metabolomics tandem mass spectrometry database, METLIN,[17][18] for characterizing human metabolites was developed in the Siuzdak laboratory at the Scripps Research Institute. METLIN has since grown and as of December, 2023, METLIN contains MS/MS experimental data on over 930,000 molecular standards and other chemical entities,[19] each compound having experimental tandem mass spectrometry data generated from molecular standards at multiple collision energies and in positive and negative ionization modes. METLIN is the largest repository of tandem mass spectrometry data of its kind. The dedicated academic journal Metabolomics first appeared in 2005, founded by its current editor-in-chief Roy Goodacre.
In 2005, the Siuzdak lab was engaged in identifying metabolites associated with sepsis and in an effort to address the issue of statistically identifying the most relevant dysregulated metabolites across hundreds of LC/MS datasets, the first algorithm was developed to allow for the nonlinear alignment of mass spectrometry metabolomics data. Called XCMS,[20] it has since (2012)[21] been developed as an online tool and as of 2019 (with METLIN) has over 30,000 registered users.
On 23 January 2007, the Human Metabolome Project, led by David S. Wishart, completed the first draft of the human metabolome, consisting of a database of approximately 2,500 metabolites, 1,200 drugs and 3,500 food components.[22][23] Similar projects have been underway in several plant species, most notably Medicago truncatula[24] and Arabidopsis thaliana[25] for several years.
As late as mid-2010, metabolomics was still considered an "emerging field".[26] Further, it was noted that further progress in the field depended in large part, through addressing otherwise "irresolvable technical challenges", by technical evolution of mass spectrometry instrumentation.[26]
In 2015, real-time metabolome profiling was demonstrated for the first time.[27]
Metabolome
[edit]
The metabolome refers to the complete set of small-molecule (<1.5 kDa)[22] metabolites (such as metabolic intermediates, hormones and other signaling molecules, and secondary metabolites) to be found within a biological sample, such as a single organism.[28][29] The word was coined in analogy with transcriptomics and proteomics; like the transcriptome and the proteome, the metabolome is dynamic, changing from second to second. Although the metabolome can be defined readily enough, it is not currently possible to analyse the entire range of metabolites by a single analytical method.
In January 2007, scientists at the University of Alberta and the University of Calgary completed the first draft of the human metabolome. The Human Metabolome Database (HMDB) is perhaps the most extensive public metabolomic spectral database to date[30] and is a freely available electronic database (www.hmdb.ca) containing detailed information about small molecule metabolites found in the human body. It is intended to be used for applications in metabolomics, clinical chemistry, biomarker discovery and general education. The database is designed to contain or link three kinds of data:
- Chemical data,
- Clinical data and
- Molecular biology/biochemistry data.
The database contains 220,945 metabolite entries including both water-soluble and lipid soluble metabolites. Additionally, 8,610 protein sequences (enzymes and transporters) are linked to these metabolite entries. Each MetaboCard entry contains 130 data fields with 2/3 of the information being devoted to chemical/clinical data and the other 1/3 devoted to enzymatic or biochemical data.[31] The version 3.5 of the HMDB contains >16,000 endogenous metabolites, >1,500 drugs and >22,000 food constituents or food metabolites.[32] This information, available at the Human Metabolome Database and based on analysis of information available in the current scientific literature, is far from complete.[33] In contrast, much more is known about the metabolomes of other organisms. For example, over 50,000 metabolites have been characterized from the plant kingdom, and many thousands of metabolites have been identified and/or characterized from single plants.[34][35]
Each type of cell and tissue has a unique metabolic 'fingerprint' that can elucidate organ or tissue-specific information. Bio-specimens used for metabolomics analysis include but not limit to plasma, serum, urine, saliva, feces, muscle, sweat, exhaled breath and gastrointestinal fluid.[36] The ease of collection facilitates high temporal resolution, and because they are always at dynamic equilibrium with the body, they can describe the host as a whole.[37] Genome can tell what could happen, transcriptome can tell what appears to be happening, proteome can tell what makes it happen and metabolome can tell what has happened and what is happening.[38]
Metabolites
[edit]| Part of a series on |
| Microbiomes |
|---|
Metabolites are the substrates, intermediates and products of metabolism. Within the context of metabolomics, a metabolite is usually defined as any molecule less than 1.5 kDa in size.[22] However, there are exceptions to this depending on the sample and detection method. For example, macromolecules such as lipoproteins and albumin are reliably detected in NMR-based metabolomics studies of blood plasma.[39] In plant-based metabolomics, it is common to refer to "primary" and "secondary" metabolites.[3] A primary metabolite is directly involved in the normal growth, development, and reproduction. A secondary metabolite is not directly involved in those processes, but usually has important ecological function. Examples include antibiotics and pigments.[40] By contrast, in human-based metabolomics, it is more common to describe metabolites as being either endogenous (produced by the host organism) or exogenous.[41][42] Metabolites of foreign substances such as drugs are termed xenometabolites.[43]
The metabolome derives from a large network of metabolic reactions, where outputs from one enzymatic chemical reaction are inputs to other chemical reactions. Such systems have been described as hypercycles.[citation needed]
Metabonomics
[edit]Metabonomics is defined as "the quantitative measurement of the dynamic multiparametric metabolic response of living systems to pathophysiological stimuli or genetic modification". The word origin is from the Greek μεταβολή meaning change and nomos meaning a rule set or set of laws.[44] This approach was pioneered by Jeremy Nicholson at Murdoch University and has been used in toxicology, disease diagnosis and a number of other fields. Historically, the metabonomics approach was one of the first methods to apply the scope of systems biology to studies of metabolism.[45][46][47]
There has been some disagreement over the exact differences between 'metabolomics' and 'metabonomics'. The difference between the two terms is not related to choice of analytical platform: although metabonomics is more associated with NMR spectroscopy and metabolomics with mass spectrometry-based techniques, this is simply because of usages amongst different groups that have popularized the different terms. While there is still no absolute agreement, there is a growing consensus that 'metabolomics' places a greater emphasis on metabolic profiling at a cellular or organ level and is primarily concerned with normal endogenous metabolism. 'Metabonomics' extends metabolic profiling to include information about perturbations of metabolism caused by environmental factors (including diet and toxins), disease processes, and the involvement of extragenomic influences, such as gut microflora. This is not a trivial difference; metabolomic studies should, by definition, exclude metabolic contributions from extragenomic sources, because these are external to the system being studied. However, in practice, within the field of human disease research there is still a large degree of overlap in the way both terms are used, and they are often in effect synonymous.[48]
Exometabolomics
[edit]Exometabolomics, or "metabolic footprinting", is the study of extracellular metabolites. It uses many techniques from other subfields of metabolomics, and has applications in biofuel development, bioprocessing, determining drugs' mechanism of action, and studying intercellular interactions.[49]
Analytical technologies
[edit]
The typical workflow of metabolomics studies is shown in the figure. First, samples are collected from tissue, plasma, urine, saliva, cells, etc. Next, metabolites are extracted often with the addition of internal standards and derivatization.[38] During sample analysis, metabolites are quantified (liquid chromatography or gas chromatography coupled with MS and/or NMR spectroscopy).[50] The raw output data can be used for metabolite feature extraction and further processed before statistical analysis (such as principal component analysis, PCA). Many bioinformatic tools and software are available to identify associations with disease states and outcomes, determine significant correlations, and characterize metabolic signatures with existing biological knowledge.[51]
Separation methods
[edit]Initially, analytes in a metabolomic sample comprise a highly complex mixture. This complex mixture can be simplified prior to detection by separating some analytes from others. Separation achieves various goals: analytes which cannot be resolved by the detector may be separated in this step; in MS analysis, ion suppression is reduced; the retention time of the analyte serves as information regarding its identity. This separation step is not mandatory and is often omitted in NMR and "shotgun" based approaches such as shotgun lipidomics.
Gas chromatography (GC), especially when interfaced with mass spectrometry (GC-MS), is a widely used separation technique for metabolomic analysis. GC offers very high chromatographic resolution, and can be used in conjunction with a flame ionization detector (GC/FID) or a mass spectrometer (GC-MS). The method is especially useful for identification and quantification of small and volatile molecules.[52] However, a practical limitation of GC is the requirement of chemical derivatization for many biomolecules as only volatile chemicals can be analysed without derivatization. In cases where greater resolving power is required, two-dimensional chromatography (GCxGC) can be applied.
High performance liquid chromatography (HPLC) has emerged as the most common separation technique for metabolomic analysis. With the advent of electrospray ionization, HPLC was coupled to MS. In contrast with GC, HPLC has lower chromatographic resolution, but requires no derivatization for polar molecules, and separates molecules in the liquid phase. Additionally HPLC has the advantage that a much wider range of analytes can be measured with a higher sensitivity than GC methods.[53]
Capillary electrophoresis (CE) has a higher theoretical separation efficiency than HPLC (although requiring much more time per separation), and is suitable for use with a wider range of metabolite classes than is GC. As for all electrophoretic techniques, it is most appropriate for charged analytes.[54]
In direct-infusion mass spectrometry (DI-MS), sample is directly introduced into the spectrometer and separation steps are skipped. DI-MS can be employed to perform single cell metabolic analysis of human cells.[55]
Detection methods
[edit]Mass spectrometry (MS) is used to identify and quantify metabolites after optional separation by GC, HPLC, or CE. GC-MS was the first hyphenated technique to be developed. Identification leverages the distinct patterns in which analytes fragment. These patterns can be thought of as a mass spectral fingerprint. Libraries exist that allow identification of a metabolite according to this fragmentation pattern [example needed]. MS is both sensitive and can be very specific. There are also a number of techniques which use MS as a stand-alone technology: the sample is infused directly into the mass spectrometer with no prior separation, and the MS provides sufficient selectivity to both separate and to detect metabolites.
For analysis by mass spectrometry, the analytes must be imparted with a charge and transferred to the gas phase. Electron ionization (EI) is the most common ionization technique applied to GC separations as it is amenable to low pressures. EI also produces fragmentation of the analyte, both providing structural information while increasing the complexity of the data and possibly obscuring the molecular ion. Atmospheric-pressure chemical ionization (APCI) is an atmospheric pressure technique that can be applied to all the above separation techniques. APCI is a gas phase ionization method, which provides slightly more aggressive ionization than ESI which is suitable for less polar compounds. Electrospray ionization (ESI) is the most common ionization technique applied in LC/MS. This soft ionization is most successful for polar molecules with ionizable functional groups. Another commonly used soft ionization technique is secondary electrospray ionization (SESI).
In the 2000s, surface-based mass analysis has seen a resurgence, with new MS technologies focused on increasing sensitivity, minimizing background, and reducing sample preparation. The ability to analyze metabolites directly from biofluids and tissues continues to challenge current MS technology, largely because of the limits imposed by the complexity of these samples, which contain thousands to tens of thousands of metabolites. Among the technologies being developed to address this challenge is Nanostructure-Initiator MS (NIMS),[56][57] a desorption/ ionization approach that does not require the application of matrix and thereby facilitates small-molecule (i.e., metabolite) identification. MALDI is also used; however, the application of a MALDI matrix can add significant background at < 1000 Da that complicates analysis of the low-mass range (i.e., metabolites). In addition, the size of the resulting matrix crystals limits the spatial resolution that can be achieved in tissue imaging. Because of these limitations, several other matrix-free desorption/ionization approaches have been applied to the analysis of biofluids and tissues.
Secondary ion mass spectrometry (SIMS) was one of the first matrix-free desorption/ionization approaches used to analyze metabolites from biological samples.[citation needed] SIMS uses a high-energy primary ion beam to desorb and generate secondary ions from a surface. The primary advantage of SIMS is its high spatial resolution (as small as 50 nm), a powerful characteristic for tissue imaging with MS. However, SIMS has yet to be readily applied to the analysis of biofluids and tissues because of its limited sensitivity at >500 Da and analyte fragmentation generated by the high-energy primary ion beam. Desorption electrospray ionization (DESI) is a matrix-free technique for analyzing biological samples that uses a charged solvent spray to desorb ions from a surface. Advantages of DESI are that no special surface is required and the analysis is performed at ambient pressure with full access to the sample during acquisition. A limitation of DESI is spatial resolution because "focusing" the charged solvent spray is difficult. However, a recent development termed laser ablation ESI (LAESI) is a promising approach to circumvent this limitation.[citation needed] Most recently, ion trap techniques such as orbitrap mass spectrometry are also applied to metabolomics research.[58]
Nuclear magnetic resonance (NMR) spectroscopy is the only detection technique which does not rely on separation of the analytes, and the sample can thus be recovered for further analyses. All kinds of small molecule metabolites can be measured simultaneously - in this sense, NMR is close to being a universal detector. The main advantages of NMR are high analytical reproducibility and simplicity of sample preparation. Practically, however, it is relatively insensitive compared to mass spectrometry-based techniques.[59][60]
Although NMR and MS are the most widely used modern-day techniques for detection, there are other methods in use. These include Fourier-transform ion cyclotron resonance,[61] ion-mobility spectrometry,[62] electrochemical detection (coupled to HPLC), Raman spectroscopy and radiolabel (when combined with thin-layer chromatography).[citation needed]
| Technology | Sensitivity (LOD) | Sample volume | Compatible with gases | Compatible with liquids | Compatible with solids | Start-up cost | Can be used in metabolite imaging (MALDI or DESI) | Advantages | Disadvantages |
|---|---|---|---|---|---|---|---|---|---|
| GC-MS | 0.5 μM | 0.1-0.2 mL | Yes | Yes | No | <$300,000 | No |
|
|
| LC-MS | 0.5 nM | 10—100 μL | No | Yes | Yes | >$300,000 | Yes |
|
|
| NMR spectroscopy | 5 μM | 10—100 μL | No | Yes | Yes | >US$1 million | Yes |
|
|
Statistical methods
[edit]The data generated in metabolomics usually consist of measurements performed on subjects under various conditions. These measurements may be digitized spectra, or a list of metabolite features. In its simplest form, this generates a matrix with rows corresponding to subjects and columns corresponding with metabolite features (or vice versa).[8] Several statistical programs are currently available for analysis of both NMR and mass spectrometry data. A great number of free software are already available for the analysis of metabolomics data shown in the table. Some statistical tools listed in the table were designed for NMR data analyses were also useful for MS data.[63] For mass spectrometry data, software is available that identifies molecules that vary in subject groups on the basis of mass-over-charge value and sometimes retention time depending on the experimental design.[64]
Once metabolite data matrix is determined, unsupervised data reduction techniques (e.g. PCA) can be used to elucidate patterns and connections. In many studies, including those evaluating drug-toxicity and some disease models, the metabolites of interest are not known a priori. This makes unsupervised methods, those with no prior assumptions of class membership, a popular first choice. The most common of these methods includes principal component analysis (PCA) which can efficiently reduce the dimensions of a dataset to a few which explain the greatest variation.[37] When analyzed in the lower-dimensional PCA space, clustering of samples with similar metabolic fingerprints can be detected. PCA algorithms aim to replace all correlated variables with a much smaller number of uncorrelated variables (referred to as principal components (PCs)) and retain most of the information in the original dataset.[65] This clustering can elucidate patterns and assist in the determination of disease biomarkers – metabolites that correlate most with class membership.
Linear models are commonly used for metabolomics data, but are affected by multicollinearity. On the other hand, multivariate statistics are thriving methods for high-dimensional correlated metabolomics data, of which the most popular one is Projection to Latent Structures (PLS) regression and its classification version PLS-DA. Other data mining methods, such as random forest, support-vector machines, etc. are received increasing attention for untargeted metabolomics data analysis.[66] In the case of univariate methods, variables are analyzed one by one using classical statistics tools (such as Student's t-test, ANOVA or mixed models) and only these with sufficient small p-values are considered relevant.[36] However, correction strategies should be used to reduce false discoveries when multiple comparisons are conducted since there is no standard method for measuring the total amount of metabolites directly in untargeted metabolomics.[67] For multivariate analysis, models should always be validated to ensure that the results can be generalized.
Machine learning and data mining
[edit]Machine learning is a powerful tool that can be used in metabolomics analysis. Recently, scientists have developed retention time prediction software. These tools allow researchers to apply artificial intelligence to the retention time prediction of small molecules in complex mixture, such as human plasma, plant extracts, foods, or microbial cultures. Retention time prediction increases the identification rate in liquid chromatography and can lead to an improved biological interpretation of metabolomics data.[68]
Key applications
[edit]Toxicity assessment/toxicology by metabolic profiling (especially of urine or blood plasma samples) detects the physiological changes caused by toxic insult of a chemical (or mixture of chemicals). In many cases, the observed changes can be related to specific syndromes, e.g. a specific lesion in liver or kidney. This is of particular relevance to pharmaceutical companies wanting to test the toxicity of potential drug candidates: if a compound can be eliminated before it reaches clinical trials on the grounds of adverse toxicity, it saves the enormous expense of the trials.[48]
For functional genomics, metabolomics can be an excellent tool for determining the phenotype caused by a genetic manipulation, such as gene deletion or insertion. Sometimes this can be a sufficient goal in itself—for instance, to detect any phenotypic changes in a genetically modified plant intended for human or animal consumption. More exciting is the prospect of predicting the function of unknown genes by comparison with the metabolic perturbations caused by deletion/insertion of known genes. Such advances are most likely to come from model organisms such as Saccharomyces cerevisiae and Arabidopsis thaliana. The Cravatt laboratory at the Scripps Research Institute has recently applied this technology to mammalian systems, identifying the N-acyltaurines as previously uncharacterized endogenous substrates for the enzyme fatty acid amide hydrolase (FAAH) and the monoalkylglycerol ethers (MAGEs) as endogenous substrates for the uncharacterized hydrolase KIAA1363.[69][70]
Metabologenomics is a novel approach to integrate metabolomics and genomics data by correlating microbial-exported metabolites with predicted biosynthetic genes.[71] This bioinformatics-based pairing method enables natural product discovery at a larger-scale by refining non-targeted metabolomic analyses to identify small molecules with related biosynthesis and to focus on those that may not have previously well known structures.
Fluxomics is a further development of metabolomics. The disadvantage of metabolomics is that it only provides the user with abundances or concentrations of metabolites, while fluxomics determines the reaction rates of metabolic reactions and can trace metabolites in a biological system over time.
Nutrigenomics is a generalised term which links genomics, transcriptomics, proteomics and metabolomics to human nutrition. In general, in a given body fluid, a metabolome is influenced by endogenous factors such as age, sex, body composition and genetics as well as underlying pathologies. The large bowel microflora are also a very significant potential confounder of metabolic profiles and could be classified as either an endogenous or exogenous factor. The main exogenous factors are diet and drugs. Diet can then be broken down to nutrients and non-nutrients. Metabolomics is one means to determine a biological endpoint, or metabolic fingerprint, which reflects the balance of all these forces on an individual's metabolism.[72][73] Thanks to recent cost reductions, metabolomics has now become accessible for companion animals, such as pregnant dogs.[74][75]
Volatolomics is a development of metabolomics that studies volatile organic compounds (VOCs) emitted by a biological systems.
Plant metabolomics is designed to study the overall changes in metabolites of plant samples and then conduct deep data mining and chemometric analysis. Specialized metabolites are considered components of plant defense systems biosynthesized in response to biotic and abiotic stresses.[76] Metabolomics approaches have recently been used to assess the natural variance in metabolite content between individual plants, an approach with great potential for the improvement of the compositional quality of crops.[77]
See also
[edit]- Epigenomics
- Fluxomics
- Genomics
- Lipidomics
- Molecular epidemiology
- Molecular medicine
- Molecular pathology
- Precision medicine
- Proteomics
- Transcriptomics
- XCMS Online, a bioinformatics software designed for statistical analysis of mass spectrometry data
References
[edit]- ^ Daviss B (April 2005). "Growing pains for metabolomics". The Scientist. 19 (8): 25–28. Archived from the original on 13 October 2008.
- ^ Jordan KW, Nordenstam J, Lauwers GY, Rothenberger DA, Alavi K, Garwood M, et al. (March 2009). "Metabolomic characterization of human rectal adenocarcinoma with intact tissue magnetic resonance spectroscopy". Diseases of the Colon and Rectum. 52 (3): 520–525. doi:10.1007/DCR.0b013e31819c9a2c. PMC 2720561. PMID 19333056.
- ^ a b Villate A, San Nicolas M, Gallastegi M, Aulas PA, Olivares M, Usobiaga A, et al. (February 2021). "Review: Metabolomics as a prediction tool for plants performance under environmental stress". Plant Science. 303 110789. Bibcode:2021PlnSc.30310789V. doi:10.1016/j.plantsci.2020.110789. PMID 33487364. S2CID 230533604.
- ^ Hollywood K, Brison DR, Goodacre R (September 2006). "Metabolomics: current technologies and future trends". Proteomics. 6 (17): 4716–4723. doi:10.1002/pmic.200600106. PMID 16888765. S2CID 14631544.
- ^ Cavicchioli MV, Santorsola M, Balboni N, Mercatelli D, Giorgi FM (March 2022). "Prediction of Metabolic Profiles from Transcriptomics Data in Human Cancer Cell Lines". International Journal of Molecular Sciences. 23 (7): 3867. doi:10.3390/ijms23073867. PMC 8998886. PMID 35409231.
- ^ Gates SC, Sweeley CC (October 1978). "Quantitative metabolic profiling based on gas chromatography". Clinical Chemistry. 24 (10): 1663–1673. doi:10.1093/clinchem/24.10.1663. PMID 359193.
- ^ Preti, George (June 6, 2005). "Metabolomics comes of age?". The Scientist. 19 (11): 8.
- ^ a b c van der Greef J, Smilde AK (May 2005). "Symbiosis of chemometrics and metabolomics: past, present, and future". Journal of Chemometrics. 19 (5–7): 376–386. doi:10.1002/cem.941. S2CID 122419960.
- ^ Shapiro I, Kavkalo DN, Petrova GV, Ganzin AP (1989). "[Angioleiomyoma of the large-intestinal mesentery complicated by diffuse peritonitis]". Sovetskaia Meditsina (in Russian) (9): 116. PMID 2603028.
- ^ Griffiths WJ, Wang Y (July 2009). "Mass spectrometry: from proteomics to metabolomics and lipidomics". Chemical Society Reviews. 38 (7): 1882–1896. doi:10.1039/b618553n. PMID 19551169. S2CID 12237358.
- ^ Hoult DI, Busby SJ, Gadian DG, Radda GK, Richards RE, Seeley PJ (November 1974). "Observation of tissue metabolites using 31P nuclear magnetic resonance". Nature. 252 (5481): 285–287. Bibcode:1974Natur.252..285H. doi:10.1038/252285a0. PMID 4431445. S2CID 4291661.
- ^ Nicholson JK, Lindon JC (October 2008). "Systems biology: Metabonomics". Nature. 455 (7216): 1054–1056. Bibcode:2008Natur.455.1054N. doi:10.1038/4551054a. PMID 18948945. S2CID 4411723.
- ^ Holmes E, Antti H (December 2002). "Chemometric contributions to the evolution of metabonomics: mathematical solutions to characterising and interpreting complex biological NMR spectra". The Analyst. 127 (12): 1549–1557. Bibcode:2002Ana...127.1549H. doi:10.1039/b208254n. PMID 12537357.
- ^ Lenz EM, Wilson ID (February 2007). "Analytical strategies in metabonomics". Journal of Proteome Research. 6 (2): 443–458. doi:10.1021/pr0605217. PMID 17269702.
- ^ Lerner RA, Siuzdak G, Prospero-Garcia O, Henriksen SJ, Boger DL, Cravatt BF (September 1994). "Cerebrodiene: a brain lipid isolated from sleep-deprived cats". Proceedings of the National Academy of Sciences of the United States of America. 91 (20): 9505–9508. Bibcode:1994PNAS...91.9505L. doi:10.1073/pnas.91.20.9505. PMC 44841. PMID 7937797.
- ^ Cravatt BF, Prospero-Garcia O, Siuzdak G, Gilula NB, Henriksen SJ, Boger DL, et al. (June 1995). "Chemical characterization of a family of brain lipids that induce sleep". Science. 268 (5216): 1506–1509. Bibcode:1995Sci...268.1506C. doi:10.1126/science.7770779. PMID 7770779.
- ^ Smith CA, O'Maille G, Want EJ, Qin C, Trauger SA, Brandon TR, et al. (December 2005). "METLIN: a metabolite mass spectral database". Therapeutic Drug Monitoring. 27 (6): 747–751. doi:10.1097/01.ftd.0000179845.53213.39. PMID 16404815. S2CID 14774455.
- ^ Guijas C, Montenegro-Burke JR, Domingo-Almenara X, Palermo A, Warth B, Hermann G, et al. (March 2018). "METLIN: A Technology Platform for Identifying Knowns and Unknowns". Analytical Chemistry. 90 (5): 3156–3164. Bibcode:2018AnaCh..90.3156G. doi:10.1021/acs.analchem.7b04424. PMC 5933435. PMID 29381867.
- ^ "The Analytical Scientist Innovation Awards 2023". The Analytical Scientist. 2023-12-12. Retrieved 2023-12-14.
- ^ Smith CA, Want EJ, O'Maille G, Abagyan R, Siuzdak G (February 2006). "XCMS: processing mass spectrometry data for metabolite profiling using nonlinear peak alignment, matching, and identification". Analytical Chemistry. 78 (3): 779–787. Bibcode:2006AnaCh..78..779S. doi:10.1021/ac051437y. PMID 16448051.
- ^ Tautenhahn R, Patti GJ, Rinehart D, Siuzdak G (June 2012). "XCMS Online: a web-based platform to process untargeted metabolomic data". Analytical Chemistry. 84 (11): 5035–5039. Bibcode:2012AnaCh..84.5035T. doi:10.1021/ac300698c. PMC 3703953. PMID 22533540.
- ^ a b c Wishart DS, Tzur D, Knox C, Eisner R, Guo AC, Young N, et al. (January 1, 2007). "HMDB: the Human Metabolobe Database". Nucleic Acids Research. 35 (D1): D521 – D526. doi:10.1093/nar/gkl923. PMC 1899095. PMID 17202168.
- ^ Wishart DS, Knox C, Guo AC, Eisner R, Young N, Gautam B, et al. (1 January 2009). "HMDB: a knowledgebase for the human metabolome". Nucleic Acids Research. 37 (D1): D603 – D610. doi:10.1093/nar/gkn810. PMC 2686599. PMID 18953024.
- ^ Farag MA, Huhman DV, Dixon RA, Sumner LW (February 2008). "Metabolomics reveals novel pathways and differential mechanistic and elicitor-specific responses in phenylpropanoid and isoflavonoid biosynthesis in Medicago truncatula cell cultures". Plant Physiology. 146 (2): 387–402. doi:10.1104/pp.107.108431. PMC 2245840. PMID 18055588.
- ^ "www.Plantmetabolomics.org". Nov 7, 2012. Archived from the original on 2012-11-07. Retrieved May 20, 2020.
- ^ a b Morrow Jr KJ (1 April 2010). "Mass Spec Central to Metabolomics". Genetic Engineering & Biotechnology News. Vol. 30, no. 7. p. 1. Archived from the original on 12 August 2011. Retrieved 28 June 2010.
- ^ "Real-time analysis of metabolic products". phys.org. Retrieved May 20, 2020.
- ^ Oliver SG, Winson MK, Kell DB, Baganz F (September 1998). "Systematic functional analysis of the yeast genome". Trends in Biotechnology. 16 (9): 373–378. doi:10.1016/S0167-7799(98)01214-1. PMID 9744112.
- ^ Griffin JL, Vidal-Puig A (June 2008). "Current challenges in metabolomics for diabetes research: a vital functional genomic tool or just a ploy for gaining funding?". Physiological Genomics. 34 (1): 1–5. doi:10.1152/physiolgenomics.00009.2008. PMID 18413782. S2CID 9416755.
- ^ HMDB 4.0 – the human metabolome database in 2018.
- ^ Wishart DS, Guo A, Oler E, Wang F, Anjum A, Peters H, et al. (January 2022). "HMDB 5.0: the Human Metabolome Database for 2022". Nucleic Acids Research. 50 (D1): D622 – D631. doi:10.1093/nar/gkab1062. PMC 8728138. PMID 34986597.
- ^ Wishart DS, Jewison T, Guo AC, Wilson M, Knox C, Liu Y, et al. (January 2013). "HMDB 3.0—The Human Metabolome Database in 2013". Nucleic Acids Research. 41 (Database issue): D801 – D807. doi:10.1093/nar/gks1065. PMC 3531200. PMID 23161693.
- ^ Pearson H (March 2007). "Meet the human metabolome". Nature. 446 (7131): 8. Bibcode:2007Natur.446....8P. doi:10.1038/446008a. PMID 17330009. S2CID 2235062.
- ^ De Luca V, St Pierre B (April 2000). "The cell and developmental biology of alkaloid biosynthesis". Trends in Plant Science. 5 (4): 168–173. doi:10.1016/S1360-1385(00)01575-2. PMID 10740298.
- ^ Griffin JL, Shockcor JP (July 2004). "Metabolic profiles of cancer cells". Nature Reviews. Cancer. 4 (7): 551–561. doi:10.1038/nrc1390. PMID 15229480. S2CID 527894.
- ^ a b Ulaszewska MM, Weinert CH, Trimigno A, Portmann R, Andres Lacueva C, Badertscher R, et al. (January 2019). "Nutrimetabolomics: An Integrative Action for Metabolomic Analyses in Human Nutritional Studies". Molecular Nutrition & Food Research. 63 (1) 1800384: e1800384. doi:10.1002/mnfr.201800384. hdl:11572/214273. PMID 30176196.
{{cite journal}}: CS1 maint: article number as page number (link) - ^ a b Nicholson JK, Wilson ID (August 2003). "Opinion: understanding 'global' systems biology: metabonomics and the continuum of metabolism". Nature Reviews. Drug Discovery. 2 (8): 668–676. doi:10.1038/nrd1157. PMID 12904817. S2CID 23743031.
- ^ a b Dettmer K, Aronov PA, Hammock BD (2007). "Mass spectrometry-based metabolomics". Mass Spectrometry Reviews. 26 (1): 51–78. Bibcode:2007MSRv...26...51D. doi:10.1002/mas.20108. PMC 1904337. PMID 16921475.
- ^ Nicholson JK, Foxall PJ, Spraul M, Farrant RD, Lindon JC (March 1995). "750 MHz 1H and 1H-13C NMR spectroscopy of human blood plasma". Analytical Chemistry. 67 (5): 793–811. Bibcode:1995AnaCh..67..793N. doi:10.1021/ac00101a004. PMID 7762816.
- ^ Bentley R (1999). "Secondary metabolite biosynthesis: the first century". Critical Reviews in Biotechnology. 19 (1): 1–40. doi:10.1080/0738-859991229189. PMID 10230052.
- ^ Nordström A, O'Maille G, Qin C, Siuzdak G (May 2006). "Nonlinear data alignment for UPLC-MS and HPLC-MS based metabolomics: quantitative analysis of endogenous and exogenous metabolites in human serum". Analytical Chemistry. 78 (10): 3289–3295. doi:10.1021/ac060245f. PMC 3705959. PMID 16689529.
- ^ Lin W, Conway LP, Block A, Sommi G, Vujasinovic M, Löhr JM, et al. (June 2020). "Sensitive mass spectrometric analysis of carbonyl metabolites in human urine and fecal samples using chemoselective modification". The Analyst. 145 (11): 3822–3831. Bibcode:2020Ana...145.3822L. doi:10.1039/D0AN00150C. PMID 32393929.
- ^ Crockford DJ, Maher AD, Ahmadi KR, Barrett A, Plumb RS, Wilson ID, et al. (September 2008). "1H NMR and UPLC-MS(E) statistical heterospectroscopy: characterization of drug metabolites (xenometabolome) in epidemiological studies". Analytical Chemistry. 80 (18): 6835–6844. Bibcode:2008AnaCh..80.6835C. doi:10.1021/ac801075m. PMID 18700783.
- ^ Nicholson JK (2006). "Global systems biology, personalized medicine and molecular epidemiology". Molecular Systems Biology. 2 (1) 52. doi:10.1038/msb4100095. PMC 1682018. PMID 17016518.
- ^ Nicholson JK, Lindon JC, Holmes E (November 1999). "'Metabonomics': understanding the metabolic responses of living systems to pathophysiological stimuli via multivariate statistical analysis of biological NMR spectroscopic data". Xenobiotica; the Fate of Foreign Compounds in Biological Systems. 29 (11): 1181–1189. doi:10.1080/004982599238047. PMID 10598751.
- ^ Nicholson JK, Connelly J, Lindon JC, Holmes E (February 2002). "Metabonomics: a platform for studying drug toxicity and gene function". Nature Reviews. Drug Discovery. 1 (2): 153–161. doi:10.1038/nrd728. PMID 12120097. S2CID 17881327.
- ^ Holmes E, Wilson ID, Nicholson JK (September 2008). "Metabolic phenotyping in health and disease". Cell. 134 (5): 714–717. doi:10.1016/j.cell.2008.08.026. PMID 18775301. S2CID 6677621.
- ^ a b Robertson DG (June 2005). "Metabonomics in toxicology: a review". Toxicological Sciences. 85 (2): 809–822. doi:10.1093/toxsci/kfi102. PMID 15689416.
- ^ Silva LP, Northen TR (August 2015). "Exometabolomics and MSI: deconstructing how cells interact to transform their small molecule environment". Current Opinion in Biotechnology. 34. Elsevier BV: 209–216. doi:10.1016/j.copbio.2015.03.015. PMID 25855407.
- ^ Lu W, Su X, Klein MS, Lewis IA, Fiehn O, Rabinowitz JD (June 2017). "Metabolite Measurement: Pitfalls to Avoid and Practices to Follow". Annual Review of Biochemistry. 86 (1): 277–304. doi:10.1146/annurev-biochem-061516-044952. PMC 5734093. PMID 28654323.
- ^ Rasmiena AA, Ng TW, Meikle PJ (March 2013). "Metabolomics and ischaemic heart disease". Clinical Science. 124 (5): 289–306. doi:10.1042/CS20120268. PMID 23157406.
- ^ "Gas Chromatography Mass Spectrometry (GC-MS) Information". Thermo Fisher Scientific - US. Retrieved 2018-09-26.
- ^ Gika HG, Theodoridis GA, Wingate JE, Wilson ID (August 2007). "Within-day reproducibility of an HPLC-MS-based method for metabonomic analysis: application to human urine". Journal of Proteome Research. 6 (8): 3291–3303. doi:10.1021/pr070183p. PMID 17625818.
- ^ Soga T, Ohashi Y, Ueno Y, Naraoka H, Tomita M, Nishioka T (September 2003). "Quantitative metabolome analysis using capillary electrophoresis mass spectrometry". Journal of Proteome Research. 2 (5): 488–494. doi:10.1021/pr034020m. PMID 14582645.
- ^ Tang H, Ali A, Abdelazem E, Ottenhoff TH, Heeren RM, Mashaghi A (August 2023). "Random forest and live single-cell metabolomics reveal metabolic profiles of human macrophages upon polarization". Biotechnology and Bioengineering. 120 (8): 2314–2325. Bibcode:2023BiotB.120.2314T. doi:10.1002/bit.28494. hdl:1887/3633712. PMID 37424521.
- ^ Northen TR, Yanes O, Northen MT, Marrinucci D, Uritboonthai W, Apon J, et al. (October 2007). "Clathrate nanostructures for mass spectrometry". Nature. 449 (7165): 1033–1036. Bibcode:2007Natur.449.1033N. doi:10.1038/nature06195. PMID 17960240. S2CID 4404703.
- ^ Woo HK, Northen TR, Yanes O, Siuzdak G (July 2008). "Nanostructure-initiator mass spectrometry: a protocol for preparing and applying NIMS surfaces for high-sensitivity mass analysis". Nature Protocols. 3 (8): 1341–1349. doi:10.1038/NPROT.2008.110. PMID 18714302. S2CID 20620548.
- ^ Ghaste M, Mistrik R, Shulaev V (May 2016). "Applications of Fourier Transform Ion Cyclotron Resonance (FT-ICR) and Orbitrap Based High Resolution Mass Spectrometry in Metabolomics and Lipidomics". International Journal of Molecular Sciences. 17 (6): 816. doi:10.3390/ijms17060816. PMC 4926350. PMID 27231903.
- ^ Griffin JL (October 2003). "Metabonomics: NMR spectroscopy and pattern recognition analysis of body fluids and tissues for characterisation of xenobiotic toxicity and disease diagnosis". Current Opinion in Chemical Biology. 7 (5): 648–654. doi:10.1016/j.cbpa.2003.08.008. PMID 14580571.
- ^ Beckonert O, Keun HC, Ebbels TM, Bundy J, Holmes E, Lindon JC, et al. (2007). "Metabolic profiling, metabolomic and metabonomic procedures for NMR spectroscopy of urine, plasma, serum and tissue extracts". Nature Protocols. 2 (11): 2692–2703. doi:10.1038/nprot.2007.376. PMID 18007604. S2CID 205463871.
- ^ Habchi B, Alves S, Jouan-Rimbaud Bouveresse D, Appenzeller B, Paris A, Rutledge DN, et al. (January 2018). "Potential of dynamically harmonized Fourier transform ion cyclotron resonance cell for high-throughput metabolomics fingerprinting: control of data quality". Analytical and Bioanalytical Chemistry. 410 (2): 483–490. doi:10.1007/s00216-017-0738-3. PMID 29167936. S2CID 3769892.
- ^ King AM, Mullin LG, Wilson ID, Coen M, Rainville PD, Plumb RS, et al. (January 2019). "Development of a rapid profiling method for the analysis of polar analytes in urine using HILIC-MS and ion mobility enabled HILIC-MS". Metabolomics. 15 (2) 17. doi:10.1007/s11306-019-1474-9. PMC 6342856. PMID 30830424.
- ^ Sugimoto M, Kawakami M, Robert M, Soga T, Tomita M (March 2012). "Bioinformatics Tools for Mass Spectroscopy-Based Metabolomic Data Processing and Analysis". Current Bioinformatics. 7 (1): 96–108. doi:10.2174/157489312799304431. PMC 3299976. PMID 22438836.
- ^ Spicer R, Salek RM, Moreno P, Cañueto D, Steinbeck C (2017). "Navigating freely-available software tools for metabolomics analysis". Metabolomics. 13 (9) 106. doi:10.1007/s11306-017-1242-7. PMC 5550549. PMID 28890673.
- ^ Ren S, Hinzman AA, Kang EL, Szczesniak RD, Lu LJ (December 2015). "Computational and statistical analysis of metabolomics data". Metabolomics. 11 (6): 1492–513. doi:10.1007/s11306-015-0823-6. S2CID 15712363.
- ^ Gromski PS, Muhamadali H, Ellis DI, Xu Y, Correa E, Turner ML, et al. (June 2015). "A tutorial review: Metabolomics and partial least squares-discriminant analysis--a marriage of convenience or a shotgun wedding" (PDF). Analytica Chimica Acta. 879: 10–23. Bibcode:2015AcAC..879...10G. doi:10.1016/j.aca.2015.02.012. PMID 26002472.
- ^ "Metabolomics in the clinic: A review of the shared and unique features of untargeted metabolomics for clinical research and clinical testing". Metabolon. Retrieved 2022-11-08.
- ^ Bonini P, Kind T, Tsugawa H, Barupal DK, Fiehn O (June 2020). "Retip: Retention Time Prediction for Compound Annotation in Untargeted Metabolomics". Analytical Chemistry. 92 (11): 7515–7522. Bibcode:2020AnaCh..92.7515B. doi:10.1021/acs.analchem.9b05765. PMC 8715951. PMID 32390414.
- ^ Saghatelian A, Trauger SA, Want EJ, Hawkins EG, Siuzdak G, Cravatt BF (November 2004). "Assignment of endogenous substrates to enzymes by global metabolite profiling". Biochemistry. 43 (45): 14332–14339. doi:10.1021/bi0480335. PMID 15533037.
- ^ Chiang KP, Niessen S, Saghatelian A, Cravatt BF (October 2006). "An enzyme that regulates ether lipid signaling pathways in cancer annotated by multidimensional profiling". Chemistry & Biology. 13 (10): 1041–1050. doi:10.1016/j.chembiol.2006.08.008. PMID 17052608.
- ^ Goering AW, McClure RA, Doroghazi JR, Albright JC, Haverland NA, Zhang Y, et al. (February 2016). "Metabologenomics: Correlation of Microbial Gene Clusters with Metabolites Drives Discovery of a Nonribosomal Peptide with an Unusual Amino Acid Monomer". ACS Central Science. 2 (2): 99–108. doi:10.1021/acscentsci.5b00331. PMC 4827660. PMID 27163034.
- ^ Gibney MJ, Walsh M, Brennan L, Roche HM, German B, van Ommen B (September 2005). "Metabolomics in human nutrition: opportunities and challenges". The American Journal of Clinical Nutrition. 82 (3): 497–503. doi:10.1093/ajcn/82.3.497. PMID 16155259.
- ^ Christopher KB (March 2018). "Nutritional metabolomics in critical illness". Current Opinion in Clinical Nutrition and Metabolic Care. 21 (2): 121–125. doi:10.1097/MCO.0000000000000451. PMC 5826639. PMID 29251691.
- ^ Arlt SP, Ottka C, Lohi H, Hinderer J, Lüdeke J, Müller E, et al. (2023-05-10). Mükremin Ö (ed.). "Metabolomics during canine pregnancy and lactation". PLOS ONE. 18 (5) e0284570. Bibcode:2023PLoSO..1884570A. doi:10.1371/journal.pone.0284570. PMC 10171673. PMID 37163464.
- ^ Ottka C, Vapalahti K, Arlt SP, Bartel A, Lohi H (2023-02-02). "The metabolic differences of anestrus, heat, pregnancy, pseudopregnancy, and lactation in 800 female dogs". Frontiers in Veterinary Science. 10 1105113. doi:10.3389/fvets.2023.1105113. PMC 9932911. PMID 36816179.
- ^ Cotrim GD, Silva DM, Graça JP, Oliveira Junior A, Castro C, Zocolo GJ, et al. (January 2023). "Glycine max (L.) Merr. (Soybean) metabolome responses to potassium availability". Phytochemistry. 205 113472. Bibcode:2023PChem.205k3472C. doi:10.1016/j.phytochem.2022.113472. PMID 36270412.
- ^ Schauer N, Fernie AR (October 2006). "Plant metabolomics: towards biological function and mechanism". Trends in Plant Science. 11 (10): 508–516. Bibcode:2006TPS....11..508S. doi:10.1016/j.tplants.2006.08.007. PMID 16949327.
Further reading
[edit]- Bundy JG, Davey MP, Viant MR (2009). "Environmental metabolomics: A critical review and future perspectives". Metabolomics. 5: 3–21. doi:10.1007/s11306-008-0152-0. S2CID 22179989.
- Claudino WM, Quattrone A, Biganzoli L, Pestrin M, Bertini I, Di Leo A (July 2007). "Metabolomics: available results, current research projects in breast cancer, and future applications". Journal of Clinical Oncology. 25 (19): 2840–2846. doi:10.1200/JCO.2006.09.7550. PMID 17502626. Archived from the original on 2008-01-20.
- Ellis DI, Dunn WB, Griffin JL, Allwood JW, Goodacre R (September 2007). "Metabolic fingerprinting as a diagnostic tool". Pharmacogenomics. 8 (9): 1243–1266. doi:10.2217/14622416.8.9.1243. PMID 17924839.
- Ellis DI, Goodacre R (August 2006). "Metabolic fingerprinting in disease diagnosis: biomedical applications of infrared and Raman spectroscopy". The Analyst. 131 (8): 875–885. Bibcode:2006Ana...131..875E. doi:10.1039/b602376m. PMID 17028718.
- Fan TW, Lorkiewicz PK, Sellers K, Moseley HN, Higashi RM, Lane AN (March 2012). "Stable isotope-resolved metabolomics and applications for drug development". Pharmacology & Therapeutics. 133 (3): 366–391. doi:10.1016/j.pharmthera.2011.12.007. PMC 3471671. PMID 22212615.
- Haug K, Salek RM, Conesa P, Hastings J, de Matos P, Rijnbeek M, et al. (January 2013). "MetaboLights--an open-access general-purpose repository for metabolomics studies and associated meta-data". Nucleic Acids Research. 41 (Database issue): D781 – D786. doi:10.1093/nar/gks1004. PMC 3531110. PMID 23109552.
- Tomita M, Nishioka T (2005). Metabolomics: The Frontier of Systems Biology. Springer. ISBN 4-431-25121-9.
- Weckwerth W (2006). Metabolomics: Methods And Protocols (Methods in Molecular Biology). Humana Press. ISBN 1-588-29561-3. OCLC 493824826.
External links
[edit]Metabolomics
View on GrokipediaOverview
Definition and Scope
Metabolomics is defined as the comprehensive, qualitative, and quantitative analysis of all small-molecule metabolites present in a biological system, such as cells, tissues, organs, or organisms, under specific physiological or environmental conditions.[1] This field emerged as a key component of systems biology, focusing on the metabolome—the complete set of metabolites within a system—to capture the dynamic biochemical state at a given time.[5] The scope of metabolomics encompasses both endogenous metabolites, which are produced by the organism's own metabolic processes (such as amino acids, sugars, and lipids), and exogenous metabolites derived from external sources like diet, drugs, or microbial interactions.[6] These metabolites are typically low-molecular-weight compounds under 1,500 Da, excluding larger macromolecules such as proteins, nucleic acids, or polysaccharides.[7] This boundary ensures focus on small molecules that serve as substrates, intermediates, or products in metabolic reactions, providing a snapshot of biochemical activity without overlapping into other omics disciplines.[8] The primary objectives of metabolomics include identifying and characterizing metabolic profiles to map biochemical pathways, elucidating how these pathways respond to genetic variations, environmental stressors, or disease states, and linking metabolite changes to observable phenotypes or physiological outcomes.[5] By quantifying alterations in metabolite levels, researchers can infer functional impacts of upstream biological processes, such as gene expression or protein activity, offering insights into how organisms adapt to internal or external perturbations. In distinction from other omics fields like genomics or proteomics, metabolomics serves as the downstream readout of gene-protein interactions, directly reflecting the integrated effects of genetic, transcriptional, and translational regulation on cellular function and environmental responses.[9] This positions metabolomics as a proximal measure of phenotype, bridging the gap between molecular mechanisms and observable traits in a way that upstream omics cannot achieve alone.Importance and Interdisciplinary Role
Metabolomics serves as a vital bridge between genomics, transcriptomics, and proteomics, integrating these datasets to connect genotype with phenotype and reveal underlying functional biology. By analyzing the end products of cellular processes, it uncovers dynamic metabolic pathway dysregulations that genomics and transcriptomics alone cannot fully explain, providing deeper insights into disease mechanisms such as those in cancer and fibrosis.[10] For example, joint transcriptomic-metabolomic analyses have highlighted enhanced enzyme-metabolite coupling in breast cancer, illustrating how metabolomics elucidates molecular interactions at the systems level.[11] This interdisciplinary integration supports comprehensive pathway analysis, advancing the understanding of complex biological responses.[12] The field significantly contributes to personalized medicine through biomarker discovery and the evaluation of individual responses to therapeutic interventions or dietary changes. In pharmacotherapy, baseline metabolomic profiles predict drug efficacy, such as elevated bile acids indicating better LDL cholesterol reduction with simvastatin treatment.[13] Similarly, in nutrition, metabolomics identifies metabotypes that stratify dietary responses, enabling tailored recommendations; for instance, branched-chain amino acids serve as biomarkers for type 2 diabetes risk influenced by diet.[14] These applications facilitate patient stratification and optimize treatment outcomes by capturing personalized metabolic variations.[15] Metabolomics extends across disciplines, including toxicology, nutrition, and ecology, where it informs exposure effects and environmental impacts. In toxicology, known as toxicometabolomics, it profiles metabolic perturbations from chemical exposures, aiding risk assessment and mechanism elucidation.[16] Nutritional applications reveal dietary intake biomarkers, such as those for red meat or whole grains, linking consumption to health outcomes like diabetes risk.[14] In ecology, metabolomics assesses pollutant effects on wildlife, for example, detecting altered energy metabolism and steatosis in Atlantic bluefin tuna exposed to persistent organic pollutants.[17] Metabolic fingerprinting further enhances diagnostics by generating rapid metabolite pattern profiles for disease identification using techniques like NMR and mass spectrometry.[18] A key quantitative impact of metabolomics lies in its production of high-dimensional datasets, often encompassing thousands of metabolite features per sample, which enable holistic, systems-wide views of biological states.[19] This scale captures the complexity of metabolic networks, supporting integrative analyses that reveal emergent properties in health and disease.[10]Historical Development
Early Foundations
The foundations of metabolomics trace back to 19th-century biochemistry, when chemists began systematically identifying and characterizing small organic molecules central to biological processes. Pioneers such as Justus von Liebig advanced the understanding of metabolic transformations through studies on fermentation, respiration, and nutrient breakdown, establishing the groundwork for viewing metabolism as a network of chemical reactions. By the late 1800s, the structural elucidation of key metabolites like glucose, urea, and lactic acid had been achieved, laying the conceptual basis for later quantitative analyses of biological fluids.[20] In the early 20th century, Archibald Garrod's seminal work further solidified these roots by introducing the concept of "inborn errors of metabolism" in his 1908 Croonian Lectures and subsequent book. Garrod described disorders such as alkaptonuria, cystinuria, albinism, and pentosuria as inherited conditions arising from blocks in specific metabolic pathways, leading to the accumulation of abnormal metabolites detectable in urine. This approach marked the first targeted profiling of metabolites for diagnostic purposes, emphasizing how genetic defects disrupt normal biochemical flows.[21] From the 1940s through the 1960s, extensions of Garrod's ideas involved more routine examination of urinary metabolite patterns to identify inborn errors, facilitated by emerging separation techniques. Researchers employed paper chromatography to resolve amino acids and other compounds in urine samples, enabling population-based screening for disorders like phenylketonuria and maple syrup urine disease. These methods, though qualitative and labor-intensive, represented early systematic efforts to capture metabolic snapshots, often revealing diagnostic patterns of elevated or missing compounds in affected individuals.[22] A pivotal advancement came in 1971 with the work of Evan C. Horning and colleagues, who applied gas chromatography-mass spectrometry (GC-MS) to generate comprehensive metabolic profiles from human urine, blood, and tissues. This technique allowed the simultaneous detection of steroids, organic acids, and other volatiles, demonstrating the feasibility of broad-spectrum metabolite analysis and coining the phrase "metabolic profiles" for such datasets. Although performed ad hoc in clinical and biochemical contexts before the 1990s, these pre-high-throughput efforts in clinical chemistry focused on targeted diagnostics rather than global overviews. The formal term "metabolome"—denoting the entirety of metabolites within a biological system—was introduced later in 1998 by Stephen G. Oliver and colleagues, retrospectively framing these historical analyses as precursors to the field.[23] The discovery of the citric acid cycle (Krebs cycle) in 1937 by Hans A. Krebs and William A. Johnson exemplified early metabolic pathway mapping, integrating 19th- and early 20th-century biochemical insights into a coherent cycle of carbon oxidation that underpins cellular energy production. This elucidation of interconnected reactions highlighted the dynamic nature of metabolite interconversions, influencing subsequent profiling strategies.[24]Key Milestones and Technological Advances
The field of metabolomics emerged in the 1990s as a distinct discipline, driven by early applications of nuclear magnetic resonance (NMR) spectroscopy and mass spectrometry (MS) to profile metabolites in plant and microbial systems. These initial studies focused on comprehensive metabolic fingerprinting to link genotypes to phenotypes, marking a shift from targeted analyses to global approaches. In 2000, a study by Oliver Fiehn and colleagues on plant functional genomics demonstrated the potential of gas chromatography-MS for identifying hundreds of metabolites in Arabidopsis mutants, establishing metabolomics as complementary to genomics and proteomics.[25] A seminal contribution came in 2002 when Oliver Fiehn defined metabolomics as "a post-genomic technology for the systematic study of the unique chemical fingerprints that specific cellular processes leave behind, specifically using the complete set of metabolites within a cell, tissue, or organism."[26] In the 2000s, organizational and database efforts accelerated the field's growth, integrating metabolomics with genomics through international consortia. The Metabolomics Society was founded in 2004 to foster collaboration, standardize methods, and promote education among researchers worldwide.[27] That same year, the Human Metabolome Project (HMP) was launched to systematically identify and quantify all detectable metabolites in human tissues, biofluids, and cells, initially cataloging approximately 2,200 compounds by 2007 via high-throughput NMR and MS techniques.[28] This project not only expanded knowledge of the human metabolome but also facilitated cross-disciplinary integration, such as combining metabolomic data with genomic profiles to uncover disease biomarkers.[29] The 2010s and 2020s witnessed transformative technological advances, particularly in untargeted metabolomics, which enables broad-spectrum detection without prior knowledge of metabolites, and spatial metabolomics for mapping metabolite distributions in tissues. Untargeted approaches evolved with software like XCMS (introduced in 2006 but refined through the 2010s) for processing LC-MS data, allowing detection of thousands of features per sample.[30] Spatial metabolomics advanced significantly with matrix-assisted laser desorption/ionization (MALDI) imaging MS, which gained prominence around 2015 for high-resolution metabolite localization in biological tissues, such as visualizing lipid gradients in plant roots or drug metabolites in tumors. Concurrently, artificial intelligence (AI) has revolutionized metabolite annotation by predicting structures from MS spectra using deep learning models, improving accuracy for unknown compounds in complex datasets.[31] By 2025, databases like the Human Metabolome Database (HMDB) have grown to over 220,000 metabolite entries, reflecting expanded coverage from ongoing HMP updates and global contributions.[32] A pivotal milestone aiding metabolite localization was the 2012 Nobel Prize in Chemistry awarded to Eric Betzig, Stefan Hell, and William E. Moerner for developing super-resolved fluorescence microscopy, which overcomes the diffraction limit to achieve nanoscale resolution. This breakthrough has enhanced spatial metabolomics by enabling precise correlation of fluorescently labeled metabolites with MS imaging data, revealing subcellular distributions previously unattainable.Core Concepts
The Metabolome
The metabolome refers to the complete set of all small-molecule metabolites present within a biological system, such as a cell, tissue, organ, or organism, at a specific point in time.[33] This collection represents the downstream products of genomic, transcriptomic, and proteomic activities, providing a snapshot of physiological state influenced by genetic makeup, environmental factors, and physiological conditions.[1] First coined in 1998 to describe the metabolites synthesized by an organism in parallel to the transcriptome and proteome, the metabolome encapsulates the functional output of cellular metabolism. The metabolome can be categorized into a core or constitutional component, consisting of stable, conserved metabolites essential for basic cellular functions and present across similar biological systems, and a dynamic or fluxional component, which includes condition-specific metabolites that vary with external stimuli or internal changes.[34] The core metabolome typically involves a relatively small number of fundamental compounds, such as those in central metabolic pathways, while the dynamic portion reflects adaptive responses to factors like stress or diet.[35] This distinction highlights the metabolome's dual nature: a stable foundation overlaid with transient variability. Estimates suggest the human metabolome encompasses approximately 100,000 distinct compounds, though databases like the Human Metabolome Database (HMDB) catalog over 250,000 potential entries, including detected, quantified, and predicted metabolites, underscoring its vast chemical diversity.[36] This diversity spans major classes such as lipids, amino acids, sugars, organic acids, and nucleotides, with many exhibiting structural complexity and functional specificity.[37] Full characterization remains challenging due to the instability of certain metabolites, their occurrence at low concentrations, and the influence of variability factors like diurnal cycles, age, and sex, which can alter profiles significantly across individuals and time points.[38][39] These elements complicate comprehensive mapping but emphasize the metabolome's role as a sensitive indicator of health and environmental interactions.Metabolites: Types and Functions
Metabolites serve as the end products of cellular regulatory processes, offering a dynamic reflection of an organism's real-time physiological state in response to genetic and environmental influences, unlike the relatively static genome.[40] This positions them as key indicators in metabolomics, where the metabolome—the comprehensive set of all metabolites—captures the integrated outcomes of metabolic pathways.[40] Metabolites are commonly classified into primary and secondary categories based on their essentiality to core biological functions. Primary metabolites are vital for growth, development, and reproduction, encompassing compounds such as amino acids, nucleotides, carbohydrates, lipids, and organic acids that support fundamental processes like protein synthesis and energy production.[41] In contrast, secondary metabolites, such as alkaloids, terpenoids, and flavonoids, are not required for basic cellular maintenance but confer specialized advantages, including defense against pathogens, attraction of pollinators, and adaptation to environmental stresses.[41] An alternative classification organizes metabolites by their chemical composition, highlighting their diversity. Organic metabolites form the majority and include carbohydrates (e.g., glucose for energy), lipids (e.g., phospholipids for membranes), and organic acids (e.g., citric acid in metabolic cycles), all of which are carbon-based and endogenously produced.[42] Inorganic metabolites consist of small ions like Na⁺ (maintaining osmotic balance and nerve signaling), K⁺ (regulating membrane potentials), and Ca²⁺ (mediating muscle contraction and enzymatic activation).[43] Exogenous metabolites, often termed xenobiotics, are externally derived compounds such as drugs, pollutants, and dietary components that undergo biotransformation within the organism.[44] The functions of metabolites are multifaceted, underpinning cellular homeostasis and adaptation. They facilitate energy storage through molecules like glycogen and triglycerides, which release energy during demand. In signaling, metabolites act as hormones or second messengers, exemplified by steroid hormones (e.g., cortisol) that regulate stress responses and cyclic AMP that propagates intracellular signals. Structural roles are evident in components like cellulose in plant cell walls and phospholipids in biomembranes, providing mechanical support and compartmentalization.[42] Additionally, metabolites exert regulatory effects via allosteric modulation of enzymes, such as ATP inhibiting glycolysis to prevent overproduction, thereby fine-tuning metabolic flux.Related Fields: Metabonomics and Exometabolomics
Metabonomics represents a systems biology approach to studying the global metabolic responses of living systems to pathophysiological stimuli, environmental changes, or genetic modifications, emphasizing the dynamic, time-related multiparametric changes in the metabolome through techniques like multivariate statistical analysis of nuclear magnetic resonance (NMR) spectroscopic data. Introduced by Nicholson and colleagues in 1999, this field highlights holistic metabolic perturbations and their integration with physiological processes, often overlapping with metabolomics but with a stronger focus on functional dynamics rather than comprehensive static profiling. Historically, the distinction between metabonomics and metabolomics arose from their origins: metabonomics was coined in the UK by the Nicholson group to describe broader, response-oriented metabolic studies, while metabolomics emerged in the US through Oliver Fiehn's 2001 work, centering on the systematic identification and quantification of all metabolites in a biological system to link genotypes to phenotypes. By the mid-2020s, however, the terms have become largely interchangeable in scientific literature, both encompassing the comprehensive analysis of small-molecule metabolites under various conditions, with methodological overlaps in technologies like mass spectrometry and NMR.[45] Exometabolomics, a specialized subfield also termed metabolic footprinting, examines extracellular metabolites secreted or excreted by organisms into their surrounding environment, such as culture media, biofilms, or waste streams, providing insights into intercellular interactions and resource exchange.[46] This approach is particularly valuable in microbial ecology for mapping community-level metabolism and in industrial applications like fermentation monitoring to detect contamination or optimize yields without disrupting the system.[47] A distinctive feature of exometabolomics is its facilitation of non-invasive sampling, exemplified by breathomics, which profiles volatile organic compounds in exhaled breath to noninvasively evaluate metabolic states in clinical settings.00163-2)Analytical Technologies
Sample Preparation and Separation Methods
Sample preparation in metabolomics begins with the collection and handling of diverse biological samples, including biofluids such as urine, plasma, blood, and saliva, as well as tissues and cells.[48] For biofluids, initial steps involve centrifugation to remove debris, typically at 4°C and 15,000×g for 10 minutes, followed by storage at -80°C to preserve metabolite integrity.[48] Tissues require homogenization, often using mechanical disruption like bead milling or freeze-thaw cycles, while cells are harvested via centrifugation or filtration to capture intracellular metabolites.[49] Quenching is essential to halt ongoing metabolic processes and prevent artifactual changes; common methods include rapid addition of cold methanol (e.g., -40°C to -48°C at 60% concentration) for polar metabolites or liquid nitrogen snap-freezing for tissues and cells, minimizing leakage and degradation.[49] These steps are critical as metabolite instability, driven by enzymatic activity, can lead to rapid turnover if not addressed promptly.[50] Extraction techniques isolate metabolites from complex matrices, with solvent-based methods predominating for their efficiency in recovering diverse compound classes. For polar and semi-polar metabolites in biofluids and cells, methanol or acetonitrile precipitation (e.g., 1:8 sample-to-solvent ratio) effectively removes proteins and yields high metabolite coverage, detecting up to 201 compounds in urine with coefficients of variation below 30%.[48] Lipid extraction often employs the chloroform-methanol mixture (2:1 ratio, known as the Folch method) or Bligh-Dyer variant (chloroform:methanol:water, 1:2:0.8), which partitions non-polar lipids into the organic phase while aqueous phases capture polars.[50] Multi-phase extractions, such as sequential solvent applications, provide orthogonal data by fractionating metabolites into polar, semi-polar, and non-polar groups, enhancing coverage in tissues where matrix effects from proteins and salts can suppress recovery.[51] Derivatization is particularly vital for gas chromatography-compatible analyses, converting non-volatile or polar compounds into volatile derivatives; trimethylsilylation with agents like MSTFA replaces active hydrogens on hydroxyl, carboxyl, amino, or thiol groups, enabling analysis of amino acids, sugars, and organic acids.[52] Challenges in extraction include metabolite instability during processing and matrix interferences, which can reduce reproducibility, necessitating optimized protocols tailored to sample type.[53] Separation methods fractionate extracted metabolites prior to downstream analysis, with chromatography techniques selected based on compound properties like volatility, polarity, and charge. Gas chromatography (GC) excels for volatile and semi-volatile metabolites, operating on partitioning principles where analytes distribute between a mobile gas phase and a non-polar stationary liquid film (e.g., RTX-5MS column); temperature gradients from 50°C to 330°C at 20°C/min resolve compounds like fatty acids, sterols, and derivatized sugars with peak widths of 2-3 seconds.[52] Liquid chromatography (LC) addresses polar and non-volatile metabolites; reversed-phase LC (RPLC) uses non-polar stationary phases (e.g., C18) to separate based on hydrophobicity, ideal for lipids and steroids, while hydrophilic interaction LC (HILIC) employs polar phases to retain charged species like amino acids and nucleosides via water-enriched layers.[51] Capillary electrophoresis (CE) separates charged molecules through electrophoretic mobility in an electric field within a fused-silica capillary, influenced by charge-to-size ratio and electroosmotic flow; it is suited for ionic metabolites such as organic acids, nucleotides, and phosphorylated sugars, often in positive mode (low pH) for cations or negative mode for anions.[54] Multidimensional approaches, combining RPLC and HILIC, can increase metabolite detection by up to 108% in plasma by orthogonally resolving diverse polarities.[51] These methods mitigate challenges like co-elution in complex samples, though metabolite instability and matrix effects remain hurdles requiring careful optimization.[53]Detection and Quantification Techniques
Mass spectrometry (MS) is a cornerstone technique in metabolomics for the detection and quantification of metabolites, offering high sensitivity and the ability to analyze complex mixtures by measuring the mass-to-charge ratio (m/z) of ions.[55] In electrospray ionization mass spectrometry (ESI-MS), liquid samples are nebulized into charged droplets, enabling soft ionization suitable for polar and ionic metabolites commonly found in biological samples; this method is often coupled with liquid chromatography (LC) for enhanced separation prior to detection.[56] Matrix-assisted laser desorption/ionization mass spectrometry (MALDI-MS), on the other hand, uses a laser to desorb and ionize analytes from a solid matrix, making it particularly effective for spatial metabolomics in tissue imaging where direct profiling without extensive sample preparation is needed.[57] Metabolite identification in MS relies on accurate m/z measurements, often achieved with high-resolution instruments like Orbitrap or time-of-flight (TOF) analyzers, while tandem MS (MS/MS) provides structural confirmation through fragmentation patterns generated by colliding selected precursor ions with a gas, producing diagnostic daughter ions for database matching.[55] Nuclear magnetic resonance (NMR) spectroscopy serves as a complementary, non-destructive detection method in metabolomics, excelling in structural elucidation without requiring ionization or derivatization.[58] It detects metabolites based on their chemical shifts, which arise from the interaction of atomic nuclei (typically hydrogen or carbon) with an external magnetic field, allowing identification of molecular skeletons and functional groups in aqueous samples.[58] Proton NMR (¹H-NMR) is the most routine approach for metabolomic profiling due to its high natural abundance and sensitivity for common metabolites like amino acids and sugars, providing quantitative data proportional to signal integrals under standardized conditions.[58] Other spectroscopic techniques, such as Fourier transform infrared (FTIR) and ultraviolet-visible (UV-Vis) spectroscopy, are applied for targeted detection of specific metabolite classes, though less comprehensively than MS or NMR. FTIR identifies vibrational modes of molecular bonds, enabling rapid, label-free profiling of carbohydrates, lipids, and proteins in crude extracts via characteristic absorption bands in the infrared spectrum.[59] UV-Vis detects chromophores in aromatic or conjugated metabolites, such as flavonoids and nucleotides, through absorbance at specific wavelengths, often used in hyphenated systems for initial screening.[59] Hybrid systems like LC-MS combine chromatographic separation with MS detection to achieve high resolution, resolving thousands of features in a single run and improving specificity for low-abundance metabolites in complex matrices.[60] Quantification in metabolomics distinguishes between relative abundance (e.g., peak area ratios) and absolute measurements, with the latter providing concentrations in molar units essential for pathway modeling.[61] Absolute quantification typically employs isotope dilution MS, where stable isotope-labeled standards (e.g., ¹³C or ²H analogs) are added to samples, compensating for matrix effects and ionization variability to yield accurate levels via the ratio of heavy to light ions.[62] Detection limits for MS-based methods reach the femtomole (fmol) range, with limits of detection (LODs) as low as 50 fmol for many metabolites under optimized conditions, enabling analysis of trace-level compounds in biofluids. As of 2025, ion mobility spectrometry (IMS) is integrated into MS workflows, adding a gas-phase separation dimension based on ion shape and size to enhance resolution of isobaric metabolites and improve quantification in high-throughput metabolomics; recent advancements include cyclic IMS for spatial metabolomics applications.[64]Data Analysis Methods
Statistical Approaches
In metabolomics, statistical approaches begin with preprocessing to ensure data quality and comparability across samples. Normalization adjusts for systematic variations such as differences in sample concentration or instrument response, with common methods including sum normalization, which scales intensities to a constant total ion current, and internal standard normalization using added reference compounds to correct for technical variability.[65] Imputation addresses missing values arising from detection limits or technical noise, often employing techniques like k-nearest neighbors or zero-filling, though caution is advised as imputation can introduce bias if missingness is not random.[66] Scaling methods, such as Pareto scaling (which divides variables by the square root of their standard deviation) or unit variance scaling (dividing by standard deviation), mitigate the dominance of high-abundance metabolites in downstream analyses.[65] Univariate statistical methods evaluate individual metabolites independently to identify significant differences between groups, facilitating feature selection. T-tests are applied for comparing two groups, assuming normality of the data distribution, while analysis of variance (ANOVA) extends this to multiple groups, testing for overall differences in means.[67] Given the high dimensionality of metabolomics datasets, where thousands of features are tested simultaneously, multiple testing corrections are essential; the Benjamini-Hochberg procedure controls the false discovery rate (FDR) by adjusting p-values to limit the proportion of false positives among significant results.[68] These methods assume independence among features and normality (or large sample sizes for central limit theorem applicability), though metabolomics data often violate these, necessitating robust variants or transformations.[67] Multivariate statistical approaches handle the correlative nature of metabolomics data by analyzing multiple variables simultaneously, enabling dimensionality reduction and pattern detection. Principal component analysis (PCA) is an unsupervised method that decomposes the centered data matrix into principal components, expressed as: where contains the scores (projections of samples onto principal components), the loadings (variable contributions to components), and the residual matrix.[69] PCA assumes linear relationships, independence of observations, and often sphericity for inference, though it does not strictly require normality for decomposition. Score plots visualize sample clustering in low-dimensional space, revealing outliers or group separations, while loading plots highlight influential metabolites driving variance.[69] Partial least squares discriminant analysis (PLS-DA), a supervised extension, maximizes covariance between and a class indicator matrix , enhancing class separation; its score plots show group discrimination, and loading plots identify discriminatory features, with validation via cross-validation to avoid overfitting.[69] These techniques provide interpretable visualizations but require careful preprocessing to meet assumptions of independence and linearity.[69]Machine Learning and Data Mining
Machine learning (ML) and data mining techniques have become essential for handling the high-dimensional, noisy datasets generated in metabolomics, enabling the discovery of patterns, biomarkers, and predictive models that statistical methods alone cannot efficiently uncover. Supervised ML approaches, in particular, excel at classification and regression tasks by training on labeled data to predict outcomes such as disease states or treatment responses. Unsupervised methods facilitate exploratory analysis by identifying inherent structures without prior labels, while data mining tools like network analysis reveal complex interactions among metabolites. These methods often build on preprocessing steps from statistical approaches, such as normalization, to mitigate variability in spectral data. Recent integrations with large databases further enhance annotation accuracy, though challenges like overfitting persist in high-dimensional spaces. Supervised ML algorithms, including random forests (RF) and support vector machines (SVM), are widely applied for biomarker prediction in metabolomics. RF, an ensemble method that constructs multiple decision trees and aggregates their predictions, is particularly robust to noisy data and handles multicollinearity common in metabolite profiles. SVM, effective for high-dimensional classification, separates data using hyperplanes and has demonstrated superior performance in diagnosing metabolic-associated steatohepatitis (MASH) by integrating metabolomic features, achieving areas under the curve (AUC) exceeding 0.90 in validation sets. Feature importance in RF is often assessed via the Gini index, which measures impurity reduction at each split, allowing prioritization of influential metabolites like amino acids in disease models. A 10-metabolite RF-based model, for instance, predicted gastric cancer with 90.5% sensitivity in external cohorts, underscoring RF's interpretability and generalizability.[70] Unsupervised ML techniques, such as clustering, group metabolomic profiles to reveal subgroups or subtypes without predefined outcomes. Hierarchical clustering builds a tree-like structure of similarities, often using Ward's linkage, and has demonstrated substantial compositional differences in microbial communities via metabolomics, enabling objective similarity assessments across samples; for example, it has proven effective for stratifying head and neck cancer patients based on pre-treatment blood metabolomes, where it identified high- and low-risk groups linked to survival outcomes.[71] Sparse variants of k-means further refine this by selecting key metabolites, improving clustering quality in untargeted assays. K-means clustering partitions data into k clusters by minimizing intra-cluster variance. Association rule mining, exemplified by the Apriori algorithm, uncovers co-occurring metabolite patterns by identifying frequent itemsets with support and confidence thresholds; in metabolomics-inspired studies, it has revealed disorder co-occurrence networks in traditional Chinese medicine datasets, adaptable to metabolite synergies in biological pathways. Data mining in metabolomics emphasizes network-based analyses to model metabolite interactions as graphs. Tools like Cytoscape, through plugins such as MetScape, visualize and interpret metabolomic data within human metabolic networks, integrating gene expression and compound levels to highlight pathway enrichments. These graphs reveal connectivity patterns, such as correlations between metabolites in mass spectrometry datasets, aiding in the inference of functional modules. Deep learning methods, including autoencoders, address spectral denoising by learning compressed representations to reconstruct clean signals from noisy inputs. Denoising autoencoders have reduced systematic errors in large-scale untargeted metabolomics data, such as in GC-MS workflows, lowering quality control sample variability by up to 50% and enhancing peak detection reliability; similar approaches apply to LC-MS data.[72] As of 2025, ML integration with databases like the Global Natural Products Social Molecular Networking (GNPS) platform has advanced spectral annotation through ensemble models that predict fragmentation patterns. The Ensemble Spectral Prediction (ESP) model, for example, combines neural networks to match MS/MS spectra against libraries, improving annotation accuracy for diverse metabolites in untargeted workflows.[73] However, high-dimensional metabolomics data—often featuring thousands of features per sample—exacerbates the curse of dimensionality, leading to data sparsity and increased overfitting risk in ML models. Techniques like dimensionality reduction (e.g., principal component analysis as preprocessing) and regularization mitigate this, but empirical Bayes methods are increasingly used to control false discovery rates in annotation pipelines. For supervised ML in dietary biomarker discovery, such as urinary indicators of kiwifruit intake under habitual diets, XGBoost—an ensemble method based on gradient-boosted decision trees—has achieved high accuracy (e.g., 88% with a 7-metabolite panel).[74]Applications
Biomedical and Clinical Uses
Metabolomics plays a pivotal role in biomedical and clinical applications by enabling the identification of disease-specific metabolic signatures, which facilitate early diagnosis, risk stratification, and personalized treatment strategies in human health. Through high-throughput analysis of metabolites in biofluids such as blood, urine, and cerebrospinal fluid, researchers can detect perturbations in metabolic pathways associated with various pathologies, leveraging techniques like nuclear magnetic resonance (NMR) spectroscopy and mass spectrometry for precise quantification.[75] This approach has advanced precision medicine by integrating metabolomic data with genomics and proteomics to uncover actionable insights into disease mechanisms.[76] In biomarker discovery, metabolomics has been instrumental in identifying metabolites indicative of cancer and cardiovascular diseases. For instance, NMR-based serum profiling reveals elevated lactate levels in patients with metastatic cancers, reflecting the Warburg effect where tumor cells favor aerobic glycolysis, allowing for non-invasive detection and monitoring of disease progression.[77] Similarly, elevated branched-chain amino acids (BCAAs) in plasma serve as biomarkers for cardiometabolic risks, including insulin resistance and heart failure, as their accumulation correlates with impaired catabolism and predicts adverse cardiovascular outcomes independent of traditional risk factors.[78] Pharmacometabolomics extends these principles to predict individual drug responses and toxicities, particularly for hepatotoxic agents like acetaminophen. Pre-dose urinary metabolomic profiles can forecast susceptibility to acetaminophen-induced liver injury by identifying baseline variations in endogenous metabolites, such as those linked to glutathione synthesis pathways; upon overdose, rapid glutathione depletion allows the toxic metabolite N-acetyl-p-benzoquinone imide (NAPQI) to bind cellular proteins, and pharmacometabolomic models have shown potential for early prediction of hepatotoxicity in clinical cohorts.[79][80] In clinical trials for precision medicine, metabolomics supports phenotyping to tailor interventions, as seen in Alzheimer's disease research where altered lipid profiles, including tau-associated phospholipids and sphingolipids, correlate with neurodegeneration and cognitive decline, enabling stratification of patients for targeted therapies like anti-tau agents.[76] FDA-cleared tandem mass spectrometry-based systems for newborn screening of inborn errors of metabolism, such as amino acid and acylcarnitine profiling, are standard diagnostics, detecting over 30 disorders with high sensitivity to initiate early interventions like dietary management.[81] Longitudinal metabolomic studies further elucidate microbiome-metabolite interactions in health and disease, revealing how gut-derived short-chain fatty acids influence systemic inflammation and progression in conditions like multiple sclerosis.Environmental, Agricultural, and Industrial Applications
Metabolomics has emerged as a vital tool for monitoring environmental pollution, particularly in aquatic ecosystems, by profiling metabolic changes in organisms exposed to contaminants such as endocrine disruptors. Mass spectrometry (MS)-based approaches, including untargeted metabolomics, enable the detection of subtle disruptions in metabolic pathways, such as alterations in amino acid and lipid metabolism in fish and invertebrates following exposure to pollutants like bisphenol A and phthalates. For instance, studies on marine mussels have revealed downregulation of energy-related metabolites under multi-contaminant stress, providing biomarkers for early pollution detection without prior knowledge of specific toxins.[82][83][84] In agriculture, metabolomics supports crop improvement by identifying metabolic responses to abiotic stresses, such as drought, which often involve the accumulation of protective secondary metabolites like flavonoids. In wheat and sorghum, drought stress triggers upregulation of phenylpropanoid pathways, leading to increased levels of flavonoids such as quercetin and kaempferol glycosides, which enhance antioxidant capacity and stress tolerance. Additionally, metabolic quantitative trait loci (mQTLs) mapping integrates metabolomics with genomics to pinpoint genetic variants associated with desirable traits; for example, in rice, over 2,800 mQTLs have been identified for 900 metabolites, facilitating marker-assisted breeding for resilient varieties.[85][86][87] Industrial applications of metabolomics, particularly exometabolomics, optimize bioprocesses by analyzing extracellular metabolites in fermentation media to improve biofuel production. In lignocellulosic biomass fermentation for ethanol, exometabolomics via liquid chromatography-MS identifies inhibitory compounds like furfural and acetic acid, allowing strain engineering to improve yields through enhanced detoxification pathways. In food industries, volatile metabolomics profiling using gas chromatography-MS assesses quality attributes; for wine, it distinguishes varietal aromas by quantifying terpenes and esters, correlating volatile profiles with sensory scores to refine production and authentication.[88][46][89] Advancements in the 2020s include field-deployable portable mass spectrometers, which enable real-time metabolomics analysis in agriculture, such as on-site profiling of plant volatiles for pest detection, reducing reliance on lab-based methods. These technologies contribute to sustainability by minimizing pesticide use; metabolomics-guided breeding has developed crop varieties with inherent resistance, potentially cutting chemical inputs while maintaining yields, as demonstrated in integrated pest management studies. Exometabolomics briefly informs industrial media optimization, linking extracellular profiles to process efficiency without delving into intracellular details.[90][91][92]Challenges and Future Directions
Current Limitations
One major technical limitation in metabolomics is the incomplete coverage of the metabolome, with typical studies detecting only about 1-5% (or less) of the estimated total metabolites due to the vast chemical diversity and the constraints of current analytical platforms.[93] This partial detection arises from challenges in extracting, separating, and identifying the full spectrum of polar, non-polar, and volatile compounds across biological samples. Additionally, standardization remains a significant hurdle, as variations in protocols across laboratories hinder data comparability; the Metabolomics Standards Initiative (MSI) has proposed guidelines for reporting experimental design, sample preparation, and data processing to address this, yet adherence is inconsistent.[94] Biologically, metabolite instability poses a key challenge, as many compounds degrade rapidly due to enzymatic activity or chemical reactivity during sample collection and processing, leading to altered profiles that do not reflect in vivo states.[95] Compartmentation further complicates analysis, with metabolites exhibiting tissue-specific concentration gradients and subcellular localization that are often lost in bulk tissue extractions, masking heterogeneous distributions within organs or cells.[96] The gut microbiome introduces additional confounders, as microbial metabolites can overlap with host-derived ones in biofluids like plasma or urine, influencing interpretations of systemic metabolic changes without integrated multi-omics approaches.[97] Ethical concerns in clinical metabolomics center on data privacy, particularly for patient-derived samples where metabolite profiles may inadvertently reveal sensitive health information, necessitating robust consent processes and secure data-sharing frameworks compliant with regulations like GDPR.[98] Reproducibility issues exacerbate these challenges, with a documented crisis in high-throughput studies stemming from practices like p-hacking—selective reporting of statistically significant results—which undermines biomarker validation across cohorts.[99] Statistical tools from data analysis methods can mitigate some variability, but they cannot fully resolve underlying experimental inconsistencies.[100] A quantitative barrier is the metabolome's vast dynamic range, spanning up to 12 orders of magnitude in concentration, which exceeds the typical 4-5 orders achievable by mass spectrometry instruments, often requiring hybrid targeted-untargeted strategies to capture both low- and high-abundance metabolites.[101]Emerging Trends and Integrations
One prominent emerging trend in metabolomics is the integration of metabolomic data with other omics layers, such as genomics and proteomics, to uncover complex biological relationships. Tools like MetaboAnalyst 6.0 facilitate this multi-omics analysis by providing unified platforms for processing targeted and untargeted data, enabling joint statistical modeling and pathway enrichment across datasets.[102] For instance, integrating single nucleotide polymorphisms (SNPs) from genomic data with metabolite profiles allows researchers to link genetic variants to metabolic perturbations, as demonstrated in studies using integrative pathway analysis to identify SNP-metabolite associations in disease contexts.[103] This approach has been applied to reveal how genetic variants activate latent metabolic pathways in microbial systems, enhancing understanding of phenotypic diversity.[104] Technological advancements are pushing metabolomics toward higher resolution at the cellular level, particularly with single-cell metabolomics enabled by techniques like nano-electrospray ionization mass spectrometry (nano-ESI MS). Developed in the 2020s, nano-ESI MS allows direct analysis of metabolites from individual cells without extensive sample preparation, achieving detection limits in the attomole range and revealing heterogeneity in cellular metabolism.[105] Complementary to this, artificial intelligence (AI) is transforming de novo structure prediction for unknown metabolites, with software like SIRIUS 4 integrating fragmentation tree analysis and machine learning-based scoring to annotate molecular formulas and structures from tandem mass spectra with over 70% accuracy in benchmark datasets.[106] These tools reduce reliance on reference databases, enabling identification of novel compounds in diverse biological samples.[107] By 2025, metabolomics is witnessing the rise of portable mass spectrometry devices, which enable on-site analysis of metabolites in field settings, such as environmental monitoring or clinical diagnostics, with compact systems achieving sufficient sensitivity for volatile organic compounds and small molecules.[108] Spatial metabolomics, combining mass spectrometry imaging with metabolomics workflows, is another key trend, allowing tissue mapping of metabolite distributions at micrometer resolution to study spatial heterogeneity in diseases like cancer.[109] Global consortia, including the NIH Metabolomics Consortium and initiatives like the Global Natural Products Social Molecular Networking (GNPS), are driving database expansion, with resources like the Human Metabolome Database (HMDB) surpassing 220,000 entries and spectral libraries growing to support broader annotation of untargeted data.[110][32] Looking ahead, fluxomics represents a future direction in metabolomics, focusing on dynamic pathway rates through isotope tracing methods like 13C-labeling. In 13C metabolic flux analysis (13C-MFA), steady-state labeling patterns from 13C tracers are used to quantify intracellular fluxes, with flux rates (v) estimated from measurements of metabolite pool sizes (M), isotopic enrichment patterns, and extracellular rates by fitting comprehensive mathematical models of labeling dynamics, under steady-state or non-stationary assumptions.[111] This approach, advanced by tools for non-stationary labeling experiments, provides quantitative insights into metabolic network activity beyond static snapshots.[112]References
- https://pubmed.ncbi.nlm.nih.gov/18694716/
