Recent from talks
Nothing was collected or created yet.
CHARMM
View on Wikipedia| CHARMM | |
|---|---|
| Developers | Martin Karplus, Accelrys |
| Initial release | 1983 |
| Stable release | c47b1
/ 2022[1] |
| Preview release | c48a1
/ 2022[1] |
| Written in | FORTRAN 77-95, CUDA |
| Operating system | Unix-like: Linux, macOS, AIX, iOS[2] |
| Platform | x86, ARM, Nvidia GPU; Cray XT4, XT5[2] |
| Available in | English |
| Type | Molecular dynamics |
| License | Proprietary |
| Website | www |
Chemistry at Harvard Macromolecular Mechanics (CHARMM) is the name of a widely used set of force fields for molecular dynamics, and the name for the molecular dynamics simulation and analysis computer software package associated with them.[3][4][5] The CHARMM Development Project involves a worldwide network of developers working with Martin Karplus and his group at Harvard to develop and maintain the CHARMM program. Licenses for this software are available, for a fee, to people and groups working in academia.
Force fields
[edit]The CHARMM force fields for proteins include: united-atom (sometimes termed extended atom) CHARMM19,[6] all-atom CHARMM22[7] and its dihedral potential corrected variant CHARMM22/CMAP, as well as later versions CHARMM27 and CHARMM36 and various modifications such as CHARMM36m and CHARMM36IDPSFF.[8] In the CHARMM22 protein force field, the atomic partial charges were derived from quantum chemical calculations of the interactions between model compounds and water. Furthermore, CHARMM22 is parametrized for the TIP3P explicit water model. Nevertheless, it is often used with implicit solvents. In 2006, a special version of CHARMM22/CMAP was reparametrized for consistent use with implicit solvent GBSW.[9]
The CHARMM22 force field has the following potential energy function:[7][10]
The bond, angle, dihedral, and nonbonded terms are similar to those found in other force fields such as AMBER. The CHARMM force field also includes an improper term accounting for out-of-plane bending (which applies to any set of four atoms that are not successively bonded), where is the force constant and is the out-of-plane angle. The Urey-Bradley term is a cross-term that accounts for 1,3 nonbonded interactions not accounted for by the bond and angle terms; is the force constant and is the distance between the 1,3 atoms.
For DNA, RNA, and lipids, CHARMM27[11] is used. Some force fields may be combined, for example CHARMM22 and CHARMM27 for the simulation of protein-DNA binding. Also, parameters for NAD+, sugars, fluorinated compounds, etc., may be downloaded. These force field version numbers refer to the CHARMM version where they first appeared, but may of course be used with subsequent versions of the CHARMM executable program. Likewise, these force fields may be used within other molecular dynamics programs that support them.
In 2009, a general force field for drug-like molecules (CGenFF) was introduced. It "covers a wide range of chemical groups present in biomolecules and drug-like molecules, including a large number of heterocyclic scaffolds".[12] The general force field is designed to cover any combination of chemical groups. This inevitably comes with a decrease in accuracy for representing any particular subclass of molecules. Users are repeatedly warned in Mackerell's website not to use the CGenFF parameters for molecules for which specialized force fields already exist (as mentioned above for proteins, nucleic acids, etc.).
CHARMM also includes polarizable force fields using two approaches. One is based on the fluctuating charge (FQ) model, also termed Charge Equilibration (CHEQ).[13][14] The other is based on the Drude shell or dispersion oscillator model.[15][16]
Parameters for all of these force fields may be downloaded from the Mackerell website for free.[17]
Molecular dynamics program
[edit]The CHARMM program allows for generating and analysing a wide range of molecular simulations. The most basic kinds of simulation are minimizing a given structure and production runs of a molecular dynamics trajectory. More advanced features include free energy perturbation (FEP), quasi-harmonic entropy estimation, correlation analysis and combined quantum, and quantum mechanics–molecular mechanics (QM/MM) methods.
CHARMM is one of the oldest programs for molecular dynamics. It has accumulated many features, some of which are duplicated under several keywords with slight variants. This is an inevitable result of the many outlooks and groups working on CHARMM worldwide. The changelog file, and CHARMM's source code, are good places to look for the names and affiliations of the main developers. The involvement and coordination by Charles L. Brooks III's group at the University of Michigan is salient.
Software history
[edit]Around 1969, there was considerable interest in developing potential energy functions for small molecules. CHARMM originated at Martin Karplus's group at Harvard. Karplus and his then graduate student Bruce Gelin decided the time was ripe to develop a program that would make it possible to take a given amino acid sequence and a set of coordinates (e.g., from the X-ray structure) and to use this information to calculate the energy of the system as a function of the atomic positions. Karplus has acknowledged the importance of major inputs in the development of the (at the time nameless) program, including:
- Schneior Lifson's group at the Weizmann Institute, especially from Arieh Warshel who went to Harvard and brought his consistent force field (CFF) program with him
- Harold Scheraga's group at Cornell University
- Awareness of Michael Levitt's pioneering energy calculations for proteins
In the 1980s, finally a paper appeared and CHARMM made its public début. Gelin's program had by then been considerably restructured. For the publication, Bob Bruccoleri came up with the name HARMM (HARvard Macromolecular Mechanics), but it seemed inappropriate. So they added a C for Chemistry. Karplus said: "I sometimes wonder if Bruccoleri's original suggestion would have served as a useful warning to inexperienced scientists working with the program."[18] CHARMM has continued to grow and the latest release of the executable program was made in 2015 as CHARMM40b2.
Running CHARMM under Unix-Linux
[edit]The general syntax for using the program is:
charmm -i filename.inp -o filename.out
charmm– The name of the program (or script which runs the program) on the computer system being used.filename.inp– A text file which contains the CHARMM commands. It starts by loading the molecular topologies (top) and force field (par). Then one loads the molecular structures' Cartesian coordinates (e.g. from PDB files). One can then modify the molecules (adding hydrogens, changing secondary structure). The calculation section can include energy minimization, dynamics production, and analysis tools such as motion and energy correlations.filename.out– The log file for the CHARMM run, containing echoed commands, and various amounts of command output. The output print level may be increased or decreased in general, and procedures such as minimization and dynamics have printout frequency specifications. The values for temperature, energy pressure, etc. are output at that frequency.
Volunteer computing
[edit]Docking@Home, hosted by University of Delaware, one of the projects which use an open-source platform for the distributed computing, BOINC, used CHARMM to analyze the atomic details of protein-ligand interactions in terms of molecular dynamics (MD) simulations and minimizations.
World Community Grid, sponsored by IBM, ran a project named The Clean Energy Project[19] which also used CHARMM in its first phase, which has been completed.
See also
[edit]References
[edit]- ^ a b "Versions - CHARMM". CHARMM (Chemistry at HARvard Macromolecular Mechanics). Harvard University. Retrieved 2021-03-29.
- ^ a b "Installation". CHARMM (Chemistry at HARvard Macromolecular Mechanics). Harvard University. 2016. Retrieved 2021-03-29.
- ^ Brooks BR, Bruccoleri RE, Olafson BD, States DJ, Swaminathan S, Karplus M (1983). "CHARMM: A program for macromolecular energy, minimization, and dynamics calculations". J. Comput. Chem. 4 (2): 187–217. doi:10.1002/jcc.540040211. S2CID 91559650.
- ^ MacKerell, A.D. Jr.; Brooks, B.; Brooks, C. L. III; Nilsson, L.; Roux, B.; Won, Y.; Karplus, M. (1998). "CHARMM: The Energy Function and Its Parameterization with an Overview of the Program". In Schleyer, P.v.R.; et al. (eds.). The Encyclopedia of Computational Chemistry. Vol. 1. Chichester: John Wiley & Sons. pp. 271–277.
- ^ Brooks BR, Brooks CL 3rd, Mackerell AD Jr, Nilsson L, Petrella RJ, Roux B, Won Y, Archontis G, Bartels C, Boresch S, Caflisch A, Caves L, Cui Q, Dinner AR, Feig M, Fischer S, Gao J, Hodoscek M, Im W, Kuczera K, Lazaridis T, Ma J, Ovchinnikov V, Paci E, Pastor RW, Post CB, Pu JZ, Schaefer M, Tidor B, Venable RM, Woodcock HL, Wu X, Yang W, York DM, Karplus M (29 July 2009). "CHARMM: The biomolecular simulation program". Journal of Computational Chemistry. 30 (10): 1545–1614. doi:10.1002/jcc.21287. PMC 2810661. PMID 19444816.
- ^ Reiher, III WH (1985). Theoretical studies of hydrogen bonding (Thesis). Harvard University.
- ^ a b MacKerell AD Jr; et al. (1998). "All-atom empirical potential for molecular modeling and dynamics studies of proteins". J Phys Chem B. 102 (18): 3586–3616. doi:10.1021/jp973084f. PMID 24889800.
- ^ MacKerell AD Jr, Feig M, Brooks III CL (2004). "Extending the treatment of backbone energetics in protein force fields: limitations of gas-phase quantum mechanics in reproducing protein conformational distributions in molecular dynamics simulations". J Comput Chem. 25 (11): 1400–1415. doi:10.1002/jcc.20065. PMID 15185334. S2CID 11076418.
- ^ Brooks CL, Chen J, Im W (2006). "Balancing solvation and intramolecular interactions: toward a consistent generalized born force field (CMAP opt. for GBSW)". J Am Chem Soc. 128 (11): 3728–3736. doi:10.1021/ja057216r. PMC 2596729. PMID 16536547.
- ^ Vanommeslaeghe, K.; MacKerell, A. D. (May 2015). "CHARMM additive and polarizable force fields for biophysics and computer-aided drug design". Biochimica et Biophysica Acta (BBA) - General Subjects. 1850 (5): 861–871. doi:10.1016/j.bbagen.2014.08.004. ISSN 0006-3002. PMC 4334745. PMID 25149274.
- ^ MacKerell AD Jr, Banavali N, Foloppe N (2001). "Development and current status of the CHARMM force field for nucleic acids". Biopolymers. 56 (4): 257–265. doi:10.1002/1097-0282(2000)56:4<257::AID-BIP10029>3.0.CO;2-W. PMID 11754339. S2CID 19502363.
- ^ Vanommeslaeghe K, Hatcher E, Acharya C, Kundu S, Zhong S, Shim J, Darian E, Guvench O, Lopes P, Vorobyov I, Mackerell AD Jr (2009). "CHARMM general force field: A force field for drug-like molecules compatible with the CHARMM all-atom additive biological force fields". J Comput Chem. 31 (4): 671–90. doi:10.1002/jcc.21367. PMC 2888302. PMID 19575467.
- ^ Patel S, Brooks CL 3rd (2004). "CHARMM fluctuating charge force field for proteins: I parameterization and application to bulk organic liquid simulations". J Comput Chem. 25 (1): 1–15. doi:10.1002/jcc.10355. PMID 14634989. S2CID 39320318.
- ^ Patel S, Mackerell AD Jr, Brooks CL 3rd (2004). "CHARMM fluctuating charge force field for proteins: II protein/solvent properties from molecular dynamics simulations using a nonadditive electrostatic model". J Comput Chem. 25 (12): 1504–1514. doi:10.1002/jcc.20077. PMID 15224394. S2CID 16741310.
- ^ Lamoureux G, Roux B (2003). "Modeling induced polarization with classical Drude oscillators: Theory and molecular dynamics simulation algorithm". J Chem Phys. 119 (6): 3025–3039. Bibcode:2003JChPh.119.3025L. doi:10.1063/1.1589749.
- ^ Lamoureux G, Harder E, Vorobyov IV, Roux B, MacKerell AD (2006). "A polarizable model of water for molecular dynamics simulations of biomolecules". Chem Phys Lett. 418 (1–3): 245–249. Bibcode:2006CPL...418..245L. doi:10.1016/j.cplett.2005.10.135.
- ^ Mackerell website
- ^ Karplus M (2006). "Spinach on the ceiling: a theoretical chemist's return to biology". Annu Rev Biophys Biomol Struct. 35 (1): 1–47. doi:10.1146/annurev.biophys.33.110502.133350. PMID 16689626.
- ^ The Clean Energy Project
External links
[edit]- Official website, with documentation and helpful discussion forums
- Official website, BIOVIA
- CHARMM tutorial; Archived 2010-10-05 at the Wayback Machine
- MacKerell website, hosts package of force field parameters for CHARMM
- C.Brooks website
- CHARMM page at Harvard
- Roux website; Archived 2006-10-12 at the Wayback Machine
- Bernard R. Brooks Group website
- Docking@Home
- CHARMM-GUI project
- CHARMMing (CHARMM Interface and Graphics); Archived 2008-08-20 at the Wayback Machine
- CHARMM Tutorial
CHARMM
View on GrokipediaOverview
Definition and Purpose
CHARMM, an acronym for Chemistry at HARvard Macromolecular Mechanics, is a versatile molecular simulation program designed for modeling biomolecular systems, including proteins, nucleic acids, lipids, and carbohydrates, using classical mechanics approaches.[3][1] It enables detailed investigations into the structure, dynamics, and interactions of these systems at atomic resolution, supporting applications in computational biophysics and structural biology.[1] The core of CHARMM consists of empirical force fields that define potential energy functions for biomolecular interactions and a computational program that implements algorithms for energy minimization, molecular dynamics (MD) simulations, and free energy perturbation calculations.[1] These components allow users to perform energy evaluations and manipulations essential for simulating conformational changes, ligand binding, and thermodynamic properties in complex macromolecular environments.[1] As one of the first comprehensive biomolecular simulation packages, CHARMM has facilitated pioneering studies of biomolecular behavior since its inception, providing a foundational tool for atomic-level modeling that integrates empirical potentials with advanced simulation techniques.[1] The general form of the CHARMM potential energy function, , captures these interactions through additive terms: Here, the first three sums represent bonded interactions—harmonic potentials for bond lengths (, equilibrium , force constant ), bond angles (, equilibrium , ), and periodic dihedral angles (, multiplicity , phase , )—while the nonbonded sum includes Coulombic electrostatics ( charges, distance ) and Lennard-Jones van der Waals terms ( repulsive, attractive parameters).[7]Licensing and Availability
CHARMM is a proprietary molecular simulation program originally developed at Harvard University and commercially licensed through BIOVIA (formerly Accelrys). The academic version, known as CHARMM, became freely available to academic, government, and non-profit users starting in 2022, distributed via the official site academiccharmm.org without any licensing fees for eligible institutions.[3][4][8] In contrast, for-profit entities must acquire commercial licenses for the CHARMm variant directly from BIOVIA, ensuring controlled access to its full capabilities in industrial applications.[5][9] The software remains proprietary overall, with no open-source release, though the academic distribution includes comprehensive access to its features for non-commercial research.[10] Academic users gain access by registering at brooks.chem.lsa.umich.edu/register, after which they can download the complete release package containing source code, documentation, test cases, topology and parameter files, and pre-built binaries for select platforms.[9][11] Commercial access involves contacting BIOVIA for tailored licensing agreements, often integrated into broader software suites like Discovery Studio. Building from source requires a Fortran 95-compliant compiler, such as GCC gfortran (version 4.4 or later, excluding 4.5.1), Intel ifort (11.1 or later), or PGI pgf95 (11.1 or later), along with MPI and OpenMP for parallel execution.[11] The package unpacks into a directory like ~/c50b1 for version c50b1, with installation handled via configure scripts and make commands.[11] CHARMM primarily supports Unix/Linux environments, with confirmed compatibility for platforms including em64t, gnu Linux, osx (macOS), and GPU-accelerated systems via interfaces like DOMDEC-GPU and OpenMM.[9][11] Binaries are available for macOS and certain Linux distributions, while Windows users typically compile from source or use compatibility tools like Windows Subsystem for Linux, as native binaries are not standard.[11][12] Versions follow a cXX naming convention, such as c48a1 in 2022 or the current c50b1 as of 2025, with major releases occurring annually; detailed changelogs outlining enhancements and fixes are hosted on the documentation site.[13][14] Community resources at academiccharmm.org include extensive documentation covering installation, usage, and advanced features, along with tutorials for setup on various platforms.[11] User support is facilitated through dedicated forums at forums-academiccharmm.org, where researchers discuss installation issues, share best practices, and access developer guides.[15] This infrastructure, enhanced by the 2022 shift to free academic access, has broadened CHARMM's reach within the scientific community.[4]History
Origins and Early Development
CHARMM, or Chemistry at HARvard Macromolecular Mechanics, was initiated by Martin Karplus in the early 1970s at Harvard University as a computational tool initially designed for simulating protein structures and dynamics. The program's inception stemmed from Karplus's visit to Schneior Lifson's group at the Weizmann Institute in 1969, where there was growing interest in developing empirical potential energy functions to model the conformations of small molecules and extend these approaches to larger biomolecules. At the time, quantum mechanical calculations were computationally prohibitive for systems as complex as proteins, necessitating the use of classical empirical potentials to approximate intramolecular interactions and enable studies of structural perturbations, such as those induced by ligand binding in hemoglobin.[16] Early development of CHARMM was driven by the need to bridge the gap between static X-ray crystallography data and dynamic behavior in biological macromolecules, with initial efforts focusing on energy minimization and normal mode analysis for proteins. Key collaborators included graduate students Bruce Gelin, who contributed significantly to the program's coding and implementation, and J. Andrew McCammon, who helped pioneer its application to molecular dynamics. What began as ad hoc scripts for specific calculations evolved into a more structured software package, emphasizing modular design for handling atomic coordinates, force field parameters, and simulation algorithms. The initial scope was narrow, targeting proteins using simple empirical force fields that parameterized bonded and non-bonded interactions based on available experimental data.[16][17] The program's first major milestone came in 1977 with the publication of the inaugural molecular dynamics simulation of a protein, the bovine pancreatic trypsin inhibitor (BPTI), conducted using an early version of CHARMM. This simulation, spanning just 9.2 picoseconds, demonstrated the feasibility of capturing atomic fluctuations in a vacuum environment and revealed dynamic elements like hydrogen bonding networks that were invisible in static structures. Running on mainframe computers such as the IBM System/370, these early computations were severely limited by hardware constraints, including slow processing speeds and modest memory, restricting simulations to short timescales and small systems of a few hundred atoms. CHARMM remained an in-house tool at Harvard for research purposes until its public debut in 1983 as version c19, marking the transition to a distributable package for broader scientific use.[17]Key Milestones and Versions
The development of CHARMM began in the late 1970s, with the first formal releases occurring in the 1980s under versions c20 through c25, which introduced core capabilities for energy minimization and molecular dynamics simulations of proteins, nucleic acids, and crystalline solids.[1] These early versions, such as c20, laid the foundation for biomolecular modeling by supporting isolated molecules, solutions, and solids, with initial force fields like PARAM19 providing polar hydrogen representations for proteins and nucleic acids.[1] In the 1990s, CHARMM advanced through versions c26 to c30, incorporating lipid parameters to enable simulations of membrane systems and enhancing nucleic acid support with the CHARMM27 force field in 1998, which improved accuracy for DNA and RNA structures.[1] Key releases included c26 in 1998 and c27 in 2000, alongside the introduction of targeted molecular dynamics in 1993 for studying conformational transitions.[1][14] The 2000s saw versions c31 to c36, marked by the addition of cross-term map (CMAP) corrections in 2004 via c30a1 to better capture protein backbone dihedral interactions, significantly enhancing simulation fidelity for folded states.[1] This period also initiated a shift toward polarizable force fields, with the Drude oscillator model prototyped by 2007 in c34b1 for inducible dipoles in biomolecules, and support for systems scaling to 10^10 atoms in c31b1 by 2003.[1] Lipid force fields were refined in 2005, building on 1990s parameters for phospholipid bilayers.[1] During the 2010s, versions progressed from c37 to c41, with CHARMM36 released in 2012 featuring optimized CMAP terms for proteins, lipids, and nucleic acids, improving agreement with NMR data and membrane properties.[4] Polarizable models expanded with Drude-2013 for proteins, and academic licensing began broadening access.[4] In 2013, Martin Karplus received the Nobel Prize in Chemistry, shared with Michael Levitt and Arieh Warshel, for multiscale modeling techniques that underpinned CHARMM's foundational simulations of chemical reactions in proteins.[18] The 2020s brought versions c42 to c50, including developmental builds up to c50a1 in 2024 and releases like c49b1, integrating GPU acceleration through the CHARMM/OpenMM API introduced in c37b1 and advanced with domain decomposition in 2014 for faster molecular dynamics.[14][4] CHARMM became freely available for academic and non-profit use starting in August 2022, expanding accessibility via platforms like academiccharmm.org.[8] Polarizable force fields continued evolving, with Drude-2023 for lipids and bilayers.[4] Martin Karplus, the longtime leader of CHARMM development, passed away on December 28, 2024.[19] As of November 2025, CHARMM has received minor patches for compatibility with emerging hardware like advanced GPUs, without a major force field overhaul, maintaining stability across c50 series builds.[14]Force Fields
Additive Force Fields
The additive force fields in CHARMM represent the standard non-polarizable models, utilizing fixed atomic partial charges and Lennard-Jones parameters to describe electrostatic and van der Waals interactions, respectively, without accounting for inducible polarization effects.[20] These force fields form the core of CHARMM's empirical potential energy function, enabling efficient simulations of biomolecular systems by balancing computational cost with accuracy in reproducing structural and thermodynamic properties.[20] For proteins, the CHARMM22 force field, released in 2002, marked a significant advancement in all-atom modeling, with the subsequent addition of the Cross-term map (CMAP) correction in 2004 to better capture backbone dihedral energetics and improve secondary structure stability, such as alpha-helices and beta-sheets.[21] Building on this, the CHARMM36m force field, introduced in 2017, refines protein parameters through targeted adjustments to dihedral and non-bonded terms, enhancing performance for both folded domains and intrinsically disordered regions by achieving closer agreement with experimental NMR chemical shifts, residual dipolar couplings, and small-angle X-ray scattering profiles. Nucleic acid simulations rely on the CHARMM27 force field, released in 2004, which provides optimized parameters for DNA and RNA, including glycosidic torsion potentials that stabilize helical conformations and base stacking interactions. For lipids, the CHARMM36 force field, developed in 2012, incorporates refined aliphatic chain parameters and headgroup interactions to accurately reproduce phase transition temperatures, bilayer thicknesses, and area per lipid in simulations of phosphatidylcholine and other membrane lipids. The CHARMM General Force Field (CGenFF), introduced in 2009, extends the additive framework to drug-like small molecules and organic ligands, covering a broad range of functional groups compatible with biomolecular parameters, and supports automated parameterization through the CGenFF server for rapid topology generation.[22] The update, CGenFF version 5.0 (published 2025), expands the training set by adding 1,390 new molecules to the previous approximately 930, resulting in over 2,300 molecules total, improving charge assignment and bonded terms for better prediction of intramolecular geometries and non-covalent binding affinities.[23][24] Validation of these additive force fields emphasizes quantitative comparisons with experimental data, including NMR-derived order parameters and J-couplings for proteins, X-ray diffraction-derived densities for lipid bilayers, and thermodynamic quantities like free energies of solvation for small molecules, where CHARMM36m and CGenFF achieve root-mean-square deviations of approximately 2 kcal/mol for solvation free energies and similar accuracy for other key observables. Early limitations in monovalent ion parameters, such as overestimation of Na⁺ hydration free energies, have been mitigated in updates through quantum mechanical refinements and experimental calibration against osmotic pressures and ion-DNA binding constants.[20]Polarizable Force Fields
CHARMM incorporates polarizable force fields to account for induced electronic polarization, which allows for more accurate modeling of environmental effects on molecular interactions compared to fixed-charge additive models. These force fields dynamically adjust electrostatic properties in response to the local electric field, improving simulations of complex systems such as biomolecular interfaces and ionic environments.[25] The primary polarizable model in CHARMM is the Drude oscillator approach, where atomic polarizability is represented by attaching a positively charged "Drude particle" to each non-hydrogen atom via a virtual harmonic spring; this particle oscillates in response to external electric fields, mimicking the displacement of electron clouds. The force field includes additional terms for induced dipole interactions between these oscillators, screened using Thole's damping to prevent polarization catastrophe. The polarization energy contribution is given by where is the spring constant, and are the Drude particle position and equilibrium distance, and denotes charges including the induced Drude charges.[25][7] An alternative polarizable model in CHARMM is the fluctuating charge (FQ) approach, which allows partial atomic charges to vary dynamically based on electronegativity equalization principles, enabling charge transfer and polarization effects without additional particles. This model derives from density functional theory-inspired charge responses and has been parameterized for proteins and organic liquids.[26][27] Key implementations include the Drude-2013 force field, developed for proteins and water models like SWM4-NDP, which explicitly treats polarizability for amino acids and nucleic acids. Extensions to lipids emerged in the 2020s, with Drude polarizable parameters for phospholipids like DPPC, enabling simulations of biomembranes with explicit long-range electrostatics. These polarizable models incur approximately 2-3 times the computational cost of additive force fields due to the extra degrees of freedom and extended electrostatic calculations.[28][29][30][31] Polarizable force fields in CHARMM offer advantages in capturing electronic effects at protein-ion interfaces, lipid-water boundaries, and even in excited states through QM/MM integrations, providing superior accuracy over additive models in these regimes. Validation studies demonstrate close agreement with quantum mechanical calculations for dipole moments, solvation free energies, and interaction energies, such as ion-protein binding affinities and dielectric responses.[32][33][29][34]Parameterization and Validation
Parameter derivation in CHARMM force fields primarily relies on quantum mechanical (QM) calculations to determine bonded parameters such as bond and angle force constants, which are fitted to potential energy surfaces obtained from high-level ab initio methods like MP2/6-31G(d).[35] These QM targets ensure accurate representation of intramolecular interactions, with geometries optimized and vibrational frequencies scaled to match experimental spectra where available. Empirical fitting complements this by adjusting nonbonded parameters, such as Lennard-Jones terms, to reproduce experimental observables including liquid densities and heats of vaporization from pure solvent simulations.[35] For example, in the development of the CHARMM General Force Field (CGenFF), partial charges are derived from QM electrostatic potentials and refined against experimental thermodynamic data to enhance compatibility with biomolecular simulations.[35] Tools like FFParam facilitate this process by automating the optimization of electrostatic and bonded parameters for both additive and polarizable Drude models, integrating QM target data for geometry and energy scans alongside empirical condensed-phase properties such as solvation free energies.[36] The CGenFF server provides an accessible platform for parameterizing small molecules, employing QM calculations for charges and conformational energies while targeting experimental densities and vibrational spectra to generate transferable parameters compatible with CHARMM biomolecular force fields.[35] Validation of CHARMM parameters involves direct comparison to experimental observables, such as radii of gyration from small-angle X-ray scattering (SAXS) for disordered proteins and helix propensities assessed via NMR chemical shifts and J-couplings, ensuring structural accuracy across folded and unfolded states.[37] Benchmarking against other force fields, like AMBER ff99SB-ILDN, reveals CHARMM36m's competitive performance in reproducing experimental order parameters and secondary structure distributions, though AMBER variants sometimes show lower deviations in gyration radii for intrinsically disordered proteins.[37] Key metrics include root-mean-square error (RMSE) for hydration free energies, typically around 2.04 kcal/mol, and Pearson correlation coefficients exceeding 0.88 for structural alignments, indicating robust predictive power.[38] Early challenges in CHARMM lipid force fields, such as overestimation of chain ordering in saturated lipids leading to gel-like bilayers in versions like C27r, were addressed through targeted refinements in C36, including adjustments to torsional and nonbonded parameters based on QM and experimental bilayer data, resulting in surface areas within 2% of experiment.[39] The 2025 release of CGenFF v5.0 further improves small-molecule transferability by expanding the training set by adding 1,390 new compounds to the previous approximately 930, resulting in over 2,300 compounds total, enhancing agreement with QM geometries, vibrations, and dipole moments while maintaining low errors in solvent properties.[23][24] Ongoing refinements incorporate community feedback through the MacKerell lab's parameter repository, iteratively updating parameters to resolve discrepancies in diverse chemical spaces.[40]Software Features
Molecular Dynamics Capabilities
CHARMM employs the Verlet/leap-frog integrator as its primary algorithm for propagating molecular dynamics trajectories, enabling the simulation of atomic motions under Newtonian mechanics.[41] This integrator, specified via the DYNAmics command with the LEAP keyword, updates positions and velocities in a staggered manner, offering stability and energy conservation suitable for biomolecular systems.[41] For energy minimization prior to dynamics, CHARMM supports the steepest descent (SD) method, which rapidly reduces high-energy configurations by following the negative gradient of the potential energy, and the conjugate gradient (CONJ) technique, which converges more efficiently for refined optimizations by incorporating curvature information.[42] These minimization algorithms are invoked through the MINImize command and are essential for preparing stable starting structures.[43] Advanced simulation methods in CHARMM extend beyond standard dynamics to address complex thermodynamic and reactive processes. Free energy perturbation (FEP) calculations, implemented via the PERTurb command, allow estimation of free energy differences by scaling interactions between perturbed states, often used for alchemical transformations like ligand binding.[44] Umbrella sampling, facilitated by the UMBRel command, applies biasing potentials along a reaction coordinate to enhance sampling of rare events, enabling the reconstruction of potential of mean force profiles.[45] For regions involving chemical reactivity, CHARMM integrates quantum mechanics/molecular mechanics (QM/MM) hybrid approaches through the QMMM module, treating active sites quantum mechanically (e.g., via semiempirical methods like PM6) while the surrounding environment uses classical force fields.[46] Boundary conditions in CHARMM simulations accommodate diverse system sizes and environments. Periodic boundary conditions (PBC), defined using the CRYStal command, replicate the simulation cell to mimic bulk phases, with long-range electrostatics handled by Ewald summation invoked via the EWALD keyword in nonbonded options for accurate treatment of charged systems.[47] For solvated biomolecules, stochastic boundary molecular dynamics (SBMD) confines dynamics to a reaction region with Langevin friction and random forces at the boundary, reducing computational cost while maintaining realistic solvation effects.[41] On modern hardware, CHARMM supports molecular dynamics simulations spanning nanosecond (ns) to microsecond (μs) timescales, particularly for systems up to tens of thousands of atoms, leveraging optimized integrators and parallelization.[1] Implicit solvent models, such as generalized Born (GB) with solvent-accessible surface area (SA) nonpolar terms, are available via the GBNP command, approximating solvation without explicit water molecules to accelerate longer runs.[48] CHARMM simulations are scripted using stream files with the .inp extension, which define topology, coordinates, parameters, and execution steps in a command-based syntax. A basic molecular dynamics run typically begins with reading topology (READ RTFs) and parameter (READ PARAmeters) files, followed by generating structure (GENERate), assigning coordinates (READ COORdinates), minimizing energy (MINImize), and initiating dynamics (DYNAmics) with specified timestep, steps, and output frequencies, concluding with coordinate writes (WRITE COORdinates).[49] For example:* Basic MD Example
READ RTFS CARD TOP_ALL36_PROT.RTF
READ PARA CARD PAR_ALL36_PROT.PAR
GENER SEGID PROT RESI 1 100
READ COOR CARD COORDS.PDB
MINI SD NSTEP 1000
DYNA LEAP NSTEP 10000 TIMESTEP 0.002 \
IPRFRQ 1000 IUNCRD 20 NTWF 1000 \
NTWE 1000
WRITE COOR CARD DCD OUT.DCD
STOP
* Basic MD Example
READ RTFS CARD TOP_ALL36_PROT.RTF
READ PARA CARD PAR_ALL36_PROT.PAR
GENER SEGID PROT RESI 1 100
READ COOR CARD COORDS.PDB
MINI SD NSTEP 1000
DYNA LEAP NSTEP 10000 TIMESTEP 0.002 \
IPRFRQ 1000 IUNCRD 20 NTWF 1000 \
NTWE 1000
WRITE COOR CARD DCD OUT.DCD
STOP
Analysis and Utility Tools
CHARMM provides a suite of built-in tools for analyzing molecular dynamics (MD) trajectories, enabling researchers to extract structural and dynamic insights from simulation outputs. The COOR module facilitates root-mean-square deviation (RMSD) and root-mean-square fluctuation (RMSF) calculations, which quantify structural deviations and atomic fluctuations relative to a reference structure. For instance, thecoor orient rms command aligns selected atoms, such as alpha carbons, and computes RMSD values across trajectory frames, while RMSF is derived by averaging deviations over time for each residue. Hydrogen bonding analysis is supported via the coor hbond command, which identifies donor-acceptor pairs based on geometric criteria (e.g., distance < 2.4 Å and angle > 120°) and outputs statistics like average bond counts and lifetimes for intra- or intermolecular interactions. Secondary structure assignment employs DSSP-like algorithms through the coor secs command, classifying residues into helices, sheets, or coils based on hydrogen bonding patterns and dihedral angles, with options to track temporal evolution in trajectories.[50][1]
Energy decomposition tools in CHARMM allow dissection of the potential energy into contributions from specific residues or atom groups, aiding in the identification of stabilizing interactions. The INTEraction command computes pairwise interaction energies (e.g., van der Waals and electrostatic) between selected subsets, such as a ligand and protein residues, while the ENERGY module extends this to per-residue breakdowns by summing intra- and intermolecular terms for each residue. Correlation functions for dynamics are handled by the CORREL module, which processes time series data from trajectories to compute autocorrelation functions for quantities like dihedral angles or energies, revealing timescales of motions (e.g., via exponential fitting). These tools support quasi-harmonic analysis through the VIBRAN facility, which derives covariance matrices from trajectory fluctuations to estimate entropic contributions and low-frequency modes.[1]
Utility functions in CHARMM streamline preprocessing and postprocessing tasks through its internal scripting language, which supports conditional statements, loops, variable substitution, and subroutine calls for automating workflows. PDB file manipulation is achieved with READ and WRITE COOR PDB commands, allowing atom selection, renumbering, and formatting adjustments, while the IC (internal coordinates) module enables mutations by parameterizing new residue topologies and refining geometries via energy minimization. Solvation box generation uses the SOLV command to add water molecules within a defined spherical or cubic boundary around the solute, followed by ion placement via the IONize command to neutralize charge. These scripts can chain operations, such as building solvated systems from initial coordinates.[1][50]
Visualization integration is inherent in CHARMM's output formats, with trajectory data saved in DCD binary files compatible with external tools like VMD and PyMOL for interactive rendering of dynamics, hydrogen bonds, and secondary structures. Built-in plotting capabilities via the CORREL and GRAPHX modules generate time series graphs for energies, forces, and RMSD, outputting to text or PostScript files for further analysis. The GRAPHX facility supports basic 3D visualization with features like atom coloring and bond rendering, though it is often supplemented by external software.[1]
Recent additions since 2023 enhance CHARMM's extensibility through the pyCHARMM Python interface, which embeds core functionality into Python scripts for custom trajectory analyses, such as integrating NumPy for advanced statistical processing of RMSD/RMSF data. This interface facilitates machine learning hooks, exemplified by the MLPot module, which couples CHARMM force fields with neural network potentials like PhysNet for enhanced sampling in free energy calculations, enabling on-the-fly potential corrections during MD. These developments, including support for Gaussian Process Regression in QM/MM simulations via delta-ML potentials, broaden utility for complex workflows while maintaining compatibility with existing tools. As of 2024, further enhancements include apoCHARMM for GPU-accelerated simulations, the MIST approach for third-order conformational entropy calculations, and the COOR SMAP command for hydration maps, as detailed in version c50b1.[51][4][52]
Implementation
Running CHARMM on Unix/Linux
CHARMM installation on Unix/Linux systems begins with downloading the source code package from the official academic distribution site, academiccharmm.org, which provides access to the latest release, such as c50b1, including source files, documentation, test cases, and topology/parameter files. Unpack the tarball into a working directory, typically ~/c50b1 or similar, ensuring sufficient disk space for compilation and libraries. Compilation requires a Fortran compiler; recommended options include GNU gfortran version 4.4 or later (excluding 4.5.1) or Intel ifort version 11.1 or later, with Intel icc for C components if needed.[53] To build, navigate to the unpacked directory and execute the configuration script, such as./configure --with-gcc for gfortran or --with-intel for ifort, followed by make -jN -C build/cmake install using CMake for modern builds, where N is the number of parallel jobs.[11] Optional switches during configuration enable features like FFTW support via --enable-fftw or NetCDF via --with-netcdf=/path/to/netcdf. The resulting executable, named charmm, is placed in the bin subdirectory, such as ~/c50b1/bin/charmm.[53]
Environment variables facilitate execution and customization. Set CHARMMEXEC to the full path of the compiled executable (e.g., export CHARMMEXEC=~/c50b1/bin/charmm) to simplify invocation from scripts or other tools. Additionally, include the compiler and library paths in PATH, and for optional libraries, define FFTW_HOME or NETCDF_DIR pointing to their installation directories (e.g., /usr/local/netcdf). These variables ensure CHARMM locates dependencies during runtime, particularly for I/O formats like NetCDF coordinates.
Basic execution of CHARMM on Unix/Linux uses command-line redirection for input and output files. The standard syntax is charmm < input.inp > output.out, where input.inp contains the sequence of CHARMM commands (starting with a * title line) and output.out captures the log and results.[49] For interactive sessions, omit redirection and enter commands directly at the CHARMM prompt. Graphics output, if enabled via the OPEN GRAPH command in the input, requires X11 forwarding (e.g., ssh -X).
CHARMM relies on specific file structures for molecular systems. Topology files, typically in Residue TOPology (.rtf) or extended .top format, define atom types, bonds, angles, and dihedrals for residues. Parameter files (.prm) provide force field constants like bond lengths and angles, loaded via READ PARA CARD or similar commands. Coordinate files specify atomic positions, commonly in Protein Data Bank (.pdb) format for initial structures or binary Coordinate (.crd) for dynamics trajectories. A typical workflow loads these sequentially: READ RTF CARD topology.rtf, READ PARA CARD parameters.prm, READ COOR PDBATOMS coord.pdb. For batch scripting, wrap executions in a shell script, such as:
#!/bin/bash
export CHARMMEXEC=~/c50b1/bin/charmm
$CHARMMEXEC < my_simulation.inp > my_simulation.out
#!/bin/bash
export CHARMMEXEC=~/c50b1/bin/charmm
$CHARMMEXEC < my_simulation.inp > my_simulation.out
sudo apt install libnetcdf-dev on Ubuntu) and specify the path in configuration, as it supports advanced I/O for large trajectories. Similarly, FFTW is required for fast Fourier transforms in simulations; install with sudo yum install fftw-devel on CentOS/Rocky Linux and enable the switch to avoid "undefined reference" errors. Performance optimization involves compiler flags like -O3 -march=native passed via FFLAGS or FCFLAGS environment variables (e.g., export FFLAGS="-O3 -funroll-loops" before configure), which can accelerate builds by 20-30% on modern x86_64 hardware without altering correctness.[54] If the executable fails to produce (e.g., due to mismatched MPI modules), clean the build directory with make clean and verify compiler consistency.
Platform specifics vary across Linux distributions. On Ubuntu (e.g., 24.04 LTS), use apt for dependencies like gfortran, libfftw3-dev, and libnetcdf-dev, with configuration targeting gnu machine type for seamless integration. CentOS or its successors like Rocky Linux 8/9 require dnf for packages such as gcc-gfortran and fftw-devel, often with Intel compilers preferred for HPC environments due to better vectorization. For containerization, as recommended in 2024 documentation, use Apptainer (successor to Singularity) over Docker for security in shared clusters; build from a base image like apptainer build charmm.sif image.def, binding data directories, to encapsulate CHARMM and dependencies portably across distributions.[55] This approach avoids system conflicts and supports reproducible runs on Ubuntu or CentOS-based nodes.
