Force field (chemistry)

Updated on Nov 29, 2024

Edit

Comment

In the context of molecular modeling, a force field (a special case of energy functions or interatomic potentials; not to be confused with force field in classical physics) refers to the functional form and parameter sets used to calculate the potential energy of a system of atoms or coarse-grained particles in molecular mechanics and molecular dynamics simulations. The parameters of the energy functions may be derived from experiments in physics or chemistry, calculations in quantum mechanics, or both.

All-atom force fields provide parameters for every type of atom in a system, including hydrogen, while united-atom interatomic potentials treat the hydrogen and carbon atoms in each methyl group (terminal methyl) and each methylene bridge as one interaction center. Coarse-grained potentials, which are often used in long-time simulations of macromolecules such as proteins, nucleic acids, and multi-component complexes, provide even cruder representations for higher computing efficiency.

Functional form

The basic functional form of potential energy in molecular mechanics includes bonded terms for interactions of atoms that are linked by covalent bonds, and nonbonded (also termed noncovalent) terms that describe the long-range electrostatic and van der Waals forces. The specific decomposition of the terms depends on the force field, but a general form for the total energy in an additive force field can be written as E total = E bonded + E nonbonded where the components of the covalent and noncovalent contributions are given by the following summations:

E bonded = E bond + E angle + E dihedral

E nonbonded = E electrostatic + E van der Waals

The bond and angle terms are usually modeled by quadratic energy functions that do not allow bond breaking. A more realistic description of a covalent bond at higher stretching is provided by the more expensive Morse potential. The functional form for dihedral energy is highly variable. Additional, "improper torsional" terms may be added to enforce the planarity of aromatic rings and other conjugated systems, and "cross-terms" that describe coupling of different internal variables, such as angles and bond lengths. Some force fields also include explicit terms for hydrogen bonds.

The nonbonded terms are most computationally intensive. A popular choice is to limit interactions to pairwise energies. The van der Waals term is usually computed with a Lennard-Jones potential and the electrostatic term with Coulomb's law, although both can be buffered or scaled by a constant factor to account for electronic polarizability and produce better agreement with experimental observations.

Parameterization

In addition to the functional form of the potentials, force fields define a set of parameters for different types of atoms, chemical bonds, dihedral angles and so on. The parameter sets are usually empirical. A force field would include distinct parameters for an oxygen atom in a carbonyl functional group and in a hydroxyl group. The typical parameter set includes values for atomic mass, van der Waals radius, and partial charge for individual atoms, and equilibrium values of bond lengths, bond angles, and dihedral angles for pairs, triplets, and quadruplets of bonded atoms, and values corresponding to the effective spring constant for each potential. Most current force fields parameters use a fixed-charge model by which each atom is assigned one value for the atomic charge that is not affected by the local electrostatic environment; proposed developments in next-generation force fields incorporate models for polarizability, in which a particle's charge is influenced by electrostatic interactions with its neighbors. For example, polarizability can be approximated by the introduction of induced dipoles; it can also be represented by Drude particles, massless, charge-carrying virtual sites attached by a springlike harmonic oscillator potential to each polarizable atom. The introduction of polarizability into force fields in common use has been inhibited by the high computational expense associated with calculating the local electrostatic field.

Although many molecular simulations involve biological macromolecules such as proteins, DNA, and RNA, the parameters for given atom types are generally derived from observations on small organic molecules that are more tractable for experimental studies and quantum calculations. Different force field parameters can be derived from dissimilar types of experimental data, such as enthalpy of vaporization (OPLS), enthalpy of sublimation, dipole moments, or various spectroscopic parameters.

Parameter sets and functional forms are defined by interatomic potentials developers to be self-consistent. Because the functional forms of the potential terms vary extensively between even closely related interatomic potentials (or successive versions of the same interatomic potential), the parameters from one interatomic potential function should clearly never be used together with another interatomic potential function.

Deficiencies

All interatomic potentials are based on many approximations and derived from different types of experimental data. Thus, they are termed empirical. Some existing energy functions do not account for electronic polarization of the environment, an effect that can significantly reduce electrostatic interactions of partial atomic charges. This problem was addressed by developing polarizable force fields or using macroscopic dielectric constant. However, application of one value of dielectric constant is questionable in the highly heterogeneous environments of proteins or biological membranes, and the nature of the dielectric depends on the model used.

All types of van der Waals forces are also strongly environment-dependent, because these forces originate from interactions of induced and "instantaneous" dipoles (see Intermolecular force). The original Fritz London theory of these forces can only be applied in vacuum. A more general theory of van der Waals forces in condensed media was developed by A. D. McLachlan in 1963 (this theory includes the original London's approach as a special case). The McLachlan theory predicts that van der Waals attractions in media are weaker than in vacuum and follow the like dissolves like rule, which means that different types of atoms interact more weakly than identical types of atoms. This is in contrast to combinatorial rules or Slater-Kirkwood equation applied for development of the classical force fields. The combinatorial rules state that interaction energy of two dissimilar atoms (e.g., C…N) is an average of the interaction energies of corresponding identical atom pairs (i.e., C…C and N…N). According to McLachlan theory, the interactions of particles in a media can even be fully repulsive, as observed for liquid helium. The conclusions of McLachlan theory are supported by direct measurements of attraction forces between different materials (Hamaker constant), as explained by Jacob Israelachvili in his book Intermolecular and surface forces. It was concluded that "the interaction between hydrocarbons across water is about 10% of that across vacuum". Such effects are unaccounted in standard molecular mechanics.

Another round of criticism came from practical applications, such as protein structure refinement. It was noted that Critical Assessment of protein Structure Prediction (CASP) participants did not try to refine their models to avoid "a central embarrassment of molecular mechanics, namely that energy minimization or molecular dynamics generally leads to a model that is less like the experimental structure". The force fields have been applied successfully for protein structure refinement in different X-ray crystallography and NMR spectroscopy applications, especially using program XPLOR. However, such refinement is driven mainly by a set of experimental constraints, whereas the interatomic potentials serve merely to remove interatomic hindrances. The results of calculations are practically the same with rigid sphere potentials implemented in program DYANA (calculations from NMR data), or with programs for crystallographic refinement that do not use any energy functions. The deficiencies of the interatomic potentials remain a major bottleneck in homology modeling of proteins. Such situation gave rise to development of alternative empirical scoring functions specifically for ligand docking, protein folding, homology model refinement, computational protein design, and modeling of proteins in membranes.

There is also an opinion that molecular mechanics may operate with energy which is irrelevant to protein folding or ligand binding. The parameters of typical force fields reproduce enthalpy of sublimation, i.e., energy of evaporation of molecular crystals. However, it was recognized that protein folding and ligand binding are thermodynamically very similar to crystallization, or liquid-solid transitions, because all these processes represent freezing of mobile molecules in condensed media. Thus, free energy changes during protein folding or ligand binding are expected to represent a combination of an energy similar to heat of fusion (energy absorbed during melting of molecular crystals), a conformational entropy contribution, and solvation free energy. The heat of fusion is significantly smaller than enthalpy of sublimation. Hence, the potentials describing protein folding or ligand binding must be weaker than potentials in molecular mechanics. Indeed, the energies of H-bonds in proteins are ~ -1.5 kcal/mol when estimated from protein engineering or alpha helix to coil transition data, but the same energies estimated from sublimation enthalpy of molecular crystals were -4 to -6 kcal/mol. The depths of modified Lennard-Jones potentials derived from protein engineering data were also smaller than in typical potential parameters and followed the like dissolves like rule, as predicted by McLachlan theory.

Future perspectives

The use of interatomic potentials in chemistry was first introduced in 1949, apparently independently by Hill and by Westheimer, applied mainly to organic chemistry to estimate properties such as strain energies among others. The functional form of the interatomic potential, focused in this article applied to biological systems, was established by Lifson in the 1960s. For over a half century, interatomic potentials have served us well, providing useful insights into and interpretation of biomolecular structure and function. Undoubtedly, it will continue to be widely used, thanks to its computational efficiency, while its reliability will continue to be improved. Yet, there are many well-known deficiencies as noted above. Further, the number of energy terms used in a given interatomic potential cannot be uniquely determined and a highly redundant number of degrees of freedom are typically used. Consequently, the "parameters" in different interatomic potentials can be vastly different. Of course, the emphasis to incorporate polarization into the standard pair-wise potentials can be very useful; however, there is no unique way of treating polarization in molecular mechanics because it is of quantum mechanical origin. Furthermore, often we are more interested in the properties derived from the dynamic dependence of the interatomic potential itself on molecular fluctuations.

One possibility is that the future development of interatomic potential ought to move beyond the current molecular mechanics approach, by using quantum mechanics explicitly to construct the interatomic potential. A number of the polarizable interatomic potentials listed below, such as density fitting and bond-polarization, already included some of the key ingredients towards this goal. The explicit polarization (X-Pol) method appears to have established the fundamental theoretical framework for a quantal force field; the next step is to develop the necessary parameters to achieve more accurate results than classical mechanics can offer.

Popular force fields

Different force fields are designed for different purposes. All are implemented in various computer software.

MM2 was developed by Norman Allinger mainly for conformational analysis of hydrocarbons and other small organic molecules. It is designed to reproduce the equilibrium covalent geometry of molecules as precisely as possible. It implements a large set of parameters that is continuously refined and updated for many different classes of organic compounds (MM3 and MM4).

CFF was developed by Arieh Warshel, Lifson and coworkers as a general method for unifying studies of energies, structures and vibration of general molecules and molecular crystals. The CFF program, developed by Levitt and Warshel, is based on the Cartesian representation of all the atoms, and it served as the basis for many subsequent simulation programs.

ECEPP was developed specifically for modeling of peptides and proteins. It uses fixed geometries of amino acid residues to simplify the potential energy surface. Thus, the energy minimization is conducted in the space of protein torsion angles. Both MM2 and ECEPP include potentials for H-bonds and torsion potentials for describing rotations around single bonds. ECEPP/3 was implemented (with some modifications) in Internal Coordinate Mechanics and FANTOM.

AMBER, CHARMM, and GROMOS have been developed mainly for molecular dynamics of macromolecules, although they are also commonly used for energy minimizing. Thus, the coordinates of all atoms are considered as free variables.

Classical force fields

Assisted Model Building and Energy Refinement (AMBER) – widely used for proteins and DNA.

Chemistry at HARvard Molecular Mechanics (CHARMM) – originally developed at Harvard, widely used for both small molecules and macromolecules

CVFF – also used broadly for small molecules and macromolecules.

COSMOS-NMR – hybrid QM/MM force field adapted to a variety of inorganic compounds, organic compounds and biological macromolecules, including semi-empirical calculation of atomic charges and NMR properties. COSMOS-NMR is optimized for NMR based structure elucidation and implemented in COSMOS molecular modelling package.

GROningen MOlecular Simulation (GROMOS) – a force field that comes as part of the GROMOS software, a general-purpose molecular dynamics computer simulation package for the study of biomolecular systems. GROMOS force field A-version has been developed for application to aqueous or apolar solutions of proteins, nucleotides, and sugars. A B-version to simulate gas phase isolated molecules is also available.

Optimized Potential for Liquid Simulations (OPLS, variants include OPLS-AA, OPLS-UA, OPLS-2001, OPLS-2005) – developed by William L. Jorgensen at the Yale University Department of Chemistry.

ECEPP – first force field for polypeptide molecules - developed by F.A. Momany, H.A. Scheraga and colleagues.

QCFF/PI – A general force fields for conjugated molecules.

Universal Force Field (UFF) – A general force field with parameters for the full periodic table up to and including the actinoids, developed at Colorado State University.

Consistent Force Field (CFF) – a family of forcefields adapted to a broad variety of organic compounds, includes force fields for polymers, metals, etc.

Condensed-phase Optimized Molecular Potentials for Atomistic Simulation Studies (COMPASS) – developed by H. Sun at Molecular Simulations Inc., parameterized for a variety of molecules in the condensed phase, now available through Accelrys.

Merck Molecular Force Field (MMFF) – developed at Merck, for a broad range of molecules.

MM2, MM3, MM4 – developed by Norman Allinger, parametrized for a broad range of molecules.

QVBMM - developed by Vernon G. S. Box, parameterized for all biomolecules and a broad range of organic molecules, and implemented in StruMM3D (STR3DI32).

Transferable Potentials for Phase Equilibria (TraPPE) – a family of molecular mechanics force fields developed by the Siepmann group at the University of Minnesota for molecular simulations of complex chemical systems.

Polarizable force fields

X-Pol: the Explicit Polarization Theory – a fragment-based electronic structure method introduced by Jiali Gao [1] at the University of Minnesota, which can be used at any level of theory—ab initio Hartree–Fock method (HF), semiempirical molecular orbital theory, correlated wave function theory, or Kohn-Sham (KS) density functional theory (DFT). It can perform over 3,200 steps (3.2 ps) of MD simulations of a fully solvated protein in water with periodic boundary conditions, consisting of about 15,000 atoms and 30,000 basis functions on one processor in 24 hours in 2008, with a full quantum mechanical representation of the whole system. Note that the first MD simulation of a protein by McCammon, Gelin, and Karplus in 1977 lasted 8.8 ps using a united-atom force field without solvent.

CFF/ind and ENZYMIX – The first polarizable force field which has subsequently been used in many applications to biological systems.

DRF90 developed by P. Th. van Duijnen and coworkers.

PIPF – The polarizable intermolecular potential for fluids is an induced point-dipole force field for organic liquids and biopolymers. The molecular polarization is based on Thole's interacting dipole (TID) model and was developed by Jiali Gao [2] at the University of Minnesota.

Polarizable Force Field (PFF) – developed by Richard A. Friesner and coworkers.

SP-basis Chemical Potential Equalization (CPE) – approach developed by R. Chelli and P. Procacci.

CHARMM – polarizable force field developed by S. Patel (University of Delaware) and C. L. Brooks III (University of Michigan).

AMBER – polarizable force field developed by Jim Caldwell and coworkers.

CHARMM – polarizable force field based on the classical Drude oscillator developed by A. MacKerell (University of Maryland, Baltimore) and B. Roux (University of Chicago).

Sum of Interactions Between Fragments Ab initio computed (SIBFA) – force field for small molecules and flexible proteins, developed by Nohad Gresh (Paris V, René Descartes University) and Jean-Philip Piquemal (Paris VI, Pierre & Marie Curie University). SIBFA is a molecular mechanics procedure formulated and calibrated on the basis of ab initio supermolecule computations. Its purpose is to enable the simultaneous and reliable computations of both intermolecular and conformational energies governing the binding specificities of biologically and pharmacologically relevant molecules. This procedure enables an accurate treatment of transition metals. The inclusion of a ligand field contribution allows computations on "open-shell" metalloproteins.

Atomic Multipole Optimized Energetics for Biomolecular Applications (AMOEBA) – force field developed by Pengyu Ren (University of Texas at Austin) and Jay W. Ponder (Washington University).

ORIENT – procedure developed by Anthony J. Stone (Cambridge University) and coworkers.

Non-Empirical Molecular Orbital (NEMO) – procedure developed by Gunnar Karlström and coworkers at Lund University (Sweden)

Gaussian Electrostatic Model (GEM) – a polarizable force field based on Density Fitting developed by Thomas A. Darden and G. Andrés Cisneros at NIEHS; and Jean-Philip Piquemal at Paris VI University.

Polarizable procedure based on the Kim-Gordon approach developed by Jürg Hutter and coworkers (University of Zürich)

Computer Simulation of Molecular Structure (COSMOS-NMR) – developed by Ulrich Sternberg and coworkers. Hybrid QM/MM force field enables explicit quantum-mechanical calculation of electrostatic properties using localized bond orbitals with fast BPT formalism. Atomic charge fluctuation is possible in each molecular dynamics step.

Reactive force fields

ReaxFF – reactive force field (interatomic potential) developed by Adri van Duin, William Goddard and coworkers. It is fast, transferable and is the computational method of choice for atomistic-scale dynamical simulations of chemical reactions. Parallelized ReaxFF allows reactive simulations on >>1000,000 atoms.

Empirical valence bond (EVB) – this reactive force field, introduced by Warshel and coworkers, is probably the most reliable and physically consistent way to use force fields in modeling chemical reactions in different environments. The EVB facilitates calculating activation free energies in condensed phases and in enzymes.

RWFF – reactive force field for water developed by Detlef W. M. Hofmann, Liudmila N. Kuleshova, and Bruno D'Aguanno. It is very fast, reproduces the experimental data of neutron scattering accurately, and allows simuling bond formation-breaking of water and acids.

Coarse-grained force fields

Virtual atom molecular mechanics (VAMM) – a coarse-grained force field developed by Korkut and Hendrickson for molecular mechanics calculations such as large scale conformational transitions based on the virtual interactions of C-alpha atoms. It is a knowledge based force field and formulated to capture features dependent on secondary structure and on residue-specific contact information in proteins.

MARTINI – a coarse-grained potential developed by Marrink and coworkers at the University of Groningen, initially developed for molecular dynamics simulations of lipids, later extended to various other molecules. The force field applies a mapping of four heavy atoms to one CG interaction site and is parameterized with the aim of reproducing thermodynamic properties.

Water models

The set of parameters used to model water or aqueous solutions (basically a force field for water) is called a water model. Water has attracted a great deal of attention due to its unusual properties and its importance as a solvent. Many water models have been proposed; some examples are TIP3P, TIP4P, SPC, flexible simple point charge water model (flexible SPC), and ST2.

Post-translational modifications and unnatural amino acids

Forcefield_PTM – An AMBER-based forcefield and webtool for modeling common post-translational modifications of amino acids in proteins developed by Chris Floudas and coworkers. It uses the ff03 charge model and has several side-chain torsion corrections parameterized to match the quantum chemical rotational surface.

Forcefield_NCAA - An AMBER-based forcefield and webtool for modeling common non-natural amino acids in proteins in condensed-phase simulations using the ff03 charge model. The charges have been reported to be correlated with hydration free energies of corresponding side-chain analogs.

Other

VALBOND - a function for angle bending that is based on valence bond theory and works for large angular distortions, hypervalent molecules, and transition metal complexes. It can be incorporated into other force fields such as CHARMM and UFF.

References

Force field (chemistry) Wikipedia

(Text) CC BY-SA

Contents