Cerius2 Forcefield Based Simulations - Preparing the Energy Expression and the Model

You should read this section if you want to understand what atom types are, how types and charges are assigned automatically, and how you can make your own atom type or charge assignments.

In addition, if you want to understand parameter assignment (Determination of which parameters are used with which atom types) and/or edit the forcefield parameters (Manual parameter assignment), you need to understand something about atom type assignment first.

All molecular modeling programs supplied by MSI perform automatic and/or semi-automatic atom-type and charge assignment (which needs to be re-done if you switch to a different forcefield). Please see the guidebook for the appropriate molecular modeling program for details on how to assign atom types and charges for the simulation engine you are using (see Available documentation).

What are atom types in forcefields?

The simulation engine needs the forcefield atom type of each atom in the model in order to determine which forcefield parameters to use. Forcefield parameters apply to particular combinations of atom types as specified by the forcefield.

Relation between forcefield atom types and chemical atoms

The forcefield atom types are related to the microchemical environment of the atoms in a way defined by the particular forcefield. For example, a methane model has only two atom types, one for the carbon and one for the hydrogens, even though each of the atoms may have a distinct atom name for labeling purposes. The hydrogen atoms are equivalent by symmetry; therefore, they would all have the same atom type in any forcefield.

As a more complicated example, consider propane, which has four distinct types of atoms: methyl carbon atoms, methyl hydrogen atoms, a methylene carbon atom, and the methylene hydrogens. In principle, a forcefield could consider these to be four distinct atom types, but in practice, the chemical difference between the carbon atoms or between the hydrogen atoms is very small, so in most forcefields the carbon atoms are all assigned the same atom type, and all the hydrogens are assigned a second atom type.

Assigning atom types to a model

Atom types and charges supplied by the structure file

Atom types are assigned by the simulation engine or the molecular modeling program. Atom types are automatically assigned by using a set of rules that link the type of an atom to its element type and its chemical microenvironment (for example, the number and nature of connected atoms). Different forcefields use different atom types and atom-typing rules, which are contained in a residue library or the forcefield file.

The atom type information can also be supplied by a molecular data file such as an .msi file (OFF), an .mdf file (Discover and OFF), or an RTF (or PSF) file (CHARMm). These structure files are typically created in the Cerius², Insight, or QUANTA molecular modeling program.

To make sure that atom types are assigned:

Cerius² warns you when you try to perform some action for which atom types need to have been assigned, if they have not been assigned.
Insight checks whether types (and charges) have been assigned when you exit the model-building module and gives you an opportunity to assign them.
QUANTA takes care of assigning the atom types when the model is saved in a structure file.

Charge information also is saved in the structure file.

To assure that you use the most appropriate atom types in your studies, you should always check the assigned atom types against the appropriate table under Forcefield Terms and Atom Types. In most cases, Cerius² and Insight automatically assign the atom types. However, these assignment engines of course require the models to be correctly built. One of the most critical types of information is the bond order, which should be set before the forcefield is assigned.

Atom typing in different modeling programs

In Cerius² and Insight, atoms with unassigned atom types are labelled with question marks when you label the model according to the atom type (FFTYPE or potential type).

In Cerius²·Discover, use the Discover Atom Typing control panel (accessed by selecting the Forcefield/Typing item on the DISCOVER card) to assign all atom types and charges (if you do not want to do this automatically). You can also select individual atoms and manually assign an atom type different from the one assigned automatically.
In Cerius²·OFF, use the Force Field Atom Typing control panel (accessed by going to the OFF SETUP deck of cards and selecting the Typing/Atoms card menu item) to assign all atom types. Charges are also assigned if the forcefield being used contains charge information. You can also select individual atoms and manually assign an atom type different from the one assigned automatically.
In Insight II, assignment of potential types (and charges) to each atom is done with the Forcefield/Potentials parameter block, which appears automatically when appropriate or can be accessed from the Biopolymer, Builder, and other modules of the Insight program. You can re-type individual atoms by using the Atom/Potential parameter block in the Biopolymer or Builder module.
QUANTA automatically assigns atom types when you construct or modify a model. Library (or "dictionary") files for commonly used chemical units (amino acids, nucleic acids, etc.) are supplied with CHARMm. You can also manually assign atom types different from what were assigned automatically, by using the Molecular Editor (accessed from Applications/Builders/3-D Builder on the main QUANTA menu bar).

Table 6 . Forcefields parameterized with nonzero atom charges or bond increments
Forcefield	Engine
CFF91-95, CFF, PCFF, COMPASS	OFF, Discover
CVFF	OFF, Discover
bks1.01	OFF
burchart1.01	OFF
burchart1.01-DREIDING2.21 (not all atom types)	OFF
burchart1.01-UNIVERSAL1.02 (Burchart atom types only)	OFF
glassff_1.01	OFF
MMFF93	CHARMm ¹
msxx_1.01	OFF
CHARMm	CHARMm

Important

If you want to assign charges different from those in the forcefield, you need to assign the charges after atoms typing (and automatic charge assignment) is done.

Finding charges, if needed

If you need to specifically assign charges, most relevant modules allow you to set atomic charges directly or specify an overall net charge for the whole structure using charge editing functions.

For small models, you can obtain values for charges by using an ab initio or semiempirical quantum chemistry module (for example, MOPAC).

For larger, distorted, models and when charge assignment is done by the charge equilibration method (in Cerius²), you usually need to perform a short minimization before assigning charges. This is because charge equilibration calculations on distorted models can lead to assignment of unrealistic charges.

Charge assignment in different modeling programs

In Cerius²·Discover, charges are automatically assigned when atoms are typed.
In Cerius²·OFF, if you are using UFF or the Dreiding forcefield, charges should be assigned to the model by using the Charges module, accessed from the OFF SETUP deck of cards (see Cerius² Forcefield Engines: OFF). The Charges module uses the charge equilibration approach developed by Rappé & Goddard (1991) to predict the charges from the model geometry and the atomic electronegativities. If the model geometry changes much during minimization, you should iterate the procedure of reassigning charges and reminimizing until the energy reaches a constant. The Charges control panel also allows you to edit or assign charges manually.
In Insight II for all forcefields except CFF, assignment of charges (and atom types) to each atom is done with the Forcefield/Potentials parameter block, which appears automatically when appropriate or can be accessed from the Biopolymer, Builder, and other modules. Potential function atom types must be (and are) assigned before charges or partial charges are assigned. The Insight program assigns atom types and partial charges to each atom in the structure based on information in a residue library file or (if not found in a residue library file) on the bond increments found in the forcefield file. You can edit the charges on individual atoms with the Atom/Charge parameter block in the Biopolymer or Builder module.
QUANTA automatically assigns charges when you construct or modify a model. Library (or "dictionary") files for commonly used chemical units (amino acids, nucleic acids, etc.) are supplied with CHARMm. You can also manually assign charges different from what were assigned automatically, by using the Molecular Editor (accessed from Applications/Builders/3-D Builder on the main QUANTA menu bar).

Parameter assignment

If you are a novice user or routinely run relatively simple calculations on relatively simple or standard models, you do not need to read this section. However, if, for example, an error message informs you of missing parameters or you want to customize your energy expression for an atypical model, then you do need to understand how the simulation engines determine what parameters are used for which atoms, bonds, angles, etc.

You should also understand something about atom type and charge assignment (Assigning forcefield atom types and charges) to make effective use of this section.

Determination of which parameters are used with which atom types

Before calculating the energy of a model, the simulation engine must construct the complete energy expression for the model by associating the correct forcefield parameters with the appropriate atoms and other coordinates. For example, methane has one type of bond (C-H) and one type of bond angle (H-C-H). The program must create a list of the four actual bonds and then associate the C-H bond parameters with each. Similarly, there are six H-C-H angles, but they are characterized by the same set of parameters.

It is important to understand how the parameters from the forcefield are associated with individual internal coordinates, because the energy, derivatives, structures, and almost all other properties calculated by the program depend on these forcefield parameters and the way in which they are associated with the internal coordinates. The following sections describe two facets of this process: atom type equivalences and wildcards in parameter definitions.

Atom type equivalences

Chemically distinct atoms often differ in some, but not all, of their forcefield parameters. For example, the bond parameters for the C-C bonds in ethene and in benzene are quite different, but the nonbond parameters for the carbon atoms are essentially the same.

In Discover, rather than duplicating the nonbond parameters in the forcefield parameter file, atom type equivalences are used to simplify the problem. In the example, the phenyl carbon atom type is equivalent to the pure sp² carbons of ethene insofar as the nonbond parameters are concerned.

The Discover program recognizes five types of equivalences for each atom type: nonbond, bond, angle, torsion, and out-of-plane. Crossterms such as bond-bond terms have the same equivalences (insofar as atom types are concerned) as the diagonal term of the topology of all the atoms defining the internal coordinates. For the bond-bond term, this means that the atom type equivalences for angles would be used.

The actual format of the equivalence data in the forcefield parameter file is detailed in the File Formats documentation. For the equivalences used in any particular forcefield, you should examine the actual forcefield parameter file for current information.

CHARMm PRM files handle equivalences for nonbond parameters by using partial wildcards, for example, N* means that the associated nonbond parameters apply to any nitrogen type that is not specifically parameterized.

For forcefields in Cerius²·OFF, wildcards are usually used.

Wildcard atom types in the parameter file

For some internal coordinates, the parameters do not depend strongly on the specific atom types of one or more atoms. For example, the parameters of torsion terms may not be strongly affected by the end atoms. This means that the torsion parameters are essentially defined by the central bond rather than its substituents.

The forcefield engines allow wildcard atom types to conveniently handle this type of situation. This special atom type, indicated by an X in CHARMm .PRM files and in relevant Cerius² forcefield files and by an asterisk (*) in Discover forcefield files, matches any atom type when the forcefield engine is searching for the parameters to associate with a particular internal coordinate. (In CHARMm, this applies only to bond, angle, torsion, and improper-torsion parameters.)

Automatic assignment of values for missing parameters

Table 7 . Automatic parameter assignment in MSI's molecular modeling programs

Modeling program Automatic parameters Comments
Cerius²
yes
not for all forcefields

Insight II (CDiscover)
yes
but not for AMBER forcefield

insight II (FDiscover)
yes
controllable in standalone mode only, not for AMBER

QUANTA
yes
but only if the PSF Generator is used

Table 7 . Automatic parameter assignment in MSI's molecular modeling programs
Modeling program	Automatic parameters	Comments
Cerius²	yes	not for all forcefields
Insight II (CDiscover)	yes	but not for AMBER forcefield
insight II (FDiscover)	yes	controllable in standalone mode only, not for AMBER
QUANTA	yes	but only if the PSF Generator is used

What happens if parameters are not found

Some classic and second-generation forcefields are not completely parameterized for all their atom types. (For rules-based forcefields (Rule-based forcefields broadly applicable to the periodic table), all parameters are generated according to rules rather than read from the forcefield file.) When parameters for classic and second-generation forcefields are not available, one of several things can happen, with varying consequences:

The missing parameters are simply ignored (i.e., set to zero in the energy expression). The simulation runs and yields results, but they may be very inaccurate.
Setup of the energy expression is interrupted, the simulation run is not started, and a message is output to the textport or text window.
Missing parameters are obtained automatically from a simpler generic set of parameters (using many wildcards, see above). The results may be reasonable, but not as accurate as if specific parameters existed.

Temporary patches for missing parameters; precautions

A forcefield may include automatic parameters for use when better-quality explicit parameters are not defined for a particular bond, angle, torsion, or out-of-plane interaction. These parameters are intended as temporary patches, to allow you to begin calculations immediately. While MSI has made every effort to ensure that the automatic parameters used in CVFF, the CFF family of forcefields, and CHARMm produce reasonable geometries for a wide variety of models, we cannot guarantee that the automatic parameters are appropriate in every instance. You therefore should always carefully evaluate any results that you obtain using automatic parameters.

How missing parameters are supplied

Discover automatically assigns values for parameters missing from the CFF and CVFF forcefields by switching to an automatic forcefield. This switching is accomplished with an equivalence table that converts the original set of atom types to a smaller set of generic atom types.

Cerius²·OFF behaves similarly.

QUANTA's parameter chooser looks through the existing CHARMm parameters for similar cases and averages them all to come up with suggested values.

Discover's automatic forcefield

In the automatic forcefield in the Discover program, the atom types for bonds, angles, torsions, and out-of-plane deformations have different levels of specificity. For example, while bond-stretching parameters are determined by the atom types of both atoms; angle-bending and torsion parameters may be determined by the atom type of only the central atom(s). A wildcard (*), representing any type of atom, is used for the end atoms of torsions and angles.

In some cases, angle-bending parameters are specified by two atoms (rather than only the central atom). This can lead to ambiguity--for example, C-C-N (if not explicitly defined in the forcefield) can be associated with c_-c_-* or with n_-c_-*. The underscore in this example is used to denote the generic (or automatic) atom types. Here, a one-sided wildcard (*#, where # is an integer indicating the precedence), is used for one of the end atoms in an angle.

Cerius²·OFF handles precedence with an additional field (P0, P1, ... P9) rather than wildcards.

In interpreting the wildcard, the Discover program and the Cerius²·OFF module use the parameter for which the integer is lower. The parameters for a C-C-N angle would, for example (if not explicitly defined in the forcefield), be taken from those for atom types n_-c_-*6 rather than c_-c_-*7, because 6 is smaller than 7.

An example

As an example, the parameters for the angle oh-c"-c" in oxalic acid (Figure 3) are not present in CFF91.

Figure 3 . Oxalic acid structure and CFF91 atom types

When the automatic parameter assignment process is used in Discover, it looks at the auto-equivalence table in the cff91.frc file to find the generic atom types for this angle (indicated in bold type):


#auto_equivalence     cff91_auto

!                          Equivalences
!            -----------------------------------------------------------------------------
!Ver Ref Type NonB Bond Bond  Angle     Angle    Torsion     Torsion     OOP       OOP
                   Inct      End Atom  Apex Atom End Atoms  Center Atoms End Atom Center Atom
!--- --- ---- ---- ---- --- -------- ----------- --------- ------------ -------- -----------
 2.0  2   c"   c"   c"   c'_    c_        c'_       c_          c'_       c_        c'_
 2.0  2   oh   o    o_   o_     o_        o_        o_          o_        o_        o_
 ...

Thus, for parameter assignment purposes only, atom type oh is reassigned to o_, c" is reassigned to c'_ for the apex atom, and c" is reassigned to c_ for the end atom. The parameters for the oh-c"-c" angle could be taken from either the o_c'_*7 or the c_c'_*9 lines in the quadratic_angle section of the cff91.frc file--o_c'_*7 is chosen because 7 is lower than 9:



#quadratic_angle      cff91_auto

> E = K2 * (Theta - Theta0)^2

!Ver  Ref     I      J      K       Theta0         K2
!---- ---   ----   ----   ----   ---------    ---------
 2.0   2      c_     c'_    *9    120.0000       40.0000
 2.0   2      n_     c'_    *8    120.0000       53.5000
 2.0   2      o_     c'_    *7    110.0000      122.0000
 2.0   2      o'_    c'_    *6    120.0000       68.0000
 2.0   2      h_     c'_    *2    110.0000       55.0000
 ...

Manual parameter assignment

Notification of missing parameters

If parameter(s) for a potential type are not present in the forcefield file and are not generated when the energy expression is set up, an appropriate error message is written to the textport or text window of your molecular modeling program. (In QUANTA, this occurs only if you use QUANTA's Applications/Builders tools to construct your model; there is no warning for models that are read in from some other application or database.)

Missing and/or automatic parameters are also listed in an output file after the completion of a simultion run. You can find out if parameters are missing before starting your run:

Cerius² notifies you if atom types are missing and lists them in the text window. It then quits trying to set up an energy expression.

: If terms are missing, in Cerius²·Discover the energy expression is set up unless one or more of the Stop check boxes in the Discover Parameters control panel (accessed by selecting Forcefield/Parameters in the DISCOVER card) are checked.
: If terms are missing, in Cerius²·OFF the energy expression is set up by only if Ignore undefined terms is checked in the Energy Expression control panel (accessed from the Energy Expression/Setup card menu item on the OPEN FORCE FIELD card).

In Insight 97.0, you can request a list of missing parameters with the Forcefield/Tabulate parameter block in the Builder or Biopolymer module.
In Insight 4.0.0, you can use the Forcefield/FF_Info parameter block (in the Builder or other modules) to check the model for unassigned potentials and charges.
In QUANTA, you can request a parameter report with the CHARMm/Parameters/Set Options menu item.

Obtaining new parameters

If the forcefield you are using is similar in functional form and atom typing to another forcefield which does contain the desired parameters, you may be able to use those parameters in your forcefield, at least on a trial basis. You may also be able to obtain new parameters (or help in deriving them yourself) from the scientific literature (see References) or from the developers of the forcefield you are using.

Editing a forcefield

Expert users can edit MSI forcefields in different ways to customize them to their needs or to create new forcefields.

Editing through a graphical interface

MSI's principal molecular modeling programs include forcefield editors (see Available documentation):

Cerius²--Use the Force Field Editor (in the OFF SETUP card deck), which allows you to edit all existing defined terms in the current forcefield and also lets you create new forcefields. The Cerius²·FFE module also allows you to create, edit, and delete atom types and to change the rules on which automatic atom-typing is based.

: You can change the functional form of terms in rules-based forcefields (see Rule-based forcefields broadly applicable to the periodic table) by adding an explicit term. Any explicit term is always used in preference to a generated term.

Important

The Cerius²·FFE module cannot be used to modify the CFF or CVFF forcefield files that are included with Cerius²--if you want to use customized versions of these forcefields, you need to modify them with Insight II 4.0.0 or as described in the Discover documentation.

Insight II 4.0.0--Use the Forcefield/Edit_FF parameter block, which is found in several modules and allows you to edit the parameters of the current forcefield for all terms except crossterms. (This parameter block is not included in Insight 97.0, since well parameterized forcefields exist for life science applications.)
QUANTA--The Edit/Atom Data/Parameters menu item allows you to change a limited set of parameters for any atom type. The PSF Generator allows you to edit automatically supplied parameters when you do a CHARMm calculation in PSF mode.

Manual editing

Expert users can edit the files that define many forcefields with a text editor:

The classical and second-generation forcefields that are available through the Discover program--How to modify the parameter file is explained in separate documentation. The forcefield files are located in the directory defined by the $BIOSYM_LIBRARY environment variable.

: The potential template rule file used by Insight can also be edited. You may add new atom types by making additions to this file (refer to the Insight II documentation for a complete description).

CHARMm--Parameter file contents and formats are explained in the electronic documentation supplied with CHARMM. The forcefield file is located in the directory defined by the $CHM_DATA environment variable--PARM.BIN is the binary version and PARM.PRM the ASCII version.
ESFF--Since this forcefield is rule-based, a parameter file needs to be generated and then edited. Please see the CDiscover book.
CFF--Since this forcefield is encrypted, it is edited indirectly, by means of a special template file, as explained in MSI Forcefields: CFF.

Important

For forcefields accessed through Cerius², we strongly recommend that you never edit these files by hand. Please use the Force Field Editor module. This is important because some forcefield values are linked to others and only the Force Field Editor reliably assures that related values are modified in a coordinated way.

Using alternative forms of energy terms

The energy expression is the heart of the forcefield. Potential energy is described in the energy expression as the sum of various terms that indicate the energy costs of bond stretching, angle bending, etc. Not all terms are present in all forcefields, and the functional forms of the terms vary among forcefields (see Forcefields).

This section and the following main section (Applying constraints and restraints) describe energy term preferences that you can set and restraint terms that can be optionally included in the energy expression.

If you are a novice user, you should alter the default energy terms and parameters as little as possible. One exception to this recommendation is nonbond methods (see Handling nonbond interactions), where you should choose the method according to the model type rather than necessarily accept the default settings.

Table 8 . Modifications of energy expression in MSI's simulation engines

term modification engine (restrictions) details
any
removal
FFE ¹, CHARMm
here

all of a type
scaling
FDiscover, CDiscover
here

all of a type
editing
FFE, Insight 4.0.0 ²
here

bond stretching
Morse vs. harmonic
FDiscover, CDiscover (CVFF)
here

bond stretching
scaling
OFF ³ (C-H bonds)
here

torsion twisting
scaled, averaged, or first found
OFF
here

out-of-plane movement
averaged or first found
OFF
here

van der Waals interactions
Lennard-Jones vs. quartic
FDiscover (standalone)
here

hydrogen bond interactions
if used, how set up
OFF, FDiscover (standalone, AMBER), CDiscover (standalone, AMBER), CHARMm
here

crossterms
removal
FDiscover, CDiscover (CVFF)
here

1-3 or bond stretching-angle bending interactions
Urey-Bradley term vs. crossterm
OFF, CHARMm (standalone only)
here

¹ The Force Field Editor module in Cerius².
² The Forcefield/Edit_FF parameter block in the Builder and other modules can be used to edit parameters in all terms except crossterms for AMBER, CVFF, and CFF. (Parameter block not present in Insight 97.0.)
³ The Open Force Field module in Cerius².

Table 8 . Modifications of energy expression in MSI's simulation engines
term	modification	engine (restrictions)	details
any	removal	FFE ¹, CHARMm	here
all of a type	scaling	FDiscover, CDiscover	here
all of a type	editing	FFE, Insight 4.0.0 ²	here
bond stretching	Morse vs. harmonic	FDiscover, CDiscover (CVFF)	here
bond stretching	scaling	OFF ³ (C-H bonds)	here
torsion twisting	scaled, averaged, or first found	OFF	here
out-of-plane movement	averaged or first found	OFF	here
van der Waals interactions	Lennard-Jones vs. quartic	FDiscover (standalone)	here
hydrogen bond interactions	if used, how set up	OFF, FDiscover (standalone, AMBER), CDiscover (standalone, AMBER), CHARMm	here
crossterms	removal	FDiscover, CDiscover (CVFF)	here
1-3 or bond stretching-angle bending interactions	Urey-Bradley term vs. crossterm	OFF, CHARMm (standalone only)	here

Most methods for changing the functional form of the energy expression are available via the graphical UIFs:

Cerius²·FFE, OFF--Controls in the Energy Terms control panel in the Open Force Field module. (You can also use the Energy Terms control panel in the Force Field Editor to modify the forcefield itself.)
CDiscover--Items in the Forcefield menu of the Cerius²·Discover module and commands in the Specify pulldown of the Insight·Discover_3 module.
FDiscover--Commands in the Parameters pulldown of the Insight·Discover module.
QUANTA--Controls in the CHARMm Energy Setup dialog box (accessed via the CHARMm/Energy Terms menu item) and the CHARMm Update Parameters dialog box.

Discover and CHARMm also offer additional functionality when run in standalone mode.

Removing terms from the energy expression

Why remove terms

You may, for example, want to save computation time during the early stages of minimization of a model that is far from its equilibrium conformation by not calculating any cross terms. Or you may have found that certain terms are insignificant with respect to the purposes of your study.

How to remove terms

You can effectively remove terms from the energy expression in several ways:

In Cerius²·OFF, you can use the Energy Terms Selection control panel in the Open Force Field module to include or exclude entire classes of terms (e.g., all bond-bond crossterms) when setting up the energy expression.
With the CVFF forcefield in Discover, you can choose to turn off (i.e., not use) all cross terms.
You can accomplish the same end for other terms in CVFF and for any class of terms in the other forcefields supplied with the Discover program by scaling terms with a zero scaling factor (see next section).
In CHARMm, you can omit ("skip") any type(s) of terms or constraints, for example, all bond terms.

Scaling or editing any selected type of term

The contributions of various terms in the potential energy expression to the total energy can be scaled up or down and/or otherwise edited. This can be useful, for example, in the early stages of minimizing very "bad" structures, where large contributions by certain terms might interfere with convergence.

How it works

The Cerius²·Force Field Editor module allows you to directly change the parameters in any term (e.g., for all C_3-H_ bond terms, but not for all bond terms) in the forcefield. The Energy Terms control panels include entry boxes for all relevant parameters.

In the Discover program, scaling applies to an entire class of energy terms (e.g., all bond terms) in the energy expression. The force constants (or some other parameters) for the chosen class of terms are multiplied by some constant factor. For example, all bond interactions can be scaled by one factor and all van der Waals radii by another.

In the Insight 4.0.0 molecular modeling program, you can use the Forcefield/FF_Edit parameter block in the Builder and other modules to directly change the parameters in any term except crossterms (e.g., for all C-H bond terms, but not for all bond terms) in the forcefield. The *_Par parameter blocks accessed through the editor include entry boxes for all relevant parameters.

Alternative bond terms

With the CVFF forcefield in Discover, you can choose to use quadratic bond terms rather than Morse bond terms. The Morse term can allow bonds to stretch to unrealistic lengths (Figure 2), so you may get quicker convergence from a hightly distorted configuration if you replace the Morse term by a harmonic term. You do this by specifying the "no Morse" version of CVFF.

Scaled torsion terms

If all torsions about a common bond were simply summed, the torsion energy term could be too large. Cerius²·OFF therefore allows several methods for scaling torsion terms (Discover and CHARMm automatically handle torsions optimally, because of how their forcefields are parameterized):

Behavior in Cerius²·OFF

The usual treatment in Cerius²·OFF is to divide the sum of all the parameterized torsion terms around a common bond by the number of torsions around that bond (as is done in Discover).
An alternative method of scaling torsions is to use the energy of only the first torsion found about a bond. This method is not generally recommended, because the torsion term used (and, therefore, the torsion energy) depends on the order in which atoms are created.
Calculated energies for torsions that are exocyclic to aromatic rings (Figure 4), tend to be too high and may be scaled by an additional factor, usually 0.4.

Figure 4 . Torsion exocyclic to an aromatic ring

Inversion terms

The inversion, improper, or out-of-plane torsion term represents the energy involved in inverting a chiral center or otherwise changing this out-of-plane angle.

In Cerius²·OFF, you may use the first inversion term found or the average of all inversion energies, but the first approach is not recommended.

Nonbond functional form

In the Cerius²·Force Field Editor module, you can use the van der Waals Energy Terms control panels to choose among several nonbond functional forms. (These are listed in the online help, which is accessed by right-mouse clicking over the Function popup.) However, you would have to change the relevant parameters as well, if you wanted good results.

You can use the DSL (the FDiscover command language) set command to choose between the usual Lennard-Jones 6-12 or 6-9 potential (e.g., term 10 in Eq. 23) and a quartic form:

Eq. 26

The quartic form is useful when you need to eliminate bad van der Waals contacts, but the second derivatives are not calculated.

Hydrogen bonds and hydrogen-bond terms

Many forcefields, especially the newer ones, fully account for hydrogen bonds by other terms in the forcefield and so do not have or require specific terms for handling hydrogen bond interactions.

However, some older forcefields include specific terms for hydrogen bonding (e.g., older versions of AMBER). Others (e.g., CHARMm) allow you to use a hydrogen bond term if you want (but MSI does not recommend this). If you are using a forcefield with explicit hydrogen bond terms, you should read this section.

Lack of hydrogen bond terms is an asset

If specific hydrogen bonds are required, generation of a list of hydrogen bonds is a major step in evaluating the energy of a system. This process involves looking at all possible pairs of hydrogen bond donors and acceptors and selecting those that meet certain criteria (Figure 5):

The hydrogen bond length is less than a defined cutoff.
The deviation of D-H-A from linearity is less than a defined cutoff. Typically, the best hydrogen bond has a D-H-A angle of 180°.

Figure 5 . Distance and angle criteria for hydrogen bonds
A = hydrogen acceptor; D = hydrogen donor.

Since hydrogen bond interactions depend on both angle and distance, both angle cutoffs and distance cutoffs must be specified for a switching function (see Nonbond cutoffs). A switching, or spline, function (Figure 15) is needed to conserve energy by smoothing transitions over the cutoffs.

Specifying the criteria

In Cerius²·OFF, the hydrogen bond criteria can be changed by using the Hydrogen Bond Preferences control panel (accessed by selecting the Energy Terms/Hydrogen Bond menu item from the OPEN FORCE FIELD card). This control panel also allows specification of switching function ("spline") parameters.

In Discover, default hydrogen bond criteria are contained in the forcefield file (amber.frc). CDiscover allows you to use the BTCL forcefield scale command to scale hydrogen bond terms (if they exist). In FDiscover, they can be changed by editing the command input file to change the variables HBDIST and HBANGL.

In CHARMm, you can change the hydrogen bond criteria with the CHARMm Update Parameters dialog box, which is accessed from the CHARMm/Update Parameters menu item. This dialog box also allows specification of switching function parameters for hydrogen bonds. Setting the Update Frequency to 0 (the default) effectively omits the hydrogen bond term from the potential energy expression. You can also omit explicit hydrogen bond terms by using the CHARMm/Energy Terms menu item.

Bond-angle cross terms vs. Urey-Bradley terms

An alternative or supplement to bond-angle interactions is the Urey-Bradley term, which accounts for 1-3 interactions between two atoms that are bonded to a common atom.

In Cerius²·OFF, use the Energy Terms Selection control panel to specify whether to use the Urey-Bradley term (assuming it is available in the current forcefield).

In CHARMm, the Urey-Bradley term, if present, can be omitted from the energy expression (standalone only) or can be specified in the parameter file (ANGLE statement).

Applying constraints and restraints

Constraints and restraints allow you to focus the calculation on a region or conformation of interest and also to set up computational experiments. Such experiments are one of the primary uses of molecular modeling, allowing you control over a model at the atomic level. Several examples are described under When to use constraints/restraints.

Restraints vs. constraints

The seminal difference between a constraint and a restraint is that a constraint is an absolute restriction imposed on the calculation, while a restraint is an energetic bias that tends to force the calculation toward a certain restriction (even though many people use these terms as if they were interchangeable).

Table 9 . Constraints and restraints in MSI's simulation engines

constraint/restraint type engine¹ details
atom
fixed (constraints)
OFF ², FDiscover, CDiscover, CHARMm
here

template forcing
harmonic (Eq. 28) restraint
FDiscover
here

tethering and template forcing
quadratic (Eq. 29) restraint
CDiscover ³
here

tethering
harmonic (Eq. 28) restraint
FDiscover
here

tethering
mass-weighted harmonic (Eq. 30) restraint
CHARMm
here

quartic droplet
harmonic (Eq. 31) restraint
CHARMm
here

distance
harmonic (Eq. 32) restraint
OFF
here

distance
quadratic (Eq. 29), flat-bottomed (Eq. 34), or cosine (Eq. 36) restraint
CDiscover³
here

distance
harmonic (Eq. 32) or flat-bottomed (Eq. 33) restraint
FDiscover
here

distance
flat-bottomed (Eq. 35) restraint
CHARMm
here

dynamics
RATTLE algorithm (constraints)
CDiscover
here

dynamics
SHAKE algorithm (constraints)
CHARMm
here

dynamics
consensus dynamics (Eq. 28) (standalone only)
FDiscover, CDiscover
here

angle
harmonic (Eq. 37) restraint
OFF
here

angle
quadratic (Eq. 29), flat-bottomed (Eq. 34), or cosine (Eq. 36) restraint
CDiscover^c
here

torsion
harmonic (Eq. 38) restraint
OFF
here

torsion
quadratic (Eq. 29), flat-bottomed or J³ dihedral (Eq. 34), cosine (Eq. 36), cis (Eq. 39), trans (Eq. 40), or cis/trans (Eq. 41) restraint
CDiscover^c
here

torsion
flat-bottomed (Eq. 33) restraint (standalone only)
FDiscover
here

torsion
cosine (Eq. 42) or harmonic (Eq. 38) torque (one of these is standalone only)
FDiscover
here

torsion
harmonic (Eq. 38) restraint
CHARMm
here

inversion
harmonic (Eq. 43) restraint
OFF
here

chiral
flat-bottomed (Eq. 34) restraint
CDiscover³
here

out-of-plane
quadratic (Eq. 29), flat-bottomed or J³ dihedral (Eq. 34), or cosine (Eq. 36) restraint
CDiscover³
here

out-of-plane
harmonic (Eq. 43) restraint (standalone only)
FDiscover
here

¹ The standalone modes of running simulation engines may give access to additional constraints and restraints--please see the appropriate documentation.
² The Open Force Field module in Cerius².
³ Not available yet in the Cerius²·Discover module; restraints applied with CDiscover (in Insight and standalone modes) can also be scaled.

Table 9 . Constraints and restraints in MSI's simulation engines
constraint/restraint	type	engine¹	details
atom	fixed (constraints)	OFF ², FDiscover, CDiscover, CHARMm	here
template forcing	harmonic (Eq. 28) restraint	FDiscover	here
tethering and template forcing	quadratic (Eq. 29) restraint	CDiscover ³	here
tethering	harmonic (Eq. 28) restraint	FDiscover	here
tethering	mass-weighted harmonic (Eq. 30) restraint	CHARMm	here
quartic droplet	harmonic (Eq. 31) restraint	CHARMm	here
distance	harmonic (Eq. 32) restraint	OFF	here
distance	quadratic (Eq. 29), flat-bottomed (Eq. 34), or cosine (Eq. 36) restraint	CDiscover³	here
distance	harmonic (Eq. 32) or flat-bottomed (Eq. 33) restraint	FDiscover	here
distance	flat-bottomed (Eq. 35) restraint	CHARMm	here
dynamics	RATTLE algorithm (constraints)	CDiscover	here
dynamics	SHAKE algorithm (constraints)	CHARMm	here
dynamics	consensus dynamics (Eq. 28) (standalone only)	FDiscover, CDiscover	here
angle	harmonic (Eq. 37) restraint	OFF	here
angle	quadratic (Eq. 29), flat-bottomed (Eq. 34), or cosine (Eq. 36) restraint	CDiscover^c	here
torsion	harmonic (Eq. 38) restraint	OFF	here
torsion	quadratic (Eq. 29), flat-bottomed or J³ dihedral (Eq. 34), cosine (Eq. 36), cis (Eq. 39), trans (Eq. 40), or cis/trans (Eq. 41) restraint	CDiscover^c	here
torsion	flat-bottomed (Eq. 33) restraint (standalone only)	FDiscover	here
torsion	cosine (Eq. 42) or harmonic (Eq. 38) torque (one of these is standalone only)	FDiscover	here
torsion	harmonic (Eq. 38) restraint	CHARMm	here
inversion	harmonic (Eq. 43) restraint	OFF	here
chiral	flat-bottomed (Eq. 34) restraint	CDiscover³	here
out-of-plane	quadratic (Eq. 29), flat-bottomed or J³ dihedral (Eq. 34), or cosine (Eq. 36) restraint	CDiscover³	here
out-of-plane	harmonic (Eq. 43) restraint (standalone only)	FDiscover	here

Most restraints and constraints are available via the graphical UIFs:

Cerius²·OFF--Controls in the Restraints control panel, which is accessed from the Energy Terms/Restraints card menu item and in the Atom Constraints control panel, which is accessed from the Atom Constraints card menu item. The latter also allows you to color-code immovable atoms.
Cerius²·Minimizer--Controls in the Atom Constraints control panel, which is accessed from the Constraints/Atoms card menu item.
CDiscover--Controls in the Atom Constraints control panel of the Cerius²·Discover module and commands in the Specify pulldown of the Insight·Discover_3 module.
FDiscover--Commands in the Constraint pulldown of the Insight·Discover module.
QUANTA--Controls accessed by the CHARMm/Constraints Options and CHARMm/SHAKE Options menu items.

Discover and CHARMm offer additional restraint and constraint functionality when run in standalone mode.

When to use constraints/restraints

Constraints and restraints are often used to control and direct the minimization.

Fixed-atoms example

For example, you can fix some atoms in space, not allowing then to move. For example, part of the structure of a molecule may have been well solved experimentally, but the structures of other areas are less clear. Or you might want to keep parts of your model (e.g., solvent molecules) rigid to decrease computational costs.

Torsion-rotation example

You can add extra terms to the energy expression to restrain or bias the system in certain ways. For example, if you are investigating the adiabatic energy barrier to rotation about a bond, you would restrain the value of that torsion and minimize the structure. Repeating this procedure for a set of torsion values in the range 0°-360° yields a complete energy profile for rotation about the bond. A similar process is used to generate phi/psi maps and other multidimensional energy surfaces in studies of model conformation.

Docking example

If a substrate is being docked onto an enzyme and a specific hydrogen bond between the enzyme and the ligand is thought to be involved in binding, the donor and acceptor atoms can be pulled together to provide a docking coordinate. In this way, the results are not so dependent on the initial starting configuration, which may have been only a crude graphic alignment. In cases like this, the restraint is turned off at some point to make sure that the biased minimum is close to a true minimum.

Modeling incomplete models

Another example of the use of restraints is in modeling incomplete systems. Often, it is difficult or impossible to construct a realistic environment around parts of a model system. For example, only a partial structure of a large protein complex may be available, and some atoms must be restrained to stay near their initial crystal positions because they do not "feel" interactions with neighboring (missing) amino acids, membrane, or solvent. If the site of interest (for instance, a binding site for a competitive inhibitor) is well characterized but other parts of the enzyme are unknown or would require too much computation time if they were included, a limited study can still be carried out with the ends tethered to their crystal coordinates. Usually, these restraints are permanent parts of the model. The results of such calculations must be critically evaluated but can be valid if the ligand binding does not depend on interactions with missing pieces of the model or on conformational flexibility in the tethered regions.

Relaxing crystal structures

As a final example, tethering can be used to gently relax a crystal structure. Often, crystal coordinates, even if highly refined, have several strained interactions due to intrinsically disordered or poorly defined atomic positions, which, upon minimization, give rise to large initial forces. If these forces are not restrained, they can result in artifactual movement away from the original structure. The general approach is to progressively relax parts of the model in stages, starting with the least well determined atoms, until the entire system can minimize freely. The restraints are ultimately removed so that the final minimum represents an unperturbed conformation. It is usually not necessary to minimize to convergence at each stage--the object is to relax the most-strained parts of the system as quickly as possible without introducing artifacts.

Fixed atom constraints

Cost-saving

Fixed atoms are constrained to a given location in space; they cannot move at all. Fixed atoms reduce the expense of a calculation in two ways:

Terms in the energy expression involving only fixed atoms can be eliminated, because they merely add a constant to the total energy. Since the positions of fixed atoms cannot change, neither can the contribution of the terms that depend only on these positions. (Interactions between moving and fixed atoms are calculated.)
Fixing atoms reduces the number of degrees of freedom in the system, so minimizers converge in fewer steps and dynamics requires fewer steps to sweep out the available conformational space.

Important

The energy calculated by simulation engines is correct only to an arbitrary constant, depending on the model as well as the fixed atoms. Thus, only differences in energy between conformations of the same model having the same fixed atoms are meaningful.

Use atom constraints when you want to apply minimization or dynamics to part of a model, while keeping the remainder of the model fixed and rigid. For example, use atom constraints to quickly minimize a sorbate in a zeolite by fixing the atom positions of the zeolite frame and allowing only the sorbate atoms to move. Or fix all residues in a protein except for those in the active site.

Template forcing, tethering, quartic droplet restraints, and consensus conformations

Typical uses of these related types of restraints are to bias the conformation of one model towards that of another, to bias selected atoms towards their experimentally known positions, to restrain the core of a model while allowing its solvent-exposed constituents more freedom of movement, or to find an identical or close set of conformations that a group of related models can achieve.

Template forcing

To force the conformation of one model to be similar to that of a template model, a one-to-one correspondence between atoms in the template and in the moving structure is set up, and (for example) one of the following restraint terms is added to the energy expression:

The term in Eq. 27 is proportional to the root-mean-square (rms) deviation of the analog atoms from the template atoms. (This form cannot be used with the Newton-Raphson minimizer in FDiscover.) The values obtained for the energy and the rms function depend on the value of the forcing parameter K. Typical values for this constant are in the neighborhood of 5 kcal Å^-1. It is often instructive to look at the dependence of the energy and rms functions on the forcing parameter by making several determinations with different forcing parameters. If several runs (minimization or dynamics) are made, it may also be helpful to plot the energy as a function of the rms value. For tethered minimizations, a very large forcing constant (e.g., 2000.0 kcal Å^-1) is often used to prevent significant movement of any of the tethered atoms.

Eq. 28 represents a conceptually more straightforward restraint, with each atom restrained by an isotropic spring to the position of its template atom. In either form, the summation is over a list of pairs of atoms to restrain: one from the moving model, and one from the template model. FDiscover uses this quadratic form by default.

The K_i in Eq. 28 are determined by the distance of atom i from the atom defining the origin as:

where r_min is the distance at which the tethering turns on, k_min is the initial force constant at that distance, r_max is the distance where the force constant reaches its maximum allowed value, and k_maxis the maximum allowed force constant. If r_min and k_min are not given, the default values are zero. If r_max is zero, tethering uses a constant force constant of k_max.

In CDiscover, a simpler quadratic is used:

Eq. 29

where V is any appropriate internal (bond length, angle, etc.--the same functional form is used for several types of restraints).

Advantages of each type of template-forcing restraint

The first form (Eq. 27) gives the best rms fit for the least energetic cost, but individual atoms may remain quite far from their template position. The second form (Eq. 28) restrains each atom individually, so each atom is forced toward its template partner. The resulting rms fit is not as good as that from Eq. 27, but no one atom is allowed to deviate as much as is possible with Eq. 27. The form in Eq. 28 also allows for a different force constant for each pair, which means that different atoms or classes of atoms can be treated differently.

Tethering

Tethering is the same as template forcing, except that the atoms are restrained to their original positions rather than to positions in a template structure. Both Eq. 27 and quadratic forms are applicable for tethering; however, Eq. 28 is used by FDiscover and Eq. 29 by CDiscover, because tethering is usually used to keep atoms from moving too far from their original positions.

CHARMm allows mass-weighted tethering by calculating an additional energy term for all atoms that are to be restrained. This term has the form:

Eq. 30

Where E_cons is the constraint energy, k_i is the force constant, m_i is the mass of atom i (if mass weighing is used) or 1, r_i is the position of atom i, r₀ is the reference position about which the atom is to be centered, and n is an exponent.

Quartic droplet restraint

The quartic droplet restraint term in CHARMm is designed to put the entire model into a "cage" by constructing a restraining sphere around a model. The potential is scaled so that atom positions furthest from the center of mass or the geometric center of the model have the greatest restraining force applied.

The quartic droplet restraint term is based on the center of mass (COM) or the center of geometry of the model. No net force or torque is introduced by the center of mass term. The potential function is:

Eq. 31

FDiscover (standalone only) allows similar restraints within spherical shells.

Consensus dynamics

Consensus dynamics is used to find the consensus configuration of a set of analogs. In essence, all models in the set are treated as both moving molecules and templates.

Standalone FDiscover uses the harmonic template-forcing restraint (Eq. 28).

The database capability of CDiscover is used in settng up consensus dynamics calculations, using the restraint in Eq. 29.

General internal-coordinate restraints

In CHARMm, you can apply general internal coordinate restraints by applying restraints to all bonds, angles, and/or dihedral angles that have entries in an internal coordinates table. This facility is global, that is, not applicable to specific internal coordinates.

Distance and NOE restraints

Distance restraints are used to bias the distance between two atoms, bonded or not, toward a given value. Some uses are to cyclize linear models by bringing the ends closer together, dock different models, and fit distance data derived from NOE and other experiments.

Several functional forms for distance restraints: harmonic...

Several commonly used functional forms are supported.

One is a simple harmonic function, which in FDiscover has the form:

Eq. 32

where K is a force constant, R_ij is the current distance between the atoms, and R_targetis the target distance. A large force constant tends to force the distance to be close to the target distance; a smaller force constant results in a correspondingly smaller bias.

In CDiscover, the quadratic form is the same, except that scaling is enabled (Eq. 29).

In Cerius², the form is the same, except that K is multiplied by 0.5. R_target can be defined explicitly or automatically extracted from the model as the current distance between atoms.

...and flat-bottomed...

The second form is also harmonic, but it is separated into several piecewise continuous regions, resulting in a flat-bottomed potential (Figure 6). For FDiscover, the form is:

Eq. 33

Figure 6 . Distance restraint function
E as a function of R, the distance between two atoms or dihedral angles, defined as in Eq. 33.

For CDiscover, the flat-bottomed form is:

Eq. 34

where V is any appropriate internal (bond length, angle, etc.--the same functional form is used for several types of restraints).

The restraining potential used in CHARMm is:

Eq. 35

Where R_lim is the value of R where the force equals f_max.

...and cosine

CDiscover also allows a cosine form of restraint:

Eq. 36

where V is any appropriate internal (bond length, angle, etc.) and n is the periodicity.

Advantages of the flat-bottomed functional form

It is not necessary for the flat-bottomed potential (Figure 6) to be symmetric. By appropriate definition of the points R₁, R₂, etc., any of the regions may be eliminated. For Eq. 33, the important regions are those from R₁ to R₂ and from R₃ to R₄, where a harmonic potential is applied, and the flat bottom from R₂ to R₃.

This form of the restraint allows a range of acceptable distances and is particularly useful for incorporating experimental distance information, such as those from NOE experiments, into a calculation. The flat bottom allows for experimental error in the determined distance. The two outer regions (Figure 6) have a constant gradient, which is useful for avoiding unreasonably large forces if the initial structure is far from the target value.

Distance and angle constraints in dynamics simulations

The RATTLE and SHAKE algorithms effectively remove very-high-frequency vibrations from consideration during dynamics simulations. Use of these algorithms can allow for a larger time step during simulation.

In CDiscover, the BTCL rattle command is used before the dynamics command to set up constraints in bonds, angles, or water molecules in a molecular dynamics simulation. It can be used to constrain bonds or any atom pairs to user-defined distances. It can be used to constrain angles spanned by two constrained bonds. In addition, it can be used to fix the geometry of water molecules so that the fixed-geometry water models SPC and TIP3P can be used in a simulation. This functionality is available in a limited way via the Calculate/Dynamics parameter block in the Discover_3 module of Insight (click More to display the Rattle toggle).

In CHARMm, SHAKE is used to constrain bond lengths and angles spanned by two constrained bonds during dynamics runs. (However, its use is recommended only for constraining all bonds in which one of the bonded atoms is a hydrogen.) The SHAKE algorithm cannot be used with the Newton-Raphson or ABNR minimizers (see Minimization).

Angle restraints

In Cerius², an angle restraint can be applied to a group of any three atoms. The restraint is implemented such that:

Eq. 37

Where: K_a is the angle force constant;

is the angle between the selected atoms; and

₀ is the desired restrained angle of the selected atoms.

₀ can be defined explicitly or can be automatically extracted from the model as the current angle connecting selected atoms.

In CDiscover the default form of angle restraints is cosine (Eq. 36). Quadratic (Eq. 29) and flat-bottomed (Eq. 34) angle restraints can also be used.

Torsion restraints

Some uses of torsion restraints are to enforce chiral and prochiral centers, prevent cis-trans conversions, and fit NOE J-coupling constants from NMR experiments. Conversely, other uses are to force torsion rotation in order to perform phi/psi mapping, perform conformational searching, and induce conformational changes.

Functional forms

Several forms of torsion restraints are used in the literature and implemented in MSI's simulation engines.

Harmonic restraints, or periodic restraints (Eq. 42 with n = 1), are appropriate for forcing a torsion angle to a particular value. The periodic form with a periodicity greater than one is useful for restraining a torsion to one of several related angles. For instance, a threefold potential could keep a torsion either trans or at one of the two gauche conformations, depending on the starting conformation and the strength of the potential applied.

Implementation

In Cerius²·OFF, a torsion (dihedral) restraint can be defined among any group of four atoms. The restraint is implemented such that:

Eq. 38

Where K_t is the torsion force constant;

is the angle between the i-j-k and j-k-l planes; and

₀ is the desired restrained angle of the selected atoms, which can be defined explicitly or automatically extracted from the model as the current angle connecting selected atoms.

In CDiscover, you can specifically restrain dihedrals to be cis:

Eq. 39

or trans:

Eq. 40

or either cis or trans:

Eq. 41

You can also use the flat-bottomed function (Eq. 34) to apply J³ dihedral restraints to fit the results of NOE experiments. A plain cosine form (Eq. 36) and a quadratic form (Eq. 29) are also available. The torson involving any four atoms can be restrained.

In FDiscover, the functional forms include a simple harmonic form analogous to Eq. 32 and a piecewise continuous form like Eq. 33 with R interpreted as the angle, rather than the distance. Another form is the periodic function of Eq. 42:

Eq. 42

where V gives the strength of the restraint, n is an integer periodicity, and

₀ is the phase angle.

CHARMm uses a harmonic potential to restrict the motion of a dihedral angle to a value close to a reference position or to examine a series of different conformations when making potential energy maps.

Inversion, out-of-plane, and chiral restraints

Typical uses include prevention of changes in chirality or prochirality. (A molecule is chiral if no stable conformation of it can be superimposed on its mirror image--most chiral organic molecules can be described in terms of chiral centers, i.e., an atom that has four distinct substituents. Two chemically identical substituents on an otherwise chiral tetrahedral center are prochiral; in addition, sp² hybridised planar systems with three different substituents are considered prochiral.)

Implementation

In Cerius²·OFF, an inversion (improper torsion or out-of-plane angle) restraint can be defined among any four atoms i, j, k, l, where i defines the inversion center. The restraint is implemented such that:

Eq. 43

Where K_i is the force constant for the out-of-plane;

is the angle between the i-j-l and i-k-l planes; and

₀ is the desired restrained out-of-plane angle of the selected atoms, which can be defined explicitly or automatically extracted from the model as the current angle connecting selected atoms. There must be a real atomic center for the inversion.

The CDiscover program can impose a flat-bottomed chiral restraint (Eq. 34) to invert the chirality or force it to be R or S.

CDiscover can also impose a cosine (Eq. 36), quadratic (Eq. 29), or flat-bottomed (Eq. 34) out-of-plane restraint.

The FDiscover DSL language can be used to impose chirality and prochirality restraints having the same functional form as Eq. 43, where ₀ is the out-of-plane angle corresponding to R or S.

Plane and other geometrical constraints and restraints

The BTCL language of CDiscover allows sophisticated geometric manipulation of molecular and other objects, including constraints and restraints, by means of the geometry, molGeom, restraint and other commands. A subset of this functionality is accessible in the Calculate/Geometric parameter block in the Insight·Discover_3 module.

Modeling periodic systems

Periodic boundary conditions refers to the simulation of models consisting of a periodic lattice of identical subunits. By applying periodic boundaries to simulations, the influence, for example, of bulk solvent or crystalline environments can be included, thereby improving the rigor and realism of a model.

Table 10 . Periodic boundary methods in MSI's simulation engines

periodicity engine details
minimum image
OFF ¹, FDiscover, CHARMm
here

explicit image
FDiscover, CDiscover, CHARMm
here

crystal simulations
OFF, CDiscover, CHARMm
here

bonds across boundaries
OFF, CDiscover, CHARMm
here

¹ The Open Force Field module in Cerius².

Table 10 . Periodic boundary methods in MSI's simulation engines
periodicity	engine	details
minimum image	OFF ¹, FDiscover, CHARMm	here
explicit image	FDiscover, CDiscover, CHARMm	here
crystal simulations	OFF, CDiscover, CHARMm	here
bonds across boundaries	OFF, CDiscover, CHARMm	here

Most methods for controlling the treatment of periodic systems are available via the graphical UIFs:

CDiscover--The program detects whether a system is periodic (in Cerius²: fully automatic, depends only on which model is current; in Insight: semi-automatic, you need to execute the Setup/System parameter block) and displays the appropriate controls or parameters in the interface of the Cerius²·Discover or Insight·Discover_3 module.
FDiscover--You can choose the minimum-image or explicit-image convention in the Parameters/Variables parameter block of the Insight·Discover module. You have to specify whether a system is periodic by toggling PBC (periodic boundary conditions) in the Run/Run parameter block.
QUANTA--Use the CHARMm/Periodic Boundaries menu item to turn periodic boundary conditions on and off and to specify where to obtain this information.

Discover and CHARMm offer additional functionality when run in standalone mode.

Models are specified in Cartesian space

Some simulation engines accept only Cartesian coordinates, not crystal coordinates (others are able to convert between the two systems). This is important when using asymmetric space groups, since the symmetry operators assume that the input coordinates correspond to the standard asymmetric unit as defined in the International Tables for Crystallography (Reidl 1983).

For Discover, it is assumed that the x Cartesian axis corresponds to the a crystal axis and that the b axis lies in the x,y plane (see Figure 7)

Figure 7 . Relationship between Cartesian coordinate system (xyz) and periodic system (abc) in Discover and CHARMm

.

For Cerius²·OFF, by default the c lattice vector is parallel to the z Cartesian axis and the b lattice vector lies in the y,z plane (Figure 8).

Figure 8 . Relationship between Cartesian coordinate system (xyz) and periodic system (abc) in Cerius²·OFF

CHARMm can handle models that are defined in either crystal or Cartesian space. In converting from crystal to Cartesian axes, the a, b, c crystal axes are aligned with the x, y, z Cartesian axes (Figure 7).

Minimum-image model

Tip

For periodic systems in which nonbond interactions dominate, the Ewald sum method (Ewald sums for periodic systems) is preferred over the the minimum-image convention.

Simulation in bulk solvent

The left side of Figure 9 shows a solute molecule surrounded by enough solvent to occupy the volume (and shape) of a cube. A simulation carried out on this isolated cubic system is a poor approximation of what would happen in a true bulk solvent environment. For example, the solute can diffuse toward a surface or solvent molecules can evaporate. To remedy this, on the right of Figure 9 the cube is replicated in three dimensions to form a 3 X 3 X 3 lattice of identical cubes. This is a much better representation of bulk solvent for the interior cube, because molecules near the surfaces now interact with solvent in adjacent cubes. The imaged atoms are used to calculate energies and forces on the real atoms in the interior cube. The energies and forces on the imaged atoms themselves are not calculated because their motions are computed as symmetry operations on the real atoms, for example, by translations along the cubic axes.

Figure 9 . Solute surrounded by solvent
A solute surrounded by an isolated cube of solvent is replicated periodically in three dimensions in order to better represent a bulk or crystalline environment.

Implications of minimum-image model for calculating nonbond interactions

Consider the implications of this model for a specific case. In Figure 10, molecule A1 is located near an edge of the square. (For simplicity, this discussion focuses on a two-dimensional lattice.) In addition, eight images of A1 (A2-A9) are present in the adjacent symmetrically related squares. Consider the interactions of molecules A with molecules B. The closest image of B to A1 is actually not B1, but rather B5. If molecules in the interior cell are allowed to interact only with the molecule or molecular image closest to it, this is called a minimum-image model. Each molecule interacts only with those molecules and images within a distance of half the cell size. The advantage of this approach is its simplicity. It is straightforward to compute energy between a given pair of molecules without explicitly keeping track of the images in neighboring cells. All periodic boundary algorithms imply a cutoff criterion, but the minimum-image convention implies a maximum distance for this cutoff of no more than half the cell dimensions.

Figure 10 . Minimum-image model
Minimum-image model showing that each real molecule interacts with at most only one image of each real molecule.

For a description of the minimum-image convention, see also Allen and Tildesley (1987).

Explicit-image model

A more general approach--ghost molecules

Simulation engines (Discover and CHARMm) can also use a more general approach by generating explicit images of the interior molecules, also called ghost molecules, which interact with the interior molecules. These ghost molecules are replicated to as great a distance as necessary (but no farther than necessary) to satisfy the desired potential energy cutoff criteria.

The left side of Figure 11 shows molecule A1 interacting with several images of B (B1, B2, B3, B5) within the specified cutoff radius (shown as a shaded circle centered on A1). A1 interacts with several of its own images as well (A3, A5, A6, A8).

Figure 11 . Explicit-image model
Explicit-image model showing how a cutoff distance defines which molecules in adjacent unit cells are selected as ghost images. (Different cutoff distances are used in the left and right figures.) Left: explicit-image model--a larger cutoff including interactions with more images is possible than with the minimum-image convention; right: the shaded region identifies which molecules are selected as ghost images within the cutoff distance of any molecules in the unit cell.

Cutoff distances and nonbond interactions

The right side of Figure 11 shows which molecules in the adjacent unit cells become explicit ghost molecules for a given cutoff distance. Not every molecule in an adjacent cell becomes a ghost. However, if a cutoff distance that is longer than the cell length is used, ghosts from unit cells beyond the nearest neighbor cells may be included. As molecules (effectively, see below) move in and out of the boundaries, the molecules that are ghosts can change. Therefore, the ghost list is regenerated periodically.

Nonbond interactions do not have to be calculated between ghost atoms. This helps to significantly reduce computation time.

When group-based cutoffs (Charge groups and group-based cutoffs) are used, the nonbond potential is cut off on the basis of charge groups (i.e., only if two groups are within the cutoff is the interaction calculated), and only those groups in molecular ghosts that are within the cutoff distance of a real group are included in the ghost atom list.

How images and "real" molecules move

Ghost molecules follow their symmetrically related counterparts. However, when it comes time to move the molecules (in a dynamics step or minimization iteration), only the real molecules (A1 and B1) are actually moved according to the accumulated forces each molecule has felt. The ghost molecule positions are simply regenerated by applying the defined symmetry relations to the new positions of the molecules.

Perfect symmetry is maintained between the primary structure and all its image objects. For many applications, this condition is satisfactory. However, it is not possible to study, for example, cooperative changes between image objects.

To maintain all molecules in the central cell, image centering is used. Molecules that happen to migrate to an edge of the primary structure and would appear in one of its image objects instead reappear in the primary structure from the opposite direction. Thus a constant number of atoms is maintained and no molecules are lost, no matter how far they may diffuse during the calculation.

Crystal simulations

Energies of crystals can be calculated and the lattice parameters a, b, c,

, and

can be optimized with Cerius², CDiscover, and CHARMm:

In the Cerius²·OFF module, you can choose to optimize cell dimensions and angles in 2D or 3D periodic systems or to constrain some of these coordinates. From the MINIMIZER card (accessed from the OFF METHODS deck of cards), you can access cell constraints options with the Constraints/Cell menu item.

: Crystal simulations are also available in several Cerius²·OFF Instruments modules. For example, you can use the Crystal Packer module to optimize crystals or calculate their energy and can include minimization of periodic structures in a Mechanical Properties run.

In the Cerius²·Discover module, the Optimize Cell checkbox in the Discover Minimize control panel is automatically checked if the current model is periodic.
In the Insight·Discover_3 module, crystal optimization is requested by toggling Optimize_Cell in the Calculate/Minimize parameter block. Crystal optimization is also available within the Structure_Refine, Amorphous_Cell, and other Insight II modules.
In QUANTA, use the CHARMm/Periodic Boundaries menu item to turn periodic boundary conditions on and obtain crystal energies.

: Because crystal patching is not available in CHARMm, bonds between crystal images are not handled well. Similarly, hydrogen bond interactions described by an explicit hydrogen bond function cannot be used. The only forces that can be calculated between primary and image atoms in crystals are nonbond forces.

Bonds across boundaries

Allowing bonds (with additional energy terms including angles, dihedrals, and improper dihedrals) between the primary atoms and image atoms enables you to study polymers such as DNA or industrial polymers.

Cerius²·OFF, CDiscover, and CHARMm can handle bonds across cell boundaries. (However, CHARMm is best used only for linear polymers, since it does not handle 3D lattices or networks well.)

Handling nonbond interactions

Electrostatic (Coulombic) and van der Waals interactions together are referred to as nonbond interactions.

Nonbond terms can involve extensive calculation. To avoid a heavy calculation burden, some approximation scheme is often employed. Choosing the best method for your particular model can save computational expense without sacrificing accuracy.

In addition, you have some direct control over the functional terms for nonbond interactions:

You might be able to improve your simulation by changing the default combination rules for van der Waals interactions between non-identical atom types (OFF, Discover, Combination rules for van der Waals terms).
You can change the dielectric constant to account for nonaqueous solvents and/or solvent screening or make the dielectric "constant" a function of distance (OFF, Discover, CHARMm, The dielectric constant and the Coulombic term).

Table 11 . Nonbond methods in MSI's simulation engines

method type of system engine details
atom-based (single) cutoffs
periodic, nonperiodic
OFF ¹, FDiscover, CDiscover, CHARMm
here

group-based cutoffs
periodic, nonperiodic
FDiscover, FDiscover, CHARMm
here

double cuttoffs
periodic, nonperiodic
FDiscover
here

tail corrections
disordered periodic
CDiscover
here

cell-based cutoffs
periodic
CDiscover
here

cell multipole method
nonperiodic, periodic ²
CDiscover
here

Ewald sums
periodic
OFF, CDiscover
here

¹ The Open Force Field module in Cerius².
² Standalone only for periodic systems, cannot be used for constant-pressure or constant-stress dynamics.

Table 11 . Nonbond methods in MSI's simulation engines
method	type of system	engine	details
atom-based (single) cutoffs	periodic, nonperiodic	OFF ¹, FDiscover, CDiscover, CHARMm	here
group-based cutoffs	periodic, nonperiodic	FDiscover, FDiscover, CHARMm	here
double cuttoffs	periodic, nonperiodic	FDiscover	here
tail corrections	disordered periodic	CDiscover	here
cell-based cutoffs	periodic	CDiscover	here
cell multipole method	nonperiodic, periodic ²	CDiscover	here
Ewald sums	periodic	OFF, CDiscover	here

Most methods for specifying how to treat nonbond interactions are available via the graphical UIFs:

Cerius²·OFF--The Van Der Waals Preferences and Coulomb preferences control panels (accessed from the Energy Terms card menu item), and similar control panels in several OFF Instruments modules.
Cerius²·MMFF--The MMFF Nonbonded Preferences control panel.
CDiscover--Controls in the Discover Non-Bond control panel of the Cerius²·Discover module and commands in the Specify/Nonbonds parameter block of the Insight·Discover_3 module.
FDiscover--Commands in the Parameters pulldown of the Insight·Discover module.
QUANTA--Controls accessed through the CHARMm/Update Parameters menu item.

Discover and CHARMm also offer additional functionality when run in standalone mode.

You may use different methods for van der Waals and electrostatic interactions

Typically, both van der Waals and Coulombic interactions are calculated by the same method and (if by the nonbond cutoff method) with the same nonbond list. However, different methods and parameters may be used for van der Waals and Coulombic terms in CDiscover and (except for some operations) in Cerius²·OFF and CHARMm. This allows you, for instance, to use a large cutoff for electrostatic interactions and a smaller cutoff for van der Waals interactions.

The van der Waals interaction potential is relatively short range and dies out as 1 \xda r⁶. By 8-10 Å, the energy and forces are quite small. Thus, using cutoffs to bring the van der Waals potential to zero at about 10 Å can be a reasonable approximation. The Coulombic interactions, on the other hand, die off as 1 \xda r, so even at considerable distances the energy of interaction is not negligible. But this depends on the model: except for a few formally charged groups, most molecules are composed of neutral fragments with dipoles and quadrupoles. Thus, in most models the major component of the electrostatic interaction between molecules or parts of molecules is a dipole-dipole interaction, which falls off as 1 \xda r³.

To specify different methods for treating van der Waals and Coulombic interactions in the Cerius²·Discover module, select the Forcefield/Nonbond menu item in the DISCOVER card and check the Treat VDW and Coulomb Separately check box in the Discover Non-Bond control panel.
Cerius²·OFF does not allow you to select different cutoffs for Coulombic and van der Waals interactions if you are using a non-Ewald method for both. However, you may independently select an Ewald or non-Ewald method for either. If you do select Ewald for both, you can independently set the cutoff and convergence parameters for each.

Note

For models having 2D periodicity (e.g., built using the Cerius²·Surface Builder) the Ewald method is available for the Coulombic terms but not for the van der Waals terms.

To specify different methods for treating van der Waals and Coulombic interactions in the Insight·Discover_3 module, go the Specify/Nonbonds parameter block, click More and then set Define Cutoffs to Separate.
In QUANTA and in Cerius²·MMFF, you can use different switching functions for the van der Waals and electrostatic interactions.

Automatic exclusions

Van der Waals and Coulombic interactions are ordinarily calculated between all atom pairs that are not specifically excluded. Most forcefields exclude nonbond terms for atoms connected by bonds (1-2 interactions) and valence angles (1-3). Some forcefields also exclude nonbond terms between end atoms in torsion (1-4) interactions. These interactions are illustrated Figure 12.

Figure 12 , Types of interactions usually excluded from nonbond calculations

1-4 interactions and AMBER

If 1-4 nonbond interactions in torsions are included in the nonbond list, they may be scaled. For example, with the AMBER forcefield (as implemented in both Cerius²·OFF and Discover) these nonbond interactions must be scaled by 0.50.

Scaling by 0.5 occurs by default with Cerius² when AMBER is loaded.

In the Insight·Discover module, you need to toggle the p1_4 parameter in the Parameters/Scale_Terms parameter block on and enter 0.5 in the p14 parameter box,

The standalone version of the FDiscover program handles this scaling with the following DSL command:


>	scale 1-4 by 0.5

Important

This term is not set by default in Discover, even for the AMBER forcefield, so you must remember to set the p1_4 parameter either the first time that you run the Discover program from Insight when using AMBER or in your command input file for each standalone job that uses AMBER.

The equivalent BTCL command for the CDiscover program is forcefield scale with the vdw_1_4 keyword.

Combination rules for van der Waals terms

van der Waals radius combination rules

Any van der Waals interaction parameters that are actually defined for heterogenous atom pairs are called off-diagonal parameters. Off-diagonal parameters that are not available for such atom pairs are calculated by averaging those for each of the two atom types, using a geometric, arithmetic, or (in CDiscover and Cerius·OFF) 6th-power combination rule:

Eq. 44

Eq. 45

Eq. 46

In Discover and Cerius²·OFF, a choice of combination rule is available and is specified in the forcefield file (see the File Formats documentation).

Quality of results

The arithmetic mean gives marginally better equilibrium distances for van der Waals interactions than the geometric combination rule (Halgren 1992). The 6th-power rule (not available with all forcefields) yields even better results (Waldman and Hagler 1993).

van der Waals combination rules and Ewald sums

With the Ewald method (Karasawa & Goddard 1989) (Ewald sums for periodic systems), the geometric mean leads to faster convergence than the arithmetic mean.

In addition, because the Ewald sum calculation proceeds much faster when only diagonal parameters are used, the Cerius²·OFF Van Der Waals Preferences control panel includes an option to ignore off-diagonals even when they are present (they are not present in any of the Discover forcefields).

The dielectric constant and the Coulombic term

Role of the dielectric constant in modeling

The electrostatic potential is computed from the partial atomic charges associated with the model (Assigning charges). Approximate solvent-screening effects can be included by specifying a nondefault value for the dielectric constant if it is explicitly included in the forcefield. (The "dielectric constant" used in modeling is not the dielectric constant that most experimental chemists would think of--it is instead an empirical, dimensionless scaling factor.)

The dielectric constant reflects the polarizability of the solvent molecules. A polarizable solvent such as water has a greater dielectric constant than less polar liquids. Electrostatic interactions in polarizable solvents with high dielectric constants are greatly attenuated. In closely packed molecules, however, there are fewer solvent molecules to screen the charge interactions.

A relatively large dielectric constant can be used for simulating the aqueous environment of small systems. However, many calculations on models use a smaller dielectric constant. For example, a dielectric constant between 2.0 and 10.0 has been used for simulations in the interior of a protein. A typical value for water would be around 4.

Additional information

For a helpful review, please see Harvey (1989). A tutorial on dielectric constants in forcefields can be found at MSI's website:

http://www.msi.com/support/insight/insight/dielectric.html

A distance-dependent dielectric "constant"

The dielectric constant can be kept constant, or the Coulombic term can be made a shielded function, where the dielectric "constant" is a function of distance (r ·). This is useful for electrostatic interactions in closely packed molecules, where the number of solvent molecules between two interacting charges is usually fewer than in bulk solvent. A distance-dependent dielectric constant is also useful for models in which explicit solvent molecules are not included.

The distance-dependent dielectric function (also called a shielded dielectric) is generally used with the Dreiding and AMBER forcefields and may be used with others.

A shielded Coulombic term is faster to calculate than a non-shielded term because no square root has to be evaluated.

Important

A distance-dependent dielectric constant cannot be used on a periodic model with the Ewald sum method (Ewald sums for periodic systems).

The dielectric constant can be changed and/or made distance dependent in any of MSI's simulation engines.

In Cerius²·FFE, the Coulombic control panel allows you to choose the form of the Coulombic term--the term can be distance-dependent, not distance-dependent, or corrected by an erfc term (see Glass forcefield).

Special considerations for the AMBER forcefield

With the AMBER forcefield, in most applications a distance-dependent dielectric ( = f (r)) should be used.

In Cerius²·OFF, the Epsilon value is 1.0 unless you change it in the Coulomb Preferences control panel (which is accessed by selecting the Energy Terms/Coulomb card menu item).

In the Insight·Discover_3 module, in the Specify/Nonbonds parameter block you would click More, then set Dielectric to Dist_Dependent and enter 4.0 for the Dielectric Value.

The equivalent BTCL command for the CDiscover program is forcefield with the distance_dep keyword set to true and dielect set to 4.0.

In the Insight·Discover module, in the Parameters/Set parameter block you would enter 4.0 in the Dielectric parameter box and make sure the Dist_Dependent parameter is toggled on.

The standalone version of the FDiscover program handles this with the following DSL command:


>	set dielectric = 4.0*r

Nonbond cutoffs