Open godotalgorithm opened 2 years ago
Hi,
thanks for explaining this change clearly. One side-effect is that it causes a few issues when using a recent Conda build with the ASE MOPAC interface https://wiki.fysik.dtu.dk/ase/ase/calculators/mopac.html#module-ase.calculators.mopac
1SCF GRADIENTS
) calculation returns forces but no potential energy. This is surprising to e.g. optimisers and dynamics routines that expect to track the energy.So it looks like the ASE parser can be updated in a backward-compatible way. But I would like to ask if it seems reasonable from MOPAC's perspective for ASE to keep using the TOTAL ENERGY value. This will be used within optimisers, dynamics routines etc. Would you recommend, for example, that the dispersion energy is added to the total for this purpose? Is it likely to make any practical difference if these methods switch to using the FINAL HEAT OF FORMATION as the reported "potential energy"? Is there a structure-dependent component of ENERGY OF ATOMS or are they unchanging free-atom values?
Short Version: Versions of MOPAC prior to 2022 reported a
TOTAL ENERGY
,ELECTRONIC ENERGY
, andCORE-CORE REPULSION
in the output file. These outputs are now suppressed by default, but they are still accessible by using theDISP
keyword. It is also important to note that the geometric derivatives (e.g. forces & vibrations) computed by MOPAC are with respect to the heat of formation, not thisTOTAL ENERGY
.Medium Version: These outputs have been suppressed because they were observed to be a common source of confusion among MOPAC users. MOPAC's semiempirical models are designed to reproduce heats of formation, and their primary energetic output is a predicted heat of formation. The additional outputs of
TOTAL ENERGY
,ELECTRONIC ENERGY
, andCORE-CORE REPULSION
represent an arbitrary decomposition of the heat of formation that do not have any independent physical meaning and notably lack any dispersion corrections that have been introduced in recent models. TheDISP
keyword clearly reports the decomposition of the heat of formation into these terms and a dispersion correction, when applicable. This is in contrast with conventional quantum chemistry calculations that typically report a physically meaningful "total energy" and "total electronic energy".Long Version: Semiempirical thermochemistry and its implementation in MOPAC have long focused on directly modeling molecular heats of formation, rather than modeling the total energy of a molecule and relying on more expensive vibrational calculations to produce the vibrational and entropic contributions to a predicted heat of formation. This decision was highly beneficial at a time when computational resources were limited, as it enabled a direct connection to available reference data (i.e. experimental heats of formation) for fitting models and avoided the need to perform expensive vibrational calculations in the process of predicting heats of formation, which were of direct interest to experimentalists using MOPAC. The downside of this decision was that a minimal-basis Hartree-Fock model doesn't contain any terms that looked like a non-trivial vibrational contribution (a model limitation) and formal access to a purely electronic energy requires a cumbersome and expensive removal of an explicit vibrational contribution (a computational limitation). In practice, these downsides are not very relevant because the vibrational/entropic contributions that differentiate a heat of formation from a formation energy can be approximated as conformation-independent atomic corrections. This approximation is roughly comparable to the underlying accuracy of MOPAC's models, and it is a sensible, practical way to repurpose MOPAC's outputs as pure energies rather than heats (e.g. if MOPAC is used to drive molecular dynamics). As the accuracy of semiempirical models continues to improve, it will eventually be important to be more mindful and careful about the distinction between modeling formation energies and heats of formation.