openmopac / mopac

Molecular Orbital PACkage
http://openmopac.net
GNU Lesser General Public License v3.0
124 stars 32 forks source link

Missing TOTAL ENERGY, ELECTRONIC ENERGY, and CORE-CORE REPULSION #66

Open godotalgorithm opened 2 years ago

godotalgorithm commented 2 years ago

Short Version: Versions of MOPAC prior to 2022 reported a TOTAL ENERGY, ELECTRONIC ENERGY, and CORE-CORE REPULSION in the output file. These outputs are now suppressed by default, but they are still accessible by using the DISP keyword. It is also important to note that the geometric derivatives (e.g. forces & vibrations) computed by MOPAC are with respect to the heat of formation, not this TOTAL ENERGY.

Medium Version: These outputs have been suppressed because they were observed to be a common source of confusion among MOPAC users. MOPAC's semiempirical models are designed to reproduce heats of formation, and their primary energetic output is a predicted heat of formation. The additional outputs of TOTAL ENERGY, ELECTRONIC ENERGY, and CORE-CORE REPULSION represent an arbitrary decomposition of the heat of formation that do not have any independent physical meaning and notably lack any dispersion corrections that have been introduced in recent models. The DISP keyword clearly reports the decomposition of the heat of formation into these terms and a dispersion correction, when applicable. This is in contrast with conventional quantum chemistry calculations that typically report a physically meaningful "total energy" and "total electronic energy".

Long Version: Semiempirical thermochemistry and its implementation in MOPAC have long focused on directly modeling molecular heats of formation, rather than modeling the total energy of a molecule and relying on more expensive vibrational calculations to produce the vibrational and entropic contributions to a predicted heat of formation. This decision was highly beneficial at a time when computational resources were limited, as it enabled a direct connection to available reference data (i.e. experimental heats of formation) for fitting models and avoided the need to perform expensive vibrational calculations in the process of predicting heats of formation, which were of direct interest to experimentalists using MOPAC. The downside of this decision was that a minimal-basis Hartree-Fock model doesn't contain any terms that looked like a non-trivial vibrational contribution (a model limitation) and formal access to a purely electronic energy requires a cumbersome and expensive removal of an explicit vibrational contribution (a computational limitation). In practice, these downsides are not very relevant because the vibrational/entropic contributions that differentiate a heat of formation from a formation energy can be approximated as conformation-independent atomic corrections. This approximation is roughly comparable to the underlying accuracy of MOPAC's models, and it is a sensible, practical way to repurpose MOPAC's outputs as pure energies rather than heats (e.g. if MOPAC is used to drive molecular dynamics). As the accuracy of semiempirical models continues to improve, it will eventually be important to be more mindful and careful about the distinction between modeling formation energies and heats of formation.

ajjackson commented 2 years ago

Hi,

thanks for explaining this change clearly. One side-effect is that it causes a few issues when using a recent Conda build with the ASE MOPAC interface https://wiki.fysik.dtu.dk/ase/ase/calculators/mopac.html#module-ase.calculators.mopac

So it looks like the ASE parser can be updated in a backward-compatible way. But I would like to ask if it seems reasonable from MOPAC's perspective for ASE to keep using the TOTAL ENERGY value. This will be used within optimisers, dynamics routines etc. Would you recommend, for example, that the dispersion energy is added to the total for this purpose? Is it likely to make any practical difference if these methods switch to using the FINAL HEAT OF FORMATION as the reported "potential energy"? Is there a structure-dependent component of ENERGY OF ATOMS or are they unchanging free-atom values?