matthewcarbone / Crescendo

A greatly abstracted machine learning suite built for ease of use, powered by Hydra + Lightning
BSD 3-Clause "New" or "Revised" License
5 stars 1 forks source link

Create FEFF and VASP XANES models #58

Open matthewcarbone opened 11 months ago

matthewcarbone commented 11 months ago

FEFF/VASP transition metal models

Using the databases I will create, we want to train the following models (note specifically, when we say "X-O", we mean a REST query to the Materials Project v2 API in which we make queries like ["X-O", "X-O-*", "X-O-*-*"]).

FEFF

VASP

Transfer-learned FEFF-to-VASP models

Tasks

For each model, there will be two different train/validation/testing splits:

For each of these, and each model, we will do the following:

  1. Train and hyper parameter tune via Optuna (Crescendo has this functionality) to find the best model architecture and hyper-parameters.
  2. Evaluate on the testing set and analyze.
  3. Train a "production model" on all of the data (Crescendo also has this functionality).
  4. Put this production model into the model zoo (crescendo/extern/m3gnet/zoo).
  5. [Optional, not necessary for a paper] Train an ensemble model so we can accurately quantify uncertainty.

We also want to demonstrate the superiority of the transfer-learned FEFF-to-VASP models over the pure VASP models.

matthewcarbone commented 11 months ago

A quick update: we should also probably include model cards akin to Facebook's Llama model. Here's that example. At the very least, we should have the following fields: