Wrap PyTorch model in JAX/Flax

Description

We want to use the trained models in different production environments and backends. A first step in this direction is to wrap the trained PyTorch model in a JAX function and use dlpack and zero-copy to map between JAX and PyTorch tensors on the used device. This allows using a PyTorch model with JAX's autograd—i.e., forward calls are in PyTorch, and backward calls are in JAX.

In general, we have two scenarios for the PyTorch- to JAX conversion:

wrap a trained model in a Flax model. In this scenario, the modelforge pairlist calculation is used and the input signature for the wrapped model is ($\vec{R}$, Z, Q)
convert a trained model to a Flax model. This scenario will use the chiron pairlist and the input signature for the converted model ($\vec{r}_ij$, $d_ij$, Z, Q)

This PR will implement scenario (1). This is analogous to using any pre-trained model that is provided by third parties (e.g., the MACE model), which takes derivatives with respect to coordinates and, therefore, uses coordinates as input. and the input signature ($\vec{R}$, Z, Q)

Todos

[x] Convert inputs to JAX arrays/NamedTuple
[x] Expend tests to JAX models

Status

[x] Ready to go

choderalab / modelforge