Excessive padding used in `eri_primitives`

Note. For water STO-3G test-case LMAX is 1 instead of 4. This means the sizes of Ci Cj Ck become (4*4+1)^3=3375 instead of (1+1)^3=8.

As we discussed, I think there's a way to circumvent padding to L_MAX in Jax without resorting to C++.

Problem: Different primitives have different L (in our case L=0 for hydrogen and L=1 for oxygen). The resulting (Ci, Cj, Ck) have shapes 1 for L=0 and 3 for L=3. The output of the broadcast Ci Cj Ck can then take shapes (1,1,1), (1,1,3), (1,3,1), (3,1,1), (3,3,1), (3,1,3), ..., (3,3,3).

Current solution: Pad everything L=4. This works but increases memory/compute/?trace time? 400x.

Other solution: Batch together calls with the same shape. Example: do the (1,1,1) calls together, do the (1,1,3), (1,3,1) and (3,1,1) calls together, and so on. For inspiration, this is done here in ~50 lines of Jax.. The (counts,sizes) looks like [(13271, 1), (32711, 3), (57121, 9), ...] which correspond to the cases (1,1,1,1) then (1,1,3,1) and (1,3,1,1) and so on.

@awf Happy to clarify in person. TLDR: Looks like for this case we should be able to get performant Jax code.

graphcore-research / pyscf-ipu

Excessive padding used in `eri_primitives` #117