Closed Obs01ete closed 7 months ago
Certain modules of GPU4PySCF are still running on CPU, such as mf.get_hcore(), since it is not very expensive. More modules are moved into GPU in the recent release. The current release is gpu4pyscf-0.5. Please notice that the patch-mechanism may not be fully supported. You can explicitly import the module instead. Here is the example. https://github.com/pyscf/gpu4pyscf/blob/master/examples/01-h2o_dftd3.py
@wxj6000 Thank you for clarifying the state of the library. I'd be glad to see get_hcore() sped up in the future release of GPU4PySCF. For bigger molecules or crystal cells get_hcore runtime can go up to 10s of seconds. If course it is way faster than the DFT compute but it would be great to make use of my GPU.
Thank you for the feedback. I will add this to the future development.
Sweet, sweet, that HF iterations now run on GPU nicely. Here is my benchmarking:
CPU 48 cores: 40 s/molecule
GPU V100: 15 s/molecule
GPU A100: 15 s/molecule
@Obs01ete Glad to hear that! Do A100 and V100 have the similar performance in your benchmark?
@Obs01ete Glad to hear that! Do A100 and V100 have the similar performance in your benchmark?
Yes. I am not sure why, though.
I run the GPU-patched version of pyscf to compute the hamiltonian.
This code does not load my V100 unlike the example in README.md. What do I do to speed up this code with GPU?