deepmodeling / abacus-develop

An electronic structure package based on either plane wave basis or numerical atomic orbitals.
http://abacus.ustc.edu.cn
GNU Lesser General Public License v3.0
169 stars 129 forks source link

A Segmentation fault when calculating the vacancy energy of 256 iron atoms #4989

Open ieiue opened 1 month ago

ieiue commented 1 month ago

Describe the bug

When calculating the vacancy energy of 256 (actually 255) iron atoms by using 64 cpus and 2 processors with ABACUS(v3.7.3) , there is a Segmentation fault. And I tried 28 cpus and 4 processors, it can run. (1)64 cpus & 2 processors : image (2)28 cpus & 4 processors: image

Expected behavior

What's the relationship between calculation success and the number of cpus and the number of processes.

To Reproduce

Fe256_vacancy.zip

Environment

No response

Additional Context

No response

Task list for Issue attackers (only for developers)

xdzhu commented 1 month ago

I think there must be something went wrong in 3.7.x version, recently i also met problems but SIGNAL 9 AND 6. I suggest you to try version 3.6.5 in https://github.com/deepmodeling/abacus-develop/releases/tag/v3.6.5

Reach me if it works.

ieiue commented 1 month ago

I tired v.3.6.4 before. There was the same Segmentation. image @xdzhu

xdzhu commented 1 month ago

Is it OK for 1 mpi case?

ieiue commented 1 month ago

For 1 processor, it still has a Segmentation fault. image image

maki49 commented 1 month ago

There seems to be some bug with ELPA, could you please try ks_solver scalapack_gvx?

ieiue commented 1 month ago

Setting ks_solver= scalapack_gvx doesn't help. image

xdzhu commented 1 month ago

Can you try the latest version 3.7.5 to check whether it's solved or not?

xdzhu commented 1 month ago

with elpa