Cytnx-dev / Cytnx

Project Cytnx, A Cross-section of Python & C++,Tensor network library
Apache License 2.0
35 stars 14 forks source link

GPU Det and Directsum are ridiculously slow #485

Open jeffry1829 opened 1 month ago

jeffry1829 commented 1 month ago

GPU Det and Directsum are ridiculously slow

Det uses cusolver ?getrf

Currently not sure whether this only happens to these two methods

IvanaGyro commented 1 month ago

What did you compare them to, the CPU version? How large is the input tensor? For inspecting the reason, myebe the NVDIA profiler can help.

yingjerkao commented 1 month ago

GPU Det and Directsum are ridiculously slow

Det uses cusolver ?getrf

Currently not sure whether this only happens to these two methods

Are you benchmarking against CPU version? Or old magma version?

yingjerkao commented 1 week ago

I believe this issue was due to the fact that our DGX II has been hacked. Should perform the benchmark on some other machines.