Cytnx-dev / Cytnx

Project Cytnx, A Cross-section of Python & C++,Tensor network library
Apache License 2.0
35 stars 13 forks source link

GPU Det and Directsum are ridiculously slow #485

Open jeffry1829 opened 3 weeks ago

jeffry1829 commented 3 weeks ago

GPU Det and Directsum are ridiculously slow

Det uses cusolver ?getrf

Currently not sure whether this only happens to these two methods

IvanaGyro commented 2 weeks ago

What did you compare them to, the CPU version? How large is the input tensor? For inspecting the reason, myebe the NVDIA profiler can help.

yingjerkao commented 2 weeks ago

GPU Det and Directsum are ridiculously slow

Det uses cusolver ?getrf

Currently not sure whether this only happens to these two methods

Are you benchmarking against CPU version? Or old magma version?