Closed KnightOfTheMoonlight closed 4 years ago
I just found that when I am using telsa v100
, the cupy library runs much slower than it should be. When I run these scripts on titan xp
or telsa p100
, the running speed is much reasonable only 10s.
Hi, @hszhao, Thanks for this great work.
I have tested the aggregation and subtraction scripts in the folder of
/lib/sa/functions/
, from my setup as follows:I find it takes around 10mins to finish. Here is the log:
Is it the same level of time you have taken. Cause I think the size of the input blocks is really small
[2, 8, 5, 5]
, it looks pretty weird to take 10mins to finish.Is there anything else that need to be clarified about my setup, please let me know.