Open GoogleCodeExporter opened 8 years ago
I just ran on my Ubuntu 11.10 system. It reminded me about what you probably
thought was your infinite loop: on linux, there is significant startup time for
driver kernel module, which gets loaded the first time you run some GPU code,
and unloaded automatically after a certain delay (not sure how long). There
are a couple of ways to eliminate most of this startup delay:
1. Open a new terminal (or tab) and run "nvidia-smi -l 15" to re-run nvidia-smi
every 15 seconds, which forces the driver kernel module to reload each time it
runs.
2. Run nvidia-smi in persistence mode. You need to run as root, so "sudo
nvidia-smi -pm 1". This will keep it loaded.
When I do this, simpleCUDPP runs immediately rather than waiting several
seconds (on my K20) to over a minute (on my GTX 680 -- that sounds excessive,
possibly a driver bug?).
But simpleCUDPP still shows over 700 ms of overhead (the kernels and memcopies
take < 100 us). So I will investigate a bit more.
Original comment by harr...@gmail.com
on 11 Dec 2012 at 2:06
Original issue reported on code.google.com by
nicolas....@gmail.com
on 10 Dec 2012 at 6:46