ForestClaw / forestclaw

Quadtree/octree adaptive PDE solver based based on p4est.
http://www.forestclaw.org
BSD 2-Clause "Simplified" License
58 stars 21 forks source link

CudaClaw : Add timers to time CUDA operations #132

Closed donnaaboise closed 5 years ago

donnaaboise commented 6 years ago

We should add performance timers to time CUDA memory allocation and patch updates.

See fclaw_timer.h and fclaw_timer.c for templates on how to add timers.

I suggest timers : FCLAW2D_TIMER_CUDA_ALLOCATION, FCLAW2D_TIMER_CUDA_KERNEL1, FCLAW2D_TIMER_CUDA_KERNEL2 and FCLAW2D_TIMER_CUDA_KERNEL3.

donnaaboise commented 6 years ago

I've added timers, but they still need to be started /stopped from the cudaclaw5 code. This will require updating the arguments lists to step2_wrap and step2.

donnaaboise commented 5 years ago

@MelodyShih added cudaStreams, which are needed for asynchronous calls.