utcs-scea / altis

A benchmarking suite for heterogeneous systems. The primary goal of this project is to improve and update aspects of existing benchmarking suites which are either insufficient or outdated.
https://utcs-scea.github.io/altis/
BSD 2-Clause "Simplified" License
37 stars 14 forks source link

Add missing synchronize #22

Closed Chr1s603 closed 1 year ago

Chr1s603 commented 1 year ago

Allows measuring the actual computation time of the kernels

Chr1s603 commented 1 year ago

Hey there, without this change I measure only 5ms for size 3 on my RTX 2080. With the synchronization, it's around 500ms, so this is definetely missing here.

BDHU commented 1 year ago

Thanks! I think it will be better to use CUDA event to measure latency in this case. I will go ahead and update them.