This variable is overwritten from multiple threads without any protection. Each thread should have separate copy of this variable. As we discussed about cuda streams, introduce a per-thread structure holding all data which are private to a single thread.
This variable is overwritten from multiple threads without any protection. Each thread should have separate copy of this variable. As we discussed about cuda streams, introduce a per-thread structure holding all data which are private to a single thread.