josephjaspers / blackcat_tensors

Matrix-Vector Library Designed for Neural Network Construction. cuda (gpu) support, openmp (multithreaded cpu) support, partial support of BLAS, expression template based implementation PTX code generation identical to hand written kernels, and support for auto-differentiation
12 stars 4 forks source link

Fix broadcasting with openmp #14

Closed josephjaspers closed 5 years ago

josephjaspers commented 5 years ago

Automatic Tensor broadcasting and openmp may result in race conditions

josephjaspers commented 5 years ago

fixed with: https://github.com/josephjaspers/BlackCat_Tensors/commit/51ddb5b16af92638c635cce605f954669ac1cda3