Open alexandermorozov opened 8 years ago
I've implemented memory access API and syncronization based on bitmasks. Tesnsor/TensorView and decoupling aren't implemented.
Native and CUDA pass all tests. OpenCL compiles but segfaults on my machine, both with this PR and without it.
PR isn't ready to be merged yet -- I'd like to fix plugins and Leaf first to see that there are no unexpected problems.
I've implemented memory access API and syncronization based on bitmasks. Tesnsor/TensorView and decoupling aren't implemented.
Native and CUDA pass all tests. OpenCL compiles but segfaults on my machine, both with this PR and without it.
PR isn't ready to be merged yet -- I'd like to fix plugins and Leaf first to see that there are no unexpected problems.