lukeiwanski / tensorflow

OpenCL support for TensorFlow via SYCL
Apache License 2.0
65 stars 14 forks source link

[SYCL] Improve GatherOp #219

Closed Rbiessy closed 6 years ago

Rbiessy commented 6 years ago

GatherOp used to copy some large chunk of submatrices row by row using 2 chip operations making everything too slow. It is now using memcpy when possible.

This should be pushed to dev/eigen_mehdi or integration/eigen_mehdi as well.