openxla / xla

A machine learning compiler for GPUs, CPUs, and ML accelerators
Apache License 2.0
2.64k stars 418 forks source link

Allow vector size 8 in VectorizeLoadsAndStoresPass. #17927

Closed copybara-service[bot] closed 1 week ago

copybara-service[bot] commented 1 week ago

Allow vector size 8 in VectorizeLoadsAndStoresPass.

When we unroll loops, we sometimes unroll by a factor of 8. Then we will also compute a vector size of 8, so we should allow it, otherwise we need to rely on LLVM vectorization which may not happen in case one of the parameters is in-place.