halide / Halide

a language for fast, portable data-parallel computation
https://halide-lang.org
Other
5.83k stars 1.07k forks source link

Investigate using vector predication intrinsics for SVE2 architecture specific cases in CodeGen_ARM. #8088

Open zvookin opened 7 months ago

zvookin commented 7 months ago

There is architecture specific code in CodeGen_ARM.cpp that can be accomplished using LLVM's vector predication intrinsics. However doing so requires investigating the final code that is generated and making sure there are no failures inside LLVM. (At the time of filing the issue, this is almost certainly not going to work, hence the way the code was written.)

zvookin commented 7 months ago

79776e0d669869fd8fbb7a144b8b736479f6a07d