The LLVM Project is a collection of modular and reusable compiler and toolchain technologies.
28.63k
stars
11.83k
forks
source link
[x86-64] Broadcasting an element of a vector should not use `vpermb` or `vpshufb` #113396
Open
Validark opened 2 hours ago
I have code like so:
Here is the LLVM version:
Here is how it lowers on Zen 5:
Here is how I think it should lower:
Same applies to broadcasting into an xmm0:
I would much rather avoid the trip to memory: