intel / ARM_NEON_2_x86_SSE

The platform independent header allowing to compile any C/C++ code containing ARM NEON intrinsic functions for x86 target systems using SIMD up to AVX2 intrinsic functions
Other
430 stars 149 forks source link

vld1_u8_x3 not available #61

Open xxxxxxLD opened 2 years ago

xxxxxxLD commented 2 years ago

In original , vld1_u8_x3(Load multiple single-element structures to one, two, three, or four registers) are available, and it seems that in arm v7 also available. see at https://developer.arm.com/architectures/instruction-sets/intrinsics/ but in NEON_2_SSE.h, it is not available. NEON_2_SSE.h supports which version of arm processor?

Zvictoria commented 2 years ago

Hi, @xxxxxxLD My understanding is vld1 instruction is arm v7 aarch64, while I've implemented arm v7 original version only. Anyway, there are no fast way to implement it in x86 - you are free to use your own implementation - like calling vld1_u8 three times correspondingly.