Closed Yibo-He closed 1 hour ago
The indices taken by indexed load/store are in byte granularity, not element granularity; that is, you want to shift the index argument left by 2 (i.e. __riscv_vsll_vx_u64m4(__riscv_vle64_v_u64m4(ptr_idx, vl), 2, vl)
).
The indices taken by indexed load/store are in byte granularity, not element granularity; that is, you want to shift the index argument left by 2 (i.e.
__riscv_vsll_vx_u64m4(__riscv_vle64_v_u64m4(ptr_idx, vl), 2, vl)
).
Oh, i see. Thank you very much!
I try to use
vluxei
andvloxei
intrinsics and write a code snippet. However, the results are confusing. Is this a compiler bug or my misunderstanding?Code:
Just load and store the data. Here are results:
However, I think the results should be the data in
tmp_a[dataLen]
. Is this a compiler bug or my misunderstanding?Version: