stage2 performance regression regarding struct and packed struct vectorization

ziglang / zig

General-purpose programming language and toolchain for maintaining robust, optimal, and reusable software.

MIT License

34.37k stars 2.51k forks source link

Performance for this example can be recovered using for(dataset_packed) |*v, k| but you have to be careful to insert a copy in exactly the right place:

for(dataset_packed) |*v, k| {
    dataset_packed[k].a +%= v.a;
    const v_copy = v.*;
    dataset_packed[k].b = v_copy.c and v_copy.d;
    dataset_packed[k].c = v_copy.b and v_copy.d;
    dataset_packed[k].d = v_copy.b and v_copy.c;
}

If v_copy is moved up a line and used for the entire loop body, performance is still bad.

If v.* is not copied at all, then this does not compute the same result as the original code.

ziglang / zig

stage2 performance regression regarding struct and packed struct vectorization #13373

Zig Version

Steps to Reproduce and Observed Behavior

Expected Behavior