kovidgoyal / kitty

Cross-platform, fast, feature-rich, GPU based terminal
https://sw.kovidgoyal.net/kitty/
GNU General Public License v3.0
24.41k stars 980 forks source link

upgrade SIMDe version to 0.8.0 #7261

Closed mr-c closed 7 months ago

kovidgoyal commented 7 months ago

Why? The deps kitty binaries are built against are periodically updated. Is there something specific in this simde update you need?

mr-c commented 7 months ago

Is there something specific in this simde update you need?

No, I just wanted to be sure the release of SIMDe I made didn't break anything for you all

kovidgoyal commented 7 months ago

Ah, well I am happy to test it out when I have the chance. Unfortunately, updating the JSON deps list is not quite enough, as deps are not automatically built from it, I have to trigger that manually.

If it does break something I will be sure to ping you. Thanks much for SIMDe it has been a big help, indeed, it was what allowed me to even contemplate doing the SIMD code in kitty in the first place.

kovidgoyal commented 7 months ago

Oh and I can tell you the kitty test suite passes on linux x64 when built against simde 0.8.0. The other platforms it is used on are arm64 and x86 for which testing will have to wait.

mr-c commented 7 months ago

Nice, thanks!

kovidgoyal commented 6 months ago

I have updated simde am getting a build failure in CI related to some vrnd functions not being found. https://github.com/kovidgoyal/kitty/actions/runs/8889251587/job/24407334906

The simde headers are included like this:

START_IGNORE_DIAGNOSTIC("-Wfloat-conversion")
START_IGNORE_DIAGNOSTIC("-Wpedantic")
#if  defined(__clang__) && __clang_major__ > 12
_Pragma("clang diagnostic push")
_Pragma("clang diagnostic ignored \"-Wbitwise-instead-of-logical\"")
#endif
#include <simde/x86/avx2.h>
#include <simde/arm/neon.h>
#if  defined(__clang__) && __clang_major__ > 12
_Pragma("clang diagnostic pop")
#endif
END_IGNORE_DIAGNOSTIC
END_IGNORE_DIAGNOSTIC

The file with these includes is compiled with:

clang -MMD -DDEBUG -DPRIMARY_VERSION=4000 -DSECONDARY_VERSION=34 -DXT_VERSION="0.34.1" -DGL_SILENCE_DEPRECATION -Wextra -Wfloat-conversion -Wno-missing-field-initializers -Wall -Wstrict-prototypes -std=c11 -pedantic-errors -Werror -g3 -Og -fwrapv -fstack-protector-strong -pipe -fvisibility=hidden -fno-plt -D_FORTIFY_SOURCE=2 -DKITTY_DEBUG_BUILD -fno-omit-frame-pointer -mbranch-protection=standard -pthread -I/Users/Shared/kitty-build/sw/sw/include/libpng16 -I/Users/Shared/kitty-build/sw/sw/include -I/Users/Shared/kitty-build/sw/sw/include -I/var/folders/wj/cqy1nsyn4wn94d96btkldt0r0000gn/T/t/openssl,x86_64,-5kbzhdrp/include -I/Users/Shared/kitty-build/sw/sw/include/harfbuzz -I/Users/Shared/kitty-build/sw/sw/python/Python.framework/Versions/3.11/include/python3.11 -c kitty/simd-string.c -o build/fast_data_types-kitty-simd-string.c.o

The errors are:

2024-04-30T04:45:45.5212300Z In file included from kitty/simd-string-128.c:9:
2024-04-30T04:45:45.5226380Z In file included from kitty/simd-string-impl.h:37:
2024-04-30T04:45:45.5238970Z /Users/Shared/kitty-build/sw/sw/include/simde/arm/neon.h:123860:12: error: call to undeclared function 'vrnd32x_f64'; ISO C99 and later do not support implicit function declarations [-Wimplicit-function-declaration]
2024-04-30T04:45:45.5260770Z     return vrnd32x_f64(a);
2024-04-30T04:45:45.5301430Z            ^
2024-04-30T04:45:45.5333490Z /Users/Shared/kitty-build/sw/sw/include/simde/arm/neon.h:123860:12: error: returning 'int' from a function with incompatible result type 'simde_float64x1_t' (aka 'float64x1_t')
2024-04-30T04:45:45.5348130Z     return vrnd32x_f64(a);
2024-04-30T04:45:45.5359250Z            ^~~~~~~~~~~~~~
2024-04-30T04:45:45.5369120Z /Users/Shared/kitty-build/sw/sw/include/simde/arm/neon.h:123920:12: error: call to undeclared function 'vrnd32xq_f64'; ISO C99 and later do not support implicit function declarations [-Wimplicit-function-declaration]
2024-04-30T04:45:45.5379550Z     return vrnd32xq_f64(a);
2024-04-30T04:45:45.5408000Z            ^
2024-04-30T04:45:45.5416520Z /Users/Shared/kitty-build/sw/sw/include/simde/arm/neon.h:123920:12: error: returning 'int' from a function with incompatible result type 'simde_float64x2_t' (aka 'float64x2_t')
2024-04-30T04:45:45.5439980Z     return vrnd32xq_f64(a);
2024-04-30T04:45:45.5440920Z            ^~~~~~~~~~~~~~~
2024-04-30T04:45:45.5442440Z /Users/Shared/kitty-build/sw/sw/include/simde/arm/neon.h:124025:12: error: call to undeclared function 'vrnd32z_f64'; ISO C99 and later do not support implicit function declarations [-Wimplicit-function-declaration]
2024-04-30T04:45:45.5443830Z     return vrnd32z_f64(a);
2024-04-30T04:45:45.5444400Z            ^
2024-04-30T04:45:45.5445270Z /Users/Shared/kitty-build/sw/sw/include/simde/arm/neon.h:124025:12: note: did you mean 'vrnd32z_f32'?
2024-04-30T04:45:45.5446940Z /Applications/Xcode_15.3.app/Contents/Developer/Toolchains/XcodeDefault.xctoolchain/usr/lib/clang/15.0.0/include/arm_neon.h:66616:18: note: 'vrnd32z_f32' declared here
2024-04-30T04:45:45.5448140Z __ai float32x2_t vrnd32z_f32(float32x2_t __p0) {
2024-04-30T04:45:45.5448780Z                  ^
2024-04-30T04:45:45.5449410Z In file included from kitty/simd-string-128.c:9:
2024-04-30T04:45:45.5450130Z In file included from kitty/simd-string-impl.h:37:
2024-04-30T04:45:45.5452010Z /Users/Shared/kitty-build/sw/sw/include/simde/arm/neon.h:124025:12: error: returning 'int' from a function with incompatible result type 'simde_float64x1_t' (aka 'float64x1_t')
2024-04-30T04:45:45.5453220Z     return vrnd32z_f64(a);
2024-04-30T04:45:45.5453800Z            ^~~~~~~~~~~~~~
2024-04-30T04:45:45.5455150Z /Users/Shared/kitty-build/sw/sw/include/simde/arm/neon.h:124085:12: error: call to undeclared function 'vrnd32zq_f64'; ISO C99 and later do not support implicit function declarations [-Wimplicit-function-declaration]
2024-04-30T04:45:45.5456380Z     return vrnd32zq_f64(a);
2024-04-30T04:45:45.5456950Z            ^
2024-04-30T04:45:45.5457730Z /Users/Shared/kitty-build/sw/sw/include/simde/arm/neon.h:124085:12: note: did you mean 'vrnd32zq_f32'?
2024-04-30T04:45:45.5459130Z /Applications/Xcode_15.3.app/Contents/Developer/Toolchains/XcodeDefault.xctoolchain/usr/lib/clang/15.0.0/include/arm_neon.h:66600:18: note: 'vrnd32zq_f32' declared here
2024-04-30T04:45:45.5460320Z __ai float32x4_t vrnd32zq_f32(float32x4_t __p0) {
2024-04-30T04:45:45.5460960Z                  ^
2024-04-30T04:45:45.5461520Z In file included from kitty/simd-string-128.c:9:
2024-04-30T04:45:45.5462250Z In file included from kitty/simd-string-impl.h:37:
2024-04-30T04:45:45.5463440Z /Users/Shared/kitty-build/sw/sw/include/simde/arm/neon.h:124085:12: error: returning 'int' from a function with incompatible result type 'simde_float64x2_t' (aka 'float64x2_t')
2024-04-30T04:45:45.5464470Z     return vrnd32zq_f64(a);
2024-04-30T04:45:45.5465060Z            ^~~~~~~~~~~~~~~
2024-04-30T04:45:45.5466370Z /Users/Shared/kitty-build/sw/sw/include/simde/arm/neon.h:124190:12: error: call to undeclared function 'vrnd64x_f64'; ISO C99 and later do not support implicit function declarations [-Wimplicit-function-declaration]
2024-04-30T04:45:45.5467630Z     return vrnd64x_f64(a);
2024-04-30T04:45:45.5468110Z            ^
2024-04-30T04:45:45.5469250Z /Users/Shared/kitty-build/sw/sw/include/simde/arm/neon.h:124190:12: error: returning 'int' from a function with incompatible result type 'simde_float64x1_t' (aka 'float64x1_t')
2024-04-30T04:45:45.5470270Z     return vrnd64x_f64(a);
2024-04-30T04:45:45.5470760Z            ^~~~~~~~~~~~~~
2024-04-30T04:45:45.5472160Z /Users/Shared/kitty-build/sw/sw/include/simde/arm/neon.h:124250:12: error: call to undeclared function 'vrnd64xq_f64'; ISO C99 and later do not support implicit function declarations [-Wimplicit-function-declaration]
2024-04-30T04:45:45.5473330Z     return vrnd64xq_f64(a);
2024-04-30T04:45:45.5473890Z            ^
2024-04-30T04:45:45.5474970Z /Users/Shared/kitty-build/sw/sw/include/simde/arm/neon.h:124250:12: error: returning 'int' from a function with incompatible result type 'simde_float64x2_t' (aka 'float64x2_t')
2024-04-30T04:45:45.5476130Z     return vrnd64xq_f64(a);
2024-04-30T04:45:45.5476650Z            ^~~~~~~~~~~~~~~
2024-04-30T04:45:45.5477980Z /Users/Shared/kitty-build/sw/sw/include/simde/arm/neon.h:124355:12: error: call to undeclared function 'vrnd64z_f64'; ISO C99 and later do not support implicit function declarations [-Wimplicit-function-declaration]
2024-04-30T04:45:45.5479430Z     return vrnd64z_f64(a);
2024-04-30T04:45:45.5479860Z            ^
2024-04-30T04:45:45.5480720Z /Users/Shared/kitty-build/sw/sw/include/simde/arm/neon.h:124355:12: note: did you mean 'vrnd64z_f32'?
2024-04-30T04:45:45.5482150Z /Applications/Xcode_15.3.app/Contents/Developer/Toolchains/XcodeDefault.xctoolchain/usr/lib/clang/15.0.0/include/arm_neon.h:66680:18: note: 'vrnd64z_f32' declared here
2024-04-30T04:45:45.5483300Z __ai float32x2_t vrnd64z_f32(float32x2_t __p0) {
2024-04-30T04:45:45.5483980Z                  ^
2024-04-30T04:45:45.5484560Z In file included from kitty/simd-string-128.c:9:
2024-04-30T04:45:45.5485220Z In file included from kitty/simd-string-impl.h:37:
2024-04-30T04:45:45.5486400Z /Users/Shared/kitty-build/sw/sw/include/simde/arm/neon.h:124355:12: error: returning 'int' from a function with incompatible result type 'simde_float64x1_t' (aka 'float64x1_t')
2024-04-30T04:45:45.5487520Z     return vrnd64z_f64(a);
2024-04-30T04:45:45.5488040Z            ^~~~~~~~~~~~~~
2024-04-30T04:45:45.5489370Z /Users/Shared/kitty-build/sw/sw/include/simde/arm/neon.h:124415:12: error: call to undeclared function 'vrnd64zq_f64'; ISO C99 and later do not support implicit function declarations [-Wimplicit-function-declaration]
2024-04-30T04:45:45.5490540Z     return vrnd64zq_f64(a);
2024-04-30T04:45:45.5491050Z            ^
2024-04-30T04:45:45.5491870Z /Users/Shared/kitty-build/sw/sw/include/simde/arm/neon.h:124415:12: note: did you mean 'vrnd64zq_f32'?
2024-04-30T04:45:45.5493290Z /Applications/Xcode_15.3.app/Contents/Developer/Toolchains/XcodeDefault.xctoolchain/usr/lib/clang/15.0.0/include/arm_neon.h:66664:18: note: 'vrnd64zq_f32' declared here
2024-04-30T04:45:45.5494310Z __ai float32x4_t vrnd64zq_f32(float32x4_t __p0) {
2024-04-30T04:45:45.5494960Z                  ^
2024-04-30T04:45:45.5495450Z In file included from kitty/simd-string-128.c:9:
2024-04-30T04:45:45.5496070Z In file included from kitty/simd-string-impl.h:37:
2024-04-30T04:45:45.5497200Z /Users/Shared/kitty-build/sw/sw/include/simde/arm/neon.h:124415:12: error: returning 'int' from a function with incompatible result type 'simde_float64x2_t' (aka 'float64x2_t')
2024-04-30T04:45:45.5516830Z     return vrnd64zq_f64(a);
2024-04-30T04:45:45.5517300Z            ^~~~~~~~~~~~~~~
2024-04-30T04:45:45.5837100Z 16 errors generated.

This is on a macOS ARM system with Xcode 15

Any clue what might cause this?

kovidgoyal commented 6 months ago

Never mind, I think I know the issue the neon sub-folder is missing from the simde install

kovidgoyal commented 6 months ago

Or actually, no, the homebrew installed simde has that folder, but the released amalgamated build doesnt, I am guessing its contents are folded into the amalgamation. So @mr-c I am at a loss.

kovidgoyal commented 6 months ago

Ah, looks like this is: https://github.com/simd-everywhere/simde/issues/1146

kovidgoyal commented 6 months ago

I'd appreciate a new release of simde with the fix, for the moment reverting to simde 0.7.x

mr-c commented 6 months ago

@kovidgoyal Thanks! Yes, a new release is in progress: https://github.com/simd-everywhere/simde/issues/1176

mr-c commented 6 months ago

@kovidgoyal SIMDe version 0.8.2 has been released: https://simd-everywhere.github.io/blog/2024/05/02/0.8.0-0.8.2-release.html

kovidgoyal commented 6 months ago

Thanks updated and all tests pass on all CI platforms.

kovidgoyal commented 6 months ago

Fails to build on x86 (32bit Intel) with the following errors (this platform is not tested in CI so I didnt catch it earlier)

/sw/sw/include/simde/arm/neon.h: In function ‘simde_vcadd_rot270_f16’:
/sw/sw/include/simde/arm/neon.h:30269:77: error: excess elements in vector initializer
30269 |       b_.values = SIMDE_SHUFFLE_VECTOR_(16, 4, -b_.values, b_.values, 5, 0, 7, 2);
      |                                                                             ^
/sw/sw/include/simde/x86/avx2.h:7475:71: note: in definition of macro ‘SIMDE_SHUFFLE_VECTOR_’
 7475 |          int##elem_size##_t SIMDE_VECTOR(vec_size) simde_shuffle_ = { __VA_ARGS__ }; \
      |                                                                       ^~~~~~~~~~~
/sw/sw/include/simde/arm/neon.h:30269:77: note: (near initialization for ‘simde_shuffle_’)
30269 |       b_.values = SIMDE_SHUFFLE_VECTOR_(16, 4, -b_.values, b_.values, 5, 0, 7, 2);
      |                                                                             ^
/sw/sw/include/simde/x86/avx2.h:7475:71: note: in definition of macro ‘SIMDE_SHUFFLE_VECTOR_’
 7475 |          int##elem_size##_t SIMDE_VECTOR(vec_size) simde_shuffle_ = { __VA_ARGS__ }; \
      |                                                                       ^~~~~~~~~~~
/sw/sw/include/simde/arm/neon.h:30269:80: error: excess elements in vector initializer
30269 |       b_.values = SIMDE_SHUFFLE_VECTOR_(16, 4, -b_.values, b_.values, 5, 0, 7, 2);
      |                                                                                ^
/sw/sw/include/simde/x86/avx2.h:7475:71: note: in definition of macro ‘SIMDE_SHUFFLE_VECTOR_’
 7475 |          int##elem_size##_t SIMDE_VECTOR(vec_size) simde_shuffle_ = { __VA_ARGS__ }; \
      |                                                                       ^~~~~~~~~~~
/sw/sw/include/simde/arm/neon.h:30269:80: note: (near initialization for ‘simde_shuffle_’)
30269 |       b_.values = SIMDE_SHUFFLE_VECTOR_(16, 4, -b_.values, b_.values, 5, 0, 7, 2);
      |                                                                                ^
/sw/sw/include/simde/x86/avx2.h:7475:71: note: in definition of macro ‘SIMDE_SHUFFLE_VECTOR_’
 7475 |          int##elem_size##_t SIMDE_VECTOR(vec_size) simde_shuffle_ = { __VA_ARGS__ }; \
      |                                                                       ^~~~~~~~~~~
/sw/sw/include/simde/x86/avx2.h:7476:12: error: ‘__builtin_shuffle’ number of elements of the argument vector(s) and the mask vector should be the same
 7476 |            __builtin_shuffle(a, b, simde_shuffle_); \
      |            ^~~~~~~~~~~~~~~~~
/sw/sw/include/simde/arm/neon.h:30269:19: note: in expansion of macro ‘SIMDE_SHUFFLE_VECTOR_’
30269 |       b_.values = SIMDE_SHUFFLE_VECTOR_(16, 4, -b_.values, b_.values, 5, 0, 7, 2);
      |                   ^~~~~~~~~~~~~~~~~~~~~
/sw/sw/include/simde/arm/neon.h: In function ‘simde_vcaddq_rot270_f16’:
/sw/sw/include/simde/arm/neon.h:30299:84: error: excess elements in vector initializer
30299 |       b_.values = SIMDE_SHUFFLE_VECTOR_(16, 8, -b_.values, b_.values, 9, 0, 11, 2, 13, 4, 15, 6);
      |                                                                                    ^~
/sw/sw/include/simde/x86/avx2.h:7475:71: note: in definition of macro ‘SIMDE_SHUFFLE_VECTOR_’
 7475 |          int##elem_size##_t SIMDE_VECTOR(vec_size) simde_shuffle_ = { __VA_ARGS__ }; \
      |                                                                       ^~~~~~~~~~~
/sw/sw/include/simde/arm/neon.h:30299:84: note: (near initialization for ‘simde_shuffle_’)
30299 |       b_.values = SIMDE_SHUFFLE_VECTOR_(16, 8, -b_.values, b_.values, 9, 0, 11, 2, 13, 4, 15, 6);
      |                                                                                    ^~
/sw/sw/include/simde/x86/avx2.h:7475:71: note: in definition of macro ‘SIMDE_SHUFFLE_VECTOR_’
 7475 |          int##elem_size##_t SIMDE_VECTOR(vec_size) simde_shuffle_ = { __VA_ARGS__ }; \
      |                                                                       ^~~~~~~~~~~
/sw/sw/include/simde/arm/neon.h:30299:88: error: excess elements in vector initializer
30299 |       b_.values = SIMDE_SHUFFLE_VECTOR_(16, 8, -b_.values, b_.values, 9, 0, 11, 2, 13, 4, 15, 6);
      |                                                                                        ^
/sw/sw/include/simde/x86/avx2.h:7475:71: note: in definition of macro ‘SIMDE_SHUFFLE_VECTOR_’
 7475 |          int##elem_size##_t SIMDE_VECTOR(vec_size) simde_shuffle_ = { __VA_ARGS__ }; \
      |                                                                       ^~~~~~~~~~~
/sw/sw/include/simde/arm/neon.h:30299:88: note: (near initialization for ‘simde_shuffle_’)
30299 |       b_.values = SIMDE_SHUFFLE_VECTOR_(16, 8, -b_.values, b_.values, 9, 0, 11, 2, 13, 4, 15, 6);
      |                                                                                        ^
/sw/sw/include/simde/x86/avx2.h:7475:71: note: in definition of macro ‘SIMDE_SHUFFLE_VECTOR_’
 7475 |          int##elem_size##_t SIMDE_VECTOR(vec_size) simde_shuffle_ = { __VA_ARGS__ }; \
      |                                                                       ^~~~~~~~~~~
/sw/sw/include/simde/arm/neon.h:30299:91: error: excess elements in vector initializer
30299 |       b_.values = SIMDE_SHUFFLE_VECTOR_(16, 8, -b_.values, b_.values, 9, 0, 11, 2, 13, 4, 15, 6);
      |                                                                                           ^~
/sw/sw/include/simde/x86/avx2.h:7475:71: note: in definition of macro ‘SIMDE_SHUFFLE_VECTOR_’
 7475 |          int##elem_size##_t SIMDE_VECTOR(vec_size) simde_shuffle_ = { __VA_ARGS__ }; \
      |                                                                       ^~~~~~~~~~~
/sw/sw/include/simde/arm/neon.h:30299:91: note: (near initialization for ‘simde_shuffle_’)
30299 |       b_.values = SIMDE_SHUFFLE_VECTOR_(16, 8, -b_.values, b_.values, 9, 0, 11, 2, 13, 4, 15, 6);
      |                                                                                           ^~
/sw/sw/include/simde/x86/avx2.h:7475:71: note: in definition of macro ‘SIMDE_SHUFFLE_VECTOR_’
 7475 |          int##elem_size##_t SIMDE_VECTOR(vec_size) simde_shuffle_ = { __VA_ARGS__ }; \
      |                                                                       ^~~~~~~~~~~
/sw/sw/include/simde/arm/neon.h:30299:95: error: excess elements in vector initializer
30299 |       b_.values = SIMDE_SHUFFLE_VECTOR_(16, 8, -b_.values, b_.values, 9, 0, 11, 2, 13, 4, 15, 6);
      |                                                                                               ^
/sw/sw/include/simde/x86/avx2.h:7475:71: note: in definition of macro ‘SIMDE_SHUFFLE_VECTOR_’
 7475 |          int##elem_size##_t SIMDE_VECTOR(vec_size) simde_shuffle_ = { __VA_ARGS__ }; \
      |                                                                       ^~~~~~~~~~~
/sw/sw/include/simde/arm/neon.h:30299:95: note: (near initialization for ‘simde_shuffle_’)
30299 |       b_.values = SIMDE_SHUFFLE_VECTOR_(16, 8, -b_.values, b_.values, 9, 0, 11, 2, 13, 4, 15, 6);
      |                                                                                               ^
/sw/sw/include/simde/x86/avx2.h:7475:71: note: in definition of macro ‘SIMDE_SHUFFLE_VECTOR_’
 7475 |          int##elem_size##_t SIMDE_VECTOR(vec_size) simde_shuffle_ = { __VA_ARGS__ }; \
      |                                                                       ^~~~~~~~~~~
/sw/sw/include/simde/x86/avx2.h:7476:12: error: ‘__builtin_shuffle’ number of elements of the argument vector(s) and the mask vector should be the same
 7476 |            __builtin_shuffle(a, b, simde_shuffle_); \
      |            ^~~~~~~~~~~~~~~~~
/sw/sw/include/simde/arm/neon.h:30299:19: note: in expansion of macro ‘SIMDE_SHUFFLE_VECTOR_’
30299 |       b_.values = SIMDE_SHUFFLE_VECTOR_(16, 8, -b_.values, b_.values, 9, 0, 11, 2, 13, 4, 15, 6);
      |                   ^~~~~~~~~~~~~~~~~~~~~
/sw/sw/include/simde/arm/neon.h: In function ‘simde_vcadd_rot90_f16’:
/sw/sw/include/simde/arm/neon.h:30463:77: error: excess elements in vector initializer
30463 |       b_.values = SIMDE_SHUFFLE_VECTOR_(16, 4, -b_.values, b_.values, 1, 4, 3, 6);
      |                                                                             ^
/sw/sw/include/simde/x86/avx2.h:7475:71: note: in definition of macro ‘SIMDE_SHUFFLE_VECTOR_’
 7475 |          int##elem_size##_t SIMDE_VECTOR(vec_size) simde_shuffle_ = { __VA_ARGS__ }; \
      |                                                                       ^~~~~~~~~~~
/sw/sw/include/simde/arm/neon.h:30463:77: note: (near initialization for ‘simde_shuffle_’)
30463 |       b_.values = SIMDE_SHUFFLE_VECTOR_(16, 4, -b_.values, b_.values, 1, 4, 3, 6);
      |                                                                             ^
/sw/sw/include/simde/x86/avx2.h:7475:71: note: in definition of macro ‘SIMDE_SHUFFLE_VECTOR_’
 7475 |          int##elem_size##_t SIMDE_VECTOR(vec_size) simde_shuffle_ = { __VA_ARGS__ }; \
      |                                                                       ^~~~~~~~~~~
/sw/sw/include/simde/arm/neon.h:30463:80: error: excess elements in vector initializer
30463 |       b_.values = SIMDE_SHUFFLE_VECTOR_(16, 4, -b_.values, b_.values, 1, 4, 3, 6);
      |                                                                                ^
/sw/sw/include/simde/x86/avx2.h:7475:71: note: in definition of macro ‘SIMDE_SHUFFLE_VECTOR_’
 7475 |          int##elem_size##_t SIMDE_VECTOR(vec_size) simde_shuffle_ = { __VA_ARGS__ }; \
      |                                                                       ^~~~~~~~~~~
/sw/sw/include/simde/arm/neon.h:30463:80: note: (near initialization for ‘simde_shuffle_’)
30463 |       b_.values = SIMDE_SHUFFLE_VECTOR_(16, 4, -b_.values, b_.values, 1, 4, 3, 6);
      |                                                                                ^
/sw/sw/include/simde/x86/avx2.h:7475:71: note: in definition of macro ‘SIMDE_SHUFFLE_VECTOR_’
 7475 |          int##elem_size##_t SIMDE_VECTOR(vec_size) simde_shuffle_ = { __VA_ARGS__ }; \
      |                                                                       ^~~~~~~~~~~
/sw/sw/include/simde/x86/avx2.h:7476:12: error: ‘__builtin_shuffle’ number of elements of the argument vector(s) and the mask vector should be the same
 7476 |            __builtin_shuffle(a, b, simde_shuffle_); \
      |            ^~~~~~~~~~~~~~~~~
/sw/sw/include/simde/arm/neon.h:30463:19: note: in expansion of macro ‘SIMDE_SHUFFLE_VECTOR_’
30463 |       b_.values = SIMDE_SHUFFLE_VECTOR_(16, 4, -b_.values, b_.values, 1, 4, 3, 6);
      |                   ^~~~~~~~~~~~~~~~~~~~~
/sw/sw/include/simde/arm/neon.h: In function ‘simde_vcaddq_rot90_f16’:
/sw/sw/include/simde/arm/neon.h:30493:84: error: excess elements in vector initializer
30493 |       b_.values = SIMDE_SHUFFLE_VECTOR_(16, 8, -b_.values, b_.values, 1, 8, 3, 10, 5, 12, 7, 14);
      |                                                                                    ^
/sw/sw/include/simde/x86/avx2.h:7475:71: note: in definition of macro ‘SIMDE_SHUFFLE_VECTOR_’
 7475 |          int##elem_size##_t SIMDE_VECTOR(vec_size) simde_shuffle_ = { __VA_ARGS__ }; \
      |                                                                       ^~~~~~~~~~~
/sw/sw/include/simde/arm/neon.h:30493:84: note: (near initialization for ‘simde_shuffle_’)
30493 |       b_.values = SIMDE_SHUFFLE_VECTOR_(16, 8, -b_.values, b_.values, 1, 8, 3, 10, 5, 12, 7, 14);
      |                                                                                    ^
/sw/sw/include/simde/x86/avx2.h:7475:71: note: in definition of macro ‘SIMDE_SHUFFLE_VECTOR_’
 7475 |          int##elem_size##_t SIMDE_VECTOR(vec_size) simde_shuffle_ = { __VA_ARGS__ }; \
      |                                                                       ^~~~~~~~~~~
/sw/sw/include/simde/arm/neon.h:30493:87: error: excess elements in vector initializer
30493 |       b_.values = SIMDE_SHUFFLE_VECTOR_(16, 8, -b_.values, b_.values, 1, 8, 3, 10, 5, 12, 7, 14);
      |                                                                                       ^~
/sw/sw/include/simde/x86/avx2.h:7475:71: note: in definition of macro ‘SIMDE_SHUFFLE_VECTOR_’
 7475 |          int##elem_size##_t SIMDE_VECTOR(vec_size) simde_shuffle_ = { __VA_ARGS__ }; \
      |                                                                       ^~~~~~~~~~~
/sw/sw/include/simde/arm/neon.h:30493:87: note: (near initialization for ‘simde_shuffle_’)
30493 |       b_.values = SIMDE_SHUFFLE_VECTOR_(16, 8, -b_.values, b_.values, 1, 8, 3, 10, 5, 12, 7, 14);
      |                                                                                       ^~
/sw/sw/include/simde/x86/avx2.h:7475:71: note: in definition of macro ‘SIMDE_SHUFFLE_VECTOR_’
 7475 |          int##elem_size##_t SIMDE_VECTOR(vec_size) simde_shuffle_ = { __VA_ARGS__ }; \
      |                                                                       ^~~~~~~~~~~
/sw/sw/include/simde/arm/neon.h:30493:91: error: excess elements in vector initializer
30493 |       b_.values = SIMDE_SHUFFLE_VECTOR_(16, 8, -b_.values, b_.values, 1, 8, 3, 10, 5, 12, 7, 14);
      |                                                                                           ^
/sw/sw/include/simde/x86/avx2.h:7475:71: note: in definition of macro ‘SIMDE_SHUFFLE_VECTOR_’
 7475 |          int##elem_size##_t SIMDE_VECTOR(vec_size) simde_shuffle_ = { __VA_ARGS__ }; \
      |                                                                       ^~~~~~~~~~~
/sw/sw/include/simde/arm/neon.h:30493:91: note: (near initialization for ‘simde_shuffle_’)
30493 |       b_.values = SIMDE_SHUFFLE_VECTOR_(16, 8, -b_.values, b_.values, 1, 8, 3, 10, 5, 12, 7, 14);
      |                                                                                           ^
/sw/sw/include/simde/x86/avx2.h:7475:71: note: in definition of macro ‘SIMDE_SHUFFLE_VECTOR_’
 7475 |          int##elem_size##_t SIMDE_VECTOR(vec_size) simde_shuffle_ = { __VA_ARGS__ }; \
      |                                                                       ^~~~~~~~~~~
/sw/sw/include/simde/arm/neon.h:30493:94: error: excess elements in vector initializer
30493 |       b_.values = SIMDE_SHUFFLE_VECTOR_(16, 8, -b_.values, b_.values, 1, 8, 3, 10, 5, 12, 7, 14);
      |                                                                                              ^~
/sw/sw/include/simde/x86/avx2.h:7475:71: note: in definition of macro ‘SIMDE_SHUFFLE_VECTOR_’
 7475 |          int##elem_size##_t SIMDE_VECTOR(vec_size) simde_shuffle_ = { __VA_ARGS__ }; \
      |                                                                       ^~~~~~~~~~~
/sw/sw/include/simde/arm/neon.h:30493:94: note: (near initialization for ‘simde_shuffle_’)
30493 |       b_.values = SIMDE_SHUFFLE_VECTOR_(16, 8, -b_.values, b_.values, 1, 8, 3, 10, 5, 12, 7, 14);
      |                                                                                              ^~
/sw/sw/include/simde/x86/avx2.h:7475:71: note: in definition of macro ‘SIMDE_SHUFFLE_VECTOR_’
 7475 |          int##elem_size##_t SIMDE_VECTOR(vec_size) simde_shuffle_ = { __VA_ARGS__ }; \
      |                                                                       ^~~~~~~~~~~
/sw/sw/include/simde/x86/avx2.h:7476:12: error: ‘__builtin_shuffle’ number of elements of the argument vector(s) and the mask vector should be the same
 7476 |            __builtin_shuffle(a, b, simde_shuffle_); \
      |            ^~~~~~~~~~~~~~~~~
/sw/sw/include/simde/arm/neon.h:30493:19: note: in expansion of macro ‘SIMDE_SHUFFLE_VECTOR_’
30493 |       b_.values = SIMDE_SHUFFLE_VECTOR_(16, 8, -b_.values, b_.values, 1, 8, 3, 10, 5, 12, 7, 14);
      |                   ^~~~~~~~~~~~~~~~~~~~~
/sw/sw/include/simde/arm/neon.h: In function ‘simde_vcmla_lane_f16’:
/sw/sw/include/simde/arm/neon.h:44221:74: error: excess elements in vector initializer
44221 |     a_.values = SIMDE_SHUFFLE_VECTOR_(16, 4, a_.values, a_.values, 0, 0, 2, 2);
      |                                                                          ^
/sw/sw/include/simde/x86/avx2.h:7475:71: note: in definition of macro ‘SIMDE_SHUFFLE_VECTOR_’
 7475 |          int##elem_size##_t SIMDE_VECTOR(vec_size) simde_shuffle_ = { __VA_ARGS__ }; \
      |                                                                       ^~~~~~~~~~~
/sw/sw/include/simde/arm/neon.h:44221:74: note: (near initialization for ‘simde_shuffle_’)
44221 |     a_.values = SIMDE_SHUFFLE_VECTOR_(16, 4, a_.values, a_.values, 0, 0, 2, 2);
      |                                                                          ^
/sw/sw/include/simde/x86/avx2.h:7475:71: note: in definition of macro ‘SIMDE_SHUFFLE_VECTOR_’
 7475 |          int##elem_size##_t SIMDE_VECTOR(vec_size) simde_shuffle_ = { __VA_ARGS__ }; \
      |                                                                       ^~~~~~~~~~~
/sw/sw/include/simde/arm/neon.h:44221:77: error: excess elements in vector initializer
44221 |     a_.values = SIMDE_SHUFFLE_VECTOR_(16, 4, a_.values, a_.values, 0, 0, 2, 2);
      |                                                                             ^
/sw/sw/include/simde/x86/avx2.h:7475:71: note: in definition of macro ‘SIMDE_SHUFFLE_VECTOR_’
 7475 |          int##elem_size##_t SIMDE_VECTOR(vec_size) simde_shuffle_ = { __VA_ARGS__ }; \
      |                                                                       ^~~~~~~~~~~
/sw/sw/include/simde/arm/neon.h:44221:77: note: (near initialization for ‘simde_shuffle_’)
44221 |     a_.values = SIMDE_SHUFFLE_VECTOR_(16, 4, a_.values, a_.values, 0, 0, 2, 2);
      |                                                                             ^
/sw/sw/include/simde/x86/avx2.h:7475:71: note: in definition of macro ‘SIMDE_SHUFFLE_VECTOR_’
 7475 |          int##elem_size##_t SIMDE_VECTOR(vec_size) simde_shuffle_ = { __VA_ARGS__ }; \
      |                                                                       ^~~~~~~~~~~
/sw/sw/include/simde/x86/avx2.h:7476:12: error: ‘__builtin_shuffle’ number of elements of the argument vector(s) and the mask vector should be the same
 7476 |            __builtin_shuffle(a, b, simde_shuffle_); \
      |            ^~~~~~~~~~~~~~~~~
/sw/sw/include/simde/arm/neon.h:44221:17: note: in expansion of macro ‘SIMDE_SHUFFLE_VECTOR_’
44221 |     a_.values = SIMDE_SHUFFLE_VECTOR_(16, 4, a_.values, a_.values, 0, 0, 2, 2);
      |                 ^~~~~~~~~~~~~~~~~~~~~
/sw/sw/include/simde/arm/neon.h: In function ‘simde_vcmla_laneq_f16’:
/sw/sw/include/simde/arm/neon.h:44282:74: error: excess elements in vector initializer
44282 |     a_.values = SIMDE_SHUFFLE_VECTOR_(16, 4, a_.values, a_.values, 0, 0, 2, 2);
      |                                                                          ^
/sw/sw/include/simde/x86/avx2.h:7475:71: note: in definition of macro ‘SIMDE_SHUFFLE_VECTOR_’
 7475 |          int##elem_size##_t SIMDE_VECTOR(vec_size) simde_shuffle_ = { __VA_ARGS__ }; \
      |                                                                       ^~~~~~~~~~~
/sw/sw/include/simde/arm/neon.h:44282:74: note: (near initialization for ‘simde_shuffle_’)
44282 |     a_.values = SIMDE_SHUFFLE_VECTOR_(16, 4, a_.values, a_.values, 0, 0, 2, 2);
      |                                                                          ^
/sw/sw/include/simde/x86/avx2.h:7475:71: note: in definition of macro ‘SIMDE_SHUFFLE_VECTOR_’
 7475 |          int##elem_size##_t SIMDE_VECTOR(vec_size) simde_shuffle_ = { __VA_ARGS__ }; \
      |                                                                       ^~~~~~~~~~~
/sw/sw/include/simde/arm/neon.h:44282:77: error: excess elements in vector initializer
44282 |     a_.values = SIMDE_SHUFFLE_VECTOR_(16, 4, a_.values, a_.values, 0, 0, 2, 2);
      |                                                                             ^
/sw/sw/include/simde/x86/avx2.h:7475:71: note: in definition of macro ‘SIMDE_SHUFFLE_VECTOR_’
 7475 |          int##elem_size##_t SIMDE_VECTOR(vec_size) simde_shuffle_ = { __VA_ARGS__ }; \
      |                                                                       ^~~~~~~~~~~
/sw/sw/include/simde/arm/neon.h:44282:77: note: (near initialization for ‘simde_shuffle_’)
44282 |     a_.values = SIMDE_SHUFFLE_VECTOR_(16, 4, a_.values, a_.values, 0, 0, 2, 2);
      |                                                                             ^
/sw/sw/include/simde/x86/avx2.h:7475:71: note: in definition of macro ‘SIMDE_SHUFFLE_VECTOR_’
 7475 |          int##elem_size##_t SIMDE_VECTOR(vec_size) simde_shuffle_ = { __VA_ARGS__ }; \
      |                                                                       ^~~~~~~~~~~
/sw/sw/include/simde/x86/avx2.h:7476:12: error: ‘__builtin_shuffle’ number of elements of the argument vector(s) and the mask vector should be the same
 7476 |            __builtin_shuffle(a, b, simde_shuffle_); \
      |            ^~~~~~~~~~~~~~~~~
/sw/sw/include/simde/arm/neon.h:44282:17: note: in expansion of macro ‘SIMDE_SHUFFLE_VECTOR_’
44282 |     a_.values = SIMDE_SHUFFLE_VECTOR_(16, 4, a_.values, a_.values, 0, 0, 2, 2);
      |                 ^~~~~~~~~~~~~~~~~~~~~
/sw/sw/include/simde/arm/neon.h: In function ‘simde_vcmlaq_lane_f16’:
/sw/sw/include/simde/arm/neon.h:44345:83: error: excess elements in vector initializer
44345 |     a_low.values = SIMDE_SHUFFLE_VECTOR_(16, 4, a_low.values, a_low.values, 0, 0, 2, 2);
      |                                                                                   ^
/sw/sw/include/simde/x86/avx2.h:7475:71: note: in definition of macro ‘SIMDE_SHUFFLE_VECTOR_’
 7475 |          int##elem_size##_t SIMDE_VECTOR(vec_size) simde_shuffle_ = { __VA_ARGS__ }; \
      |                                                                       ^~~~~~~~~~~
/sw/sw/include/simde/arm/neon.h:44345:83: note: (near initialization for ‘simde_shuffle_’)
44345 |     a_low.values = SIMDE_SHUFFLE_VECTOR_(16, 4, a_low.values, a_low.values, 0, 0, 2, 2);
      |                                                                                   ^
/sw/sw/include/simde/x86/avx2.h:7475:71: note: in definition of macro ‘SIMDE_SHUFFLE_VECTOR_’
 7475 |          int##elem_size##_t SIMDE_VECTOR(vec_size) simde_shuffle_ = { __VA_ARGS__ }; \
      |                                                                       ^~~~~~~~~~~
/sw/sw/include/simde/arm/neon.h:44345:86: error: excess elements in vector initializer
44345 |     a_low.values = SIMDE_SHUFFLE_VECTOR_(16, 4, a_low.values, a_low.values, 0, 0, 2, 2);
      |                                                                                      ^
/sw/sw/include/simde/x86/avx2.h:7475:71: note: in definition of macro ‘SIMDE_SHUFFLE_VECTOR_’
 7475 |          int##elem_size##_t SIMDE_VECTOR(vec_size) simde_shuffle_ = { __VA_ARGS__ }; \
      |                                                                       ^~~~~~~~~~~
/sw/sw/include/simde/arm/neon.h:44345:86: note: (near initialization for ‘simde_shuffle_’)
44345 |     a_low.values = SIMDE_SHUFFLE_VECTOR_(16, 4, a_low.values, a_low.values, 0, 0, 2, 2);
      |                                                                                      ^
/sw/sw/include/simde/x86/avx2.h:7475:71: note: in definition of macro ‘SIMDE_SHUFFLE_VECTOR_’
 7475 |          int##elem_size##_t SIMDE_VECTOR(vec_size) simde_shuffle_ = { __VA_ARGS__ }; \
      |                                                                       ^~~~~~~~~~~
/sw/sw/include/simde/x86/avx2.h:7476:12: error: ‘__builtin_shuffle’ number of elements of the argument vector(s) and the mask vector should be the same
 7476 |            __builtin_shuffle(a, b, simde_shuffle_); \
      |            ^~~~~~~~~~~~~~~~~
/sw/sw/include/simde/arm/neon.h:44345:20: note: in expansion of macro ‘SIMDE_SHUFFLE_VECTOR_’
44345 |     a_low.values = SIMDE_SHUFFLE_VECTOR_(16, 4, a_low.values, a_low.values, 0, 0, 2, 2);
      |                    ^~~~~~~~~~~~~~~~~~~~~
/sw/sw/include/simde/arm/neon.h:44346:86: error: excess elements in vector initializer
44346 |     a_high.values = SIMDE_SHUFFLE_VECTOR_(16, 4, a_high.values, a_high.values, 0, 0, 2, 2);
      |                                                                                      ^
/sw/sw/include/simde/x86/avx2.h:7475:71: note: in definition of macro ‘SIMDE_SHUFFLE_VECTOR_’
 7475 |          int##elem_size##_t SIMDE_VECTOR(vec_size) simde_shuffle_ = { __VA_ARGS__ }; \
      |                                                                       ^~~~~~~~~~~
/sw/sw/include/simde/arm/neon.h:44346:86: note: (near initialization for ‘simde_shuffle_’)
44346 |     a_high.values = SIMDE_SHUFFLE_VECTOR_(16, 4, a_high.values, a_high.values, 0, 0, 2, 2);
      |                                                                                      ^
/sw/sw/include/simde/x86/avx2.h:7475:71: note: in definition of macro ‘SIMDE_SHUFFLE_VECTOR_’
 7475 |          int##elem_size##_t SIMDE_VECTOR(vec_size) simde_shuffle_ = { __VA_ARGS__ }; \
      |                                                                       ^~~~~~~~~~~
/sw/sw/include/simde/arm/neon.h:44346:89: error: excess elements in vector initializer
44346 |     a_high.values = SIMDE_SHUFFLE_VECTOR_(16, 4, a_high.values, a_high.values, 0, 0, 2, 2);
      |                                                                                         ^
/sw/sw/include/simde/x86/avx2.h:7475:71: note: in definition of macro ‘SIMDE_SHUFFLE_VECTOR_’
 7475 |          int##elem_size##_t SIMDE_VECTOR(vec_size) simde_shuffle_ = { __VA_ARGS__ }; \
      |                                                                       ^~~~~~~~~~~
/sw/sw/include/simde/arm/neon.h:44346:89: note: (near initialization for ‘simde_shuffle_’)
44346 |     a_high.values = SIMDE_SHUFFLE_VECTOR_(16, 4, a_high.values, a_high.values, 0, 0, 2, 2);
      |                                                                                         ^
/sw/sw/include/simde/x86/avx2.h:7475:71: note: in definition of macro ‘SIMDE_SHUFFLE_VECTOR_’
 7475 |          int##elem_size##_t SIMDE_VECTOR(vec_size) simde_shuffle_ = { __VA_ARGS__ }; \
      |                                                                       ^~~~~~~~~~~
/sw/sw/include/simde/x86/avx2.h:7476:12: error: ‘__builtin_shuffle’ number of elements of the argument vector(s) and the mask vector should be the same
 7476 |            __builtin_shuffle(a, b, simde_shuffle_); \
      |            ^~~~~~~~~~~~~~~~~
/sw/sw/include/simde/arm/neon.h:44346:21: note: in expansion of macro ‘SIMDE_SHUFFLE_VECTOR_’
44346 |     a_high.values = SIMDE_SHUFFLE_VECTOR_(16, 4, a_high.values, a_high.values, 0, 0, 2, 2);
      |                     ^~~~~~~~~~~~~~~~~~~~~
/sw/sw/include/simde/arm/neon.h: In function ‘simde_vcmlaq_laneq_f16’:
/sw/sw/include/simde/arm/neon.h:44413:83: error: excess elements in vector initializer
44413 |     a_low.values = SIMDE_SHUFFLE_VECTOR_(16, 4, a_low.values, a_low.values, 0, 0, 2, 2);
      |                                                                                   ^
/sw/sw/include/simde/x86/avx2.h:7475:71: note: in definition of macro ‘SIMDE_SHUFFLE_VECTOR_’
 7475 |          int##elem_size##_t SIMDE_VECTOR(vec_size) simde_shuffle_ = { __VA_ARGS__ }; \
      |                                                                       ^~~~~~~~~~~
/sw/sw/include/simde/arm/neon.h:44413:83: note: (near initialization for ‘simde_shuffle_’)
44413 |     a_low.values = SIMDE_SHUFFLE_VECTOR_(16, 4, a_low.values, a_low.values, 0, 0, 2, 2);
      |                                                                                   ^
/sw/sw/include/simde/x86/avx2.h:7475:71: note: in definition of macro ‘SIMDE_SHUFFLE_VECTOR_’
 7475 |          int##elem_size##_t SIMDE_VECTOR(vec_size) simde_shuffle_ = { __VA_ARGS__ }; \
      |                                                                       ^~~~~~~~~~~
/sw/sw/include/simde/arm/neon.h:44413:86: error: excess elements in vector initializer
44413 |     a_low.values = SIMDE_SHUFFLE_VECTOR_(16, 4, a_low.values, a_low.values, 0, 0, 2, 2);
      |                                                                                      ^
/sw/sw/include/simde/x86/avx2.h:7475:71: note: in definition of macro ‘SIMDE_SHUFFLE_VECTOR_’
 7475 |          int##elem_size##_t SIMDE_VECTOR(vec_size) simde_shuffle_ = { __VA_ARGS__ }; \
      |                                                                       ^~~~~~~~~~~
/sw/sw/include/simde/arm/neon.h:44413:86: note: (near initialization for ‘simde_shuffle_’)
44413 |     a_low.values = SIMDE_SHUFFLE_VECTOR_(16, 4, a_low.values, a_low.values, 0, 0, 2, 2);
      |                                                                                      ^
/sw/sw/include/simde/x86/avx2.h:7475:71: note: in definition of macro ‘SIMDE_SHUFFLE_VECTOR_’
 7475 |          int##elem_size##_t SIMDE_VECTOR(vec_size) simde_shuffle_ = { __VA_ARGS__ }; \
      |                                                                       ^~~~~~~~~~~
/sw/sw/include/simde/x86/avx2.h:7476:12: error: ‘__builtin_shuffle’ number of elements of the argument vector(s) and the mask vector should be the same
 7476 |            __builtin_shuffle(a, b, simde_shuffle_); \
      |            ^~~~~~~~~~~~~~~~~
/sw/sw/include/simde/arm/neon.h:44413:20: note: in expansion of macro ‘SIMDE_SHUFFLE_VECTOR_’
44413 |     a_low.values = SIMDE_SHUFFLE_VECTOR_(16, 4, a_low.values, a_low.values, 0, 0, 2, 2);
      |                    ^~~~~~~~~~~~~~~~~~~~~
/sw/sw/include/simde/arm/neon.h:44415:86: error: excess elements in vector initializer
44415 |     a_high.values = SIMDE_SHUFFLE_VECTOR_(16, 4, a_high.values, a_high.values, 0, 0, 2, 2);
      |                                                                                      ^
/sw/sw/include/simde/x86/avx2.h:7475:71: note: in definition of macro ‘SIMDE_SHUFFLE_VECTOR_’
 7475 |          int##elem_size##_t SIMDE_VECTOR(vec_size) simde_shuffle_ = { __VA_ARGS__ }; \
      |                                                                       ^~~~~~~~~~~
/sw/sw/include/simde/arm/neon.h:44415:86: note: (near initialization for ‘simde_shuffle_’)
44415 |     a_high.values = SIMDE_SHUFFLE_VECTOR_(16, 4, a_high.values, a_high.values, 0, 0, 2, 2);
      |                                                                                      ^
/sw/sw/include/simde/x86/avx2.h:7475:71: note: in definition of macro ‘SIMDE_SHUFFLE_VECTOR_’
 7475 |          int##elem_size##_t SIMDE_VECTOR(vec_size) simde_shuffle_ = { __VA_ARGS__ }; \
      |                                                                       ^~~~~~~~~~~
/sw/sw/include/simde/arm/neon.h:44415:89: error: excess elements in vector initializer
44415 |     a_high.values = SIMDE_SHUFFLE_VECTOR_(16, 4, a_high.values, a_high.values, 0, 0, 2, 2);
      |                                                                                         ^
/sw/sw/include/simde/x86/avx2.h:7475:71: note: in definition of macro ‘SIMDE_SHUFFLE_VECTOR_’
 7475 |          int##elem_size##_t SIMDE_VECTOR(vec_size) simde_shuffle_ = { __VA_ARGS__ }; \
      |                                                                       ^~~~~~~~~~~
/sw/sw/include/simde/arm/neon.h:44415:89: note: (near initialization for ‘simde_shuffle_’)
44415 |     a_high.values = SIMDE_SHUFFLE_VECTOR_(16, 4, a_high.values, a_high.values, 0, 0, 2, 2);
      |                                                                                         ^
/sw/sw/include/simde/x86/avx2.h:7475:71: note: in definition of macro ‘SIMDE_SHUFFLE_VECTOR_’
 7475 |          int##elem_size##_t SIMDE_VECTOR(vec_size) simde_shuffle_ = { __VA_ARGS__ }; \
      |                                                                       ^~~~~~~~~~~
/sw/sw/include/simde/x86/avx2.h:7476:12: error: ‘__builtin_shuffle’ number of elements of the argument vector(s) and the mask vector should be the same
 7476 |            __builtin_shuffle(a, b, simde_shuffle_); \
      |            ^~~~~~~~~~~~~~~~~
/sw/sw/include/simde/arm/neon.h:44415:21: note: in expansion of macro ‘SIMDE_SHUFFLE_VECTOR_’
44415 |     a_high.values = SIMDE_SHUFFLE_VECTOR_(16, 4, a_high.values, a_high.values, 0, 0, 2, 2);
      |                     ^~~~~~~~~~~~~~~~~~~~~
/sw/sw/include/simde/arm/neon.h: In function ‘simde_vcmla_rot180_lane_f16’:
/sw/sw/include/simde/arm/neon.h:44761:74: error: excess elements in vector initializer
44761 |     a_.values = SIMDE_SHUFFLE_VECTOR_(16, 4, a_.values, a_.values, 0, 0, 2, 2);
      |                                                                          ^
/sw/sw/include/simde/x86/avx2.h:7475:71: note: in definition of macro ‘SIMDE_SHUFFLE_VECTOR_’
 7475 |          int##elem_size##_t SIMDE_VECTOR(vec_size) simde_shuffle_ = { __VA_ARGS__ }; \
      |                                                                       ^~~~~~~~~~~
/sw/sw/include/simde/arm/neon.h:44761:74: note: (near initialization for ‘simde_shuffle_’)
44761 |     a_.values = SIMDE_SHUFFLE_VECTOR_(16, 4, a_.values, a_.values, 0, 0, 2, 2);
      |                                                                          ^
/sw/sw/include/simde/x86/avx2.h:7475:71: note: in definition of macro ‘SIMDE_SHUFFLE_VECTOR_’
 7475 |          int##elem_size##_t SIMDE_VECTOR(vec_size) simde_shuffle_ = { __VA_ARGS__ }; \
      |                                                                       ^~~~~~~~~~~
/sw/sw/include/simde/arm/neon.h:44761:77: error: excess elements in vector initializer
44761 |     a_.values = SIMDE_SHUFFLE_VECTOR_(16, 4, a_.values, a_.values, 0, 0, 2, 2);
      |                                                                             ^
/sw/sw/include/simde/x86/avx2.h:7475:71: note: in definition of macro ‘SIMDE_SHUFFLE_VECTOR_’
 7475 |          int##elem_size##_t SIMDE_VECTOR(vec_size) simde_shuffle_ = { __VA_ARGS__ }; \
      |                                                                       ^~~~~~~~~~~
/sw/sw/include/simde/arm/neon.h:44761:77: note: (near initialization for ‘simde_shuffle_’)
44761 |     a_.values = SIMDE_SHUFFLE_VECTOR_(16, 4, a_.values, a_.values, 0, 0, 2, 2);
      |                                                                             ^
/sw/sw/include/simde/x86/avx2.h:7475:71: note: in definition of macro ‘SIMDE_SHUFFLE_VECTOR_’
 7475 |          int##elem_size##_t SIMDE_VECTOR(vec_size) simde_shuffle_ = { __VA_ARGS__ }; \
      |                                                                       ^~~~~~~~~~~
/sw/sw/include/simde/x86/avx2.h:7476:12: error: ‘__builtin_shuffle’ number of elements of the argument vector(s) and the mask vector should be the same
 7476 |            __builtin_shuffle(a, b, simde_shuffle_); \
      |            ^~~~~~~~~~~~~~~~~
/sw/sw/include/simde/arm/neon.h:44761:17: note: in expansion of macro ‘SIMDE_SHUFFLE_VECTOR_’
44761 |     a_.values = SIMDE_SHUFFLE_VECTOR_(16, 4, a_.values, a_.values, 0, 0, 2, 2);
      |                 ^~~~~~~~~~~~~~~~~~~~~
/sw/sw/include/simde/arm/neon.h:44762:75: error: excess elements in vector initializer
44762 |     b_.values = SIMDE_SHUFFLE_VECTOR_(16, 4, -b_.values, b_.values, 0, 1, 2, 3);
      |                                                                           ^
/sw/sw/include/simde/x86/avx2.h:7475:71: note: in definition of macro ‘SIMDE_SHUFFLE_VECTOR_’
 7475 |          int##elem_size##_t SIMDE_VECTOR(vec_size) simde_shuffle_ = { __VA_ARGS__ }; \
      |                                                                       ^~~~~~~~~~~
/sw/sw/include/simde/arm/neon.h:44762:75: note: (near initialization for ‘simde_shuffle_’)
44762 |     b_.values = SIMDE_SHUFFLE_VECTOR_(16, 4, -b_.values, b_.values, 0, 1, 2, 3);
      |                                                                           ^
/sw/sw/include/simde/x86/avx2.h:7475:71: note: in definition of macro ‘SIMDE_SHUFFLE_VECTOR_’
 7475 |          int##elem_size##_t SIMDE_VECTOR(vec_size) simde_shuffle_ = { __VA_ARGS__ }; \
      |                                                                       ^~~~~~~~~~~
/sw/sw/include/simde/arm/neon.h:44762:78: error: excess elements in vector initializer
44762 |     b_.values = SIMDE_SHUFFLE_VECTOR_(16, 4, -b_.values, b_.values, 0, 1, 2, 3);
      |                                                                              ^
/sw/sw/include/simde/x86/avx2.h:7475:71: note: in definition of macro ‘SIMDE_SHUFFLE_VECTOR_’
 7475 |          int##elem_size##_t SIMDE_VECTOR(vec_size) simde_shuffle_ = { __VA_ARGS__ }; \
      |                                                                       ^~~~~~~~~~~
/sw/sw/include/simde/arm/neon.h:44762:78: note: (near initialization for ‘simde_shuffle_’)
44762 |     b_.values = SIMDE_SHUFFLE_VECTOR_(16, 4, -b_.values, b_.values, 0, 1, 2, 3);
      |                                                                              ^
/sw/sw/include/simde/x86/avx2.h:7475:71: note: in definition of macro ‘SIMDE_SHUFFLE_VECTOR_’
 7475 |          int##elem_size##_t SIMDE_VECTOR(vec_size) simde_shuffle_ = { __VA_ARGS__ }; \
      |                                                                       ^~~~~~~~~~~
/sw/sw/include/simde/x86/avx2.h:7476:12: error: ‘__builtin_shuffle’ number of elements of the argument vector(s) and the mask vector should be the same
 7476 |            __builtin_shuffle(a, b, simde_shuffle_); \
      |            ^~~~~~~~~~~~~~~~~
/sw/sw/include/simde/arm/neon.h:44762:17: note: in expansion of macro ‘SIMDE_SHUFFLE_VECTOR_’
44762 |     b_.values = SIMDE_SHUFFLE_VECTOR_(16, 4, -b_.values, b_.values, 0, 1, 2, 3);
      |                 ^~~~~~~~~~~~~~~~~~~~~
/sw/sw/include/simde/arm/neon.h: In function ‘simde_vcmlaq_rot180_lane_f16’:
/sw/sw/include/simde/arm/neon.h:44826:83: error: excess elements in vector initializer
44826 |     a_low.values = SIMDE_SHUFFLE_VECTOR_(16, 4, a_low.values, a_low.values, 0, 0, 2, 2);
      |                                                                                   ^
/sw/sw/include/simde/x86/avx2.h:7475:71: note: in definition of macro ‘SIMDE_SHUFFLE_VECTOR_’
 7475 |          int##elem_size##_t SIMDE_VECTOR(vec_size) simde_shuffle_ = { __VA_ARGS__ }; \
      |                                                                       ^~~~~~~~~~~
/sw/sw/include/simde/arm/neon.h:44826:83: note: (near initialization for ‘simde_shuffle_’)
44826 |     a_low.values = SIMDE_SHUFFLE_VECTOR_(16, 4, a_low.values, a_low.values, 0, 0, 2, 2);
      |                                                                                   ^
/sw/sw/include/simde/x86/avx2.h:7475:71: note: in definition of macro ‘SIMDE_SHUFFLE_VECTOR_’
 7475 |          int##elem_size##_t SIMDE_VECTOR(vec_size) simde_shuffle_ = { __VA_ARGS__ }; \
      |                                                                       ^~~~~~~~~~~
/sw/sw/include/simde/arm/neon.h:44826:86: error: excess elements in vector initializer
44826 |     a_low.values = SIMDE_SHUFFLE_VECTOR_(16, 4, a_low.values, a_low.values, 0, 0, 2, 2);
      |                                                                                      ^
/sw/sw/include/simde/x86/avx2.h:7475:71: note: in definition of macro ‘SIMDE_SHUFFLE_VECTOR_’
 7475 |          int##elem_size##_t SIMDE_VECTOR(vec_size) simde_shuffle_ = { __VA_ARGS__ }; \
      |                                                                       ^~~~~~~~~~~
/sw/sw/include/simde/arm/neon.h:44826:86: note: (near initialization for ‘simde_shuffle_’)
44826 |     a_low.values = SIMDE_SHUFFLE_VECTOR_(16, 4, a_low.values, a_low.values, 0, 0, 2, 2);
      |                                                                                      ^
/sw/sw/include/simde/x86/avx2.h:7475:71: note: in definition of macro ‘SIMDE_SHUFFLE_VECTOR_’
 7475 |          int##elem_size##_t SIMDE_VECTOR(vec_size) simde_shuffle_ = { __VA_ARGS__ }; \
      |                                                                       ^~~~~~~~~~~
/sw/sw/include/simde/x86/avx2.h:7476:12: error: ‘__builtin_shuffle’ number of elements of the argument vector(s) and the mask vector should be the same
 7476 |            __builtin_shuffle(a, b, simde_shuffle_); \
      |            ^~~~~~~~~~~~~~~~~
/sw/sw/include/simde/arm/neon.h:44826:20: note: in expansion of macro ‘SIMDE_SHUFFLE_VECTOR_’
44826 |     a_low.values = SIMDE_SHUFFLE_VECTOR_(16, 4, a_low.values, a_low.values, 0, 0, 2, 2);
      |                    ^~~~~~~~~~~~~~~~~~~~~
/sw/sw/include/simde/arm/neon.h:44827:86: error: excess elements in vector initializer
44827 |     a_high.values = SIMDE_SHUFFLE_VECTOR_(16, 4, a_high.values, a_high.values, 0, 0, 2, 2);
      |                                                                                      ^
/sw/sw/include/simde/x86/avx2.h:7475:71: note: in definition of macro ‘SIMDE_SHUFFLE_VECTOR_’
 7475 |          int##elem_size##_t SIMDE_VECTOR(vec_size) simde_shuffle_ = { __VA_ARGS__ }; \
      |                                                                       ^~~~~~~~~~~
/sw/sw/include/simde/arm/neon.h:44827:86: note: (near initialization for ‘simde_shuffle_’)
44827 |     a_high.values = SIMDE_SHUFFLE_VECTOR_(16, 4, a_high.values, a_high.values, 0, 0, 2, 2);
      |                                                                                      ^
/sw/sw/include/simde/x86/avx2.h:7475:71: note: in definition of macro ‘SIMDE_SHUFFLE_VECTOR_’
 7475 |          int##elem_size##_t SIMDE_VECTOR(vec_size) simde_shuffle_ = { __VA_ARGS__ }; \
      |                                                                       ^~~~~~~~~~~
/sw/sw/include/simde/arm/neon.h:44827:89: error: excess elements in vector initializer
44827 |     a_high.values = SIMDE_SHUFFLE_VECTOR_(16, 4, a_high.values, a_high.values, 0, 0, 2, 2);
      |                                                                                         ^
/sw/sw/include/simde/x86/avx2.h:7475:71: note: in definition of macro ‘SIMDE_SHUFFLE_VECTOR_’
 7475 |          int##elem_size##_t SIMDE_VECTOR(vec_size) simde_shuffle_ = { __VA_ARGS__ }; \
      |                                                                       ^~~~~~~~~~~
/sw/sw/include/simde/arm/neon.h:44827:89: note: (near initialization for ‘simde_shuffle_’)
44827 |     a_high.values = SIMDE_SHUFFLE_VECTOR_(16, 4, a_high.values, a_high.values, 0, 0, 2, 2);
      |                                                                                         ^
/sw/sw/include/simde/x86/avx2.h:7475:71: note: in definition of macro ‘SIMDE_SHUFFLE_VECTOR_’
 7475 |          int##elem_size##_t SIMDE_VECTOR(vec_size) simde_shuffle_ = { __VA_ARGS__ }; \
      |                                                                       ^~~~~~~~~~~
/sw/sw/include/simde/x86/avx2.h:7476:12: error: ‘__builtin_shuffle’ number of elements of the argument vector(s) and the mask vector should be the same
 7476 |            __builtin_shuffle(a, b, simde_shuffle_); \
      |            ^~~~~~~~~~~~~~~~~
/sw/sw/include/simde/arm/neon.h:44827:21: note: in expansion of macro ‘SIMDE_SHUFFLE_VECTOR_’
44827 |     a_high.values = SIMDE_SHUFFLE_VECTOR_(16, 4, a_high.values, a_high.values, 0, 0, 2, 2);
      |                     ^~~~~~~~~~~~~~~~~~~~~
/sw/sw/include/simde/arm/neon.h:44828:75: error: excess elements in vector initializer
44828 |     b_.values = SIMDE_SHUFFLE_VECTOR_(16, 4, -b_.values, b_.values, 0, 1, 2, 3);
      |                                                                           ^
/sw/sw/include/simde/x86/avx2.h:7475:71: note: in definition of macro ‘SIMDE_SHUFFLE_VECTOR_’
 7475 |          int##elem_size##_t SIMDE_VECTOR(vec_size) simde_shuffle_ = { __VA_ARGS__ }; \
      |                                                                       ^~~~~~~~~~~
/sw/sw/include/simde/arm/neon.h:44828:75: note: (near initialization for ‘simde_shuffle_’)
44828 |     b_.values = SIMDE_SHUFFLE_VECTOR_(16, 4, -b_.values, b_.values, 0, 1, 2, 3);
      |                                                                           ^
/sw/sw/include/simde/x86/avx2.h:7475:71: note: in definition of macro ‘SIMDE_SHUFFLE_VECTOR_’
 7475 |          int##elem_size##_t SIMDE_VECTOR(vec_size) simde_shuffle_ = { __VA_ARGS__ }; \
      |                                                                       ^~~~~~~~~~~
/sw/sw/include/simde/arm/neon.h:44828:78: error: excess elements in vector initializer
44828 |     b_.values = SIMDE_SHUFFLE_VECTOR_(16, 4, -b_.values, b_.values, 0, 1, 2, 3);
      |                                                                              ^
/sw/sw/include/simde/x86/avx2.h:7475:71: note: in definition of macro ‘SIMDE_SHUFFLE_VECTOR_’
 7475 |          int##elem_size##_t SIMDE_VECTOR(vec_size) simde_shuffle_ = { __VA_ARGS__ }; \
      |                                                                       ^~~~~~~~~~~
/sw/sw/include/simde/arm/neon.h:44828:78: note: (near initialization for ‘simde_shuffle_’)
44828 |     b_.values = SIMDE_SHUFFLE_VECTOR_(16, 4, -b_.values, b_.values, 0, 1, 2, 3);
      |                                                                              ^
/sw/sw/include/simde/x86/avx2.h:7475:71: note: in definition of macro ‘SIMDE_SHUFFLE_VECTOR_’
 7475 |          int##elem_size##_t SIMDE_VECTOR(vec_size) simde_shuffle_ = { __VA_ARGS__ }; \
      |                                                                       ^~~~~~~~~~~
/sw/sw/include/simde/x86/avx2.h:7476:12: error: ‘__builtin_shuffle’ number of elements of the argument vector(s) and the mask vector should be the same
 7476 |            __builtin_shuffle(a, b, simde_shuffle_); \
      |            ^~~~~~~~~~~~~~~~~
/sw/sw/include/simde/arm/neon.h:44828:17: note: in expansion of macro ‘SIMDE_SHUFFLE_VECTOR_’
44828 |     b_.values = SIMDE_SHUFFLE_VECTOR_(16, 4, -b_.values, b_.values, 0, 1, 2, 3);
      |                 ^~~~~~~~~~~~~~~~~~~~~
/sw/sw/include/simde/arm/neon.h: In function ‘simde_vcmla_rot180_laneq_f16’:
/sw/sw/include/simde/arm/neon.h:44894:74: error: excess elements in vector initializer
44894 |     a_.values = SIMDE_SHUFFLE_VECTOR_(16, 4, a_.values, a_.values, 0, 0, 2, 2);
      |                                                                          ^
/sw/sw/include/simde/x86/avx2.h:7475:71: note: in definition of macro ‘SIMDE_SHUFFLE_VECTOR_’
 7475 |          int##elem_size##_t SIMDE_VECTOR(vec_size) simde_shuffle_ = { __VA_ARGS__ }; \
      |                                                                       ^~~~~~~~~~~
/sw/sw/include/simde/arm/neon.h:44894:74: note: (near initialization for ‘simde_shuffle_’)
44894 |     a_.values = SIMDE_SHUFFLE_VECTOR_(16, 4, a_.values, a_.values, 0, 0, 2, 2);
      |                                                                          ^
/sw/sw/include/simde/x86/avx2.h:7475:71: note: in definition of macro ‘SIMDE_SHUFFLE_VECTOR_’
 7475 |          int##elem_size##_t SIMDE_VECTOR(vec_size) simde_shuffle_ = { __VA_ARGS__ }; \
      |                                                                       ^~~~~~~~~~~
/sw/sw/include/simde/arm/neon.h:44894:77: error: excess elements in vector initializer
44894 |     a_.values = SIMDE_SHUFFLE_VECTOR_(16, 4, a_.values, a_.values, 0, 0, 2, 2);
      |                                                                             ^
/sw/sw/include/simde/x86/avx2.h:7475:71: note: in definition of macro ‘SIMDE_SHUFFLE_VECTOR_’
 7475 |          int##elem_size##_t SIMDE_VECTOR(vec_size) simde_shuffle_ = { __VA_ARGS__ }; \
      |                                                                       ^~~~~~~~~~~
/sw/sw/include/simde/arm/neon.h:44894:77: note: (near initialization for ‘simde_shuffle_’)
44894 |     a_.values = SIMDE_SHUFFLE_VECTOR_(16, 4, a_.values, a_.values, 0, 0, 2, 2);
      |                                                                             ^
/sw/sw/include/simde/x86/avx2.h:7475:71: note: in definition of macro ‘SIMDE_SHUFFLE_VECTOR_’
 7475 |          int##elem_size##_t SIMDE_VECTOR(vec_size) simde_shuffle_ = { __VA_ARGS__ }; \
      |                                                                       ^~~~~~~~~~~
/sw/sw/include/simde/x86/avx2.h:7476:12: error: ‘__builtin_shuffle’ number of elements of the argument vector(s) and the mask vector should be the same
 7476 |            __builtin_shuffle(a, b, simde_shuffle_); \
      |            ^~~~~~~~~~~~~~~~~
/sw/sw/include/simde/arm/neon.h:44894:17: note: in expansion of macro ‘SIMDE_SHUFFLE_VECTOR_’
44894 |     a_.values = SIMDE_SHUFFLE_VECTOR_(16, 4, a_.values, a_.values, 0, 0, 2, 2);
      |                 ^~~~~~~~~~~~~~~~~~~~~
/sw/sw/include/simde/arm/neon.h:44895:75: error: excess elements in vector initializer
44895 |     b_.values = SIMDE_SHUFFLE_VECTOR_(16, 4, -b_.values, b_.values, 0, 1, 2, 3);
      |                                                                           ^
/sw/sw/include/simde/x86/avx2.h:7475:71: note: in definition of macro ‘SIMDE_SHUFFLE_VECTOR_’
 7475 |          int##elem_size##_t SIMDE_VECTOR(vec_size) simde_shuffle_ = { __VA_ARGS__ }; \
      |                                                                       ^~~~~~~~~~~
/sw/sw/include/simde/arm/neon.h:44895:75: note: (near initialization for ‘simde_shuffle_’)
44895 |     b_.values = SIMDE_SHUFFLE_VECTOR_(16, 4, -b_.values, b_.values, 0, 1, 2, 3);
      |                                                                           ^
/sw/sw/include/simde/x86/avx2.h:7475:71: note: in definition of macro ‘SIMDE_SHUFFLE_VECTOR_’
 7475 |          int##elem_size##_t SIMDE_VECTOR(vec_size) simde_shuffle_ = { __VA_ARGS__ }; \
      |                                                                       ^~~~~~~~~~~
/sw/sw/include/simde/arm/neon.h:44895:78: error: excess elements in vector initializer
44895 |     b_.values = SIMDE_SHUFFLE_VECTOR_(16, 4, -b_.values, b_.values, 0, 1, 2, 3);
      |                                                                              ^
/sw/sw/include/simde/x86/avx2.h:7475:71: note: in definition of macro ‘SIMDE_SHUFFLE_VECTOR_’
 7475 |          int##elem_size##_t SIMDE_VECTOR(vec_size) simde_shuffle_ = { __VA_ARGS__ }; \
      |                                                                       ^~~~~~~~~~~
/sw/sw/include/simde/arm/neon.h:44895:78: note: (near initialization for ‘simde_shuffle_’)
44895 |     b_.values = SIMDE_SHUFFLE_VECTOR_(16, 4, -b_.values, b_.values, 0, 1, 2, 3);
      |                                                                              ^
/sw/sw/include/simde/x86/avx2.h:7475:71: note: in definition of macro ‘SIMDE_SHUFFLE_VECTOR_’
 7475 |          int##elem_size##_t SIMDE_VECTOR(vec_size) simde_shuffle_ = { __VA_ARGS__ }; \
      |                                                                       ^~~~~~~~~~~
/sw/sw/include/simde/x86/avx2.h:7476:12: error: ‘__builtin_shuffle’ number of elements of the argument vector(s) and the mask vector should be the same
 7476 |            __builtin_shuffle(a, b, simde_shuffle_); \
      |            ^~~~~~~~~~~~~~~~~
/sw/sw/include/simde/arm/neon.h:44895:17: note: in expansion of macro ‘SIMDE_SHUFFLE_VECTOR_’
44895 |     b_.values = SIMDE_SHUFFLE_VECTOR_(16, 4, -b_.values, b_.values, 0, 1, 2, 3);
      |                 ^~~~~~~~~~~~~~~~~~~~~
/sw/sw/include/simde/arm/neon.h: In function ‘simde_vcmlaq_rot180_laneq_f16’:
/sw/sw/include/simde/arm/neon.h:44959:83: error: excess elements in vector initializer
44959 |     a_low.values = SIMDE_SHUFFLE_VECTOR_(16, 4, a_low.values, a_low.values, 0, 0, 2, 2);
      |                                                                                   ^
/sw/sw/include/simde/x86/avx2.h:7475:71: note: in definition of macro ‘SIMDE_SHUFFLE_VECTOR_’
 7475 |          int##elem_size##_t SIMDE_VECTOR(vec_size) simde_shuffle_ = { __VA_ARGS__ }; \
      |                                                                       ^~~~~~~~~~~
/sw/sw/include/simde/arm/neon.h:44959:83: note: (near initialization for ‘simde_shuffle_’)
44959 |     a_low.values = SIMDE_SHUFFLE_VECTOR_(16, 4, a_low.values, a_low.values, 0, 0, 2, 2);
      |                                                                                   ^
/sw/sw/include/simde/x86/avx2.h:7475:71: note: in definition of macro ‘SIMDE_SHUFFLE_VECTOR_’
 7475 |          int##elem_size##_t SIMDE_VECTOR(vec_size) simde_shuffle_ = { __VA_ARGS__ }; \
      |                                                                       ^~~~~~~~~~~
/sw/sw/include/simde/arm/neon.h:44959:86: error: excess elements in vector initializer
44959 |     a_low.values = SIMDE_SHUFFLE_VECTOR_(16, 4, a_low.values, a_low.values, 0, 0, 2, 2);
      |                                                                                      ^
/sw/sw/include/simde/x86/avx2.h:7475:71: note: in definition of macro ‘SIMDE_SHUFFLE_VECTOR_’
 7475 |          int##elem_size##_t SIMDE_VECTOR(vec_size) simde_shuffle_ = { __VA_ARGS__ }; \
      |                                                                       ^~~~~~~~~~~
/sw/sw/include/simde/arm/neon.h:44959:86: note: (near initialization for ‘simde_shuffle_’)
44959 |     a_low.values = SIMDE_SHUFFLE_VECTOR_(16, 4, a_low.values, a_low.values, 0, 0, 2, 2);
      |                                                                                      ^
/sw/sw/include/simde/x86/avx2.h:7475:71: note: in definition of macro ‘SIMDE_SHUFFLE_VECTOR_’
 7475 |          int##elem_size##_t SIMDE_VECTOR(vec_size) simde_shuffle_ = { __VA_ARGS__ }; \
      |                                                                       ^~~~~~~~~~~
/sw/sw/include/simde/x86/avx2.h:7476:12: error: ‘__builtin_shuffle’ number of elements of the argument vector(s) and the mask vector should be the same
 7476 |            __builtin_shuffle(a, b, simde_shuffle_); \
      |            ^~~~~~~~~~~~~~~~~
/sw/sw/include/simde/arm/neon.h:44959:20: note: in expansion of macro ‘SIMDE_SHUFFLE_VECTOR_’
44959 |     a_low.values = SIMDE_SHUFFLE_VECTOR_(16, 4, a_low.values, a_low.values, 0, 0, 2, 2);
      |                    ^~~~~~~~~~~~~~~~~~~~~
/sw/sw/include/simde/arm/neon.h:44960:86: error: excess elements in vector initializer
44960 |     a_high.values = SIMDE_SHUFFLE_VECTOR_(16, 4, a_high.values, a_high.values, 0, 0, 2, 2);
      |                                                                                      ^
/sw/sw/include/simde/x86/avx2.h:7475:71: note: in definition of macro ‘SIMDE_SHUFFLE_VECTOR_’
 7475 |          int##elem_size##_t SIMDE_VECTOR(vec_size) simde_shuffle_ = { __VA_ARGS__ }; \
      |                                                                       ^~~~~~~~~~~
/sw/sw/include/simde/arm/neon.h:44960:86: note: (near initialization for ‘simde_shuffle_’)
44960 |     a_high.values = SIMDE_SHUFFLE_VECTOR_(16, 4, a_high.values, a_high.values, 0, 0, 2, 2);
      |                                                                                      ^
/sw/sw/include/simde/x86/avx2.h:7475:71: note: in definition of macro ‘SIMDE_SHUFFLE_VECTOR_’
 7475 |          int##elem_size##_t SIMDE_VECTOR(vec_size) simde_shuffle_ = { __VA_ARGS__ }; \
      |                                                                       ^~~~~~~~~~~
/sw/sw/include/simde/arm/neon.h:44960:89: error: excess elements in vector initializer
44960 |     a_high.values = SIMDE_SHUFFLE_VECTOR_(16, 4, a_high.values, a_high.values, 0, 0, 2, 2);
      |                                                                                         ^
/sw/sw/include/simde/x86/avx2.h:7475:71: note: in definition of macro ‘SIMDE_SHUFFLE_VECTOR_’
 7475 |          int##elem_size##_t SIMDE_VECTOR(vec_size) simde_shuffle_ = { __VA_ARGS__ }; \
      |                                                                       ^~~~~~~~~~~
/sw/sw/include/simde/arm/neon.h:44960:89: note: (near initialization for ‘simde_shuffle_’)
44960 |     a_high.values = SIMDE_SHUFFLE_VECTOR_(16, 4, a_high.values, a_high.values, 0, 0, 2, 2);
      |                                                                                         ^
/sw/sw/include/simde/x86/avx2.h:7475:71: note: in definition of macro ‘SIMDE_SHUFFLE_VECTOR_’
 7475 |          int##elem_size##_t SIMDE_VECTOR(vec_size) simde_shuffle_ = { __VA_ARGS__ }; \
      |                                                                       ^~~~~~~~~~~
/sw/sw/include/simde/x86/avx2.h:7476:12: error: ‘__builtin_shuffle’ number of elements of the argument vector(s) and the mask vector should be the same
 7476 |            __builtin_shuffle(a, b, simde_shuffle_); \
      |            ^~~~~~~~~~~~~~~~~
/sw/sw/include/simde/arm/neon.h:44960:21: note: in expansion of macro ‘SIMDE_SHUFFLE_VECTOR_’
44960 |     a_high.values = SIMDE_SHUFFLE_VECTOR_(16, 4, a_high.values, a_high.values, 0, 0, 2, 2);
      |                     ^~~~~~~~~~~~~~~~~~~~~
/sw/sw/include/simde/arm/neon.h:44961:75: error: excess elements in vector initializer
44961 |     b_.values = SIMDE_SHUFFLE_VECTOR_(16, 4, -b_.values, b_.values, 0, 1, 2, 3);
      |                                                                           ^
/sw/sw/include/simde/x86/avx2.h:7475:71: note: in definition of macro ‘SIMDE_SHUFFLE_VECTOR_’
 7475 |          int##elem_size##_t SIMDE_VECTOR(vec_size) simde_shuffle_ = { __VA_ARGS__ }; \
      |                                                                       ^~~~~~~~~~~
/sw/sw/include/simde/arm/neon.h:44961:75: note: (near initialization for ‘simde_shuffle_’)
44961 |     b_.values = SIMDE_SHUFFLE_VECTOR_(16, 4, -b_.values, b_.values, 0, 1, 2, 3);
      |                                                                           ^
/sw/sw/include/simde/x86/avx2.h:7475:71: note: in definition of macro ‘SIMDE_SHUFFLE_VECTOR_’
 7475 |          int##elem_size##_t SIMDE_VECTOR(vec_size) simde_shuffle_ = { __VA_ARGS__ }; \
      |                                                                       ^~~~~~~~~~~
/sw/sw/include/simde/arm/neon.h:44961:78: error: excess elements in vector initializer
44961 |     b_.values = SIMDE_SHUFFLE_VECTOR_(16, 4, -b_.values, b_.values, 0, 1, 2, 3);
      |                                                                              ^
/sw/sw/include/simde/x86/avx2.h:7475:71: note: in definition of macro ‘SIMDE_SHUFFLE_VECTOR_’
 7475 |          int##elem_size##_t SIMDE_VECTOR(vec_size) simde_shuffle_ = { __VA_ARGS__ }; \
      |                                                                       ^~~~~~~~~~~
/sw/sw/include/simde/arm/neon.h:44961:78: note: (near initialization for ‘simde_shuffle_’)
44961 |     b_.values = SIMDE_SHUFFLE_VECTOR_(16, 4, -b_.values, b_.values, 0, 1, 2, 3);
      |                                                                              ^
/sw/sw/include/simde/x86/avx2.h:7475:71: note: in definition of macro ‘SIMDE_SHUFFLE_VECTOR_’
 7475 |          int##elem_size##_t SIMDE_VECTOR(vec_size) simde_shuffle_ = { __VA_ARGS__ }; \
      |                                                                       ^~~~~~~~~~~
/sw/sw/include/simde/x86/avx2.h:7476:12: error: ‘__builtin_shuffle’ number of elements of the argument vector(s) and the mask vector should be the same
 7476 |            __builtin_shuffle(a, b, simde_shuffle_); \
      |            ^~~~~~~~~~~~~~~~~
/sw/sw/include/simde/arm/neon.h:44961:17: note: in expansion of macro ‘SIMDE_SHUFFLE_VECTOR_’
44961 |     b_.values = SIMDE_SHUFFLE_VECTOR_(16, 4, -b_.values, b_.values, 0, 1, 2, 3);
      |                 ^~~~~~~~~~~~~~~~~~~~~
/sw/sw/include/simde/arm/neon.h: In function ‘simde_vcmla_rot270_lane_f16’:
/sw/sw/include/simde/arm/neon.h:45306:74: error: excess elements in vector initializer
45306 |     a_.values = SIMDE_SHUFFLE_VECTOR_(16, 4, a_.values, a_.values, 1, 1, 3, 3);
      |                                                                          ^
/sw/sw/include/simde/x86/avx2.h:7475:71: note: in definition of macro ‘SIMDE_SHUFFLE_VECTOR_’
 7475 |          int##elem_size##_t SIMDE_VECTOR(vec_size) simde_shuffle_ = { __VA_ARGS__ }; \
      |                                                                       ^~~~~~~~~~~
/sw/sw/include/simde/arm/neon.h:45306:74: note: (near initialization for ‘simde_shuffle_’)
45306 |     a_.values = SIMDE_SHUFFLE_VECTOR_(16, 4, a_.values, a_.values, 1, 1, 3, 3);
      |                                                                          ^
/sw/sw/include/simde/x86/avx2.h:7475:71: note: in definition of macro ‘SIMDE_SHUFFLE_VECTOR_’
 7475 |          int##elem_size##_t SIMDE_VECTOR(vec_size) simde_shuffle_ = { __VA_ARGS__ }; \
      |                                                                       ^~~~~~~~~~~
/sw/sw/include/simde/arm/neon.h:45306:77: error: excess elements in vector initializer
45306 |     a_.values = SIMDE_SHUFFLE_VECTOR_(16, 4, a_.values, a_.values, 1, 1, 3, 3);
      |                                                                             ^
/sw/sw/include/simde/x86/avx2.h:7475:71: note: in definition of macro ‘SIMDE_SHUFFLE_VECTOR_’
 7475 |          int##elem_size##_t SIMDE_VECTOR(vec_size) simde_shuffle_ = { __VA_ARGS__ }; \
      |                                                                       ^~~~~~~~~~~
/sw/sw/include/simde/arm/neon.h:45306:77: note: (near initialization for ‘simde_shuffle_’)
45306 |     a_.values = SIMDE_SHUFFLE_VECTOR_(16, 4, a_.values, a_.values, 1, 1, 3, 3);
      |                                                                             ^
/sw/sw/include/simde/x86/avx2.h:7475:71: note: in definition of macro ‘SIMDE_SHUFFLE_VECTOR_’
 7475 |          int##elem_size##_t SIMDE_VECTOR(vec_size) simde_shuffle_ = { __VA_ARGS__ }; \
      |                                                                       ^~~~~~~~~~~
/sw/sw/include/simde/x86/avx2.h:7476:12: error: ‘__builtin_shuffle’ number of elements of the argument vector(s) and the mask vector should be the same
 7476 |            __builtin_shuffle(a, b, simde_shuffle_); \
      |            ^~~~~~~~~~~~~~~~~
/sw/sw/include/simde/arm/neon.h:45306:17: note: in expansion of macro ‘SIMDE_SHUFFLE_VECTOR_’
45306 |     a_.values = SIMDE_SHUFFLE_VECTOR_(16, 4, a_.values, a_.values, 1, 1, 3, 3);
      |                 ^~~~~~~~~~~~~~~~~~~~~
/sw/sw/include/simde/arm/neon.h:45307:75: error: excess elements in vector initializer
45307 |     b_.values = SIMDE_SHUFFLE_VECTOR_(16, 4, -b_.values, b_.values, 5, 0, 7, 2);
      |                                                                           ^
/sw/sw/include/simde/x86/avx2.h:7475:71: note: in definition of macro ‘SIMDE_SHUFFLE_VECTOR_’
 7475 |          int##elem_size##_t SIMDE_VECTOR(vec_size) simde_shuffle_ = { __VA_ARGS__ }; \
      |                                                                       ^~~~~~~~~~~
/sw/sw/include/simde/arm/neon.h:45307:75: note: (near initialization for ‘simde_shuffle_’)
45307 |     b_.values = SIMDE_SHUFFLE_VECTOR_(16, 4, -b_.values, b_.values, 5, 0, 7, 2);
      |                                                                           ^
/sw/sw/include/simde/x86/avx2.h:7475:71: note: in definition of macro ‘SIMDE_SHUFFLE_VECTOR_’
 7475 |          int##elem_size##_t SIMDE_VECTOR(vec_size) simde_shuffle_ = { __VA_ARGS__ }; \
      |                                                                       ^~~~~~~~~~~
/sw/sw/include/simde/arm/neon.h:45307:78: error: excess elements in vector initializer
45307 |     b_.values = SIMDE_SHUFFLE_VECTOR_(16, 4, -b_.values, b_.values, 5, 0, 7, 2);
      |                                                                              ^
/sw/sw/include/simde/x86/avx2.h:7475:71: note: in definition of macro ‘SIMDE_SHUFFLE_VECTOR_’
 7475 |          int##elem_size##_t SIMDE_VECTOR(vec_size) simde_shuffle_ = { __VA_ARGS__ }; \
      |                                                                       ^~~~~~~~~~~
/sw/sw/include/simde/arm/neon.h:45307:78: note: (near initialization for ‘simde_shuffle_’)
45307 |     b_.values = SIMDE_SHUFFLE_VECTOR_(16, 4, -b_.values, b_.values, 5, 0, 7, 2);
      |                                                                              ^
/sw/sw/include/simde/x86/avx2.h:7475:71: note: in definition of macro ‘SIMDE_SHUFFLE_VECTOR_’
 7475 |          int##elem_size##_t SIMDE_VECTOR(vec_size) simde_shuffle_ = { __VA_ARGS__ }; \
      |                                                                       ^~~~~~~~~~~
/sw/sw/include/simde/x86/avx2.h:7476:12: error: ‘__builtin_shuffle’ number of elements of the argument vector(s) and the mask vector should be the same
 7476 |            __builtin_shuffle(a, b, simde_shuffle_); \
      |            ^~~~~~~~~~~~~~~~~
/sw/sw/include/simde/arm/neon.h:45307:17: note: in expansion of macro ‘SIMDE_SHUFFLE_VECTOR_’
45307 |     b_.values = SIMDE_SHUFFLE_VECTOR_(16, 4, -b_.values, b_.values, 5, 0, 7, 2);
      |                 ^~~~~~~~~~~~~~~~~~~~~
/sw/sw/include/simde/arm/neon.h: In function ‘simde_vcmlaq_rot270_lane_f16’:
/sw/sw/include/simde/arm/neon.h:45371:83: error: excess elements in vector initializer
45371 |     a_low.values = SIMDE_SHUFFLE_VECTOR_(16, 4, a_low.values, a_low.values, 1, 1, 3, 3);
      |                                                                                   ^
/sw/sw/include/simde/x86/avx2.h:7475:71: note: in definition of macro ‘SIMDE_SHUFFLE_VECTOR_’
 7475 |          int##elem_size##_t SIMDE_VECTOR(vec_size) simde_shuffle_ = { __VA_ARGS__ }; \
      |                                                                       ^~~~~~~~~~~
/sw/sw/include/simde/arm/neon.h:45371:83: note: (near initialization for ‘simde_shuffle_’)
45371 |     a_low.values = SIMDE_SHUFFLE_VECTOR_(16, 4, a_low.values, a_low.values, 1, 1, 3, 3);
      |                                                                                   ^
/sw/sw/include/simde/x86/avx2.h:7475:71: note: in definition of macro ‘SIMDE_SHUFFLE_VECTOR_’
 7475 |          int##elem_size##_t SIMDE_VECTOR(vec_size) simde_shuffle_ = { __VA_ARGS__ }; \
      |                                                                       ^~~~~~~~~~~
/sw/sw/include/simde/arm/neon.h:45371:86: error: excess elements in vector initializer
45371 |     a_low.values = SIMDE_SHUFFLE_VECTOR_(16, 4, a_low.values, a_low.values, 1, 1, 3, 3);
      |                                                                                      ^
/sw/sw/include/simde/x86/avx2.h:7475:71: note: in definition of macro ‘SIMDE_SHUFFLE_VECTOR_’
 7475 |          int##elem_size##_t SIMDE_VECTOR(vec_size) simde_shuffle_ = { __VA_ARGS__ }; \
      |                                                                       ^~~~~~~~~~~
/sw/sw/include/simde/arm/neon.h:45371:86: note: (near initialization for ‘simde_shuffle_’)
45371 |     a_low.values = SIMDE_SHUFFLE_VECTOR_(16, 4, a_low.values, a_low.values, 1, 1, 3, 3);
      |                                                                                      ^
/sw/sw/include/simde/x86/avx2.h:7475:71: note: in definition of macro ‘SIMDE_SHUFFLE_VECTOR_’
 7475 |          int##elem_size##_t SIMDE_VECTOR(vec_size) simde_shuffle_ = { __VA_ARGS__ }; \
      |                                                                       ^~~~~~~~~~~
/sw/sw/include/simde/x86/avx2.h:7476:12: error: ‘__builtin_shuffle’ number of elements of the argument vector(s) and the mask vector should be the same
 7476 |            __builtin_shuffle(a, b, simde_shuffle_); \
      |            ^~~~~~~~~~~~~~~~~
/sw/sw/include/simde/arm/neon.h:45371:20: note: in expansion of macro ‘SIMDE_SHUFFLE_VECTOR_’
45371 |     a_low.values = SIMDE_SHUFFLE_VECTOR_(16, 4, a_low.values, a_low.values, 1, 1, 3, 3);
      |                    ^~~~~~~~~~~~~~~~~~~~~
/sw/sw/include/simde/arm/neon.h:45372:86: error: excess elements in vector initializer
45372 |     a_high.values = SIMDE_SHUFFLE_VECTOR_(16, 4, a_high.values, a_high.values, 1, 1, 3, 3);
      |                                                                                      ^
/sw/sw/include/simde/x86/avx2.h:7475:71: note: in definition of macro ‘SIMDE_SHUFFLE_VECTOR_’
 7475 |          int##elem_size##_t SIMDE_VECTOR(vec_size) simde_shuffle_ = { __VA_ARGS__ }; \
      |                                                                       ^~~~~~~~~~~
/sw/sw/include/simde/arm/neon.h:45372:86: note: (near initialization for ‘simde_shuffle_’)
45372 |     a_high.values = SIMDE_SHUFFLE_VECTOR_(16, 4, a_high.values, a_high.values, 1, 1, 3, 3);
      |                                                                                      ^
/sw/sw/include/simde/x86/avx2.h:7475:71: note: in definition of macro ‘SIMDE_SHUFFLE_VECTOR_’
 7475 |          int##elem_size##_t SIMDE_VECTOR(vec_size) simde_shuffle_ = { __VA_ARGS__ }; \
      |                                                                       ^~~~~~~~~~~
/sw/sw/include/simde/arm/neon.h:45372:89: error: excess elements in vector initializer
45372 |     a_high.values = SIMDE_SHUFFLE_VECTOR_(16, 4, a_high.values, a_high.values, 1, 1, 3, 3);
      |                                                                                         ^
/sw/sw/include/simde/x86/avx2.h:7475:71: note: in definition of macro ‘SIMDE_SHUFFLE_VECTOR_’
 7475 |          int##elem_size##_t SIMDE_VECTOR(vec_size) simde_shuffle_ = { __VA_ARGS__ }; \
      |                                                                       ^~~~~~~~~~~
/sw/sw/include/simde/arm/neon.h:45372:89: note: (near initialization for ‘simde_shuffle_’)
45372 |     a_high.values = SIMDE_SHUFFLE_VECTOR_(16, 4, a_high.values, a_high.values, 1, 1, 3, 3);
      |                                                                                         ^
/sw/sw/include/simde/x86/avx2.h:7475:71: note: in definition of macro ‘SIMDE_SHUFFLE_VECTOR_’
 7475 |          int##elem_size##_t SIMDE_VECTOR(vec_size) simde_shuffle_ = { __VA_ARGS__ }; \
      |                                                                       ^~~~~~~~~~~
/sw/sw/include/simde/x86/avx2.h:7476:12: error: ‘__builtin_shuffle’ number of elements of the argument vector(s) and the mask vector should be the same
 7476 |            __builtin_shuffle(a, b, simde_shuffle_); \
      |            ^~~~~~~~~~~~~~~~~
/sw/sw/include/simde/arm/neon.h:45372:21: note: in expansion of macro ‘SIMDE_SHUFFLE_VECTOR_’
45372 |     a_high.values = SIMDE_SHUFFLE_VECTOR_(16, 4, a_high.values, a_high.values, 1, 1, 3, 3);
      |                     ^~~~~~~~~~~~~~~~~~~~~
/sw/sw/include/simde/arm/neon.h:45373:75: error: excess elements in vector initializer
45373 |     b_.values = SIMDE_SHUFFLE_VECTOR_(16, 4, -b_.values, b_.values, 5, 0, 7, 2);
      |                                                                           ^
/sw/sw/include/simde/x86/avx2.h:7475:71: note: in definition of macro ‘SIMDE_SHUFFLE_VECTOR_’
 7475 |          int##elem_size##_t SIMDE_VECTOR(vec_size) simde_shuffle_ = { __VA_ARGS__ }; \
      |                                                                       ^~~~~~~~~~~
/sw/sw/include/simde/arm/neon.h:45373:75: note: (near initialization for ‘simde_shuffle_’)
45373 |     b_.values = SIMDE_SHUFFLE_VECTOR_(16, 4, -b_.values, b_.values, 5, 0, 7, 2);
      |                                                                           ^
/sw/sw/include/simde/x86/avx2.h:7475:71: note: in definition of macro ‘SIMDE_SHUFFLE_VECTOR_’
 7475 |          int##elem_size##_t SIMDE_VECTOR(vec_size) simde_shuffle_ = { __VA_ARGS__ }; \
      |                                                                       ^~~~~~~~~~~
/sw/sw/include/simde/arm/neon.h:45373:78: error: excess elements in vector initializer
45373 |     b_.values = SIMDE_SHUFFLE_VECTOR_(16, 4, -b_.values, b_.values, 5, 0, 7, 2);
      |                                                                              ^
/sw/sw/include/simde/x86/avx2.h:7475:71: note: in definition of macro ‘SIMDE_SHUFFLE_VECTOR_’
 7475 |          int##elem_size##_t SIMDE_VECTOR(vec_size) simde_shuffle_ = { __VA_ARGS__ }; \
      |                                                                       ^~~~~~~~~~~
/sw/sw/include/simde/arm/neon.h:45373:78: note: (near initialization for ‘simde_shuffle_’)
45373 |     b_.values = SIMDE_SHUFFLE_VECTOR_(16, 4, -b_.values, b_.values, 5, 0, 7, 2);
      |                                                                              ^
/sw/sw/include/simde/x86/avx2.h:7475:71: note: in definition of macro ‘SIMDE_SHUFFLE_VECTOR_’
 7475 |          int##elem_size##_t SIMDE_VECTOR(vec_size) simde_shuffle_ = { __VA_ARGS__ }; \
      |                                                                       ^~~~~~~~~~~
/sw/sw/include/simde/x86/avx2.h:7476:12: error: ‘__builtin_shuffle’ number of elements of the argument vector(s) and the mask vector should be the same
 7476 |            __builtin_shuffle(a, b, simde_shuffle_); \
      |            ^~~~~~~~~~~~~~~~~
/sw/sw/include/simde/arm/neon.h:45373:17: note: in expansion of macro ‘SIMDE_SHUFFLE_VECTOR_’
45373 |     b_.values = SIMDE_SHUFFLE_VECTOR_(16, 4, -b_.values, b_.values, 5, 0, 7, 2);
      |                 ^~~~~~~~~~~~~~~~~~~~~
/sw/sw/include/simde/arm/neon.h: In function ‘simde_vcmla_rot270_laneq_f16’:
/sw/sw/include/simde/arm/neon.h:45439:74: error: excess elements in vector initializer
45439 |     a_.values = SIMDE_SHUFFLE_VECTOR_(16, 4, a_.values, a_.values, 1, 1, 3, 3);
      |                                                                          ^
/sw/sw/include/simde/x86/avx2.h:7475:71: note: in definition of macro ‘SIMDE_SHUFFLE_VECTOR_’
 7475 |          int##elem_size##_t SIMDE_VECTOR(vec_size) simde_shuffle_ = { __VA_ARGS__ }; \
      |                                                                       ^~~~~~~~~~~
/sw/sw/include/simde/arm/neon.h:45439:74: note: (near initialization for ‘simde_shuffle_’)
45439 |     a_.values = SIMDE_SHUFFLE_VECTOR_(16, 4, a_.values, a_.values, 1, 1, 3, 3);
      |                                                                          ^
/sw/sw/include/simde/x86/avx2.h:7475:71: note: in definition of macro ‘SIMDE_SHUFFLE_VECTOR_’
 7475 |          int##elem_size##_t SIMDE_VECTOR(vec_size) simde_shuffle_ = { __VA_ARGS__ }; \
      |                                                                       ^~~~~~~~~~~
/sw/sw/include/simde/arm/neon.h:45439:77: error: excess elements in vector initializer
45439 |     a_.values = SIMDE_SHUFFLE_VECTOR_(16, 4, a_.values, a_.values, 1, 1, 3, 3);
      |                                                                             ^
/sw/sw/include/simde/x86/avx2.h:7475:71: note: in definition of macro ‘SIMDE_SHUFFLE_VECTOR_’
 7475 |          int##elem_size##_t SIMDE_VECTOR(vec_size) simde_shuffle_ = { __VA_ARGS__ }; \
      |                                                                       ^~~~~~~~~~~
/sw/sw/include/simde/arm/neon.h:45439:77: note: (near initialization for ‘simde_shuffle_’)
45439 |     a_.values = SIMDE_SHUFFLE_VECTOR_(16, 4, a_.values, a_.values, 1, 1, 3, 3);
      |                                                                             ^
/sw/sw/include/simde/x86/avx2.h:7475:71: note: in definition of macro ‘SIMDE_SHUFFLE_VECTOR_’
 7475 |          int##elem_size##_t SIMDE_VECTOR(vec_size) simde_shuffle_ = { __VA_ARGS__ }; \
      |                                                                       ^~~~~~~~~~~
/sw/sw/include/simde/x86/avx2.h:7476:12: error: ‘__builtin_shuffle’ number of elements of the argument vector(s) and the mask vector should be the same
 7476 |            __builtin_shuffle(a, b, simde_shuffle_); \
      |            ^~~~~~~~~~~~~~~~~
/sw/sw/include/simde/arm/neon.h:45439:17: note: in expansion of macro ‘SIMDE_SHUFFLE_VECTOR_’
45439 |     a_.values = SIMDE_SHUFFLE_VECTOR_(16, 4, a_.values, a_.values, 1, 1, 3, 3);
      |                 ^~~~~~~~~~~~~~~~~~~~~
/sw/sw/include/simde/arm/neon.h:45440:75: error: excess elements in vector initializer
45440 |     b_.values = SIMDE_SHUFFLE_VECTOR_(16, 4, -b_.values, b_.values, 5, 0, 7, 2);
      |                                                                           ^
/sw/sw/include/simde/x86/avx2.h:7475:71: note: in definition of macro ‘SIMDE_SHUFFLE_VECTOR_’
 7475 |          int##elem_size##_t SIMDE_VECTOR(vec_size) simde_shuffle_ = { __VA_ARGS__ }; \
      |                                                                       ^~~~~~~~~~~
/sw/sw/include/simde/arm/neon.h:45440:75: note: (near initialization for ‘simde_shuffle_’)
45440 |     b_.values = SIMDE_SHUFFLE_VECTOR_(16, 4, -b_.values, b_.values, 5, 0, 7, 2);
      |                                                                           ^
/sw/sw/include/simde/x86/avx2.h:7475:71: note: in definition of macro ‘SIMDE_SHUFFLE_VECTOR_’
 7475 |          int##elem_size##_t SIMDE_VECTOR(vec_size) simde_shuffle_ = { __VA_ARGS__ }; \
      |                                                                       ^~~~~~~~~~~
/sw/sw/include/simde/arm/neon.h:45440:78: error: excess elements in vector initializer
45440 |     b_.values = SIMDE_SHUFFLE_VECTOR_(16, 4, -b_.values, b_.values, 5, 0, 7, 2);
      |                                                                              ^
/sw/sw/include/simde/x86/avx2.h:7475:71: note: in definition of macro ‘SIMDE_SHUFFLE_VECTOR_’
 7475 |          int##elem_size##_t SIMDE_VECTOR(vec_size) simde_shuffle_ = { __VA_ARGS__ }; \
      |                                                                       ^~~~~~~~~~~
/sw/sw/include/simde/arm/neon.h:45440:78: note: (near initialization for ‘simde_shuffle_’)
45440 |     b_.values = SIMDE_SHUFFLE_VECTOR_(16, 4, -b_.values, b_.values, 5, 0, 7, 2);
      |                                                                              ^
/sw/sw/include/simde/x86/avx2.h:7475:71: note: in definition of macro ‘SIMDE_SHUFFLE_VECTOR_’
 7475 |          int##elem_size##_t SIMDE_VECTOR(vec_size) simde_shuffle_ = { __VA_ARGS__ }; \
      |                                                                       ^~~~~~~~~~~
/sw/sw/include/simde/x86/avx2.h:7476:12: error: ‘__builtin_shuffle’ number of elements of the argument vector(s) and the mask vector should be the same
 7476 |            __builtin_shuffle(a, b, simde_shuffle_); \
      |            ^~~~~~~~~~~~~~~~~
/sw/sw/include/simde/arm/neon.h:45440:17: note: in expansion of macro ‘SIMDE_SHUFFLE_VECTOR_’
45440 |     b_.values = SIMDE_SHUFFLE_VECTOR_(16, 4, -b_.values, b_.values, 5, 0, 7, 2);
      |                 ^~~~~~~~~~~~~~~~~~~~~
/sw/sw/include/simde/arm/neon.h: In function ‘simde_vcmlaq_rot270_laneq_f16’:
/sw/sw/include/simde/arm/neon.h:45504:86: error: excess elements in vector initializer
45504 |     a_high.values = SIMDE_SHUFFLE_VECTOR_(16, 4, a_high.values, a_high.values, 1, 1, 3, 3);
      |                                                                                      ^
/sw/sw/include/simde/x86/avx2.h:7475:71: note: in definition of macro ‘SIMDE_SHUFFLE_VECTOR_’
 7475 |          int##elem_size##_t SIMDE_VECTOR(vec_size) simde_shuffle_ = { __VA_ARGS__ }; \
      |                                                                       ^~~~~~~~~~~
/sw/sw/include/simde/arm/neon.h:45504:86: note: (near initialization for ‘simde_shuffle_’)
45504 |     a_high.values = SIMDE_SHUFFLE_VECTOR_(16, 4, a_high.values, a_high.values, 1, 1, 3, 3);
      |                                                                                      ^
/sw/sw/include/simde/x86/avx2.h:7475:71: note: in definition of macro ‘SIMDE_SHUFFLE_VECTOR_’
 7475 |          int##elem_size##_t SIMDE_VECTOR(vec_size) simde_shuffle_ = { __VA_ARGS__ }; \
      |                                                                       ^~~~~~~~~~~
/sw/sw/include/simde/arm/neon.h:45504:89: error: excess elements in vector initializer
45504 |     a_high.values = SIMDE_SHUFFLE_VECTOR_(16, 4, a_high.values, a_high.values, 1, 1, 3, 3);
      |                                                                                         ^
/sw/sw/include/simde/x86/avx2.h:7475:71: note: in definition of macro ‘SIMDE_SHUFFLE_VECTOR_’
 7475 |          int##elem_size##_t SIMDE_VECTOR(vec_size) simde_shuffle_ = { __VA_ARGS__ }; \
      |                                                                       ^~~~~~~~~~~
/sw/sw/include/simde/arm/neon.h:45504:89: note: (near initialization for ‘simde_shuffle_’)
45504 |     a_high.values = SIMDE_SHUFFLE_VECTOR_(16, 4, a_high.values, a_high.values, 1, 1, 3, 3);
      |                                                                                         ^
/sw/sw/include/simde/x86/avx2.h:7475:71: note: in definition of macro ‘SIMDE_SHUFFLE_VECTOR_’
 7475 |          int##elem_size##_t SIMDE_VECTOR(vec_size) simde_shuffle_ = { __VA_ARGS__ }; \
      |                                                                       ^~~~~~~~~~~
/sw/sw/include/simde/x86/avx2.h:7476:12: error: ‘__builtin_shuffle’ number of elements of the argument vector(s) and the mask vector should be the same
 7476 |            __builtin_shuffle(a, b, simde_shuffle_); \
      |            ^~~~~~~~~~~~~~~~~
/sw/sw/include/simde/arm/neon.h:45504:21: note: in expansion of macro ‘SIMDE_SHUFFLE_VECTOR_’
45504 |     a_high.values = SIMDE_SHUFFLE_VECTOR_(16, 4, a_high.values, a_high.values, 1, 1, 3, 3);
      |                     ^~~~~~~~~~~~~~~~~~~~~
/sw/sw/include/simde/arm/neon.h:45505:83: error: excess elements in vector initializer
45505 |     a_low.values = SIMDE_SHUFFLE_VECTOR_(16, 4, a_low.values, a_low.values, 1, 1, 3, 3);
      |                                                                                   ^
/sw/sw/include/simde/x86/avx2.h:7475:71: note: in definition of macro ‘SIMDE_SHUFFLE_VECTOR_’
 7475 |          int##elem_size##_t SIMDE_VECTOR(vec_size) simde_shuffle_ = { __VA_ARGS__ }; \
      |                                                                       ^~~~~~~~~~~
/sw/sw/include/simde/arm/neon.h:45505:83: note: (near initialization for ‘simde_shuffle_’)
45505 |     a_low.values = SIMDE_SHUFFLE_VECTOR_(16, 4, a_low.values, a_low.values, 1, 1, 3, 3);
      |                                                                                   ^
/sw/sw/include/simde/x86/avx2.h:7475:71: note: in definition of macro ‘SIMDE_SHUFFLE_VECTOR_’
 7475 |          int##elem_size##_t SIMDE_VECTOR(vec_size) simde_shuffle_ = { __VA_ARGS__ }; \
      |                                                                       ^~~~~~~~~~~
/sw/sw/include/simde/arm/neon.h:45505:86: error: excess elements in vector initializer
45505 |     a_low.values = SIMDE_SHUFFLE_VECTOR_(16, 4, a_low.values, a_low.values, 1, 1, 3, 3);
      |                                                                                      ^
/sw/sw/include/simde/x86/avx2.h:7475:71: note: in definition of macro ‘SIMDE_SHUFFLE_VECTOR_’
 7475 |          int##elem_size##_t SIMDE_VECTOR(vec_size) simde_shuffle_ = { __VA_ARGS__ }; \
      |                                                                       ^~~~~~~~~~~
/sw/sw/include/simde/arm/neon.h:45505:86: note: (near initialization for ‘simde_shuffle_’)
45505 |     a_low.values = SIMDE_SHUFFLE_VECTOR_(16, 4, a_low.values, a_low.values, 1, 1, 3, 3);
      |                                                                                      ^
/sw/sw/include/simde/x86/avx2.h:7475:71: note: in definition of macro ‘SIMDE_SHUFFLE_VECTOR_’
 7475 |          int##elem_size##_t SIMDE_VECTOR(vec_size) simde_shuffle_ = { __VA_ARGS__ }; \
      |                                                                       ^~~~~~~~~~~
/sw/sw/include/simde/x86/avx2.h:7476:12: error: ‘__builtin_shuffle’ number of elements of the argument vector(s) and the mask vector should be the same
 7476 |            __builtin_shuffle(a, b, simde_shuffle_); \
      |            ^~~~~~~~~~~~~~~~~
/sw/sw/include/simde/arm/neon.h:45505:20: note: in expansion of macro ‘SIMDE_SHUFFLE_VECTOR_’
45505 |     a_low.values = SIMDE_SHUFFLE_VECTOR_(16, 4, a_low.values, a_low.values, 1, 1, 3, 3);
      |                    ^~~~~~~~~~~~~~~~~~~~~
/sw/sw/include/simde/arm/neon.h:45506:75: error: excess elements in vector initializer
45506 |     b_.values = SIMDE_SHUFFLE_VECTOR_(16, 4, -b_.values, b_.values, 5, 0, 7, 2);
      |                                                                           ^
/sw/sw/include/simde/x86/avx2.h:7475:71: note: in definition of macro ‘SIMDE_SHUFFLE_VECTOR_’
 7475 |          int##elem_size##_t SIMDE_VECTOR(vec_size) simde_shuffle_ = { __VA_ARGS__ }; \
      |                                                                       ^~~~~~~~~~~
/sw/sw/include/simde/arm/neon.h:45506:75: note: (near initialization for ‘simde_shuffle_’)
45506 |     b_.values = SIMDE_SHUFFLE_VECTOR_(16, 4, -b_.values, b_.values, 5, 0, 7, 2);
      |                                                                           ^
/sw/sw/include/simde/x86/avx2.h:7475:71: note: in definition of macro ‘SIMDE_SHUFFLE_VECTOR_’
 7475 |          int##elem_size##_t SIMDE_VECTOR(vec_size) simde_shuffle_ = { __VA_ARGS__ }; \
      |                                                                       ^~~~~~~~~~~
/sw/sw/include/simde/arm/neon.h:45506:78: error: excess elements in vector initializer
45506 |     b_.values = SIMDE_SHUFFLE_VECTOR_(16, 4, -b_.values, b_.values, 5, 0, 7, 2);
      |                                                                              ^
/sw/sw/include/simde/x86/avx2.h:7475:71: note: in definition of macro ‘SIMDE_SHUFFLE_VECTOR_’
 7475 |          int##elem_size##_t SIMDE_VECTOR(vec_size) simde_shuffle_ = { __VA_ARGS__ }; \
      |                                                                       ^~~~~~~~~~~
/sw/sw/include/simde/arm/neon.h:45506:78: note: (near initialization for ‘simde_shuffle_’)
45506 |     b_.values = SIMDE_SHUFFLE_VECTOR_(16, 4, -b_.values, b_.values, 5, 0, 7, 2);
      |                                                                              ^
/sw/sw/include/simde/x86/avx2.h:7475:71: note: in definition of macro ‘SIMDE_SHUFFLE_VECTOR_’
 7475 |          int##elem_size##_t SIMDE_VECTOR(vec_size) simde_shuffle_ = { __VA_ARGS__ }; \
      |                                                                       ^~~~~~~~~~~
/sw/sw/include/simde/x86/avx2.h:7476:12: error: ‘__builtin_shuffle’ number of elements of the argument vector(s) and the mask vector should be the same
 7476 |            __builtin_shuffle(a, b, simde_shuffle_); \
      |            ^~~~~~~~~~~~~~~~~
/sw/sw/include/simde/arm/neon.h:45506:17: note: in expansion of macro ‘SIMDE_SHUFFLE_VECTOR_’
45506 |     b_.values = SIMDE_SHUFFLE_VECTOR_(16, 4, -b_.values, b_.values, 5, 0, 7, 2);
      |                 ^~~~~~~~~~~~~~~~~~~~~
/sw/sw/include/simde/arm/neon.h: In function ‘simde_vcmla_rot90_lane_f16’:
/sw/sw/include/simde/arm/neon.h:45852:74: error: excess elements in vector initializer
45852 |     a_.values = SIMDE_SHUFFLE_VECTOR_(16, 4, a_.values, a_.values, 1, 1, 3, 3);
      |                                                                          ^
/sw/sw/include/simde/x86/avx2.h:7475:71: note: in definition of macro ‘SIMDE_SHUFFLE_VECTOR_’
 7475 |          int##elem_size##_t SIMDE_VECTOR(vec_size) simde_shuffle_ = { __VA_ARGS__ }; \
      |                                                                       ^~~~~~~~~~~
/sw/sw/include/simde/arm/neon.h:45852:74: note: (near initialization for ‘simde_shuffle_’)
45852 |     a_.values = SIMDE_SHUFFLE_VECTOR_(16, 4, a_.values, a_.values, 1, 1, 3, 3);
      |                                                                          ^
/sw/sw/include/simde/x86/avx2.h:7475:71: note: in definition of macro ‘SIMDE_SHUFFLE_VECTOR_’
 7475 |          int##elem_size##_t SIMDE_VECTOR(vec_size) simde_shuffle_ = { __VA_ARGS__ }; \
      |                                                                       ^~~~~~~~~~~
/sw/sw/include/simde/arm/neon.h:45852:77: error: excess elements in vector initializer
45852 |     a_.values = SIMDE_SHUFFLE_VECTOR_(16, 4, a_.values, a_.values, 1, 1, 3, 3);
      |                                                                             ^
/sw/sw/include/simde/x86/avx2.h:7475:71: note: in definition of macro ‘SIMDE_SHUFFLE_VECTOR_’
 7475 |          int##elem_size##_t SIMDE_VECTOR(vec_size) simde_shuffle_ = { __VA_ARGS__ }; \
      |                                                                       ^~~~~~~~~~~
/sw/sw/include/simde/arm/neon.h:45852:77: note: (near initialization for ‘simde_shuffle_’)
45852 |     a_.values = SIMDE_SHUFFLE_VECTOR_(16, 4, a_.values, a_.values, 1, 1, 3, 3);
      |                                                                             ^
/sw/sw/include/simde/x86/avx2.h:7475:71: note: in definition of macro ‘SIMDE_SHUFFLE_VECTOR_’
 7475 |          int##elem_size##_t SIMDE_VECTOR(vec_size) simde_shuffle_ = { __VA_ARGS__ }; \
      |                                                                       ^~~~~~~~~~~
/sw/sw/include/simde/x86/avx2.h:7476:12: error: ‘__builtin_shuffle’ number of elements of the argument vector(s) and the mask vector should be the same
 7476 |            __builtin_shuffle(a, b, simde_shuffle_); \
      |            ^~~~~~~~~~~~~~~~~
/sw/sw/include/simde/arm/neon.h:45852:17: note: in expansion of macro ‘SIMDE_SHUFFLE_VECTOR_’
45852 |     a_.values = SIMDE_SHUFFLE_VECTOR_(16, 4, a_.values, a_.values, 1, 1, 3, 3);
      |                 ^~~~~~~~~~~~~~~~~~~~~
/sw/sw/include/simde/arm/neon.h:45853:75: error: excess elements in vector initializer
45853 |     b_.values = SIMDE_SHUFFLE_VECTOR_(16, 4, -b_.values, b_.values, 1, 4, 3, 6);
      |                                                                           ^
/sw/sw/include/simde/x86/avx2.h:7475:71: note: in definition of macro ‘SIMDE_SHUFFLE_VECTOR_’
 7475 |          int##elem_size##_t SIMDE_VECTOR(vec_size) simde_shuffle_ = { __VA_ARGS__ }; \
      |                                                                       ^~~~~~~~~~~
/sw/sw/include/simde/arm/neon.h:45853:75: note: (near initialization for ‘simde_shuffle_’)
45853 |     b_.values = SIMDE_SHUFFLE_VECTOR_(16, 4, -b_.values, b_.values, 1, 4, 3, 6);
      |                                                                           ^
/sw/sw/include/simde/x86/avx2.h:7475:71: note: in definition of macro ‘SIMDE_SHUFFLE_VECTOR_’
 7475 |          int##elem_size##_t SIMDE_VECTOR(vec_size) simde_shuffle_ = { __VA_ARGS__ }; \
      |                                                                       ^~~~~~~~~~~
/sw/sw/include/simde/arm/neon.h:45853:78: error: excess elements in vector initializer
45853 |     b_.values = SIMDE_SHUFFLE_VECTOR_(16, 4, -b_.values, b_.values, 1, 4, 3, 6);
      |                                                                              ^
/sw/sw/include/simde/x86/avx2.h:7475:71: note: in definition of macro ‘SIMDE_SHUFFLE_VECTOR_’
 7475 |          int##elem_size##_t SIMDE_VECTOR(vec_size) simde_shuffle_ = { __VA_ARGS__ }; \
      |                                                                       ^~~~~~~~~~~
/sw/sw/include/simde/arm/neon.h:45853:78: note: (near initialization for ‘simde_shuffle_’)
45853 |     b_.values = SIMDE_SHUFFLE_VECTOR_(16, 4, -b_.values, b_.values, 1, 4, 3, 6);
      |                                                                              ^
/sw/sw/include/simde/x86/avx2.h:7475:71: note: in definition of macro ‘SIMDE_SHUFFLE_VECTOR_’
 7475 |          int##elem_size##_t SIMDE_VECTOR(vec_size) simde_shuffle_ = { __VA_ARGS__ }; \
      |                                                                       ^~~~~~~~~~~
/sw/sw/include/simde/x86/avx2.h:7476:12: error: ‘__builtin_shuffle’ number of elements of the argument vector(s) and the mask vector should be the same
 7476 |            __builtin_shuffle(a, b, simde_shuffle_); \
      |            ^~~~~~~~~~~~~~~~~
/sw/sw/include/simde/arm/neon.h:45853:17: note: in expansion of macro ‘SIMDE_SHUFFLE_VECTOR_’
45853 |     b_.values = SIMDE_SHUFFLE_VECTOR_(16, 4, -b_.values, b_.values, 1, 4, 3, 6);
      |                 ^~~~~~~~~~~~~~~~~~~~~
/sw/sw/include/simde/arm/neon.h: In function ‘simde_vcmla_rot90_laneq_f16’:
/sw/sw/include/simde/arm/neon.h:45915:74: error: excess elements in vector initializer
45915 |     a_.values = SIMDE_SHUFFLE_VECTOR_(16, 4, a_.values, a_.values, 1, 1, 3, 3);
      |                                                                          ^
/sw/sw/include/simde/x86/avx2.h:7475:71: note: in definition of macro ‘SIMDE_SHUFFLE_VECTOR_’
 7475 |          int##elem_size##_t SIMDE_VECTOR(vec_size) simde_shuffle_ = { __VA_ARGS__ }; \
      |                                                                       ^~~~~~~~~~~
/sw/sw/include/simde/arm/neon.h:45915:74: note: (near initialization for ‘simde_shuffle_’)
45915 |     a_.values = SIMDE_SHUFFLE_VECTOR_(16, 4, a_.values, a_.values, 1, 1, 3, 3);
      |                                                                          ^
/sw/sw/include/simde/x86/avx2.h:7475:71: note: in definition of macro ‘SIMDE_SHUFFLE_VECTOR_’
 7475 |          int##elem_size##_t SIMDE_VECTOR(vec_size) simde_shuffle_ = { __VA_ARGS__ }; \
      |                                                                       ^~~~~~~~~~~
/sw/sw/include/simde/arm/neon.h:45915:77: error: excess elements in vector initializer
45915 |     a_.values = SIMDE_SHUFFLE_VECTOR_(16, 4, a_.values, a_.values, 1, 1, 3, 3);
      |                                                                             ^
/sw/sw/include/simde/x86/avx2.h:7475:71: note: in definition of macro ‘SIMDE_SHUFFLE_VECTOR_’
 7475 |          int##elem_size##_t SIMDE_VECTOR(vec_size) simde_shuffle_ = { __VA_ARGS__ }; \
      |                                                                       ^~~~~~~~~~~
/sw/sw/include/simde/arm/neon.h:45915:77: note: (near initialization for ‘simde_shuffle_’)
45915 |     a_.values = SIMDE_SHUFFLE_VECTOR_(16, 4, a_.values, a_.values, 1, 1, 3, 3);
      |                                                                             ^
/sw/sw/include/simde/x86/avx2.h:7475:71: note: in definition of macro ‘SIMDE_SHUFFLE_VECTOR_’
 7475 |          int##elem_size##_t SIMDE_VECTOR(vec_size) simde_shuffle_ = { __VA_ARGS__ }; \
      |                                                                       ^~~~~~~~~~~
/sw/sw/include/simde/x86/avx2.h:7476:12: error: ‘__builtin_shuffle’ number of elements of the argument vector(s) and the mask vector should be the same
 7476 |            __builtin_shuffle(a, b, simde_shuffle_); \
      |            ^~~~~~~~~~~~~~~~~
/sw/sw/include/simde/arm/neon.h:45915:17: note: in expansion of macro ‘SIMDE_SHUFFLE_VECTOR_’
45915 |     a_.values = SIMDE_SHUFFLE_VECTOR_(16, 4, a_.values, a_.values, 1, 1, 3, 3);
      |                 ^~~~~~~~~~~~~~~~~~~~~
/sw/sw/include/simde/arm/neon.h:45916:75: error: excess elements in vector initializer
45916 |     b_.values = SIMDE_SHUFFLE_VECTOR_(16, 4, -b_.values, b_.values, 1, 4, 3, 6);
      |                                                                           ^
/sw/sw/include/simde/x86/avx2.h:7475:71: note: in definition of macro ‘SIMDE_SHUFFLE_VECTOR_’
 7475 |          int##elem_size##_t SIMDE_VECTOR(vec_size) simde_shuffle_ = { __VA_ARGS__ }; \
      |                                                                       ^~~~~~~~~~~
/sw/sw/include/simde/arm/neon.h:45916:75: note: (near initialization for ‘simde_shuffle_’)
45916 |     b_.values = SIMDE_SHUFFLE_VECTOR_(16, 4, -b_.values, b_.values, 1, 4, 3, 6);
      |                                                                           ^
/sw/sw/include/simde/x86/avx2.h:7475:71: note: in definition of macro ‘SIMDE_SHUFFLE_VECTOR_’
 7475 |          int##elem_size##_t SIMDE_VECTOR(vec_size) simde_shuffle_ = { __VA_ARGS__ }; \
      |                                                                       ^~~~~~~~~~~
/sw/sw/include/simde/arm/neon.h:45916:78: error: excess elements in vector initializer
45916 |     b_.values = SIMDE_SHUFFLE_VECTOR_(16, 4, -b_.values, b_.values, 1, 4, 3, 6);
      |                                                                              ^
/sw/sw/include/simde/x86/avx2.h:7475:71: note: in definition of macro ‘SIMDE_SHUFFLE_VECTOR_’
 7475 |          int##elem_size##_t SIMDE_VECTOR(vec_size) simde_shuffle_ = { __VA_ARGS__ }; \
      |                                                                       ^~~~~~~~~~~
/sw/sw/include/simde/arm/neon.h:45916:78: note: (near initialization for ‘simde_shuffle_’)
45916 |     b_.values = SIMDE_SHUFFLE_VECTOR_(16, 4, -b_.values, b_.values, 1, 4, 3, 6);
      |                                                                              ^
/sw/sw/include/simde/x86/avx2.h:7475:71: note: in definition of macro ‘SIMDE_SHUFFLE_VECTOR_’
 7475 |          int##elem_size##_t SIMDE_VECTOR(vec_size) simde_shuffle_ = { __VA_ARGS__ }; \
      |                                                                       ^~~~~~~~~~~
/sw/sw/include/simde/x86/avx2.h:7476:12: error: ‘__builtin_shuffle’ number of elements of the argument vector(s) and the mask vector should be the same
 7476 |            __builtin_shuffle(a, b, simde_shuffle_); \
      |            ^~~~~~~~~~~~~~~~~
/sw/sw/include/simde/arm/neon.h:45916:17: note: in expansion of macro ‘SIMDE_SHUFFLE_VECTOR_’
45916 |     b_.values = SIMDE_SHUFFLE_VECTOR_(16, 4, -b_.values, b_.values, 1, 4, 3, 6);
      |                 ^~~~~~~~~~~~~~~~~~~~~
/sw/sw/include/simde/arm/neon.h: In function ‘simde_vcmlaq_rot90_lane_f16’:
/sw/sw/include/simde/arm/neon.h:45981:83: error: excess elements in vector initializer
45981 |     a_low.values = SIMDE_SHUFFLE_VECTOR_(16, 4, a_low.values, a_low.values, 1, 1, 3, 3);
      |                                                                                   ^
/sw/sw/include/simde/x86/avx2.h:7475:71: note: in definition of macro ‘SIMDE_SHUFFLE_VECTOR_’
 7475 |          int##elem_size##_t SIMDE_VECTOR(vec_size) simde_shuffle_ = { __VA_ARGS__ }; \
      |                                                                       ^~~~~~~~~~~
/sw/sw/include/simde/arm/neon.h:45981:83: note: (near initialization for ‘simde_shuffle_’)
45981 |     a_low.values = SIMDE_SHUFFLE_VECTOR_(16, 4, a_low.values, a_low.values, 1, 1, 3, 3);
      |                                                                                   ^
/sw/sw/include/simde/x86/avx2.h:7475:71: note: in definition of macro ‘SIMDE_SHUFFLE_VECTOR_’
 7475 |          int##elem_size##_t SIMDE_VECTOR(vec_size) simde_shuffle_ = { __VA_ARGS__ }; \
      |                                                                       ^~~~~~~~~~~
/sw/sw/include/simde/arm/neon.h:45981:86: error: excess elements in vector initializer
45981 |     a_low.values = SIMDE_SHUFFLE_VECTOR_(16, 4, a_low.values, a_low.values, 1, 1, 3, 3);
      |                                                                                      ^
/sw/sw/include/simde/x86/avx2.h:7475:71: note: in definition of macro ‘SIMDE_SHUFFLE_VECTOR_’
 7475 |          int##elem_size##_t SIMDE_VECTOR(vec_size) simde_shuffle_ = { __VA_ARGS__ }; \
      |                                                                       ^~~~~~~~~~~
/sw/sw/include/simde/arm/neon.h:45981:86: note: (near initialization for ‘simde_shuffle_’)
45981 |     a_low.values = SIMDE_SHUFFLE_VECTOR_(16, 4, a_low.values, a_low.values, 1, 1, 3, 3);
      |                                                                                      ^
/sw/sw/include/simde/x86/avx2.h:7475:71: note: in definition of macro ‘SIMDE_SHUFFLE_VECTOR_’
 7475 |          int##elem_size##_t SIMDE_VECTOR(vec_size) simde_shuffle_ = { __VA_ARGS__ }; \
      |                                                                       ^~~~~~~~~~~
/sw/sw/include/simde/x86/avx2.h:7476:12: error: ‘__builtin_shuffle’ number of elements of the argument vector(s) and the mask vector should be the same
 7476 |            __builtin_shuffle(a, b, simde_shuffle_); \
      |            ^~~~~~~~~~~~~~~~~
/sw/sw/include/simde/arm/neon.h:45981:20: note: in expansion of macro ‘SIMDE_SHUFFLE_VECTOR_’
45981 |     a_low.values = SIMDE_SHUFFLE_VECTOR_(16, 4, a_low.values, a_low.values, 1, 1, 3, 3);
      |                    ^~~~~~~~~~~~~~~~~~~~~
/sw/sw/include/simde/arm/neon.h:45982:86: error: excess elements in vector initializer
45982 |     a_high.values = SIMDE_SHUFFLE_VECTOR_(16, 4, a_high.values, a_high.values, 1, 1, 3, 3);
      |                                                                                      ^
/sw/sw/include/simde/x86/avx2.h:7475:71: note: in definition of macro ‘SIMDE_SHUFFLE_VECTOR_’
 7475 |          int##elem_size##_t SIMDE_VECTOR(vec_size) simde_shuffle_ = { __VA_ARGS__ }; \
      |                                                                       ^~~~~~~~~~~
/sw/sw/include/simde/arm/neon.h:45982:86: note: (near initialization for ‘simde_shuffle_’)
45982 |     a_high.values = SIMDE_SHUFFLE_VECTOR_(16, 4, a_high.values, a_high.values, 1, 1, 3, 3);
      |                                                                                      ^
/sw/sw/include/simde/x86/avx2.h:7475:71: note: in definition of macro ‘SIMDE_SHUFFLE_VECTOR_’
 7475 |          int##elem_size##_t SIMDE_VECTOR(vec_size) simde_shuffle_ = { __VA_ARGS__ }; \
      |                                                                       ^~~~~~~~~~~
/sw/sw/include/simde/arm/neon.h:45982:89: error: excess elements in vector initializer
45982 |     a_high.values = SIMDE_SHUFFLE_VECTOR_(16, 4, a_high.values, a_high.values, 1, 1, 3, 3);
      |                                                                                         ^
/sw/sw/include/simde/x86/avx2.h:7475:71: note: in definition of macro ‘SIMDE_SHUFFLE_VECTOR_’
 7475 |          int##elem_size##_t SIMDE_VECTOR(vec_size) simde_shuffle_ = { __VA_ARGS__ }; \
      |                                                                       ^~~~~~~~~~~
/sw/sw/include/simde/arm/neon.h:45982:89: note: (near initialization for ‘simde_shuffle_’)
45982 |     a_high.values = SIMDE_SHUFFLE_VECTOR_(16, 4, a_high.values, a_high.values, 1, 1, 3, 3);
      |                                                                                         ^
/sw/sw/include/simde/x86/avx2.h:7475:71: note: in definition of macro ‘SIMDE_SHUFFLE_VECTOR_’
 7475 |          int##elem_size##_t SIMDE_VECTOR(vec_size) simde_shuffle_ = { __VA_ARGS__ }; \
      |                                                                       ^~~~~~~~~~~
/sw/sw/include/simde/x86/avx2.h:7476:12: error: ‘__builtin_shuffle’ number of elements of the argument vector(s) and the mask vector should be the same
 7476 |            __builtin_shuffle(a, b, simde_shuffle_); \
      |            ^~~~~~~~~~~~~~~~~
/sw/sw/include/simde/arm/neon.h:45982:21: note: in expansion of macro ‘SIMDE_SHUFFLE_VECTOR_’
45982 |     a_high.values = SIMDE_SHUFFLE_VECTOR_(16, 4, a_high.values, a_high.values, 1, 1, 3, 3);
      |                     ^~~~~~~~~~~~~~~~~~~~~
/sw/sw/include/simde/arm/neon.h:45983:75: error: excess elements in vector initializer
45983 |     b_.values = SIMDE_SHUFFLE_VECTOR_(16, 4, -b_.values, b_.values, 1, 4, 3, 6);
      |                                                                           ^
/sw/sw/include/simde/x86/avx2.h:7475:71: note: in definition of macro ‘SIMDE_SHUFFLE_VECTOR_’
 7475 |          int##elem_size##_t SIMDE_VECTOR(vec_size) simde_shuffle_ = { __VA_ARGS__ }; \
      |                                                                       ^~~~~~~~~~~
/sw/sw/include/simde/arm/neon.h:45983:75: note: (near initialization for ‘simde_shuffle_’)
45983 |     b_.values = SIMDE_SHUFFLE_VECTOR_(16, 4, -b_.values, b_.values, 1, 4, 3, 6);
      |                                                                           ^
/sw/sw/include/simde/x86/avx2.h:7475:71: note: in definition of macro ‘SIMDE_SHUFFLE_VECTOR_’
 7475 |          int##elem_size##_t SIMDE_VECTOR(vec_size) simde_shuffle_ = { __VA_ARGS__ }; \
      |                                                                       ^~~~~~~~~~~
/sw/sw/include/simde/arm/neon.h:45983:78: error: excess elements in vector initializer
45983 |     b_.values = SIMDE_SHUFFLE_VECTOR_(16, 4, -b_.values, b_.values, 1, 4, 3, 6);
      |                                                                              ^
/sw/sw/include/simde/x86/avx2.h:7475:71: note: in definition of macro ‘SIMDE_SHUFFLE_VECTOR_’
 7475 |          int##elem_size##_t SIMDE_VECTOR(vec_size) simde_shuffle_ = { __VA_ARGS__ }; \
      |                                                                       ^~~~~~~~~~~
/sw/sw/include/simde/arm/neon.h:45983:78: note: (near initialization for ‘simde_shuffle_’)
45983 |     b_.values = SIMDE_SHUFFLE_VECTOR_(16, 4, -b_.values, b_.values, 1, 4, 3, 6);
      |                                                                              ^
/sw/sw/include/simde/x86/avx2.h:7475:71: note: in definition of macro ‘SIMDE_SHUFFLE_VECTOR_’
 7475 |          int##elem_size##_t SIMDE_VECTOR(vec_size) simde_shuffle_ = { __VA_ARGS__ }; \
      |                                                                       ^~~~~~~~~~~
/sw/sw/include/simde/x86/avx2.h:7476:12: error: ‘__builtin_shuffle’ number of elements of the argument vector(s) and the mask vector should be the same
 7476 |            __builtin_shuffle(a, b, simde_shuffle_); \
      |            ^~~~~~~~~~~~~~~~~
/sw/sw/include/simde/arm/neon.h:45983:17: note: in expansion of macro ‘SIMDE_SHUFFLE_VECTOR_’
45983 |     b_.values = SIMDE_SHUFFLE_VECTOR_(16, 4, -b_.values, b_.values, 1, 4, 3, 6);
      |                 ^~~~~~~~~~~~~~~~~~~~~
/sw/sw/include/simde/arm/neon.h: In function ‘simde_vcmlaq_rot90_laneq_f16’:
/sw/sw/include/simde/arm/neon.h:46051:83: error: excess elements in vector initializer
46051 |     a_low.values = SIMDE_SHUFFLE_VECTOR_(16, 4, a_low.values, a_low.values, 1, 1, 3, 3);
      |                                                                                   ^
/sw/sw/include/simde/x86/avx2.h:7475:71: note: in definition of macro ‘SIMDE_SHUFFLE_VECTOR_’
 7475 |          int##elem_size##_t SIMDE_VECTOR(vec_size) simde_shuffle_ = { __VA_ARGS__ }; \
      |                                                                       ^~~~~~~~~~~
/sw/sw/include/simde/arm/neon.h:46051:83: note: (near initialization for ‘simde_shuffle_’)
46051 |     a_low.values = SIMDE_SHUFFLE_VECTOR_(16, 4, a_low.values, a_low.values, 1, 1, 3, 3);
      |                                                                                   ^
/sw/sw/include/simde/x86/avx2.h:7475:71: note: in definition of macro ‘SIMDE_SHUFFLE_VECTOR_’
 7475 |          int##elem_size##_t SIMDE_VECTOR(vec_size) simde_shuffle_ = { __VA_ARGS__ }; \
      |                                                                       ^~~~~~~~~~~
/sw/sw/include/simde/arm/neon.h:46051:86: error: excess elements in vector initializer
46051 |     a_low.values = SIMDE_SHUFFLE_VECTOR_(16, 4, a_low.values, a_low.values, 1, 1, 3, 3);
      |                                                                                      ^
/sw/sw/include/simde/x86/avx2.h:7475:71: note: in definition of macro ‘SIMDE_SHUFFLE_VECTOR_’
 7475 |          int##elem_size##_t SIMDE_VECTOR(vec_size) simde_shuffle_ = { __VA_ARGS__ }; \
      |                                                                       ^~~~~~~~~~~
/sw/sw/include/simde/arm/neon.h:46051:86: note: (near initialization for ‘simde_shuffle_’)
46051 |     a_low.values = SIMDE_SHUFFLE_VECTOR_(16, 4, a_low.values, a_low.values, 1, 1, 3, 3);
      |                                                                                      ^
/sw/sw/include/simde/x86/avx2.h:7475:71: note: in definition of macro ‘SIMDE_SHUFFLE_VECTOR_’
 7475 |          int##elem_size##_t SIMDE_VECTOR(vec_size) simde_shuffle_ = { __VA_ARGS__ }; \
      |                                                                       ^~~~~~~~~~~
/sw/sw/include/simde/x86/avx2.h:7476:12: error: ‘__builtin_shuffle’ number of elements of the argument vector(s) and the mask vector should be the same
 7476 |            __builtin_shuffle(a, b, simde_shuffle_); \
      |            ^~~~~~~~~~~~~~~~~
/sw/sw/include/simde/arm/neon.h:46051:20: note: in expansion of macro ‘SIMDE_SHUFFLE_VECTOR_’
46051 |     a_low.values = SIMDE_SHUFFLE_VECTOR_(16, 4, a_low.values, a_low.values, 1, 1, 3, 3);
      |                    ^~~~~~~~~~~~~~~~~~~~~
/sw/sw/include/simde/arm/neon.h:46052:86: error: excess elements in vector initializer
46052 |     a_high.values = SIMDE_SHUFFLE_VECTOR_(16, 4, a_high.values, a_high.values, 1, 1, 3, 3);
      |                                                                                      ^
/sw/sw/include/simde/x86/avx2.h:7475:71: note: in definition of macro ‘SIMDE_SHUFFLE_VECTOR_’
 7475 |          int##elem_size##_t SIMDE_VECTOR(vec_size) simde_shuffle_ = { __VA_ARGS__ }; \
      |                                                                       ^~~~~~~~~~~
/sw/sw/include/simde/arm/neon.h:46052:86: note: (near initialization for ‘simde_shuffle_’)
46052 |     a_high.values = SIMDE_SHUFFLE_VECTOR_(16, 4, a_high.values, a_high.values, 1, 1, 3, 3);
      |                                                                                      ^
/sw/sw/include/simde/x86/avx2.h:7475:71: note: in definition of macro ‘SIMDE_SHUFFLE_VECTOR_’
 7475 |          int##elem_size##_t SIMDE_VECTOR(vec_size) simde_shuffle_ = { __VA_ARGS__ }; \
      |                                                                       ^~~~~~~~~~~
/sw/sw/include/simde/arm/neon.h:46052:89: error: excess elements in vector initializer
46052 |     a_high.values = SIMDE_SHUFFLE_VECTOR_(16, 4, a_high.values, a_high.values, 1, 1, 3, 3);
      |                                                                                         ^
/sw/sw/include/simde/x86/avx2.h:7475:71: note: in definition of macro ‘SIMDE_SHUFFLE_VECTOR_’
 7475 |          int##elem_size##_t SIMDE_VECTOR(vec_size) simde_shuffle_ = { __VA_ARGS__ }; \
      |                                                                       ^~~~~~~~~~~
/sw/sw/include/simde/arm/neon.h:46052:89: note: (near initialization for ‘simde_shuffle_’)
46052 |     a_high.values = SIMDE_SHUFFLE_VECTOR_(16, 4, a_high.values, a_high.values, 1, 1, 3, 3);
      |                                                                                         ^
/sw/sw/include/simde/x86/avx2.h:7475:71: note: in definition of macro ‘SIMDE_SHUFFLE_VECTOR_’
 7475 |          int##elem_size##_t SIMDE_VECTOR(vec_size) simde_shuffle_ = { __VA_ARGS__ }; \
      |                                                                       ^~~~~~~~~~~
/sw/sw/include/simde/x86/avx2.h:7476:12: error: ‘__builtin_shuffle’ number of elements of the argument vector(s) and the mask vector should be the same
 7476 |            __builtin_shuffle(a, b, simde_shuffle_); \
      |            ^~~~~~~~~~~~~~~~~
/sw/sw/include/simde/arm/neon.h:46052:21: note: in expansion of macro ‘SIMDE_SHUFFLE_VECTOR_’
46052 |     a_high.values = SIMDE_SHUFFLE_VECTOR_(16, 4, a_high.values, a_high.values, 1, 1, 3, 3);
      |                     ^~~~~~~~~~~~~~~~~~~~~
/sw/sw/include/simde/arm/neon.h:46053:75: error: excess elements in vector initializer
46053 |     b_.values = SIMDE_SHUFFLE_VECTOR_(16, 4, -b_.values, b_.values, 1, 4, 3, 6);
      |                                                                           ^
/sw/sw/include/simde/x86/avx2.h:7475:71: note: in definition of macro ‘SIMDE_SHUFFLE_VECTOR_’
 7475 |          int##elem_size##_t SIMDE_VECTOR(vec_size) simde_shuffle_ = { __VA_ARGS__ }; \
      |                                                                       ^~~~~~~~~~~
/sw/sw/include/simde/arm/neon.h:46053:75: note: (near initialization for ‘simde_shuffle_’)
46053 |     b_.values = SIMDE_SHUFFLE_VECTOR_(16, 4, -b_.values, b_.values, 1, 4, 3, 6);
      |                                                                           ^
/sw/sw/include/simde/x86/avx2.h:7475:71: note: in definition of macro ‘SIMDE_SHUFFLE_VECTOR_’
 7475 |          int##elem_size##_t SIMDE_VECTOR(vec_size) simde_shuffle_ = { __VA_ARGS__ }; \
      |                                                                       ^~~~~~~~~~~
/sw/sw/include/simde/arm/neon.h:46053:78: error: excess elements in vector initializer
46053 |     b_.values = SIMDE_SHUFFLE_VECTOR_(16, 4, -b_.values, b_.values, 1, 4, 3, 6);
      |                                                                              ^
/sw/sw/include/simde/x86/avx2.h:7475:71: note: in definition of macro ‘SIMDE_SHUFFLE_VECTOR_’
 7475 |          int##elem_size##_t SIMDE_VECTOR(vec_size) simde_shuffle_ = { __VA_ARGS__ }; \
      |                                                                       ^~~~~~~~~~~
/sw/sw/include/simde/arm/neon.h:46053:78: note: (near initialization for ‘simde_shuffle_’)
46053 |     b_.values = SIMDE_SHUFFLE_VECTOR_(16, 4, -b_.values, b_.values, 1, 4, 3, 6);
      |                                                                              ^
/sw/sw/include/simde/x86/avx2.h:7475:71: note: in definition of macro ‘SIMDE_SHUFFLE_VECTOR_’
 7475 |          int##elem_size##_t SIMDE_VECTOR(vec_size) simde_shuffle_ = { __VA_ARGS__ }; \
      |                                                                       ^~~~~~~~~~~
/sw/sw/include/simde/x86/avx2.h:7476:12: error: ‘__builtin_shuffle’ number of elements of the argument vector(s) and the mask vector should be the same
 7476 |            __builtin_shuffle(a, b, simde_shuffle_); \
      |            ^~~~~~~~~~~~~~~~~
/sw/sw/include/simde/arm/neon.h:46053:17: note: in expansion of macro ‘SIMDE_SHUFFLE_VECTOR_’
46053 |     b_.values = SIMDE_SHUFFLE_VECTOR_(16, 4, -b_.values, b_.values, 1, 4, 3, 6);
      |                 ^~~~~~~~~~~~~~~~~~~~~
mr-c commented 6 months ago

@kovidgoyal That is interesting. What is the compiler and version? Can you share a complete compiler invocation?

mr-c commented 6 months ago

For example, that codepath can't be from GCC before 12.0, as the codepath listed above is the fallback for lack of support of __builtin_shufflevector: https://gcc.gnu.org/gcc-12/changes.html

kovidgoyal commented 6 months ago

Compiler invocation

gcc -MMD -DNDEBUG -DHAS_COPY_FILE_RANGE -I/sw/sw/include -DPRIMARY_VERSION=4000 -DSECONDARY_VERSION=34 -DXT_VERSION="0.34.1" -Wextra -Wfloat-conversion -Wno-missing-field-initializers -Wall -Wstrict-prototypes -std=c11 -pedantic-errors -Werror -O3 -fwrapv -fstack-protector-strong -pipe -fvisibility=hidden -fno-plt -fPIC -I/sw/sw/include -D_FORTIFY_SOURCE=2 -flto -pthread -I/sw/sw/include/libpng16 -I/sw/sw/include -I/sw/sw/include -I/sw/sw/include -I/sw/sw/include/freetype2 -I/sw/sw/include/libpng16 -I/sw/sw/include -I/usr/include/uuid -I/sw/sw/include/harfbuzz -I/sw/sw/include/freetype2 -I/sw/sw/include/libpng16 -I/sw/sw/include -I/sw/sw/include/python3.11 -c kitty/simd-string-128.c -o build/fast_data_types-kitty-simd-string-128.c.o

Compiler version

gcc --version
gcc (Ubuntu 7.5.0-3ubuntu1~18.04) 7.5.0
Copyright (C) 2017 Free Software Foundation, Inc.
This is free software; see the source for copying conditions.  There is NO
warranty; not even for MERCHANTABILITY or FITNESS FOR A PARTICULAR PURPOSE.

Yes, it is pretty old. I build kitty binaries on old distros to maximize compatibility.

mr-c commented 6 months ago

FYI: for SIMDe we recommend adding -fopenmp-simd -DSIMDE_ENABLE_OPENMP for gcc ; though I don't think that changes anything for this particular issue