fastlib / fCWT

The fast Continuous Wavelet Transform (fCWT) is a library for fast calculation of CWT.
Apache License 2.0
263 stars 53 forks source link

ensure proper 32 byte alignment for avx2 instruction usage #14

Closed lschneiderbauer closed 1 year ago

lschneiderbauer commented 1 year ago

On some systems and compiler options _mm256_mul_ps segfaults due to incorrect (< 32-byte) memory alignment. This pull request modifies the relevant memory allocations accordingly, ensuring 32-byte alignment.

Fixes #11 and possibly #8.