Closed ilyaraz closed 7 years ago
Very nice! Then we can get rid of the FHTHelper as well (https://github.com/FALCONN-LIB/FALCONN/blob/master/src/include/falconn/core/polytope_hash.h#L80).
To what extent is the code generated now? Can it do AVX-512 potentially for the new Core i-9 CPUs? :-)
Yes, you simply need to re-implement the first several functions in gen.py. The same applied to SSE and to doubles.
Done in v1.1.
I wrote a new version of FFHT [1], which is quite a bit faster and does not require the array to be aligned. Here are the results for floats with AVX (doubles are not yet supported).
[1] https://github.com/ilyaraz/ffht_codegen