WojciechMula / toys

Storage for my snippets, toy programs, etc.
BSD 2-Clause "Simplified" License
316 stars 38 forks source link

avx512vbmi-remove-spaces #7

Open smallnamespace opened 5 years ago

smallnamespace commented 5 years ago

Instead of generating the addmask by repeatedly adding to the existing addmask in a loop, isn't the final mask simply the cumulative horizontal sum of the space mask?

Looks like a prefix sum can be computed the usual way, or can also be parallelized?

WojciechMula commented 5 years ago

@smallnamespace thank you very much! It indeed looks similar, I'll check this,