measured on an intel platform (avx2 build, gcc 9.2.0)
hopefully, it might be useful for someone to investigate where the bottlenecks are
paq8px.exe -8 data.dat // default CM
overview
paq8px.exe -8l data.dat // default CM
overview (ltsm)
paq8px.exe -8 data.dat // default CM
tree
paq8px.exe -8 data.dat // default CM
tree (ltsm)
here's a profile from https://software.intel.com/content/www/us/en/develop/tools/vtune-profiler.html
measured on an intel platform (avx2 build, gcc 9.2.0)
hopefully, it might be useful for someone to investigate where the bottlenecks are
paq8px.exe -8 data.dat // default CM overview paq8px.exe -8l data.dat // default CM overview (ltsm) paq8px.exe -8 data.dat // default CM tree paq8px.exe -8 data.dat // default CM tree (ltsm)