Add more realistic Text HashTable tests

rurban / smhasher

Hash function quality and speed tests

https://rurban.github.io/smhasher/

Other

1.84k stars 177 forks source link

Add more realistic Text HashTable tests #61

Closed rurban closed 4 years ago

rurban commented 5 years ago

Find the distribution of the most common chars in programming language names (ean...) Test against randomly generated names from those with typical short lengths (see rurban/perl-hash-stats).

And test a more common hash table speed scenario (I-Cache), to verify why FNV1A and Spooky32 are so much better there than the others.

rurban commented 4 years ago

Also see Yann's new benchmark suite for xxh3: https://github.com/Cyan4973/xxHash/tree/dev/tests/bench

dumblob commented 4 years ago

Also see Yann's new benchmark suite for xxh3: https://github.com/Cyan4973/xxHash/tree/dev/tests/bench

Oh, I'm sorry, that's the one I meant in all my comments in other threads (didn't notice it's still just in the dev branch).

@sanmayce @wangyi-fudan please note the linked Yann's benchmark from my comments seems to be the old one, but I actually meant the new one which sits in the dev branch. Feel free to take a look.

Sanmayce commented 4 years ago

It's confusing, I took the bench suite from the latest 'xxHash-0.7.2.zip' from October, what makes this 'dev' more interesting, it is much older?

dumblob commented 4 years ago

@Sanmayce can't tell about the release archive, but the current master<->dev branch doesn't differ much - see https://github.com/Cyan4973/xxHash/compare/master...dev

rurban commented 4 years ago

the benchmarks are the same: https://github.com/Cyan4973/xxHash/compare/v0.7.2...dev

rurban commented 4 years ago

Merged branch HashMapTest with b1ad36aaf820cb5d267500c270e0bd3fb2195aa0