data61 / clkhash

CLK hash: hash pii for entity matching
Apache License 2.0
47 stars 9 forks source link

Feature/cache hashed tokens #664

Closed hardbyte closed 1 year ago

hardbyte commented 1 year ago
Workers Hashes Hash Rate Before changes (KH/s) Hash Rate after changes (KH/s)
0 10k 6.06 15.64
2 10k 11.68 28.86
2 50k 11.65 28.91
2 100k 11.54 29.21
4 10k 20.74 44.54
4 50k 21.03 49.00
4 100k 21.58 49.10
8 10k 29.94 57.55
8 50k 34.17 65.49
8 100k 36.06 70.50
16 10k 40.21 70.50
16 50k 45.87 80.57
16 100k 47.90 84.17
codecov[bot] commented 1 year ago

Codecov Report

Merging #664 (704367e) into master (a836c0e) will increase coverage by 0.06%. The diff coverage is 98.18%.

Additional details and impacted files ```diff @@ Coverage Diff @@ ## master #664 +/- ## ========================================== + Coverage 94.35% 94.42% +0.06% ========================================== Files 13 13 Lines 957 969 +12 ========================================== + Hits 903 915 +12 Misses 54 54 ```
wilko77 commented 1 year ago

Cache stats for the test file: blake_encode_token cache info: CacheInfo(hits=14723511, misses=23030, maxsize=32768, currsize=23030) _blake_encode_ngrams cache info: CacheInfo(hits=1471916, misses=800153, maxsize=131072, currsize=131072)