Open hellrich opened 6 years ago
uses fixed seed probabilistic downsampling, should be weighted
Processing used google_books_parts2counts, not a good idea to have two similar tools. Change applies here too https://github.com/hellrich/hyperwords/blob/master/hyperwords/google_books_parts2counts.py
uses fixed seed probabilistic downsampling, should be weighted