castorini / pyserini

Pyserini is a Python toolkit for reproducible information retrieval research with sparse and dense representations.
http://pyserini.io/
Apache License 2.0
1.57k stars 349 forks source link

Which tokenization technique is employed by BM25? #1875

Closed lxx1220 closed 2 months ago