castorini / pyserini

Pyserini is a Python toolkit for reproducible information retrieval research with sparse and dense representations.
http://pyserini.io/
Apache License 2.0
1.57k stars 349 forks source link

Onboarding tasks billycz8 #1882

Closed billycz8 closed 2 months ago

billycz8 commented 2 months ago

System setup:

OS: macOS Ventura 13.4 Memory: 16GB Chip: Apple M1 Pro Python Version: 3.10.13 Java Version: 21.0.3 Maven: 3.9.2

Everything worked perfectly!

Suggestion: When I went through Pyserini: BGE-base Baseline for NFCorpus / Data Prep, I was curious why we added Q0 instead of 0 as the artifact in Anserini. It seems to be used in trec_eval.py, and I think explaining it would be helpful for others to understand the reason of the change. (I have added the change in the PR, we could reword or ignore that change if it is not useful.)