castorini / anserini

Anserini is a Lucene toolkit for reproducible information retrieval research
http://anserini.io/
Apache License 2.0
1.01k stars 444 forks source link

Simplify options for HNSW indexes #2533

Closed lintool closed 2 months ago

lintool commented 2 months ago

Continuation of #2531

HNSW regressions for MS MARCO regressions: revert to "default settings". (previous PR was for BEIR)

codecov[bot] commented 2 months ago

Codecov Report

All modified and coverable lines are covered by tests :white_check_mark:

Project coverage is 67.18%. Comparing base (afa6ab0) to head (ff1d7f8).

Additional details and impacted files ```diff @@ Coverage Diff @@ ## master #2533 +/- ## ========================================= Coverage 67.18% 67.18% Complexity 1481 1481 ========================================= Files 219 219 Lines 12645 12645 Branches 1528 1528 ========================================= Hits 8495 8495 Misses 3624 3624 Partials 526 526 ```

:umbrella: View full report in Codecov by Sentry.
:loudspeaker: Have feedback on the report? Share it here.

lintool commented 2 months ago

Hey @MXueguang @tteofili - this is ready for review. I've re-run regressions and everything checks out.

I need to go back and readjust tolerances for scores, but will circle back with a follow-up PR there.