This PR is the smallest I could make (except for number of LongNrq queries, could probably be fewer) to prove that the changes in apache/lucene/pull/13149 work.
I aimed at reproducing for wikimediumall. This needs to be run with optimize = True for indexing and commitPoint = 'single' for the competition - otherwise it is hard to see the performance difference. The reason for this is that the BkdTree IntsWriter otherwise chooses too good of a compression, since the number of documents is too low in each segment.
I'm not sure if this should be merged - the PR is mostly here for reference.
This PR is the smallest I could make (except for number of LongNrq queries, could probably be fewer) to prove that the changes in apache/lucene/pull/13149 work.
I aimed at reproducing for
wikimediumall
. This needs to be run withoptimize = True
for indexing andcommitPoint = 'single'
for the competition - otherwise it is hard to see the performance difference. The reason for this is that the BkdTree IntsWriter otherwise chooses too good of a compression, since the number of documents is too low in each segment.I'm not sure if this should be merged - the PR is mostly here for reference.