Heliosearch / heliosearch

The next generation of open source search
http://heliosearch.org
91 stars 19 forks source link

DocSetCollector that works with unordered ids + duplicates #19

Closed yonik closed 10 years ago

yonik commented 10 years ago

Essentially, an alternate way to collect ids when there aren't enough of them to justify setting bits in a bitset. Seems like we should sort the list of ids, then copy to a new array (while removing duplicates). A more efficient sort (hybrid MSD radix sort?) would be a bonus.