pisa-engine / pisa

PISA: Performant Indexes and Search for Academia
https://pisa-engine.github.io/pisa/book
Apache License 2.0
925 stars 64 forks source link

Reorder by URL #446

Open amallia opened 3 years ago

amallia commented 3 years ago

For some weird reason reordering by URL does not work when using https://github.com/pisa-engine/pisa/blob/master/tools/reorder_docids.cpp

It does work if we use this external script instead: https://github.com/pisa-engine/pisa/blob/master/script/generate_sorted_docids_mapping.py

elshize commented 2 years ago

If you remember, can you clarify "does not work"? The only difference between these two I can see is that the python script lowercases the values. Could that be the discrepancy?