Closed NohTow closed 4 months ago
The original implementation of ColBERT leverage a skiplist to not use certain tokens (mostly punctuation) for representing the text (they are used for the encoding, but are then discarded).
Solved in #15.
The original implementation of ColBERT leverage a skiplist to not use certain tokens (mostly punctuation) for representing the text (they are used for the encoding, but are then discarded).