Closed xhluca closed 1 month ago
BM25.retrieve
Tokenized
ids
vocab
List[List[int]]
List[List[str]]
Tokenizer.tokenize
Tokenizer.streaming_tokenize
TODO
BM25.retrieve
under each possible condition:Tokenized
namedtupleids
andvocab
Tokenized
namedtuple) of ids and vocabList[List[int]]
List[List[str]]
Tokenizer.tokenize
, where it generates:Tokenized
namedtupleids
andvocab
Tokenized
namedtuple) of ids and vocabList[List[int]]
List[List[str]]
Tokenizer.streaming_tokenize