Open Muennighoff opened 3 months ago
Looking at where self.repo_name
is defined: https://github.com/embeddings-benchmark/arena/blob/64a8780d596018912905523406621eed62a9a417/retrieval/bm25_index.py#L16
Maybe model_name
has spaces in it, which is not alphanumeric?
I think the problem is that sometimes the model name is turned into ### model a: bm25
rather than bm25
and this leads to this error; I'm not sure when exactly
Maybe we can directly feed bm25
as the model_name here?
def retrieve(self, query, corpus, model_name, topk=1):
corpus_format = CORPUS_TO_FORMAT[corpus]
if "BM25" in model_name:
index = self.load_bm25_index(model_name, corpus)
Not sure what happened but saw this in the logs: