Open CosimoRulli opened 10 months ago
Hi @CosimoRulli
Sorry for the late reply! I think the issue is due to not correctly loading the model ckpt
. From the README, if you only want to evaluate the model from existing checkpoints, you should add the init line and run:
python3 -m splade.beir_eval \
init_dict.model_type_or_dir=naver/splade-cocondenser-ensembledistil \
config.pretrained_no_yamlconfig=true \
+beir.dataset=$dataset \
+beir.dataset_path=data/beir \
config.index_retrieve_batch_size=100
Let me know if that works! Best
Hi, thank you for sharing and maintaining this repo! I am willing to generate the SPLADE representations both for documents and queries for all the datasets in BEIR, similarly to what it is possible to do with the
create_anserini
script for the MSMARCO dataset. I would like to do it both forsplade-cocondenser-ensembledistil
andefficient-splade-V-large
.I tried to run the following script,
but I get
NDCG=0.001
on thearguana
dataset (then, I stopped the script because I guess that there is something wrong). What I am doing wrong? Also, does this script save the embeddings of each dataset? If not, how can I force it to save them?