Georgetown-IR-Lab / cedr

Code for CEDR: Contextualized Embeddings for Document Ranking, accepted at SIGIR 2019.
MIT License
156 stars 28 forks source link

Question about Robust04 #15

Closed Deriq-Qian-Dong closed 4 years ago

Deriq-Qian-Dong commented 4 years ago

Hi, I am intersted to your work. Could you release the codes about how to preprocess the raw TREC Robust04 dataset? I want to reproduce the results in your paper. However I failed, I think the problem must in the step of data-preprocessing. Appreciate for your help~

seanmacavaney commented 4 years ago

Hi @DQ0408,

Unfortunately the collection license agreement prevents us from distributing it. However, I'm currently in the process of putting together code that should make it easier to reproduce the results. I'll keep you posted.

Deriq-Qian-Dong commented 4 years ago

Hi, thank you for your reply. I already have the data of collection. Is it possible to release the data-preprocessing codes only? If not, could you give me some hints about this step? Really appreciate for your help~

seanmacavaney commented 4 years ago

Yup-- it will include code for all the pre-processing/indexing/etc.

Deriq-Qian-Dong commented 4 years ago

Looking forward to your next works!