Closed chsasank closed 5 years ago
@chsasank I am working on opensourcing the scripts that were used to generate the entire dataset keep an eye posted next week.
Hey, just an update. I have uploaded the script that we used to generate the ranking triples its a scope script but should be pretty easy to understand. https://github.com/dfcf93/MSMARCOV2/blob/master/Ranking/GenerateData.script
Hi,
It's not clear how triples were generated. Your documentation says:
But it also says
How are negative passages generated if is_selected:0 is not true negative. Can you please open source the code used to generate these triples.
I think documentation for the dataset needs work. Given the usefulness of the dataset, it's a shame if people are unable to use it because of documentation.