Closed ikuyamada closed 5 years ago
+1 Same problem in the triples.train.small.tsv file:
sed -n '43427,43427p' triples.train.small.tsv When you're on a call or listening to voicemail on your iPhone, you might not be able to hear a person's voice clearly. Or you might hear crackling, static, or generally poor sound quality. Follow the steps below to resolve the issue.
Update this should be fixed later on today.
I've downloaded the train triples small file but it seems that the problem persists:
https://msmarco.blob.core.windows.net/msmarcoranking/triples.train.small.tar.gz
The md5 checksum is still the same from the old version.
36e27d06e66b85957eb774b5504723a6
Describe the bug
A lot of invalid line breaks are contained in the top1000 TSV files of the reranking datasets. For example, line 234472 in the top1000.dev.tsv does not start with the IDs.
To Reproduce