issues
search
allenai
/
ir_datasets
Provides a common interface to many IR ranking datasets.
https://ir-datasets.com/
Apache License 2.0
306
stars
40
forks
source link
issues
Newest
Newest
Most commented
Recently updated
Oldest
Least commented
Least recently updated
How to construct invert index using ir_datasets tools?
#215
VincentXWD
closed
1 year ago
2
Remove duplicate bib
#214
heinrichreimer
closed
1 year ago
0
Clueweb22
#213
heinrichreimer
closed
2 months ago
25
Fix typo
#212
heinrichreimer
closed
1 year ago
0
Touché 2022
#211
heinrichreimer
closed
1 year ago
6
ClueWeb22
#210
heinrichreimer
opened
1 year ago
9
File structure stated in msmarco_passage.py is not aligned with downloaded top1000.dev.tar.gz
#209
yuenherny
opened
1 year ago
1
UnicodeDecodeError: 'charmap' codec can't decode byte 0x9d in position 3417: character maps to <undefined> when trying to decode docs
#208
yuenherny
opened
1 year ago
6
Dataset download started but no docs were downloaded
#207
yuenherny
closed
1 year ago
1
Permissions error on /tmp/ir_dataset directory due to multiple users on the same server
#206
mitgosp
opened
1 year ago
5
Istella22 update links
#205
seanmacavaney
closed
1 year ago
0
istella22/source moved
#204
seanmacavaney
closed
1 year ago
0
Remove redundant query arg
#203
heinrichreimer
closed
1 year ago
1
Fix Touché file URLs
#202
heinrichreimer
closed
1 year ago
1
Fix and add html extractor
#201
grodino
closed
1 year ago
5
trec-dl-2022 topics and scoreddocs
#200
seanmacavaney
closed
1 year ago
0
Istella22
#199
seanmacavaney
closed
1 year ago
0
Add clueweb12 diversity task datasets
#198
grodino
closed
2 years ago
3
Add ClueWeb09/ClueWeb12 diversity track data
#197
grodino
closed
2 years ago
3
trec fair ranking 2022
#196
seanmacavaney
closed
2 years ago
0
trec-ct-2022 topics
#195
seanmacavaney
closed
2 years ago
0
trec-ct-2022 topics
#194
seanmacavaney
closed
2 years ago
0
A Dataset for Sentence Retrieval for Open-Ended Dialogues
#193
seanmacavaney
opened
2 years ago
0
codec v1 release
#192
seanmacavaney
closed
2 years ago
0
handling .z files as gzip
#191
seanmacavaney
opened
2 years ago
3
Hotfix for #188
#190
ArthurCamara
closed
2 years ago
2
TrecDocs: .Z and .z files are different.
#189
ArthurCamara
opened
2 years ago
7
Disks45 cannot read docs from plain text files.
#188
ArthurCamara
closed
2 years ago
1
DuReader
#187
seanmacavaney
opened
2 years ago
0
Allow downloads to resume for all MSMARCO dataset resources larger than 500MB
#186
kaglowka
closed
2 years ago
2
Fix TREC Genomics Track 2005 description
#185
cakiki
closed
2 years ago
1
Direct access to all doc_ids
#184
ArthurCamara
opened
2 years ago
5
WANDS
#183
seanmacavaney
opened
2 years ago
0
Use bibtex from [dblp, acl anthology, ir anthology, acm dl, elsewhere?]
#182
seanmacavaney
opened
2 years ago
0
neuMARCO
#181
seanmacavaney
closed
2 years ago
0
CURE
#180
seanmacavaney
opened
2 years ago
0
NeuCLIR Collection 1 (documents and HC4-filtered subset)
#179
eugene-yang
closed
2 years ago
1
wikiclir
#178
seanmacavaney
closed
2 years ago
0
WikiCLIR
#177
seanmacavaney
closed
2 years ago
0
cache hc4 topics/qrels
#176
seanmacavaney
closed
2 years ago
0
local datasets
#175
seanmacavaney
opened
2 years ago
0
fixed and tested issue affecting some clueweb lookups
#174
seanmacavaney
closed
2 years ago
0
improved HTML/XML parser, TREC 7 and 8
#173
seanmacavaney
closed
2 years ago
0
CODEC
#172
seanmacavaney
closed
2 years ago
0
some trec 2021 qrels released
#171
seanmacavaney
closed
2 years ago
0
TREC Health Misinformation 2022
#170
seanmacavaney
opened
2 years ago
0
TREC Fair Ranking 2022
#169
seanmacavaney
opened
2 years ago
0
TREC Deep Learning 2022
#168
seanmacavaney
closed
1 year ago
5
TREC CrisisFacts 2022
#167
seanmacavaney
opened
2 years ago
0
TREC CAsT 2022
#166
seanmacavaney
opened
2 years ago
0
Previous
Next