issues
search
allenai
/
ir_datasets
Provides a common interface to many IR ranking datasets.
https://ir-datasets.com/
Apache License 2.0
306
stars
39
forks
source link
issues
Newest
Newest
Most commented
Recently updated
Oldest
Least commented
Least recently updated
MS MARCO v2.1 and v2.1 segmented for TREC 2024 RAG
#267
mam10eks
opened
2 weeks ago
1
add trec-cast/v2 and v3 metadata
#266
andreaschari
closed
1 month ago
0
Fix/count
#265
bpiwowar
closed
2 weeks ago
3
add trec ikat 2023
#264
SimonLupart
opened
2 months ago
0
Update tsv.py
#263
tonellotto
closed
2 months ago
0
fix wrong urls for neuclir docs
#262
andreaschari
closed
2 months ago
1
detect google colab and use gsutil (for NQ)
#261
cmacdonald
closed
2 months ago
0
TREC iKAT 2023/2024
#260
SimonLupart
opened
2 months ago
6
Make pyautocorpus an optional dependency
#259
cmacdonald
closed
2 months ago
0
neuclir23
#258
seanmacavaney
closed
2 months ago
1
fix location of msmarco source files and bump version
#257
seanmacavaney
closed
5 months ago
0
MSMARCO URLs moved to another domain
#256
TheMrSheldon
closed
4 months ago
4
TREC CaST
#255
bpiwowar
closed
1 month ago
26
Unified getter for the relevance level
#254
TheMrSheldon
opened
5 months ago
1
BioASQ
#253
MathVast
opened
5 months ago
0
Use provided id_field
#252
bpiwowar
closed
6 months ago
1
Cannot read LoTTE docs
#251
ftvalentini
opened
7 months ago
0
Add BioASQ dataset to the list of supported BEIR datasets
#250
MathVast
opened
9 months ago
2
Add test dataset for trec tip of the tongue dataset
#249
mam10eks
opened
10 months ago
1
MIRACL
#248
seanmacavaney
closed
11 months ago
0
fix: fix the bug of considering the Path as a str when loading from TREC dataset
#247
yzong12138
closed
5 months ago
4
IO error related to the dataset compressed with .Z
#244
yzong12138
closed
5 months ago
2
MS MARCO Passage v2 deduplicated version
#243
seanmacavaney
closed
11 months ago
0
TREC DL 2023 Topics
#242
seanmacavaney
closed
11 months ago
0
TIPSTER corpus
#241
breuert
opened
1 year ago
0
Switch from namedtuple to dataclasses
#240
bpiwowar
closed
1 year ago
5
trec-dl-2022 qrels
#239
seanmacavaney
closed
1 year ago
0
TREC tip-of-the-tongue
#238
seanmacavaney
closed
1 year ago
2
`ir_datasets` writes to `$HOME` which makes reproducibility hard
#237
MangoIV
opened
1 year ago
0
Downloading Natural Questions Dev also grabs Train
#236
kyleclo
opened
1 year ago
0
TREC 2023 Tip-of-the-Tongue
#235
mam10eks
opened
1 year ago
3
LongEval Retrieval (used at CLEF 2023)
#234
mam10eks
opened
1 year ago
6
Update crisisfacts.py
#233
richardmcc
closed
1 year ago
0
what is a "`streamer`"
#232
MangoIV
opened
1 year ago
1
[MINOR:TYPO] Update msmarco-passage.yaml
#231
cakiki
closed
1 year ago
1
Dataset definition (in `ir_datasets/datasets/[topid].py`)
#230
seanmacavaney
closed
1 year ago
0
t2ranking
#229
seanmacavaney
opened
1 year ago
0
Lock for writing to the cache files
#228
eugene-yang
opened
1 year ago
2
A question about WikiIR dataset
#227
TheTahaaa
closed
1 year ago
3
defaulttext
#226
seanmacavaney
closed
1 year ago
0
Adding the SARA dataset
#225
JackMcKechnie
closed
1 year ago
0
Add args.me default text
#224
heinrichreimer
closed
1 year ago
1
ERROR: Failed building wheel for zlib-state
#223
km5ar
opened
1 year ago
3
lz4f_decompress failed with code: ERROR_frametype_unknown
#222
yogeswarl
closed
1 year ago
2
BEIR cqadupstack
#221
jobergum
closed
1 year ago
2
msmarco-passage/dev/2
#220
seanmacavaney
closed
1 year ago
0
beir evaluation quirks
#219
seanmacavaney
opened
1 year ago
0
Touche 2023
#218
heinrichreimer
opened
1 year ago
3
S3 or other file IO backends
#217
heinrichreimer
opened
1 year ago
1
Disable MetadataComponent for local development
#216
heinrichreimer
opened
1 year ago
0
Next