allenai / ir_datasets

Provides a common interface to many IR ranking datasets.
https://ir-datasets.com/
Apache License 2.0
317 stars 42 forks source link

Use bibtex from [dblp, acl anthology, ir anthology, acm dl, elsewhere?] #182

Open seanmacavaney opened 2 years ago

seanmacavaney commented 2 years ago

Right now, the ir-datasets.bib file is a bit messy, with inconsistencies in the ids/fields/formatting/etc. across records. It's probably best to go with an established source, such as DBLP, the ACL Anthology, the ACM DL, and/or the IR Anthology.

I'm inclined to go with DBLP when possible. I believe it covers most (all?) the papers we need, and has IDs that look nice. The IR Anthology also takes bibtex from here.

In cases where DBLP doesn't cover the paper, we can just mimic the look/fields/etc.

I don't think there's a need to pull the bibtex automatically -- I'm happy just to copy it over.