IllDepence / unarXive

A data set based on all arXiv publications, pre-processed for NLP, including structured full-text and citation network
MIT License
259 stars 19 forks source link

For some papers, references are only matched up to part of the bib_entries list #11

Open IllDepence opened 1 year ago

IllDepence commented 1 year ago

An example is the first line in arXiv_src_2105_034.jsonl (paper 2105.05883) for which only the first 14 out of 20 entries in bib_entries were extended with an ids field (i.e. processed by the matching script).

Notes: