IllDepence / unarXive

A data set based on all arXiv publications, pre-processed for NLP, including structured full-text and citation network
MIT License
259 stars 19 forks source link

Questions about the authors in this dataset #20

Closed Zivenzhu closed 1 year ago

Zivenzhu commented 1 year ago

Dear developers, I have got a question. Is there any information like descriptions of the authors of each paper in the dataset.

IllDepence commented 1 year ago

Hi,

as stated in the data format documentation, each paper’s metadata field contains metadata from kaggle.com/datasets/Cornell-University/arxiv. If you look at it there, you’ll find there are two fields: authors and authors_parsed.

Zivenzhu commented 1 year ago

ohh, I see. Thanks for your help. Hope you have a nice day.

IllDepence commented 1 year ago

thanks — you too