allenai / s2orc

S2ORC: The Semantic Scholar Open Research Corpus: https://www.aclweb.org/anthology/2020.acl-main.447/
800 stars 64 forks source link

How to identify the category of each paper #18

Closed iriscxy closed 4 years ago

iriscxy commented 4 years ago

In figure 2 we can see the Distribution of papers by Microsoft Academic field of study. How to identify this category information in the provided dataset?

kyleclo commented 4 years ago

@yingtaomj See the metadata.csv files have provided fields of study per paper

kyleclo commented 4 years ago

For example, each row looks like

{"paper_id": "77490025", "title": "State... "year": 1975, "mag_field_of_study": ["Medicine"], ...}