earlng / academic-pdf-scrap

Code that scraps the contents of the PDF papers submitted for NeurIPS 2020
MIT License
4 stars 2 forks source link

Include paper link in dataframe #7

Closed paulsedille closed 3 years ago

paulsedille commented 3 years ago

Is your feature request related to a problem? Please describe. The current dataframe only includes the "paper identifier" (an alphanumeric string) but this could easily be used to also provide the url link to the paper itself, a useful addition.

Describe the solution you'd like Add a "paper link" column in the dataframe by adding two strings to the paper identifier, before and after, as follows: "https://proceedings.neurips.cc/paper/2020/file/" + {paper identifier} + "-Paper.pdf" (all paper links are built like this).