Merge dataframe with second data (authors, institutions, countries)

Is your feature request related to a problem? Please describe. The PDF formatting makes it difficult to scrap the authors and their institutions from the XML. Fortunately, there is another repository of the articles that makes this easier, and even more fortunately, someone has already done the hard work of scraping it with python, as well as adding for many institutions their country of affiliation, here: https://github.com/nd7141/icml2020

Describe the solution you'd like Can the authors+institutions+countries data scraped by the above github user be collated into our dataframe, and output in a single csv file?

Describe alternatives you've considered Will need to look into this!

earlng / academic-pdf-scrap

Merge dataframe with second data (authors, institutions, countries) #3