Open ChristinaGitHub1 opened 3 years ago
well that never happens, quick google, found the right package, vignette examples super easy to follow and worked out how to automagically pull all the meta data needed. Package = rcrossref
https://ciakovx.github.io/rcrossref.html#Searching_by_DOI
Only problem is the it keeps the "author" field as a list column to allow for there to be more than 1 author. There is an example in the vignette of how to unnest it, but it creates a row for every author, which is not really what we want. In the list format, it wont write to csv, because csv is a "flat" file format.
Your task- what format do you want author info in? just the first author? all of them in a single cell? can you work out how to turn that list column into something useful?
Jenny
Amazing, thanks Jenny, what a good find!!
I'll look into the list problem and see what I can come up with!
Hi Jenny!
I've created a csv sample of the metadata for 1B. It's called sample_metadata_1B and its in the data_files folder. Let me know if this link works!
https://github.com/jenrichmond/christina_hons/blob/1a863e4b1a1fb94cdc572fab3c6d491756f78a1d/data_files/sample_metadata_1B.csv