jenrichmond / same_page

1 stars 0 forks source link

Sample of metadata for 1B #3

Open ChristinaGitHub1 opened 3 years ago

ChristinaGitHub1 commented 3 years ago

Hi Jenny!

I've created a csv sample of the metadata for 1B. It's called sample_metadata_1B and its in the data_files folder. Let me know if this link works!

https://github.com/jenrichmond/christina_hons/blob/1a863e4b1a1fb94cdc572fab3c6d491756f78a1d/data_files/sample_metadata_1B.csv

jenrichmond commented 3 years ago

https://github.com/jenrichmond/christina_hons/blob/5f2c4bee928312db9ba860660894c446b3c7ded2/meta.R#L1

well that never happens, quick google, found the right package, vignette examples super easy to follow and worked out how to automagically pull all the meta data needed. Package = rcrossref https://ciakovx.github.io/rcrossref.html#Searching_by_DOI

Only problem is the it keeps the "author" field as a list column to allow for there to be more than 1 author. There is an example in the vignette of how to unnest it, but it creates a row for every author, which is not really what we want. In the list format, it wont write to csv, because csv is a "flat" file format.

Your task- what format do you want author info in? just the first author? all of them in a single cell? can you work out how to turn that list column into something useful?

Jenny

ChristinaGitHub1 commented 3 years ago

Amazing, thanks Jenny, what a good find!!

I'll look into the list problem and see what I can come up with!