Closed maybay21 closed 9 months ago
@maybay21 It sounds like by now you might have already resolved this wrt your "hack script together" + my delayed reply? Sorry for the delay here.
Closing in lieu of requestor reply
Hi, @maybay21 , would you like to share your hack script? Thanks!
Hi All,
First, thank you so much for all of your hard work on the Semantic Scholar dataset. It's been invaluable to my research.
I have a quick question wrt the S2ORC dataset. I completed a bulk dataset download through the API, however the files from the API are not in the same format as the output from S2ORC-doc2json. I imagine this is intentional. Is there a helper script to translate the annotations into the S2ORC-doc2json format? I received files in the format below:
My goal is to translate to the original S2ORC format:
If there's not something on-hand, I'll hack a script together. Thanks!