Closed PedroSena closed 8 months ago
This is due to paper merging, if you search the IDs by inputting semanticscholar.org/paper/[IDHERE] they are rerouting to the same paper. Hopefully this clarifies a bit!
thank you
Hi @cfiorelli ,
Thanks for the clarification.
Is there a way to identify that a paper was merged with another (Besides counting on this rerouting) ? We are using the batch API and requesting multiple papers at once, having the ability to identify those cases would be ideal for us.
Thanks
I'd like to bump this question, I think I am also experiencing this. I'm trying to get the publication years for a list of papers, and I can't match the result up with a list of other information I have about those papers because not all of the paperId's that return are the same as the ones I requested.
I'm not sure how I would be able to get around this; I have about ~6,000 paperIds that no longer match up, so I can't reasonably manually plug them in to the URL to check.
@PedroSena - I dont currently have a way to identify paper merges. There might be a clever solution - but there is nothing to do this directly supported by Semantic Scholar.
@serenalotreck - I've got some history of these issues which i worked on internally and found an expectation that updates to clustering (on a multi month scale) would be drawing to a conclusion. We are well past delivery of these updates and hopefully your 6k is much lower.
I think high level we are operating with the thinking that we try to minimize errors, but the models at this scale push us to have some tolerances.
Thank you all for your ongoing participation and usage of the API !
Describe the bug When I search for a specific paperId a different paperId is returned.
To Reproduce Simply run:
Expected behavior
Current behavior Instead of returning the paperId I requested it returns another
Additional Context The
e87...
paperId I obtained was from Semantic Scholar and it was pointing to a related PubMed:While
93d...
returns:Notice the same DOI but different CorpusId, PubMed and PubMedCentral.
I'd like to better understand if paperId is really a unique identifier for a publication and how to ensure that whenever I search for paper X I get paper X in return.