Open TimothyElder opened 1 year ago
Thanks for the report! Definitely not ideal, but it's likely the same situation also reported in https://github.com/ropensci/openalexR/issues/115
For what it's worth, in my experience with snowball searching it's pretty common to have mismatches between the cited-by number in a paper's records vs. its discoverable connections (even within the same database). You can just think of the number of articles returned by backward-searching in oa_snowball()
as the absolute lower bound estimate of cited-by (which doesn't account for older papers, retracted papers, inaccessible papers, etc.).
The discrepancy doesn't seem to be very sizable fortunately, less than 10 or so per article in my estimation. Thanks!
When returning all the works that are cited by and that cite a focal article the number of edges in returned
edges
data frame that go to the focal article should match thecited_by_count
of the focal article, but it seems that they usually do not.I am trying to figure out whether this is an artifact in the data or whether I have misunderstood precisely what
oa_snowball
returns.Here is an example of where I think the edges should match but they don't: