pybliometrics-dev / pybliometrics

Python-based API-Wrapper to access Scopus
https://pybliometrics.readthedocs.io/en/stable/
Other
407 stars 127 forks source link

AbstractRetrieval: Author IDs in FULL view and number of references in REF view #271

Closed egonzales2000 closed 1 year ago

egonzales2000 commented 1 year ago

Hi,

I was wondering why the FULL view in the AbstractRetrieval doesnt provide any information on the authors' ids of a reference.. The list shown here is not really helpful. When switching to the REF view I get the ids, yet, I have only max. 40 entries for the references...

Any help is much appreciated.

Michael-E-Rose commented 1 year ago

This is something only Scopus can tell you...

I raised this issue many times. But my feeling is that developing the Scopus API is not high on the agenda.

What I typically do to circumvent this information is: I use the REF view information if there are fewer than 41 references, an otherwise I source the cited documents if the type of the reference in the FULL view is resolved (non-resolved documents are not indexed).

egonzales2000 commented 1 year ago

Hi, Thanks for your quick reply. My current approach is to paginate the results with the REF view. The problem here, however, is that only one slice of data is stored in the cache and with the cached data the startref and refcount information doesn't work. Using refresh=True obviously solves the problem but it would be nice to use cached date as this is a very neat feature ;-)

egonzales2000 commented 1 year ago

Found a nasty solution: I simply change line 44 in the retrieval.py to:

stem = str(identifier.replace('/', '')) + "" + str(kwds['startref'])

so that files now have the startref info in their filename. This however will lead to problems if you dont pass the the startref info.