scholarly-python-package / scholarly

Retrieve author and publication information from Google Scholar in a friendly, Pythonic way without having to worry about CAPTCHAs!
https://scholarly.readthedocs.io/
The Unlicense
1.39k stars 299 forks source link

Scholarly retrieves 20 co-authors at maximum #316

Closed AlessandroSpallina closed 3 years ago

AlessandroSpallina commented 3 years ago

Searching for an author and requesting for the co-authors list with scholarly.fill returns only the first 20 co-authors, even if the author have more co-authors. This behaviour isn't good and should be fixed, otherwise using the library to fetch co-author will return a partial information.

Code to replicate the issue:

from scholarly import scholarly

def main():
    search_query = scholarly.search_author('Carlos Lima Azevedo')
    author = next(search_query)
    author_dict = scholarly.fill(author, sections=['basics', 'indices', 'coauthors'])
    scholarly.pprint(author_dict)

if __name__ == '__main__':
    main()

Looking the output of that code you'll see only 20 co-authors, but in the scholar page of that author there are 40+!

arunkannawadi commented 3 years ago

I tested the changes in PR #322 with the scholar in your example and was able to fetch all 44 coauthors from this profile.

arunkannawadi commented 3 years ago

Fixed in v1.4