scholarly-python-package / scholarly

Retrieve author and publication information from Google Scholar in a friendly, Pythonic way without having to worry about CAPTCHAs!
https://scholarly.readthedocs.io/
The Unlicense
1.37k stars 298 forks source link

Bibtext Export issue #441

Closed Biotopp closed 1 year ago

Biotopp commented 2 years ago

The feature of exporting a publication to Bibtex doesn't work .. Thank you so much for effort! I really appreciate that ! Screenshot

arunkannawadi commented 2 years ago

Thanks for the report. Could you post the full error message in text format here? There are some other open issues related to bibtex, so this might be a duplicate.

Biotopp commented 2 years ago

It's my pleasure! here it is:

Traceback (most recent call last): File "C:/Users/Science/Downloads/Test scholarly/main.py", line 2, in query = scholarly.search_pubs("A density-based algorithm for discovering clusters in large spatial databases with noise") File "C:\Users\Science\AppData\Local\Programs\Python\Python37\lib\site-packages\scholarly_scholarly.py", line 156, in search_pubs return self.__nav.search_publications(url) File "C:\Users\Science\AppData\Local\Programs\Python\Python37\lib\site-packages\scholarly_navigator.py", line 283, in search_publications return _SearchScholarIterator(self, url) File "C:\Users\Science\AppData\Local\Programs\Python\Python37\lib\site-packages\scholarly\publication_parser.py", line 53, in init self._load_url(url) File "C:\Users\Science\AppData\Local\Programs\Python\Python37\lib\site-packages\scholarly\publication_parser.py", line 59, in _load_url self._soup = self._nav._get_soup(url) File "C:\Users\Science\AppData\Local\Programs\Python\Python37\lib\site-packages\scholarly_navigator.py", line 226, in _get_soup html = self._get_page('https://scholar.google.com{0}'.format(url)) File "C:\Users\Science\AppData\Local\Programs\Python\Python37\lib\site-packages\scholarly_navigator.py", line 177, in _get_page raise MaxTriesExceededException("Cannot Fetch from Google Scholar.") scholarly._proxy_generator.MaxTriesExceededException: Cannot Fetch from Google Scholar.

Process finished with exit code 1

arunkannawadi commented 1 year ago

The error seems to be coming from search_pubs method and not from bibtex method. This is a known issue. You'll need to setup a proxy to do this successfully, since Google Scholar blocks any automated request to search for publications in its database. There's a section in our documentation on how to set it up: https://scholarly.readthedocs.io/en/stable/quickstart.html#using-proxies