greenelab / preprint-similarity-search

A web app that uses machine learning to recommend the most suitable journals based on the text content of your preprint
https://greenelab.github.io/preprint-similarity-search/
Other
20 stars 4 forks source link

DOI Search Not Working #156

Open Jeffrey-A-Lewis opened 1 month ago

Jeffrey-A-Lewis commented 1 month ago

I tested a couple of bioRxiv doi's, and received a "Couldn't get results" message. Tried both just the shortened (e.g., 10.1101/2024.08.06.606807) and long forms (e.g., https://www.biorxiv.org/content/10.1101/2024.08.06.606807v1).

falquaddoomi commented 1 month ago

Hey @Jeffrey-A-Lewis, thanks for the bug report! It looks like something's changed on bioRxiv's side that seems to be preventing non-humans (e.g., the preprint-similarity-search API server) from pulling PDFs. I get 403s with curl and Python's requests library, but can access the same URLs fine interactively in a browser. I think it's related to some kind of new CloudFlare protection, but I'm unsure.

I'll continue to debug it and post updates; hopefully it won't take too long to resolve.

falquaddoomi commented 1 month ago

@Jeffrey-A-Lewis: So, it seems BioRxiv has resolved whatever the issue was on their end; your query appears to be going through fine now. Feel free to test it and close the issue if it's working for you, too.