Open Jeffrey-A-Lewis opened 1 month ago
Hey @Jeffrey-A-Lewis, thanks for the bug report! It looks like something's changed on bioRxiv's side that seems to be preventing non-humans (e.g., the preprint-similarity-search API server) from pulling PDFs. I get 403s with curl and Python's requests
library, but can access the same URLs fine interactively in a browser. I think it's related to some kind of new CloudFlare protection, but I'm unsure.
I'll continue to debug it and post updates; hopefully it won't take too long to resolve.
@Jeffrey-A-Lewis: So, it seems BioRxiv has resolved whatever the issue was on their end; your query appears to be going through fine now. Feel free to test it and close the issue if it's working for you, too.
I tested a couple of bioRxiv doi's, and received a "Couldn't get results" message. Tried both just the shortened (e.g., 10.1101/2024.08.06.606807) and long forms (e.g., https://www.biorxiv.org/content/10.1101/2024.08.06.606807v1).