lukasschwab / arxiv.py

Python wrapper for the arXiv API
MIT License
1.11k stars 123 forks source link

Use export.arxiv.org for non-API requests #42

Closed lukasschwab closed 3 years ago

lukasschwab commented 4 years ago

Is your feature request related to a problem? Please describe. A clear and concise description of what the problem is. Ex. I'm always frustrated when [...]

Programmatically slamming arxiv.org is bad for their site performance, and their documentation asks us to please avoid doing so.

Describe the solution you'd like A clear and concise description of what you want to happen.

Non-API requests––e.g. the arxiv.download functionality––should hit export.arxiv.org instead of arxiv.org: https://arxiv.org/help/bulk_data#play-nice

Describe alternatives you've considered A clear and concise description of any alternative solutions or features you've considered.

None

Additional context Add any other context or screenshots about the feature request here.

Before making the switch, benchmark download performance.

lukasschwab commented 3 years ago

I've chewed on this a bit, and I'm not convinced this is worth the readability cost of modifying a PDF URL received from the API (matching arxiv.org and replacing it with export.arxiv.org).