Very slow to extract more than 1000 articles

Can you share your client usage and an example query? Can you qualify "very slow," or describe what kind of performance you want to see?

Depending on the client configuration, this operation could be doing anything from

Sending one request for a 1000-result page, or
Sending 1000 requests, with one result on each page.

Waiting on a large page may mean the API itself is slow. This is only a client library; it can't change the performance of the hosted arXiv API service.

Sending many requests means repeatedly incurring the overhead cost of an HTTP round-trip. Each round trip after the first also waits (Client).delay_seconds before running: https://github.com/lukasschwab/arxiv.py#client

The default client will get ten pages of 100 results each, and wait three seconds between each request. This seems like the most likely cause of the performance you're seeing, but it's intended behavior. See arXiv's API Terms of Use: https://arxiv.org/help/api/tou#rate-limits

lukasschwab / arxiv.py

Very slow to extract more than 1000 articles #89