Arxiv has a problem with paging which means that it downloads many duplicates of data.
The max_results parameter defines how many results per page and the start parameter defines which result to start on not which page.
We also don't test for duplicates and filter them out like we do with EuropePMC which we could consider doing.
Arxiv has a problem with paging which means that it downloads many duplicates of data. The max_results parameter defines how many results per page and the start parameter defines which result to start on not which page.
We also don't test for duplicates and filter them out like we do with EuropePMC which we could consider doing.