ContentMine / getpapers

Get metadata, fulltexts or fulltext URLs of papers matching a search query
MIT License
197 stars 37 forks source link

Failure when EuPMC doesn't correctly report the number of hits #162

Closed tarrow closed 7 years ago

tarrow commented 7 years ago

At the time of writing queries like: http://www.ebi.ac.uk/europepmc/webservices/rest/search/query=FIRST_PDATE%3A2017-05-22%20OPEN_ACCESS%3Ay&resulttype=core&pageSize=1000&cursorMark=*

report m hits but actually return m-n.

This leaves getpapers tying to load the next page but this page is empty. This means we keep trying forever.

We shouldn't do this. We should also start writing tests to cover these odder and odder issues with the API so that as we fix one we don't break normal functionality or a fix from the past.