Closed thebluepotato closed 1 month ago
Thanks! I'd hold off on any further changes here for a sec because I do think we want to get #3168 merged. I'll try to do that this week. (The oai2
endpoint is really, really slow right now, but I don't remember that always being the case...)
That said, if you could rebase on #3168, we could just do everything here.
I've found the oai2 endpoint the least reliable arXiv API option for quite some time, so it'd be nice to switch away from it. Last time I looked, data quality wasn't exactly the same, but that was quite some time back.
I've found the oai2 endpoint the least reliable arXiv API option for quite some time, so it'd be nice to switch away from it. Last time I looked, data quality wasn't exactly the same, but that was quite some time back.
In terms of data quality, it seems that for at least one of the test cases, the OAI endpoint contained a "published" DOI whereas the Atom endpoint did not
This is looking great. I'm getting more and more timeouts from the old export endpoint, so I'd love to get it merged.
@adam3smith, what do you think?
Note that https://github.com/zotero/utilities/blob/e00d98d3a11f6233651a052c108117cf44873edc/utilities.js#L435 should be updated after this PR is merged since the new endpoint explicitly does support versions.
OK, I think this is ready. @dstillman or @adam3smith, would appreciate a third opinion before we merge.
Thank you so, so much! This is a huge improvement.
Based on various tests, the currently used
oai2
endpoint is very slow (up to 20s for a single query). Conversely, the endpoint documented by arXiv is much faster. This is currently a WIP.Seems to be the similar idea as #3168