karpathy / arxiv-sanity-preserver

Web interface for browsing, search and filtering recent arxiv submissions
http://www.arxiv-sanity.com/
MIT License
5.14k stars 1.33k forks source link

"Could not render: incorrectly formatted pdf" error on every recent paper #135

Closed jasonjmcghee closed 4 years ago

jasonjmcghee commented 4 years ago

image

markus-flicke commented 4 years ago

It appears to me that pdftotext is not working for recent papers anymore. Hence ASP cannot parse PDFs...

black-puppydog commented 4 years ago

so is that a change on the arxiv side (how they build the pdfs) or in the pdftotext version (if that change on the server?)

jasonjmcghee commented 4 years ago

This issue appears to be fixed.

black-puppydog commented 4 years ago

do we know what caused this in the end?

karpathy commented 4 years ago

The issue was with thumbnail generation, convert was failing with an error. Had to adjust system configuration files, no code changes. Still slightly mysterious why it happened rather suddenly without any (manual) system update or anything.

black-puppydog commented 4 years ago

you wouldn't happen to be running unattended-upgrades or such? :)