acl-org / aclpubcheck

Tools for checking ACL paper submissions
MIT License
598 stars 47 forks source link

Pages that don't contain text cause an error #22

Closed pkavumba closed 2 years ago

pkavumba commented 2 years ago

I tried to check a paper with a page containing only an image without any text, and aclpubcheck/formatchecker.py throws an error. This error is caused by line 400, text = page.extract_text().split("\n"), in check_page_num because page.extract_text() returns None when a page has no textual content.

aclpaperchecker
crux82 commented 2 years ago

Would it be possible to share your PDF (even in a private message) to replicate the issue?

Thanks

Danilo

pkavumba commented 2 years ago

Sent a PM

crux82 commented 2 years ago

I checked you case but I was not able to replicate the issue. Moreover, I do not see that code at line 400. Can you help me to find this problematic code?

pkavumba commented 2 years ago

I reinstalled the checker and the error didn't happen this time