Closed LiutongZhou closed 2 years ago
Hi @LiutongZhou, thanks for raising this issue. I think the issue you are facing is a duplicate of https://github.com/jsvine/pdfplumber/issues/383 @jsvine, Shall the PDF and the code example shared in the issue be added as a test case in #388 ?
Thanks @samkit-jain. In this case, I don't think we need to add a new PDF or test case beyond what's already in #338. The issue isn't really specific to any particular PDF, but just stems from the fact that pdfminer.six
's LTAnno
objects (extracted when users pass laparams
to pdfplumber.open(...)
) do not have bounding boxes.
How to reproduce the Error
Expected Behavior
Return the text of the page
Working Fix
https://github.com/jsvine/pdfplumber/blob/694f9193cc13c3757dbe21af9c817dca32d9d5fc/pdfplumber/utils.py#L423