when i use the "PDFPlumberLoader" provided by langchain,a bug occurs.
the bug is
"File "D:\annaconda\envs\fastapitest\lib\site-packages\pdfplumber\utils\pdfinternals.py", line 16, in
return "".join(PDFDocEncoding[o] for o in ords)
IndexError: string index out of range
"
Describe the bug
when i use the "PDFPlumberLoader" provided by langchain,a bug occurs. the bug is "File "D:\annaconda\envs\fastapitest\lib\site-packages\pdfplumber\utils\pdfinternals.py", line 16, in
return "".join(PDFDocEncoding[o] for o in ords)
IndexError: string index out of range
"
Have you tried repairing the PDF?
I run the code
but the bug is also occurs.
Code to reproduce the problem
This is the code occur erros.
PDF file
this is the error file.
Linux运维趋势_第10期_日志分析技巧分享.pdf
If applicable, add screenshots to help explain your problem.
Environment