jorisschellekens / borb

borb is a library for reading, creating and manipulating PDF files in python.
https://borbpdf.com/
Other
3.39k stars 147 forks source link

some of pdf not working #73

Closed ghost closed 2 years ago

ghost commented 2 years ago

File "C:\Users\admin\Documents\1.py", line 20, in main() File "C:\Users\admin\Documents\1.py", line 13, in main d = PDF.loads(pdf_in_handle, [l]) File "C:\Users\admin\AppData\Local\Programs\Python\Python310\lib\site-packages\borb\pdf\pdf.py", line 54, in loads return ReadAnyObjectTransformer().transform( File "C:\Users\admin\AppData\Local\Programs\Python\Python310\lib\site-packages\borb\io\read\any_object_transformer.py", line 100, in transform return super().transform( File "C:\Users\admin\AppData\Local\Programs\Python\Python310\lib\site-packages\borb\io\read\transformer.py", line 123, in transform out = h.transform( File "C:\Users\admin\AppData\Local\Programs\Python\Python310\lib\site-packages\borb\io\read\reference\xref_transformer.py", line 139, in transform trailer = self.get_root_transformer().transform( File "C:\Users\admin\AppData\Local\Programs\Python\Python310\lib\site-packages\borb\io\read\any_object_transformer.py", line 100, in transform return super().transform( File "C:\Users\admin\AppData\Local\Programs\Python\Python310\lib\site-packages\borb\io\read\transformer.py", line 123, in transform out = h.transform( File "C:\Users\admin\AppData\Local\Programs\Python\Python310\lib\site-packages\borb\io\read\object\dictionary_transformer.py", line 46, in transform v = self.get_root_transformer().transform( File "C:\Users\admin\AppData\Local\Programs\Python\Python310\lib\site-packages\borb\io\read\any_object_transformer.py", line 100, in transform return super().transform( File "C:\Users\admin\AppData\Local\Programs\Python\Python310\lib\site-packages\borb\io\read\transformer.py", line 123, in transform out = h.transform( File "C:\Users\admin\AppData\Local\Programs\Python\Python310\lib\site-packages\borb\io\read\reference\reference_transformer.py", line 103, in transform transformed_referenced_object = self.get_root_transformer().transform( File "C:\Users\admin\AppData\Local\Programs\Python\Python310\lib\site-packages\borb\io\read\any_object_transformer.py", line 100, in transform return super().transform( File "C:\Users\admin\AppData\Local\Programs\Python\Python310\lib\site-packages\borb\io\read\transformer.py", line 123, in transform out = h.transform( File "C:\Users\admin\AppData\Local\Programs\Python\Python310\lib\site-packages\borb\io\read\page\root_dictionary_transformer.py", line 84, in transform transformed_root_dictionary = t.transform( File "C:\Users\admin\AppData\Local\Programs\Python\Python310\lib\site-packages\borb\io\read\object\dictionary_transformer.py", line 46, in transform v = self.get_root_transformer().transform( File "C:\Users\admin\AppData\Local\Programs\Python\Python310\lib\site-packages\borb\io\read\any_object_transformer.py", line 100, in transform return super().transform( File "C:\Users\admin\AppData\Local\Programs\Python\Python310\lib\site-packages\borb\io\read\transformer.py", line 123, in transform out = h.transform( File "C:\Users\admin\AppData\Local\Programs\Python\Python310\lib\site-packages\borb\io\read\object\dictionary_transformer.py", line 46, in transform v = self.get_root_transformer().transform( File "C:\Users\admin\AppData\Local\Programs\Python\Python310\lib\site-packages\borb\io\read\any_object_transformer.py", line 100, in transform return super().transform( File "C:\Users\admin\AppData\Local\Programs\Python\Python310\lib\site-packages\borb\io\read\transformer.py", line 123, in transform out = h.transform( File "C:\Users\admin\AppData\Local\Programs\Python\Python310\lib\site-packages\borb\io\read\object\array_transformer.py", line 46, in transform object_to_transform[i] = self.get_root_transformer().transform( File "C:\Users\admin\AppData\Local\Programs\Python\Python310\lib\site-packages\borb\io\read\any_object_transformer.py", line 100, in transform return super().transform( File "C:\Users\admin\AppData\Local\Programs\Python\Python310\lib\site-packages\borb\io\read\transformer.py", line 123, in transform out = h.transform( File "C:\Users\admin\AppData\Local\Programs\Python\Python310\lib\site-packages\borb\io\read\reference\reference_transformer.py", line 103, in transform transformed_referenced_object = self.get_root_transformer().transform( File "C:\Users\admin\AppData\Local\Programs\Python\Python310\lib\site-packages\borb\io\read\any_object_transformer.py", line 100, in transform return super().transform( File "C:\Users\admin\AppData\Local\Programs\Python\Python310\lib\site-packages\borb\io\read\transformer.py", line 123, in transform out = h.transform( File "C:\Users\admin\AppData\Local\Programs\Python\Python310\lib\site-packages\borb\io\read\object\dictionary_transformer.py", line 46, in transform v = self.get_root_transformer().transform( File "C:\Users\admin\AppData\Local\Programs\Python\Python310\lib\site-packages\borb\io\read\any_object_transformer.py", line 100, in transform return super().transform( File "C:\Users\admin\AppData\Local\Programs\Python\Python310\lib\site-packages\borb\io\read\transformer.py", line 123, in transform out = h.transform( File "C:\Users\admin\AppData\Local\Programs\Python\Python310\lib\site-packages\borb\io\read\reference\reference_transformer.py", line 103, in transform transformed_referenced_object = self.get_root_transformer().transform( File "C:\Users\admin\AppData\Local\Programs\Python\Python310\lib\site-packages\borb\io\read\any_object_transformer.py", line 100, in transform return super().transform( File "C:\Users\admin\AppData\Local\Programs\Python\Python310\lib\site-packages\borb\io\read\transformer.py", line 123, in transform out = h.transform( File "C:\Users\admin\AppData\Local\Programs\Python\Python310\lib\site-packages\borb\io\read\page\page_dictionary_transformer.py", line 100, in transform CanvasStreamProcessor(page_out, canvas, []).read( File "C:\Users\admin\AppData\Local\Programs\Python\Python310\lib\site-packages\borb\pdf\canvas\canvas_stream_processor.py", line 251, in read obj = canvas_tokenizer.read_object() File "C:\Users\admin\AppData\Local\Programs\Python\Python310\lib\site-packages\borb\io\read\tokenize\high_level_tokenizer.py", line 190, in read_object token = self.next_non_comment_token() File "C:\Users\admin\AppData\Local\Programs\Python\Python310\lib\site-packages\borb\io\read\tokenize\low_level_tokenizer.py", line 109, in next_non_comment_token t = self.next_token() File "C:\Users\admin\AppData\Local\Programs\Python\Python310\lib\site-packages\borb\io\read\tokenize\low_level_tokenizer.py", line 154, in next_token assert ch == ">" AssertionError

other libary working fine extract text but i cant find solution for this...sad

jorisschellekens commented 2 years ago

Hi there,

You posted a giant stack trace, but this does not help me in solving this issue. I would like to know:

In short, please read these guidelines before submitting a bug: https://github.com/jorisschellekens/borb/blob/master/CONTRIBUTING.md#submit

Kind regards, Joris Schellekens