Closed dantetemplar closed 3 months ago
Can you let me have an example page please? Or try parameter "write_images=True". This option should spare out vector graphics and images from text extraction, write the respective areas out to image files (PNG) and insert Markdown references to these images in the produced text.
Can you let me have an example page please? Or try parameter "write_images=True". This option should spare out vector graphics and images from text extraction, write the respective areas out to image files (PNG) and insert Markdown references to these images in the produced text.
Now, I do it that way: find all areas with images and non-table graphics clusters, skip text processing in these areas; make "screenshot" (create pixmap) for these areas and pass it to tesseract ocr. And also insert Markdown references.
We can close the issue actually. I will send my code snippets later
I encountered a problem processing diagrams while reading presentations at my university.
Can you tell me where in the code I could intercept the diagram processing? I would like to add a fallback to a computer vision model so that the model's response is inserted instead of the diagram block.
By diagram I mean something like this:
So far I got complete mess: