Open laleet-avaiya opened 4 months ago
With GPT4o we send an Image of your page to the model. However gpt4o resize all image to 2048 max width and height while conserving the ratio. In your sample PDF as the page is very long it mean the image get resize to 2048px of height that result in an image where no text is legible.
It seems that you generated this PDF with imagemagick from a html file, try to send the html instead to llamaParse, and it will get splited into multiple 'pages' allowing GPT4o to do his job. This may work
I have 8 MB pdf but it has only one page, and during parsing to md It's returning below error.
I am doing that using pythong and facing issue.
laleet_test_wp_image.pdf