getomni-ai / zerox

Zero shot pdf OCR with gpt-4o-mini
https://getomni.ai/ocr-demo
MIT License
4.72k stars 259 forks source link

Support for JPG Files in Zerox #49

Open shawn8888 opened 2 weeks ago

shawn8888 commented 2 weeks ago

I appreciate the work done on the Zerox project for converting PDF documents to Markdown. However, I've encountered a limitation that I believe could enhance the usability of the tool.

Sometimes, I come across documents in JPG format, and it can be inconvenient to convert them to PDF just to use Zerox for conversion. Would it be possible to add support for directly processing .jpg files? This feature would streamline the workflow and make Zerox even more versatile for users dealing with various document formats.

Thank you for considering this enhancement!

CarterMcClellan commented 1 week ago

Also noted this while constructing a benchmark of the the zerox model against other providers like AWS Textract + Google OCR.

Would be great to have image support!

pradhyumna85 commented 5 days ago

duplicate of #67, #58