-
Hey @Nutlope! Cool project. 😎
I saw your [post on X](https://x.com/nutlope/status/1856402928086725020) and went straight to a global install (without reading any docs 🙈)
```
npm i -g llama-ocr…
-
```
[](https://localhost:8080/#) in extract_data_from_pdf(pdf_path)
57 # Function to extract text using the unstructured library
58 def extract_data_from_pdf(pdf_path):
---> 59 eleme…
-
Hello!
I see in the supported languages that the Greek one is not included
https://github.com/run-llama/llama_parse/blob/main/llama_parse/utils.py#L16
Is there a roadmap or are you going to s…
-
Before we implement try to implement and embed the ML model to the quick entry form, we should firstly find out how the OCR works. It will be ideal for us to synchronize our implementation with OCR's.…
-
same than #678 but for OCR
not familiar yet what are the best way to run your own OCR API, probably plenty of solutions
also maybe this could be a thing: https://github.com/microsoft/OmniParse…
-
How feasible would it be to use manga ocr to power a manga reading app?
Would a typical smartphone be able to run ocr on a line of text in a reasonable amount of time?
How big would the installatio…
-
### Issues
- [X] I have browsed through the Issues. 我已浏览过Issues,确定没有重复的建议。
### Expected behavior 预期的功能
我做的这个插件WeChatOCR 插件主要是分为两个版本:
1:插件自带微信本地 OCR 模型版本(WechatOCR_umi_plugin_zidai ocr)
插件内置了关键文件:…
-
hi,想请教下ocr data在预训练和sft阶段的具体label是怎样产生的?
看了前面很多问题提到ocr的监督为'\ntext1\ntext2\ntext3', 比如[#536](https://github.com/OpenGVLab/InternVL/issues/536)、[#49](https://github.com/OpenGVLab/InternVL/issues/49),…
-
![翻译错误](https://github.com/user-attachments/assets/6b239150-396e-46c2-b1bf-f6f442c50933)
当pdf文件均为图像,而不是可编辑(复制)状态时,翻译完全失败,具体见图
-
Hi Eduard,
Thank you for creating such a powerful package!
I wonder if you plan to extend the PDF extraction functionality in `llm_message()` to automatically detect whether the PDF is multi-col…