-
Hi, can you explain why did you use Xbox indexes 0,1,4, 5 instead of 0,1,2,3 in the try_math function? Because I re-write the ocr_result function according to Readme file, which returns maximum 4 ind…
-
6 Inline Representations
```
6.1 Classes for Inline Representation
6.1.1 ocr_glyph
6.1.2 ocr_glyphs
6.1.3 ocr_dropcap
6.1.4 ocr_chem
6.1.5 ocr_math
6.1.6 Non-breaking space
…
-
Huggingface Model: https://huggingface.co/microsoft/Phi-3.5-vision-instruct
Fine-tuned Dataset: https://huggingface.co/datasets/linxy/LaTeX_OCR
Usually, fine-tuning a multimodal large model invo…
-
Hi, I'm currently developing a pdf parser specialised for math pdf. The non-OCR solutions offer great accuracy for text because they are simply extracted, not detected optically. So, is it possible to…
-
I tested these files using iBooks (EPUB), the Kindle Mac app (MOBI), and Calibre (both); they're complete garbage. Apparently those files were created by extracting OCR'd text from the PDF file, incl…
-
When formulas are parsed some characters like the square root √ are deleted.
Character that should be lowered ₐ as well as raised ² characters are not correctly positioned.
The input:
![X4gv1P60K…
-
What does it mean?
C:\PROGRAMOWANIE\Python_OCR\Convert-own-data-to-MNIST-format-master\Convert-own-
data-to-MNIST-format-master>python convert_to_mnist_format.py C:\PROGRAMOWANIE\P
ython\OCR_Math…
-
Can we add text conversion mode in https://mitex-rs.github.io/mitex/?
I think it would be very helpful and straightforward, appreciate for your help!
-
用 paddle ocr rec v3 模型(opt 2.10 转成 nb 模型),用 paddle lite 2.10 库推理出现如下错误(偶现):
Caused by:Attempted to dereference garbage 0x....
void paddle::lite::arm::math::gemv_int8(signed char const*, signed c…
-
https://github.com/AstarLight/Lets_OCR/blob/cce8f8b857e9f8e196bf82cfd0c1ff3656cd445e/detector/ctpn/lib/generate_gt_anchor.py#L23
这样会减少下一步筛选的量,因为right_anchor_num的值应该是math.floor而不是ceil定位到的值