-
Thank you for your work. I want to ask about the requirements for the size of gpu memory to train
-
### Description
There are occasions where text is not meant to be selected or searched for, such as watermarks (see below).
I don't know how easy this is to implement with PDF, but it would be a h…
-
All indexed comments will be appreciated with a heart reaction.
Remember to edit your GPTs setting!
![image](https://github.com/xipowe/GPTs_list/assets/145952479/0e4c46ce-27ab-4ca7-a8bf-f3113125…
-
Hi, guys,
I am trying using the scripts in this repo to preprocess the im2latex dataset, but I met this error as,
> 2020-08-26 17:16:23,199 root INFO Script being executed: scripts/preprocessin…
-
pytorch 2.1.0 cannot be installed against latest python, 3.12.4.
```
ERROR: Could not find a version that satisfies the requirement torch~=2.1.0 (from latex-ocr-server) (from versions: 2.2.0, 2.2.…
-
Huggingface Model: https://huggingface.co/microsoft/Phi-3.5-vision-instruct
Fine-tuned Dataset: https://huggingface.co/datasets/linxy/LaTeX_OCR
Usually, fine-tuning a multimodal large model invo…
-
What is its difference between https://github.com/lukas-blecher/LaTeX-OCR ?
-
PDF was designed for printing, and it has limited support with devices with different sizes. But HTML is in another direction, which actually originates from a reflowable plain text stream.
[This pag…
-
I'd like to see an enhancement added to the SDAPS project that would allow optical character recognition (OCR) of the text in freeform fields.
The Gamera Project may be a good place to start:
http:/…
-
Hi, we find that GroBid cannot parse the inline formula without discarding the spacing, superscript and subscript information. Could you suggest a pathway to improve the accuracy on these scenarios?
…