-
![Screenshot 2024-08-24 160058](https://github.com/user-attachments/assets/d5e6993a-982d-4377-b9a7-2698a50a1340)
Debug log shows "non empty source txt list"
OG comic language is Dutch (Netherlan…
-
@MenghaoGuo @idansc @Gsunshine @PengtaoJiang @uyzhang Can you please provide paper refence for code /spatial_attentions/ocr.py
I guess it is for text recognition with attention but I could not gate…
-
Hi!
I found the 7B [ckpt](https://huggingface.co/renjiepi/BPO-Lora-LLaVA-7B) file you provided in a previous issue. After running my tests, I obtained the following results. Could you help me figu…
-
Description
In order to create synthetic data for OCR, we try out the approach of font style transfer using deep learning.
Model will transfer font style onto an image of text given. Now research on v…
-
您好!这份工作真的很棒,我正在寻找OCR的SOTA离线模型。如题所述,我想在自己的demo中使用这个模型,就之前我尝试的模型中,Azure的OCR效果最好,请问VimTS与Azure的对比效果如何呢?https://portal.vision.cognitive.azure.com/demo/extract-text-from-images
如果我进行领域内微调的话,可以直接使用Azure的OC…
-
## Reference
- [VLR - Code - Data](https://www.vlrlab.net/code)
- [TextDetection文本检测数据集汇总](https://www.cnblogs.com/Tom-Ren/p/11054728.html)
- [Awesome-Scene-Text-Recognition](https://github.com/c…
-
![image](https://user-images.githubusercontent.com/36793626/173314098-dac0bdb7-88cd-45b1-9c86-bde409d1fcaf.png)
Q1: Hello, in section5.1 of your paper, I notice you used Pytesseract V3.02.02, as show…
-
### Finetuninng on RVLCDIP
Download RVLCDIP first and change the path
For OCR, you might need to customize your code
```
bash scripts/finetune_rvlcdip.sh # Finetuning on RVLCDIP
```
Q1. wh…
-
The link:
This list provides basically everything we have at and even has additional nice features. And unlike MathSciNet it is free to use
While it has overall more publications than we do, i…
-
The Bitcoin Politicians project is woefully out of date due to the hundreds of hours required to manually sift through financial disclosures from 535 members of Congress.
I'm offering a bounty of $…
jlopp updated
3 months ago