-
@MenghaoGuo @idansc @Gsunshine @PengtaoJiang @uyzhang Can you please provide paper refence for code /spatial_attentions/ocr.py
I guess it is for text recognition with attention but I could not gate…
-
Hello,
I am working with the VRDU dataset, and I am attempting to normalize the bounding boxes for use with LayoutLMv3. In your [paper](https://arxiv.org/abs/2211.15421), I see that OCR is used, an…
-
![Screenshot 2024-08-24 160058](https://github.com/user-attachments/assets/d5e6993a-982d-4377-b9a7-2698a50a1340)
Debug log shows "non empty source txt list"
OG comic language is Dutch (Netherlan…
-
![image](https://user-images.githubusercontent.com/36793626/173314098-dac0bdb7-88cd-45b1-9c86-bde409d1fcaf.png)
Q1: Hello, in section5.1 of your paper, I notice you used Pytesseract V3.02.02, as show…
-
## Reference
- [VLR - Code - Data](https://www.vlrlab.net/code)
- [TextDetection文本检测数据集汇总](https://www.cnblogs.com/Tom-Ren/p/11054728.html)
- [Awesome-Scene-Text-Recognition](https://github.com/c…
-
### Finetuninng on RVLCDIP
Download RVLCDIP first and change the path
For OCR, you might need to customize your code
```
bash scripts/finetune_rvlcdip.sh # Finetuning on RVLCDIP
```
Q1. wh…
-
Description
In order to create synthetic data for OCR, we try out the approach of font style transfer using deep learning.
Model will transfer font style onto an image of text given. Now research on v…
-
Many websites fall into disrepair and go offline even after being published in a conference paper. One way we can ensure that at least the pages describing a project (if not the corpora, etc) produced…
-
Nice images but could someone elaborate what the plane is here? Probably something like (1) make images more readable (2) convert image to text (3) run something on a simulator. Which directory first?…
-
The Bitcoin Politicians project is woefully out of date due to the hundreds of hours required to manually sift through financial disclosures from 535 members of Congress.
I'm offering a bounty of $…
jlopp updated
5 months ago