-
## Reference
- [paper - 2017 -EAST: An Efficient and Accurate Scene Text Detector](https://arxiv.org/pdf/1704.03155.pdf)
- [OCR EAST: An Efficient and Accurate Scene Text Detector 自然场景下的文字算法详解](ht…
-
Hi - Apologies for the avalanches of question posted, I have a read your paper https://arxiv.org/pdf/1912.13318.pdf and also studied the funsd, previously using it to train a fasterrcnn and predict Q …
-
Description
In order to create synthetic data for OCR, we try out the approach of font style transfer using deep learning.
Model will transfer font style onto an image of text given. Now research on v…
-
Great work!
I'm now trying to **train nougat from scratch**.
**The data** I use is generated from arxiv, as the script the author provided, I got **130w** in total for now.
**The machine** …
-
Hi! It seems I'm not able to find a REST API endpoint for generating embeddings for an image, in other words, providing functionality for using models like CLIP which can take both text and images as …
-
Thank you for the excellent node. I created a workflow about it on [OpenArt](https://openart.ai/workflows/fish_intent_33/pdftoslides-in-comfyui/7rIk4LKwjsKx8xzTyiEU).
It works well with OCR. Howeve…
-
Great work indeed!
From the description in the paper, I do not find any special OCR module. I am curious how LLaVA obtains the ability to understand the text in the image (e.g., the famous examples…
-
### Finetuninng on RVLCDIP
Download RVLCDIP first and change the path
For OCR, you might need to customize your code
```
bash scripts/finetune_rvlcdip.sh # Finetuning on RVLCDIP
```
Q1. wh…
-
[The format of the issue]
Paper name/title:
Project link:
Paper link:
Code link:
amusi updated
2 weeks ago
-
Is there an example code integrating this with FasterRCNN as mentioned in the paper?
It seems to me like this solution has potential impact on localization tasks applied to OCR. I'd be curious to s…