-
Adding Vision Transformer for scene text recognition i work currently on this (with huggingface ViT backbone) if i done and have solid results it would be a charme for me to add this model if you inte…
-
https://www.gushiciku.cn/dc_tw/101260970
https://www.163.com/dy/article/GK7B1FN905328ZJ2.html
ekils updated
2 years ago
-
Can this code satisfied different size of image use your pretrained model? I found you use pretrained model from deit, and resize each image to 224 * 224? So can I define imgH and imgW another number …
-
My graphics card does not have 32G memory. After I changed the batch size to a smaller one, the loss appeared nan, and I couldn't get the best model. Do you have any suggestion?
-
Hello,
I used single GPU env with python == 3.8, torch==1.8.1 and torchvision==0.9.1
I followed the github hint with the following command:
```
python3 infer.py --gpu --image demo_image/demo_2.jpg…
-
### 🚀 The feature
Addition of new / Fine Tuning of existing models to support OCR for `Handwritten Text`.
As a first step we can start with detection/prediction models that work specifically for Han…
-
There is a code in the OCR.vitstr forward function.
features_1d_concat = torch.ones(x.shape[0] // 3, 201, x.shape[2]).type_as(x)
features_1d_concat[:, :69, :] = ap[:, 0, :69…
-
### 🔎 Search before asking
- [X] I have searched the PaddleOCR [Docs](https://paddlepaddle.github.io/PaddleOCR/) and found no similar bug report.
- [X] I have searched the PaddleOCR [Issues](https…
-
Hi. I created lmdb dataset on my own data by running create_lmdb_dataset.py. then I run the train command on it and got the following output:
CUDA_VISIBLE_DEVICES=0 python3 train.py --train_data re…
-
请提供下述完整信息以便快速定位问题/Please provide the following information to quickly locate the problem
- 系统环境/System Environment:win 10
- 版本号/Version:Paddle: PaddleOCR: 问题相关组件/Related components:
- 运行指令/Comma…