-
1. text
- paraphrase
- q/a
- translation
- summarization
2. image
3. audio
-
I got an FOTS checkpoint, trained from scratch, which has pretty good results on my database.
I want to finetune the recognition branch of this checkpoint. The finetuning run smoothly but results are…
-
I have tried to load the table transformer detection and segmentation model in the doctection analyzer as indicated in the repo but I get the following error:
Model not found in ModelCatalog. Make s…
-
Right now is the extension tried first and if a match was found, the result is returned. This prevents the actual type recognition by consulting information from the magic database and thus provides v…
-
**I installed and deployed the marker-pdf locally, the output was successful based on GPU + CUDA mode, but the model loading speed is super slow (load_all_models() from source code). WHY? normal? or i…
-
Input images from cameras etc. often have a much higher resolution than is needed to read the text. Downscaling the image can often produce the same output in much less time. This is because all of th…
-
- I have trained EAST on my own dataset, and in this dataset we have text on multiple orientations(upright, 90º, -90º, upside down);
- The model is detecting the text no matter the orientation;
- Af…
-
1) We are applying an additional Korean language when studying on config.py in order to do Korean ocr. Does fosts support multi-language features?
2) Even though we completed the recognition test t…
-
Hi,
Text Recognition can't recognize the font used in embossed credit cards for the card number:
![creditcard_digits1](https://user-images.githubusercontent.com/479928/129242654-c56e3ad0-c07a-4d87…
-
Modern android devices have a built-in scam call screening system that automatically determines if a call may be a scam call, and then shows you a "screen call" option. When you screen a call it annou…
Izaic updated
1 month ago