ocr-paper Search Results

1000+ results
for ocr-paper

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

microsoft/i-Code #86

In 'Finetuninng on RVLCDIP', which one is the dataset ?

### Finetuninng on RVLCDIP Download RVLCDIP first and change the path For OCR, you might need to customize your code ``` bash scripts/finetune_rvlcdip.sh # Finetuning on RVLCDIP ``` Q1. wh…

CheungZeeCn updated 1 year ago
1
OpenPecha/Synthetic-Data-Creation-using-Diffusion #1

OCR0042: Synthetic Data Creation Research using Diffusion

Description: Following up after gaining valuable knowledge from '[OCR0027 Font Style Transfer Research](https://github.com/orgs/OpenPecha/projects/56/views/7?pane=issue&itemId=65728918)', we learn the…

Norbu-Jamling updated 1 month ago
14
AIGText/GlyphControl-release #8

benchmark needed

lwb2099 updated 8 months ago
3
stephanecharette/DarkPlate #12

ALPR Unconstrained

Hello Dear Mr. Stéphane Charette, I tried the DarkPlate executable app with the config files and weghts you provided and it works! But if I replace them with the files and weights from this url (h…

Baxulio updated 8 months ago
3
HRNet/HRNet-Semantic-Segmentation #55

How can i use the best model

I found the best performance in Cityscapes leaderboard(83.7). How can I reproduce this result?

XuShoweR updated 4 years ago
3
mkocabas/CoordConv-pytorch #10

Object Detection Example

Is there an example code integrating this with FasterRCNN as mentioned in the paper? It seems to me like this solution has potential impact on localization tasks applied to OCR. I'd be curious to s…

aribornstein updated 6 years ago
1
Unstructured-IO/unstructured #2939

Text Extraction Issue: Greek Language PDFs Rendered with Inc…

**Describe the bug** I am evaluating the UnstructuredClient for processing PDF documents and am encountering an issue with the Greek language text extraction. When I attempt to extract text from PDF …

DarioBernardo updated 4 months ago
3
microsoft/unilm #616

The skills about DocVQA results

**Describe** when I use the LayoutLM v2 base model to reproduce the result reported in [LayoutLMv2 paper, Table 6, row 7](https://arxiv.org/pdf/2012.14740.pdf),the result is 74,and I find there is ov…

dongxuewang-123 updated 2 years ago
1
QwenLM/Qwen2-VL #105

How to use bbox in SFT/微调中的定位框问题

我在观察Qwen2-vl的SFT数据格式时发现似乎和Qwen-vl的格式差别比较大,重点是没有给出定位框的标注示例了,还有就是0-1000的归一化问题,再Qwen2-vl中还需要操作么?求解答 When observing the SFT data format of Qwen2-vl , I found that there seems to be a big difference with …

mokby updated 2 days ago
7
Yuliang-Liu/VimTS #4

能否在领域内进行多语言OCR微调

您好！这份工作真的很棒，我正在寻找OCR的SOTA离线模型。如题所述，我想在自己的demo中使用这个模型，就之前我尝试的模型中，Azure的OCR效果最好，请问VimTS与Azure的对比效果如何呢？https://portal.vision.cognitive.azure.com/demo/extract-text-from-images 如果我进行领域内微调的话，可以直接使用Azure的OC…

Mistsink updated 1 month ago
3

上一页 1...3 4 5 6 7 8 9...100 下一页

1000+ results for ocr-paper

1000+ results
for ocr-paper