-
A common use case for OCR evaluation (e.g. for search engine indexing, text- and data mining, asf.) is to omit stopwords from the word evaluation to get an understanding of the correctness of "signifi…
cneud updated
3 years ago
-
Ref: https://groups.google.com/d/msgid/tesseract-ocr/8cc88ed2-99c3-445e-b758-83ade0f680aa%40googlegroups.com?utm_medium=email
copied below
----------
Good day!
Recently I was using tesseract (…
-
Hi
I’m conducting research regarding OCR corpuses, and I would like to use this project for evaluation of how differences on the training corpus effects the quality of the post-processing.
But, I ha…
-
# URL
- https://arxiv.org/abs/2310.16809
# Affiliations
- Yongxin Shi, N/A
- Dezhi Peng, N/A
- Wenhui Liao, N/A
- Zening Lin, N/A
- Xinhong Chen, N/A
- Chongyu Liu, N/A
- Yuyi Zhang, N/A…
-
Hi @ChWick @andbue!
Thanks for this amazing project. I am using calamari as part of a data extraction task for tables in mid 20th century documents. Specifically I run calamari on the (single line)…
-
## Reference
- [paper - 2018 Calamari A High-Performance Tensorflow-based Deep Learning Package for Optical Character Recognition](https://arxiv.org/ftp/arxiv/papers/1807/1807.02004.pdf)
- [paper …
-
Can be added to the list of datasets.
* MiBio
- https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6197712/
- https://github.com/jie-mei/MiBio-OCR-dataset
-
I notice you have a request for anonymising pixel data using OCR. I have been working on this, but in a separate code base, not as modifications to deid. It turns out that the hardest part is the eval…
-
If you upload a PDF, it seems that the project is set as a OCR correction project even if you don't check ocr correction.
We should reproduce to make sure.
-
### Describe the issue
Issue:
Command:
```
#!/bin/bash
python -m llava.eval.model_vqa_loader \
--model-path liuhaotian/llava-v1.5-7b \
--question-file ./playground/data/eval/textvqa…