-
The results of LLaVA-v1.5 on the TextVQA benchmark reported in the paper are much lower than those in the LLaVA-v1.5 paper.
-
Hi, great work and thanks for sharing the code and weights!
I tried the OCR on your sample and it works well. However, may I ask how could we go to perform VQA? Something similar to your paper exa…
-
Thanks for creating this package!
As discussed in https://github.com/robertknight/ocrs/issues/14 it would be nice to add some evaluation benchmarks. And maybe optionally compare with tesseract or s…
-
I can't find the train data files of "BLIVA/bliva/data/llava/bliva_llava_150k.json" and "BLIVA/bliva/data/ocrVQA/cleaned_train_dataset.json". Can you tell me how to download …
-
Thank you very much for your work; I have been following it since the beginning. I would really like to know if you have reproduced the results from the original paper. I also created a multilingual O…
-
### Describe your problem
Hi there, during my testing it became more and more clear that something is quite wrong witht he parsing/ocr method. When e.g. inputting a 30page scientific paper, and setti…
-
Jotting down notes on this idea, brainstormed w/ Josh D:
Unclear if it's feasible, viable, or desirable, but interesting to consider as like half-SIV to intro skeptics:
------
SIV backed by P…
-
### feature
I did a test to OCR scanned documents in Brazilian Portuguese, and I saw that LLaVA makes a lot of mistakes on scanned documents in Portuguese
#### result from https://huggingface.co…
-
Hi,
Do you know how should I change the dataset creation for the task of OCR?
Is just the concatenation of bbox special tokens with the text or do I need to do more?
Thanks for the finetuni…
-
Hello , I got a problem when used custom build pytorch on Android device. And the errors are as following:
FAILED: ../../../../build/intermediates/cmake/debug/obj/arm64-v8a/libocrKit.so
: && /Use…