Closed jeong-tae closed 5 months ago
“OCR with grounding:” Hello, you can use this prompt in the demo(https://github.com/Yuliang-Liu/Monkey/blob/main/demo_textmonkey.py).
I used this: http://vlrlab-monkey.xyz:7681/
it doesn't give OCR results. is this intended in the provided demo? should I run demo_textmonkey.py
myself to get OCR results?
I'm sorry, the demo http://vlrlab-monkey.xyz:7681/ is for Monkey, and for TextMonkey you should run the code (https://github.com/Yuliang-Liu/Monkey/blob/main/demo_textmonkey.py)
In the paper, textMonkey in Figure 5, it claims the textMonkey can generate bounding boxes but they always say "use OCR model". How can I get the OCR visualized result from the model?