VamosC / CLIP4STR

An implementation of "CLIP4STR: A Simple Baseline for Scene Text Recognition with Pre-trained Vision-Language Model".
Apache License 2.0
115 stars 14 forks source link

clip4str attention graph #6

Closed dle666 closed 7 months ago

dle666 commented 9 months ago

This is an excellent work and I am very much interested in the advanced effects of clip attention maps in it.

Can you share the code used to generate the clip attention graph in the paper.

Thank you very much.

VamosC commented 9 months ago

cam.zip

This should work. The grad-cam code is built upon https://huggingface.co/spaces/njanakiev/gradio-openai-clip-grad-cam.

Have fun with it!

VamosC commented 7 months ago

Close due to in-activity. Feel free to re-open.