-
- [ ] [awesome-reasoning/README.md at master · neurallambda/awesome-reasoning](https://github.com/neurallambda/awesome-reasoning/blob/master/README.md?plain=1)
# awesome-reasoning
…
-
(clip4str) root@Lab-PC:/workspace/Project/OCR/CLIP4STR# bash scripts/vl4str_base.sh
abs_root: /home/shuai
model:
_convert_: all
img_size:
- 224
- 224
max_label_length: 25
charset_t…
-
[From Visual Prompt Learning to Zero shot transfer ](https://arxiv.org/pdf/2303.05266.pdf)
[Learning to Prompt for Vision-Language Models](https://link.springer.com/article/10.1007/s11263-022-01653…
-
### Question Validation
- [X] I have searched both the documentation and discord for an answer.
### Question
HI there!
I am curious on how to handle PowerPoints that contain images and gr…
-
- [ ] [DeepSeek-VL: Towards Real-World Vision-Language Understanding](https://arxiv.org/html/2403.05525v2)
# DeepSeek-VL: Towards Real-World Vision-Language Understanding
**Abstract**
We present De…
-
### Paper
[Learning Transferable Visual Models From Natural Language Supervision](https://arxiv.org/abs/2103.00020) (a.k.a. CLIP)
### Speaker
@joosun7l
-
这篇论文《Prompting Visual-Language Models for Efficient Video Understanding》
地址:https://voide1220.github.io/distillation_collaboration/
code的代码链接竟然是这个。。。。。。
-
### Problem
We want to add support for this new model that unlike the previous ones also supports vision. The readme for the model is described below:
---
language:
- en
- de
- fr
- it
- pt…
-
With Large Language models getting popular helping generate dynamic content, It is important to investigate the feasibility of AI into Vocabhub. Some of the points on top of my mind how we can integra…
-
- [ ] [LLaVA/README.md at main · haotian-liu/LLaVA](https://github.com/haotian-liu/LLaVA/blob/main/README.md?plain=1)
# LLaVA/README.md at main · haotian-liu/LLaVA
## 🌋 LLaVA: Large Language and Vi…