vision-language-pretraining Search Results

197 results
for vision-language-pretraining

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

SHI-Labs/NATTEN #92

Grouped Query Attention without repeat()

[Grouped Query Attention](https://arxiv.org/abs/2305.13245) improves parameter-efficiency of attention KV projections and reduces IO at inference-time, making inference faster. It can be implemente…

Birch-san updated 7 months ago
15
Bryce1010/DeepLearning-Project #3

DeepLearning Excellent Open Course

个人主页，个人学习生涯！学习流程： > 第一遍，通读全文，了解内容 > > 第二遍，针对性阅读，并记录心得 > > 第三遍，理论结合实践一点一点搬运到博客上 ## ref - [Deep learning papers reading roadmap](https://github.com/floodsung/Deep-Learning…

Bryce1010 updated 4 years ago
16
scene-verse/SceneVerse #14

Details of point cloud alignment and how to bring in custom …

Hi authors, Great work! Can you please share more details about the pointcloud alignment mentioned in your paper below: > To ensure cohesion across various sources, we conduct preprocessing ste…

zubair-irshad updated 4 months ago
5
fly51fly/aicoco #5

爱可可老师一周论文精选

fly51fly updated 6 months ago
106
jungwoo-ha/WeeklyArxivTalk #71

[20230212] Weekly AI ArXiv 만담 시즌2 - 5회차

scene-the-ella updated 1 year ago
6
OpenGVLab/InternVL #147

OpenGVLab/InternViT-6B-448px-V1-5 as Zero Shot Image Classif…

Hi, Thanks for sharing the model and code with us. I am trying to using Vision Foundation Model for a zero shot classification problem. It is possible with **OpenGVLab/InternVL-14B-224px** bu…

iavinas updated 4 months ago
1
baaivision/EVE #8

RuntimeError: shape '[33, 4096, 24, 20]' is invalid for inpu…

Nice work! When I try to running the training code, I encounter the following error: ``` File "/ssddata/yuzhen/EVE/eve/model/language_model/eve_llama.py", line 96, in forward clip_loss = self…

HYZ17 updated 4 months ago
4
Tribler/tribler #7586

Phd Placeholder: learn-to-rank, decentralised AI, on-device …

_ToDo: determine phd focus and scope_ Phd Funding project: https://www.tudelft.nl/en/2020/tu-delft/eur33m-research-funding-to-establish-trust-in-the-internet-economy Duration: 1 Sep 2023 - 1 sep 2…

synctext updated 1 week ago
39
bigai-nlco/VideoLLaMB #4

Cannot Reproduce Paper Results

@patrick-tssn Thanks for releasing the code! I tried to train the model using the default code `finetune_video_image.slurm`. The result on MVBench turns out to be 47.3. I also tested the pre-traine…

YiwuZhong updated 2 months ago
23
stevebottos/owl-vit-object-detection #11

Need assistance with notebook experiment check_zero_shot_res…

@stevebottos Thanks for the really great code! I am learning a lot. Quick issue, I am not able to get the check_zero_shot_results.ipynb working (it may be based on some older versions of your code). C…

bgoldfe2 updated 1 year ago
34

上一页 1...6 7 8 9 10 11 12...20 下一页

197 results for vision-language-pretraining

197 results
for vision-language-pretraining