vqav2 Search Results - Githubissues

202 results
for vqav2

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

merveenoyan/smol-vision #8

Out of memory error running on 2 A100s with 80 GB each

Hey, thanks for creating these notebooks! But I am trying to run Idefics_FT, and unfortunately, it isn't working... I run into an out of memory error when calling trainer.train() even though I am runn…

joris-sense updated 2 weeks ago
8
salesforce/LAVIS #545

BLIP2: Unable to connect to HuggingFace.co

When I execute the following code, I cannot connect. But other models are connectable, what causes this? OSError: We couldn't connect to 'https://huggingface.co' to load this file, couldn't find it…

TongLi97 updated 5 months ago
7
shikras/shikra #46

I have collected the download addresses for all the training…

I am reproducing the model on V100 GPU. If anyone is doing the same, I hope we can communicate and exchange ideas together. My wechat : Anymake_ren 1、Flickr 30k ： http://shannon.cs.illinois.edu/D…

Anymake updated 2 months ago
4
microsoft/GenerativeImage2Text #57

Where can we download pretrained weights for Git MSVD-QA?

Hi, thank you so much for the great work and releasing the code. I would like to study on videoQA ability of this model, specifically or MSVD-QA or TGIF-Frame, is that possible for us to download the …

ee2110 updated 1 month ago
2
long8v/PTIR #152

[140] Improved Baselines with Visual Instruction Tuning

[paper](https://arxiv.org/pdf/2310.03744.pdf) see llava https://github.com/long8v/PTIR/issues/128#issue-1749571159 here ## TL;DR - **I read this because.. :** aka LLaVA1.5 / ShareGPT4V에서 LL…

long8v updated 7 months ago
1
mlfoundations/MINT-1T #4

Why is OBELICS generally better than MINT-1T (HTML)?

Why is OBELICS generally better than MINT-1T (HTML)? Is the main advantage of MINT-1T over OBELICS primarily related to PDFs?

lijinginfo updated 2 weeks ago
2
long8v/PTIR #149

[137] mPLUG-Owl2: Revolutionizing Multi-modal Large Language…

[paper](https://arxiv.org/abs/2311.04257) ## TL;DR - **I read this because.. :** very recent VLM model - **task :** VLM + LLM - **problem :** multi-modal task는 LLM freeze 시키고 사실상 V+L을 잘하려고…

long8v updated 10 months ago
1
dandelin/ViLT #10

A very large batchsize requires 64 GPUs

Thanks for your great codes! In your paper, running the pre-training experiments needs 64 V100 GPUs. For research purposes, it is too heavy. If using a small batch size, the performance would dr…

Jxu-Thu updated 2 years ago
14
microsoft/unilm #1597

[BEIT-3] error happens when I evaluate BEiT-3 finetuned mode…

**Describe** Model I am using (UniLM, MiniLM, LayoutLM ...): BEIT-3 I want to evaluate BEiT-3 finetuned model on VQAv2. [https://github.com/microsoft/unilm/blob/master/beit3/get_started/get_start…

matsutaku44 updated 3 months ago
4
haotian-liu/LLaVA #657

[Question] A simple inquiry on the statistics of llava_v1_5_…

### Question Hi @haotian-liu, thanks for your great project. As mentioned in the paper, the statistics of llava_v1_5_mix665k.json is shown as Table 7: That is, 158+40+83+72+9+80+50+22+30+86…

HenryHZY updated 2 weeks ago
7

上一页 1...4 5 6 7 8 9 10...21 下一页

202 results for vqav2

202 results
for vqav2