vision-language-transformer Search Results

1000+ results
for vision-language-transformer

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

Yaxin9Luo/Gamma-MOD #3

Some questions about vision_tower.

Is the vision_tower you use in the code "clip-vit-large-patch14-336"? I can't load vision_tower when I use the "Gamma-MoD-llava-hr-7b-0.34" you provided. Even if I download "clip-vit-large-patch14-336…

FanshuoZeng updated 1 month ago
9
huggingface/transformers #15813

Add OFA to transformers

# 🌟 New model addition We recently proposed OFA, a unified model for multimodal pretraining, which achieves multiple SoTAs on downstream tasks, including image captioning, text-to-image generation, r…

JustinLin610 updated 1 year ago
4
haotian-liu/LLaVA #943

[Question] Finetune with chinese-clip

### Question ## motivation: I try to use chinese-clip replace clip. ## environment ```bash $ uname -a Linux localhost.localdomain 3.10.0-1160.80.1.el7.x86_64 #1 SMP Tue Nov 8 15:48:59 UTC…

chenchun0629 updated 8 months ago
6
haotian-liu/LLaVA #1039

[Usage] Unable to load LLaVA v1.6 models

### Describe the issue Issue: When trying to load `liuhaotian/llava-v1.6-mistral-7b` or `liuhaotian/llava-v1.6-34b` into my container: ``` MODEL_PATH = "liuhaotian/llava-v1.6-mistral-7b" US…

levi updated 4 months ago
16
jasp-stats/jasp-issues #1913

[Feature Request]:

### Description Deep Learning Module(s) ### Purpose Allow using 80% of Deep Learning Modelling via an intuitive and beautiful GUI, just like the JASP's ML module ### Use-case Someone wh…

luiscunhacsc updated 9 months ago
4
huggingface/transformers #30638

Add Prismatic VLMs to Transformers

### Model description Hi! I'm the author of ["Prismatic VLMs"](https://github.com/TRI-ML/prismatic-vlms), our upcoming ICML paper that introduces and ablates design choices of visually-conditioned …

siddk updated 6 months ago
5
OFA-Sys/OFA #434

Question about data processing in pretrain data.

Why the following actions were taken. Is there anything special about cc12m I missed? https://github.com/OFA-Sys/OFA/blob/a36b91ce86ff105ac8d9e513aa88f42b85e33479/data/pretrain_data/unify_dataset.…

JJJYmmm updated 8 months ago
1
salesforce/LAVIS #556

unable to run it on 16GB M1 in CPU MODE: RuntimeError: "slow…

is there a quantized version somewhere? code: ``` import torch import requests from PIL import Image from transformers import Blip2Processor, Blip2ForConditionalGeneration processor = Blip2…

dataf3l updated 12 months ago
1
AmireNoori/MathCaptchaSolver #1

Missing requirements file

hey amir, nice job with your work , you just forgot the requirements.txt in source, i`m kinda having problem here , it may caused by requirements installed version. the error : while loading with…

Moohmmd updated 5 months ago
3
yoheikikuta/paper-reading #57

[2103.00020] Learning Transferable Visual Models From Natura…

## 論文リンク https://arxiv.org/abs/2103.00020 ## 公開日（yyyy/mm/dd） 2021/01/05 ## 概要 OpenAI が発表した DALL·E の中で reranking にも使われていた CLIP (Contrastive Language-Image Pre-training) の論文。 Web 上のテキストから特別な a…

yoheikikuta updated 3 months ago
13

上一页 1...9 10 11 12 13 14 15...100 下一页

1000+ results for vision-language-transformer

1000+ results
for vision-language-transformer