vision-transformer-models Search Results

1000+ results
for vision-transformer-models

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

LLaVA-VL/LLaVA-NeXT #165

The response issue between 0.5 and 7b

### 0.5b response is norm but 7b wrong the same image,where i chage the code is` pretrained = "/home/shihongyu/MMLM_models/lmms-lab/llava-onevision-qwen2-7b-ov" model_name = "llava_qwen" device = "…

shihy2988 updated 1 week ago
14
huggingface/transformers #30565

Correct check for SDPA in Vision Language Models

### System Info In current implementation of VLMs, the "_supports_sdpa" attribute checks and activates SDPA attention only for the language model. For example in [Llava](https://github.com/huggingf…

zucchini-nlp updated 1 month ago
2
zer0int/CLIP-fine-tune #4

PEFT fine tune CLIP VIT-G?

Hello again! Would it be possible to modify the GMP fine tune script to train a LoRA with PEFT for the CLIP VIT-G model? Then merge the LoRA with the model to get a new CLIP-G model? Chat-GPT se…

bash-j updated 2 months ago
3
UKPLab/sentence-transformers #2807

Load model `all-mpnet-base-v2` with `map_device="auto"`

Hi, Trying to load model `all-mpnet-base-v2` with `map_device="auto"`. Following [this closed issue](https://github.com/UKPLab/sentence-transformers/issues/2435) I understand that it is possibl…

maayansharon10 updated 1 month ago
2
clemsgrs/hipt #17

For your paper Masked Attention

Hi, I noticed that you submitted a paper titled “Masked Attention as a Mechanism for Improving Interpretability of Vision Transformers” to Medical Imaging with Deep Learning 2024. Do you plan to integ…

AlexNmSED updated 3 months ago
1
StriveZs/MSA-Conv #1

Seems to be missing a comparison with some related works?

- [Dilated Neighborhood Attention Transformer](https://arxiv.org/abs/2209.15001) - [Neighborhood Attention Transformer](https://arxiv.org/abs/2204.07143) - [Stand-Alone Self-Attention in Vision Mode…

lartpang updated 11 months ago
3
huggingface/transformers #27379

dinov2 with REGISTERS

### Model description Dear huggingface team, The fair team published an improved version of dinov2 [VISION TRANSFORMERS NEED REGISTERS](https://arxiv.org/abs/2309.16588). The models and checkpoi…

betterze updated 3 months ago
8
huggingface/transformers #32672

fp16 support for grounding dino

### Feature request Currently, if fp16 is used with grounding dino via https://huggingface.co/docs/transformers/main/en/model_doc/grounding-dino, there is an error of the following: ``` ... Fi…

Benjamin-Tan updated 3 weeks ago
1
xenova/transformers.js #848

Support nomic-ai/nomic-embed-vision-v1.5

### Model description hey there! Was looking to use nomic-ai/nomic-embed-vision-v1.5 since I'm using the text version so I could support image / text queries using the same semantic space, but gettin…

markthethomas updated 1 month ago
1
cambrian-mllm/cambrian #58

Cannot run inference with Cambrian-1-34B on multi-GPU

- I am trying to run inference with Cambrian-1-34B. - I have RTX 6000 GPUs with 48GBs. - I following [this inference script](https://github.com/cambrian-mllm/cambrian/blob/main/inference.py). The…

Lopa07 updated 2 weeks ago
3

上一页 1...3 4 5 6 7 8 9...100 下一页

1000+ results for vision-transformer-models

1000+ results
for vision-transformer-models