vision-transformer-models Search Results

1000+ results
for vision-transformer-models

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

guidance-ai/guidance #880

Input template for Transformers vision language models ?

Hi, I'm trying to constrain the generation of my VLMs using this repo; however i can't figure out the way to personalize the pipeline for handling inputs (query+image). Whereas it is documented as …

vpellegrain updated 2 weeks ago
6
matlab-deep-learning/MATLAB-Deep-Learning-Model-Hub #13

vision/video transformer models

If you could also add vision/video transformer models, please. Thanks in advance

M12H19D updated 3 months ago
2
NeurAI-Lab/LIFE #1

Error: Cannot import name '_init_vit_weights' from 'timm.mod…

Hello, we encountered an error ‘Cannot import name '_init_vit_weights' from 'timm.models.vision_transformer’ while trying to replicate your method. This might be due to changes in the timm version tha…

lxy-146 updated 1 month ago
1
huggingface/diffusers #8957

StableDiffusionSafetyChecker ignores `attn_implementation` l…

### Describe the bug `transformers` added `sdpa` and FA2 for CLIP model in https://github.com/huggingface/transformers/pull/31940. It now initializes the vision model like https://github.com/huggingf…

jambayk updated 1 month ago
1
wkcn/TinyCLIP #4

Does model weight convertible between HF model weight and Op…

Hi I wonder model weight is convertible between HF model weight and Open_clip model weight. HF model weight : https://huggingface.co/wkcn/TinyCLIP-ViT-40M-32-Text-19M-LAION400M Open clip model : htt…

hyeinhyun updated 4 days ago
2
google-research/big_vision #126

tokenization error when using msiglip

Hi, I get this error when preprocessing text using the mSigLIP model. Any idea what may be wrong? I didn't change anything in the [demo colab ](https://colab.research.google.com/github/google-research…

simran-khanuja updated 2 weeks ago
1
THUDM/GLM-4 #500

请教关于算法原理

请问GLM 4v是如何做到高分辨图像适配输入的？与CogVLM的区别？ ![image](https://github.com/user-attachments/assets/ee3e5f1b-7a4f-4ab6-9926-1bfddef3ba83) 请问图中High-Resolution Cross-Module在项目代码哪个位置可以体现出来谢谢！

elesun2018 updated 2 days ago
3
NVIDIA/TensorRT-LLM #2001

Question: Can Context FMHA be used to implement Transformer …

I see that the multi-model models in the example all use TensorRT directly to deploy vision encoders, why not use TensorRT-LLM? Are there known issues or challenges associated with integrating Context…

lmcl90 updated 2 weeks ago
3
Paitesanshi/LLM-Agent-Survey #28

Seamlessly integrate state-of-the-art transformer models int…

Hi friends! I'd like to share our recent project embodied-agents: https://github.com/mbodiai/embodied-agents, which makes it easy to integrate large multi-modal models into existing robot stacks wi…

nqyy updated 1 week ago
1
mengduanjinghua/SS-MTr #3

报错，可能是timm版本问题

![image](https://github.com/user-attachments/assets/7bef6dbb-ffb4-4037-add0-7035c2909867)

jerryniu0624 updated 22 hours ago
1

上一页 1...1 2 3 4 5 6 7...100 下一页

1000+ results for vision-transformer-models

1000+ results
for vision-transformer-models