vision-transformer Search Results

meta-llama/llama-recipes #795

I couldn't find the code about Video Encoder in llama 3.2 vi…

llama 3.2 vision is a good work! I am doing some interesting work based on llama 3.2 vision. I have read paper about llama 3.2 vision, but I have a very important question to ask. Below is a image…

blurmemo updated 2 days ago

rachtibat/LRP-eXplains-Transformers #13

Example usage for Vision Transformers?

Hi, Thanks for the great work. Is there any example how this could be used with standard (Pytorch) Vision Transformers? Many thanks, Sid

sidgairo18 updated 1 month ago

microsoft/Phi-3CookBook #223

RuntimeError When Saving Phi 3.5 Vision Due to Shared Tensor…

I’m trying to fine-tune Phi 3.5 Vision using transformers. However, I’m running into an issue trying to save the model during or after training. See below for a minimal reproducible example. My examp…

jjbuck updated 1 week ago

bioptimus/releases #3

Vision transformer backbone

Hello, Could you provide the vision transformer backbone used for the model? I am using dino's vision_transformer.py code for a vit -giant (https://github.com/facebookresearch/dino/blob/main/visi…

rbareja25 updated 3 months ago

huggingface/transformers #34690

Changes required to `save_model` for certain models (e.g., P…

### Feature request This request proposes one of three changes (see **Motivation** for background, and **Your contribution** more thoughts on possible solutions) in order to allow saving of a certa…

jjbuck updated 6 hours ago

pytorch/vision #8598

crossvit vs vision transformer

### 🚀 The feature Implement CrossVIT model for Fine grained classification ### Motivation, pitch CrossViT integrates multi-scale feature representations, enabling it to efficiently process images o…

Navoditamathur updated 3 months ago

keras-team/keras-io #1907

Error in Vision Transformer examples

### Issue Type Documentation Bug ### Source source ### Keras Version 2.14 ### Custom Code Yes ### OS Platform and Distribution Ubuntu 22.04 ### Python version 3.10 …

angelo-ml updated 2 months ago

kochlisGit/VIT2 #2

Hello, I am very interested in your work, but why can't I fi…

Hello, I am very interested in your work, but why can't I find your paper：Pre-training Vision Transformers for Visual Times Series Forecasting

liujian123223 updated 3 days ago

microsoft/OmniParser #76

Unrecognized model in weights/icon_caption_florence.

I run the program in pycharm, one error listed below occurs, how to solve it? ValueError: Unrecognized model in weights/icon_caption_florence. Should have a `model_type` key in its config.json, or co…

xiaoscofield updated 2 weeks ago

alibaba/MNN #3077

编译MNN 打开-DMNN_KLEIDIAI=ON 后，LLM推理选择opencl后端推理结果异常

推理结果异常图片： ![ddc921571d121084892ded80d3c6b573](https://github.com/user-attachments/assets/a8cbf24e-bbfb-494e-8ecc-d31136e4e4f0) cmake -DCMAKE_SYSTEM_NAME=Linux \ -DMNN_BUILD_DIFFUSION=ON -DM…

ZhiquanWoW updated 1 week ago

1000+ results for vision-transformer

1000+ results
for vision-transformer