vision-transformers Search Results

1000+ results
for vision-transformers

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

reyllama/paper-reviews #149

Do Vision Transformers See Like Convolutional Neural Network…

## TL; DR - ViT feature representations are *less hierarchical*. - Early tr blocks learn both local and global dependencies provided with large enough dataset. - Skip connections play much more i…

reyllama updated 2 years ago
2
huggingface/huggingface-llama-recipes #37

Llama-Vision FT Error: The number of images in each batch [1…

Dear all, Thank you so much for sharing the llama3.2 vision model fine-tuning script so fast! I got the following error when running the demo ``` The model weights are not tied. Please use t…

JunMa11 updated 1 week ago
3
keras-team/keras-io #1006

ViT cannot detect multiple objects in one image

The Object Detections with Vision Transformers can only detect one object per image. I tried to run the model prediction on an image containing many same objects, only 1 big bounding box covering all …

galax-count updated 3 weeks ago
3
Paitesanshi/LLM-Agent-Survey #28

Seamlessly integrate state-of-the-art transformer models int…

Hi friends! I'd like to share our recent project embodied-agents: https://github.com/mbodiai/embodied-agents, which makes it easy to integrate large multi-modal models into existing robot stacks wi…

nqyy updated 1 month ago
1
microsoft/computervision-recipes #678

[FEATURE_REQUEST] Add vision transformers model to image cla…

### Description The [transformer-based image classification model](https://arxiv.org/abs/2010.11929) is becoming popular. It will be nice to include it in this repo. ### Expected behavior with the…

kbjiang updated 7 months ago
1
huggingface/trl #1972

VLM dpo bug

trl/trainer/dpo_trainer.py line 542 The tokenizer for _super().init ()_ should be set to _self.tokenizer_ instead of _tokenizer_, otherwise the previous _is_vision_model_ will be invalid.

liuchaohu updated 1 month ago
2
huggingface/candle #2525

[QUESTION] Protocol of adding a new model (Stella_en_<*>_v5 …

Hi, I have a working implementation of [Stella_en__v5](https://huggingface.co/dunzhang/stella_en_1.5B_v5) family of models which is one of the top ranking model in the MTEB leaderboard for rerankin…

AnubhabB updated 5 days ago
2
MinusZoneAI/ComfyUI-CogVideoX-MZ #14

生成这两个视频后我安装了更新，然后就不好用了

https://github.com/user-attachments/assets/8d02dc13-42d0-469e-b86c-46ccd24a6b5a https://github.com/user-attachments/assets/9de83f0d-a301-4aa0-90d4-fd8d6337ca07 你好，事情是这样的。当时我在测试如何放大视频，生成这两个…

Marksusu updated 1 week ago
8
pytorch/vision #8598

crossvit vs vision transformer

### 🚀 The feature Implement CrossVIT model for Fine grained classification ### Motivation, pitch CrossViT integrates multi-scale feature representations, enabling it to efficiently process images o…

Navoditamathur updated 1 month ago
2
laclouis5/uform-coreml-converters #1

New optimizations for ANE

Hello, Louis. Currently, I've been using uform-coreml-converters to convert uform models, and they're running great. uform-coreml-converters is indeed a fantastic project, and I'm very grateful for…

aalexlee updated 7 months ago
1

上一页 1...6 7 8 9 10 11 12...100 下一页

1000+ results for vision-transformers

1000+ results
for vision-transformers