vision-transformer Search Results

1000+ results
for vision-transformer

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

laclouis5/uform-coreml-converters #1

New optimizations for ANE

Hello, Louis. Currently, I've been using uform-coreml-converters to convert uform models, and they're running great. uform-coreml-converters is indeed a fantastic project, and I'm very grateful for…

aalexlee updated 6 months ago
1
PaddlePaddle/models #5281

how to use paddlepaddle vit(vision transformer) to video cla…

dotsonliu updated 6 months ago
1
lucidrains/vit-pytorch #166

ViT-Dino for Medical images

HI! I would like to thank you first for such a good and updated repo regarding Vision Transformers. I want to know if I can use 3d medical images to pretrain the ViT using 3D medical images?. D…

Mushtaqml updated 1 month ago
3
NVlabs/VILA #122

Fine tuning and --evaluation_strategy argument

I'm trying to get fine-tuning working through the 3_sft.sh script but am encountering an error: ``` Traceback (most recent call last): File "/root/VILA/llava/train/train_mem.py", line 36, in …

lyluh updated 2 weeks ago
1
google-research/vision_transformer #17

minimum GPU memory required for running vision_transformer

Hi, authors, What are the minimum GPU memory required for running vision_transformer during inference and training, respectively?

amiltonwong updated 3 years ago
1
huggingface/transformers #33294

"Qwen2-VL FP16 inference results in errors or gibberish outp…

### System Info base this pull request :https://github.com/huggingface/transformers/pull/33211 python: Python 3.10.12 ### infer code: ``` from PIL import Image import requests import torch f…

GeLee-Q updated 1 week ago
3
letme-hj/dl-papers #7

[7] MAGVLT: Masked Generative Vision-and-Language Transforme…

MAGVLT: based on **non-autoregressive** mask prediction. - enables bidirectional context encoding, fast decoding by parallel token predictions in an iterative refinement - extended editing capabilit…

letme-hj updated 1 year ago
2
LargeWorldModel/LWM #77

Error while running bash command: run_sample_video.sh | Erro…

I receive this error when i run this bash command: !bash LWM/scripts/run_sample_video.sh. I have followed all the direction listed in the repo. ``` /usr/local/lib/python3.10/dist-packages/hug…

samitm-123 updated 1 month ago
6
opendatalab/MinerU #249

Can't load pretrained model

### Description of the bug | 错误描述 Bug about loading pretrained model I can't load pretrained-model although I had to assign path containing config.json and pytorch_model.bin Error ``` Traceback…

Holmes2002 updated 1 month ago
9
gusdlf93/Paper_Survey #17

[2022 arXiv] EfficientFormer : Vision Transformers at Mobile…

한줄 평 : 우리 모델, 빠름. 가벼움. 쓰셈 Transformer와 관련해서 다양한 모델들이 나왔습니다. 이들 중에서 장점만을 모아서, 가장 Efficiency가 좋은 모델을 만들었습니다. Observation 1 : Patch Embedding -> Convolution Stem Larger Kernel과 stride를 사용하는 Pat…

gusdlf93 updated 2 years ago
1

上一页 1...12 13 14 15 16 17 18...100 下一页

1000+ results for vision-transformer

1000+ results
for vision-transformer