-
Hello, Louis.
Currently, I've been using uform-coreml-converters to convert uform models, and they're running great. uform-coreml-converters is indeed a fantastic project, and I'm very grateful for…
-
-
HI!
I would like to thank you first for such a good and updated repo regarding Vision Transformers.
I want to know if I can use 3d medical images to pretrain the ViT using 3D medical images?. D…
-
I'm trying to get fine-tuning working through the 3_sft.sh script but am encountering an error:
```
Traceback (most recent call last):
File "/root/VILA/llava/train/train_mem.py", line 36, in
…
lyluh updated
2 weeks ago
-
Hi, authors,
What are the minimum GPU memory required for running vision_transformer during inference and training, respectively?
-
### System Info
base this pull request :https://github.com/huggingface/transformers/pull/33211
python: Python 3.10.12
### infer code:
```
from PIL import Image
import requests
import torch
f…
-
MAGVLT: based on **non-autoregressive** mask prediction.
- enables bidirectional context encoding, fast decoding by parallel token predictions in an iterative refinement
- extended editing capabilit…
-
I receive this error when i run this bash command: !bash LWM/scripts/run_sample_video.sh. I have followed all the direction listed in the repo.
```
/usr/local/lib/python3.10/dist-packages/hug…
-
### Description of the bug | 错误描述
Bug about loading pretrained model
I can't load pretrained-model although I had to assign path containing config.json and pytorch_model.bin
Error
```
Traceback…
-
한줄 평 : 우리 모델, 빠름. 가벼움. 쓰셈
Transformer와 관련해서 다양한 모델들이 나왔습니다.
이들 중에서 장점만을 모아서, 가장 Efficiency가 좋은 모델을 만들었습니다.
Observation 1 :
Patch Embedding -> Convolution Stem
Larger Kernel과 stride를 사용하는 Pat…