vision-transformer Search Results

1000+ results
for vision-transformer

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

DeepVisionStudy/PaperReview #4

[3] PoolFormer: MetaFormer Is Actually What You Need for Vis…

### Links - Paper : https://arxiv.org/abs/2111.11418 - Github : https://github.com/sail-sg/poolformer ### 한 줄 요약 ![image](https://user-images.githubusercontent.com/39791467/197780827-b34eb533-e2…

omocomo updated 2 years ago
1
chaos-moon/paper_daily #18

CLIP系列

## CLIP * [\[Blog\]](https://openai.com/blog/clip/) * [\[Paper\]](https://arxiv.org/abs/2103.00020) * [\[code\]](https://github.com/openai/CLIP) * [\[Model Card\]](https://github.com/openai/CL…

zc12345 updated 1 year ago
2
changh95/WeeklySpatialAI #1

2024.07.24 - #1 - FutureMapping, GLIM, DeepSLAM, Co-RAL, SOL…

# Interesting papers - [Davison 2018 - FutureMapping: The Computational Structure of Spatial AI Systems](https://arxiv.org/abs/1803.11288) - Imperial College London의 Dyson Robotics Lab 교수님이신 A…

changh95 updated 1 month ago
5
chenjoya/dropit #1

Huging Face transformer

How can we use it for NLP transformers

jaideep11061982 updated 2 years ago
1
unslothai/unsloth #1196

Unable to execute FastLanguageModel.from_pretrained() with m…

Dear Unsloth, Based on the initialization used in [your sample notebook](https://colab.research.google.com/drive/1Ys44kVvmeZtnICzWz0xgpRnrIOjZAuxp?usp=sharing) I created [my own notebook](https://c…

aeltorio updated 1 week ago
5
google-research/vision_transformer #42

when i run the command "python3 -m vit_jax.train --name ViT-…

this is the detailed error: (VIT) llb@raypc:~/codes/vision_transformer$ python3 -m vit_jax.train --name ViT-B_16-cifar10_`date +%F_%H%M%S` --model ViT-B_16 --logdir /tmp/vit_logs --dataset cifar10 2…

KrisLee512 updated 3 years ago
3
mlfoundations/open_clip #250

Make HF clip support models using an HF text encoder

Eg https://huggingface.co/laion/CLIP-ViT-H-14-frozen-xlm-roberta-large-laion5B-s13B-b90k/tree/main Need more config, adapting the weights and also changing the model at https://github.com/huggingfa…

rom1504 updated 1 year ago
8
qwopqwop200/Neighborhood-Attention-Transformer #1

About relative position encoding

Excuse me.I notice your work.It gives me a lot of encourage to explore more work about neighborhood-attention transformer.I read your python file,but have problem in relative position encoding.Can you…

luoyixi924208423 updated 2 years ago
1
TheLastBen/fast-stable-diffusion #194

Minor: automatic1111 cannot Interrogate CLIP

This is what I'm getting: ``` load checkpoint from https://storage.googleapis.com/sfr-vision-language-research/BLIP/models/model_base_caption_capfilt_large.pth 100%|██████████████████████████████…

lgrkvst updated 2 years ago
1
google-research/vision_transformer #94

what is the Optimizer

Hi, in `momentum_clip`, there defined an optimizer that stores state using half-precision. I guess it is for quantification reasons. But this optimizer is nothing like what i am familiar with. and in …

cissoidx updated 3 years ago
6

上一页 1...93 94 95 96 97 98 99...100 下一页

1000+ results for vision-transformer

1000+ results
for vision-transformer