vit-22b Search Results - Githubissues

17 results
for vit-22b

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

mlfoundations/open_clip #426

Any plans to support the modified VIT arch based on the VIT-…

The changes include the QK normlizaiton, Parallel layers and etc. It would be cool to see how CLIP performs by applying those changes to VIT-L VIT-B VIT-H

JianbangZ updated 1 year ago
4
n00mkrad/text2image-gui #87

add Kandinsky 2.0 - the first multilingual text2image model

Ability to write Prompt in more than 100 languages. Kandinsky 2.0 https://github.com/ai-forever/Kandinsky-2.0 https://huggingface.co/sberbank-ai/Kandinsky_2.0 Model architecture: It is a late…

0-NiK-0 updated 1 year ago
4
bethgelab/model-vs-human #29

Cannot find some advanced model in the models.list_models("p…

Thanks for your code? Have you add the [ViT-22B-384](https://arxiv.org/abs/2302.05442) in the pytorch model zoo? I haven't found it.

caiyancheng updated 1 month ago
2
haotian-liu/LLaVA #1417

[Question] LLaVA Pretraining with Mixtral 8×7B

### Question Does anyone have carried out the pretraining with Mixtral 8×7B? When I run the petraining script, one problem occured like the figure shown below. I just add a llava_mixtral.py to the ll…

ShawnAn-WHU updated 5 months ago
11
kyegomez/MegaVIT #13

Is it your trained ViT-22B going to be released?

Could you please share the process? ## Upvote & Fund - We're using [Polar.sh](https://polar.sh/kyegomez) so you can upvote and help fund this issue. - We receive the funding once the issue is compl…

serser updated 3 months ago
1
jungwoo-ha/WeeklyArxivTalk #75

[20230312] Weekly AI ArXiv 만담 시즌2 - 9회차

scene-the-ella updated 1 year ago
5
OpenGVLab/InternVL #15

Any further plans on knowledge distillation?

ViT-22B conducted knowledge distillation experiments (refer to [Table 8](https://openreview.net/pdf?id=Lhyy8H75KA)), demonstrating that it is not only a large-scale model but also an excellent teacher…

lcxrocks updated 9 months ago
2
google-research/scenic #754

ViT 22B checkpoint ?

Hi, Do you plan to release the model and checkpoint of ViT22b presented in "Scaling Vision Transformers to 22 Billion Parameters" ?

jrabary updated 1 year ago
3
lucidrains/x-transformers #214

Simplifying Transformer Blocks (https://arxiv.org/abs/2311.0…

Would be nice to have this one here (https://arxiv.org/abs/2311.01906).

Froskekongen updated 8 months ago
9
EleutherAI/pythia #122

The performance about pythia and LLaMA model architecture

Hi, first of all, thanks for your great contributions to open research! I have confused about model architecture will influence model performance, I note that pythia model Layer Block like pseud…

peiyingxin updated 11 months ago
1

17 results for vit-22b

17 results
for vit-22b