-
I try to run llava-v1.6-34b-hf-awq and sucessed, but how can I run the test for Llava-v1.5 ConditionalGeneration?
https://github.com/casper-hansen/AutoAWQ/pull/250
The bug of example likely :
1. ma…
-
Hello, I cannot reproduce the results of the paper on the miniImagenet dataset. Is there any solution?(thanks very much)
I used the pretrained and finetuned model data(backbone-wrn) you provided in …
-
Hi, may I know is the code for training the Sinkhorn network available? Currently, I found that only test_region_set.py has uses pretrained Sinkhorn Network, but I'm interested to know how it was trai…
-
Hello, I think there is something wrong with the arch. If I get the args for the mbart50.ft.n1 model it says "arch": "denoising_large". But denoising_large is not available in Fairseq as I see.
**T…
-
### Is there an existing issue for this?
- [X] I have searched the existing issues
### Current Behavior
I am trying use 2 gpus to run the chatglm2-6b model with the same script which could run chat…
-
### 🚀 The feature, motivation and pitch
https://blackforestlabs.ai/#get-flux
FLUX models are the new SOTA opensource text-to-image model. I am wondering if this slightly different architecture mod…
-
### Model description
Hi! I'm the author of ["Prismatic VLMs"](https://github.com/TRI-ML/prismatic-vlms), our upcoming ICML paper that introduces and ablates design choices of visually-conditioned …
siddk updated
5 months ago
-
From the code it seems the embedding is not initialized with a pre-trained embedding (i.e. word2vec), although in the paper it says so. Am I right or I missed something? Many thanks!
relevant code …
-
Hello,team
KeyError Traceback (most recent call last)
~\AppData\Local\Temp\ipykernel_6004\1581778792.py in
----> 1 model = beit_base_patch16_224(pretrained = False…
-
I spent 5 hours getting the program running, which is a great waste of time. I hereby summarize all the necessary changes for this project to run in Python 3.x and TensorFlow r1.x environment.
I a…