-
Unexpected key(s) in state_dict: "layers.0.estimation_gate.FC1.weight", "layers.0.estimation_gate.FC1.bias", "layers.0.estimation_gate.FC2.weight", "layers.0.estimation_gate.FC2.bias", "layers.0.inh_l…
-
Excellent work! When do you plan to release your pretrained models?
-
In hte paper is "The pretraining corpus are available at https://drive.google.com/drive/folders/1ST0WD1-hX9XtiPWwCceZbgZlBV0fKPbe. (Supplementary Materials)". The link is not valid (404), is there an…
-
We tried to pretrain a 421M Samba, but after pretraining, we find that you did not open source the evaluation script.
-
Hi, there. Thanks for your work. Could you share the commands for pretraining? I'm not sure how to use cgat.py to do it.
-
Hello! First of all, I think your idea is very good. I read your blog, and I want to Recurrence results myself, but I didn't find the pretraining model. Can you provide the pre training model on Baidu…
-
## ❓ Questions and Help
First of all, thanks for the sharing BART model checkpoints and codes to run.
#### What is your question?
Could you provide a pertaining script used for BART models?
I …
-
It appears that config/pretrain-alldata-base.json is not your paper pretraining configuration. There is no cls_concat setting in this configuration file, so it uses the default value. As a result, unl…
-
Dear author,
since the model depends on pretraining on AudioSet to reach the highest score, why not to share the dataset and pretrained-model file? For the audioset dataset always become partiall…
-
您在文中并没有提到在imagenet上训练跑了多少epoch以及用了几块GPU,batch size是多大,还有优化器和学习率等参数。是否方便公布一下这些参数或者将预训练的log放出来?这会极大的帮助我们复现您的工作,非常感谢!