The models that you provided (~270GB) is currently too large for me to handle right now and using my current resource to re-train the best model according to your report is not possible for me right now either(I had to reduce the attention heads to 3 for the model to train without the out-of-memory error). From what I understand, the models that you provided in the onedrive is all the models that you trained during the ablation study so is it possible for you to upload only your best models (preferably somewhere I can wget my linux server cause using onedrive force me to first download it to my local then reupload to the server) ?
Thank you a lot.
Hi @chenzimin ,
The models that you provided (~270GB) is currently too large for me to handle right now and using my current resource to re-train the best model according to your report is not possible for me right now either(I had to reduce the attention heads to 3 for the model to train without the out-of-memory error). From what I understand, the models that you provided in the onedrive is all the models that you trained during the ablation study so is it possible for you to upload only your best models (preferably somewhere I can
wget
my linux server cause using onedrive force me to first download it to my local then reupload to the server) ? Thank you a lot.Best regards, Dang