Open A11en0 opened 11 months ago
Hi, thanks for your great work. I notice that Adalora adopts a different pre-train model from LoRA, the hyper-parameters must need to research, and how do you find the optimal parameters for the new model?
Any update on this? @A11en0
Nope, but I guess they choose the initialized parameters from the paper of DeBerta-v3.
Hi, thanks for your great work. I notice that Adalora adopts a different pre-train model from LoRA, the hyper-parameters must need to research, and how do you find the optimal parameters for the new model?