Open fei407 opened 7 months ago
Hello, frd
is implemented for LoRA, which regulaize A and B orthogonal but does not normalize. We present an ablation to compare AdaLoRA and LoRA with orthogonal regularization in Section 4.4 to show both components of AdaLoRA are important. For AdaLoRA, svd
should be selected which regulalize P and Q both orthogonal and normalized.
Hi, thanks for this awesome work! I have a question regarding the two different orthogonal regularization methods, frd and svd, as they appear to have distinct impacts. How is this aspect considered in AdaLoRa?