Closed callanwu closed 6 months ago
Hello, Great work! I want to reproduce the reported LoRA(16bit) result(36.9) on GSM8K dataset in your paper. Could you provide the correct script or more detailed hyper-parameters? Thx a lot!
Hi @callanwu, please try https://github.com/yxli2123/LoftQ/blob/main/scripts/train_gsm8k.sh#L62
Thx a lot~ I will try it!
Hello, Great work! I want to reproduce the reported LoRA(16bit) result(36.9) on GSM8K dataset in your paper. Could you provide the correct script or more detailed hyper-parameters? Thx a lot!