Open ghost opened 1 year ago
Hi, I noticed that you tried scaled rope. How could i use the scaled code for training a qlora with this?
Scaled rope results were mixed. it works well on simple prompts. You can try scaled rope with finetune.py with use_rope_scaled flag. Though you will need to modify generate.py similarly
Hi, I noticed that you tried scaled rope. How could i use the scaled code for training a qlora with this?