HP Tuning - Githubissues

kbressem / medAlpaca

LLM finetuned for medical question answering

GNU General Public License v3.0

491 stars 58 forks source link

Open kbressem opened 1 year ago

kbressem commented 1 year ago

tune the LR
increase val batch size
tune dropout (0.2 in Instruct GPT) | most important
tune --lora_r 8 (rank, the bigger, the heavier the lora is (more params to tune)) maybe 16
tune --lora_alpha 16 (the smaller it is, the bigger the retraining amount)
less epochs (2-3, higher batch size)