issues
search
mzbac
/
llama2-fine-tune
Scripts for fine-tuning Llama2 via SFT and DPO.
189
stars
37
forks
source link
issues
Newest
Newest
Most commented
Recently updated
Oldest
Least commented
Least recently updated
dpo training
#4
Vignesh199821
opened
1 year ago
0
Dataset format
#3
rajivpoddar
opened
1 year ago
2
After training with DPOTrainer of trl, and saving, loading error when using AutoPeftModelForCausalLM
#2
MS-YUN
opened
1 year ago
1
A question about formatting_prompts_func
#1
sunzx8
closed
1 year ago
4