Closed llauraa23 closed 9 months ago
@llauraa23 Take a look at huggingface/trl#1083 (comment) and https://huggingface.co/docs/trl/sft_trainer#advanced-usage
Thanks! I will have a look over the weekend. Have to work on the dpo and post writing.
close this PR because it is also in https://github.com/CambioML/pykoi/pull/101. The future work is to use https://github.com/CambioML/pykoi/pull/101 into two PR for SFT
and DPO
.
out of memory on 24G single gpu during training