support sft training on d2l

CambioML / pykoi-rlhf-finetuned-transformers

pykoi: Active learning in one unified interface

https://www.cambioml.com

Apache License 2.0

407 stars 43 forks source link

Closed llauraa23 closed 9 months ago

llauraa23 commented 9 months ago

out of memory on 24G single gpu during training

CambioML commented 9 months ago

llauraa23 commented 9 months ago

@llauraa23 Take a look at huggingface/trl#1083 (comment) and https://huggingface.co/docs/trl/sft_trainer#advanced-usage

Thanks! I will have a look over the weekend. Have to work on the dpo and post writing.

CambioML commented 9 months ago

close this PR because it is also in https://github.com/CambioML/pykoi/pull/101. The future work is to use https://github.com/CambioML/pykoi/pull/101 into two PR for SFT and DPO.