TideDra / VL-RLHF

A RLHF Infrastructure for Vision-Language Models
Apache License 2.0
85 stars 5 forks source link

Support for SFT InternLM-XComposer2? #4

Closed ZhihaoAIRobotic closed 3 months ago

ZhihaoAIRobotic commented 3 months ago

Does this repo support for SFT InternLM-XComposer2 ?

There is no sft_InternLM_XC.sh in the scripts. Therefore I want to check if the repo support for SFT InternLM-XComposer2.

Thank you!

TideDra commented 3 months ago

There is a bug when SFT InternLM-XComposer2 with gradient checkpointing, which seems a bug from the upstream dependency trl/sft_trainer. I recommend to use the official sft script of this model for best practice currently.