Closed xiaomao19970819 closed 2 months ago
This has been raised mutiple times in the issues; please refer to those issues first.
We advise you to use training frameworks, including Axolotl, Llama-Factory, Swift, etc., to finetune your models with SFT, DPO, PPO, etc.
若使用官方给定的transformers的trainner进行训练,在计算ce loss时,prompt对应的token的loss没有被mask掉,我的理解是对的吗?