Closed tangbo-sh closed 2 months ago
希望在codeqwen上做领域数据的continue pretrain,数据格式该如何组织,[bos]content[eos]还是仅加eos即可?另外,adam优化器的epsilon参数建议如何设置,谢谢
FYI https://qwen.readthedocs.io/en/latest/training/SFT/llama_factory.html
希望在codeqwen上做领域数据的continue pretrain,数据格式该如何组织,[bos]content[eos]还是仅加eos即可?另外,adam优化器的epsilon参数建议如何设置,谢谢