官方能否提供continue pretrain（增量预训练）的脚本呢？

TigerResearch / TigerBot

TigerBot: A multi-language multi-task LLM

https://www.tigerbot.com

Apache License 2.0

2.24k stars 194 forks source link

Open listwebit opened 10 months ago

listwebit commented 10 months ago

有几个问题请大佬指导一下： 1.官方能否提供continue pretrain（增量预训练）的脚步呢？ 2.如果不能话，我想在领域数据上持续预训练，需要怎么做呢？将微调代码改一下？请大佬详细说一下，谢谢 3.如果用70B的模型持续增量预训练(非lora，全量参数更新)，至少需要多少个机器呢？

感谢大佬的回复，祝愿大佬的大模型全球第一

chentigerye commented 10 months ago

感谢支持，