Closed IT-five closed 6 months ago
换一个load model就行了?
tokenizer可能也需要注意下有没有什么区别
https://huggingface.co/baichuan-inc/Baichuan2-7B-Base/blob/main/modeling_baichuan.py 你填DecoderLayer就行吧
换一个load model就行了?