yuanzhoulvpi2017 / zero_nlp

中文nlp解决方案(大模型、数据、模型、训练、推理)
MIT License
2.95k stars 363 forks source link

`use_cache=True` is incompatible with gradient checkpointing. Setting `use_cache=False`...报这个信息,保存不了模型文件 #50

Closed Chenzongchao closed 1 year ago

yuanzhoulvpi2017 commented 1 year ago

这不是错误