yuunnn-w / RWKV_Pytorch

This is an inference framework for the RWKV large language model implemented purely in native PyTorch. The official native implementation is overly complex and lacks extensibility. Let's join the flexible PyTorch ecosystem and open-source it together!
GNU General Public License v3.0
97 stars 7 forks source link

修正模型保存bug,删除初始化时对ctx_len的要求,给head_size_a和head_size_divisor提供默认值 #25

Closed WuTianyi321 closed 1 month ago