yuunnn-w / RWKV_Pytorch

This is an inference framework for the RWKV large language model implemented purely in native PyTorch. The official native implementation is overly complex and lacks extensibility. Let's join the flexible PyTorch ecosystem and open-source it together!
GNU General Public License v3.0
97 stars 7 forks source link

Onnx quantize #14

Closed uniartisan closed 2 months ago

uniartisan commented 2 months ago

onnx量化功能,可加可不加(只适合于x64平台的CPU推理)