A torchless, c++ rwkv implementation using 8bit quantization, written in cuda/hip/vulkan for maximum compatibility and minimum dependencies
303
stars
19
forks
source link
Split cpp code from the Python comment && Add a little refactor to code #7
Closed
nenkoru closed 1 year ago
Made a little refactor to a converter module to be a little more Pythonic