RWKV is an RNN with transformer-level LLM performance. It can be directly trained like a GPT (parallelizable). So it's combining the best of RNN and transformer - great performance, fast inference, saves VRAM, fast training, "infinite" ctx_len, and free sentence embedding.
我运行RWKV-v4neo下的train.py文件,但是它启动不起来,自己断 还没反应 显示的界面 自己的参数设置如图: 头秃哇。。。