Open Alchemy5 opened 1 year ago
On the corresponding paper there are references to RWKV, but in the codebase I don't see any references to experiments with RWKV?
On the corresponding paper there are references to RWKV, but in the codebase I don't see any references to experiments with RWKV?