RWKV is an RNN with transformer-level LLM performance. It can be directly trained like a GPT (parallelizable). So it's combining the best of RNN and transformer - great performance, fast inference, saves VRAM, fast training, "infinite" ctx_len, and free sentence embedding.
Fix below issue:
--- Logging error ---
Traceback (most recent call last):
File "/usr/local/miniconda3/envs/zxy/lib/python3.8/logging/init.py", line 1085, in emit
msg = self.format(record)
File "/usr/local/miniconda3/envs/zxy/lib/python3.8/logging/init.py", line 929, in format
return fmt.format(record)
File "/usr/local/miniconda3/envs/zxy/lib/python3.8/logging/init.py", line 668, in format
record.message = record.getMessage()
File "/usr/local/miniconda3/envs/zxy/lib/python3.8/logging/init.py", line 373, in getMessage
msg = msg % self.args
TypeError: not all arguments converted during string formatting
Call stack:
File "train.py", line 288, in
train_data = MyDataset(args)
File "/root/zhuangxy/RWKV-LM/RWKV-v4neo/src/dataset.py", line 60, in init
rank_zero_info("Current vocab size =", self.vocab_size, "(make sure it's correct)")
File "/usr/local/miniconda3/envs/zxy/lib/python3.8/site-packages/lightning_utilities/core/rank_zero.py", line 27, in wrapped_fn
return fn(*args, kwargs)
File "/usr/local/miniconda3/envs/zxy/lib/python3.8/site-packages/lightning_utilities/core/rank_zero.py", line 54, in rank_zero_info
_info(*args, stacklevel=stacklevel, *kwargs)
File "/usr/local/miniconda3/envs/zxy/lib/python3.8/site-packages/lightning_utilities/core/rank_zero.py", line 48, in _info
log.info(args, kwargs)
Message: 'Current vocab size ='
Arguments: (50277, "(make sure it's correct)")
Data has 30220625 tokens.
Fix below issue: --- Logging error --- Traceback (most recent call last): File "/usr/local/miniconda3/envs/zxy/lib/python3.8/logging/init.py", line 1085, in emit msg = self.format(record) File "/usr/local/miniconda3/envs/zxy/lib/python3.8/logging/init.py", line 929, in format return fmt.format(record) File "/usr/local/miniconda3/envs/zxy/lib/python3.8/logging/init.py", line 668, in format record.message = record.getMessage() File "/usr/local/miniconda3/envs/zxy/lib/python3.8/logging/init.py", line 373, in getMessage msg = msg % self.args TypeError: not all arguments converted during string formatting Call stack: File "train.py", line 288, in
train_data = MyDataset(args)
File "/root/zhuangxy/RWKV-LM/RWKV-v4neo/src/dataset.py", line 60, in init
rank_zero_info("Current vocab size =", self.vocab_size, "(make sure it's correct)")
File "/usr/local/miniconda3/envs/zxy/lib/python3.8/site-packages/lightning_utilities/core/rank_zero.py", line 27, in wrapped_fn
return fn(*args, kwargs)
File "/usr/local/miniconda3/envs/zxy/lib/python3.8/site-packages/lightning_utilities/core/rank_zero.py", line 54, in rank_zero_info
_info(*args, stacklevel=stacklevel, *kwargs)
File "/usr/local/miniconda3/envs/zxy/lib/python3.8/site-packages/lightning_utilities/core/rank_zero.py", line 48, in _info
log.info(args, kwargs)
Message: 'Current vocab size ='
Arguments: (50277, "(make sure it's correct)")
Data has 30220625 tokens.