Describe the bug
When I run the start up code in README.md, in step 4 "Train the model" I can't properly run the code. Google colab reported "TypeError: linear(): argument 'input' (position 1) must be Tensor, not NoneType" when running logits = model(input_ids, attention_mask)
Describe the bug When I run the start up code in README.md, in step 4 "Train the model" I can't properly run the code. Google colab reported "TypeError: linear(): argument 'input' (position 1) must be Tensor, not NoneType" when running
logits = model(input_ids, attention_mask)
Minimal steps to reproduce
Just follow the README.md and find the bug TT.