OpenBMB / ModelCenter

Efficient, Low-Resource, Distributed transformer implementation based on BMTrain
https://modelcenter.readthedocs.io
Apache License 2.0
243 stars 30 forks source link

[BUG] TypeError: linear(): argument 'input' (position 1) must be Tensor, not NoneType when running get started code #45

Open jiangzizi opened 9 months ago

jiangzizi commented 9 months ago

Describe the bug When I run the start up code in README.md, in step 4 "Train the model" I can't properly run the code. Google colab reported "TypeError: linear(): argument 'input' (position 1) must be Tensor, not NoneType" when running logits = model(input_ids, attention_mask)

Minimal steps to reproduce

Just follow the README.md and find the bug TT.