tunib-ai / parallelformers

Parallelformers: An Efficient Model Parallelization Toolkit for Deployment
https://tunib-ai.github.io/parallelformers
Apache License 2.0
776 stars 61 forks source link

Bug about `AlbertModel` #5

Closed hyunwoongko closed 3 years ago

hyunwoongko commented 3 years ago
File "/opt/conda/lib/python3.8/site-packages/transformers/models/albert/modeling_albert.py", line 368, in forward
  self.dense.weight.t()
RuntimeError: shape '[6, 64, 384]' is invalid for input of size 294912

Environment