kingoflolz / mesh-transformer-jax

Model parallel transformers in JAX and Haiku
Apache License 2.0
6.29k stars 892 forks source link

limitation min_length=1024 #155

Closed ghost closed 2 years ago

ghost commented 2 years ago

how to apply this fix https://github.com/microsoft/DeepSpeed/pull/1114/commits/5e9b420e543e83d6fa1c64f86f9c44a708f37125 it is imported this issue, because ai starts to explain complex topic problems and then it stops at GitHub I was find some issues at this topic https://github.com/microsoft/DeepSpeed/issues/1112#issuecomment-856942487 Bildschirmfoto 2021-11-18 um 20 27 24

kingoflolz commented 2 years ago

I cannot support other implementations of GPT-J, please file an issue in repository which you are using.