bytedance / ParaGen

ParaGen is a PyTorch deep learning framework for parallel sequence generation.
Other
186 stars 23 forks source link

Auto avg ckpt #10

Closed JiangtaoFeng closed 2 years ago

JiangtaoFeng commented 2 years ago

add auto avg ckpt