databricks / megablocks

Apache License 2.0
1.11k stars 154 forks source link

Unsharding scripts for megablocks models #94

Open mayank31398 opened 5 months ago

mayank31398 commented 5 months ago

The base Megatron-LM repo provides unsharding scripts for the models which can be used after training. I didn't find any such scripts in the repo. Would it be possible to provide the same?