Can BMTrain work with Megatron-LM?

OpenBMB / BMTrain

Efficient Training (including pre-training and fine-tuning) for Big Models

Apache License 2.0

548 stars 74 forks source link

Closed marscrazy closed 1 year ago

marscrazy commented 1 year ago

We want to try a large LM model (>30B). Are there any examples to do that?

Achazwl commented 1 year ago

If you want pre-training, you can write a new model config and a new model based on those layers in modelcenter