A scalable generative AI framework built for researchers and developers working on Large Language Models, Multimodal, and Speech AI (Automatic Speech Recognition and Text-to-Speech)
I have multiple checkpoints produced after running examples/nlp/language_modeling/megatron_gpt_continue_training.py.
However, I am unable to use examples/nlp/language_modeling/megatron_ckpt_to_nemo.py to convert it to a .nemo object. It's probably because the environment in which I want to do the conversion is the not the same as the one used for training. Is there some way to do the conversion only on CPU or just 1 GPU?
I have tried using two different sets of parameters:
I have multiple checkpoints produced after running
examples/nlp/language_modeling/megatron_gpt_continue_training.py
.However, I am unable to use
examples/nlp/language_modeling/megatron_ckpt_to_nemo.py
to convert it to a.nemo
object. It's probably because the environment in which I want to do the conversion is the not the same as the one used for training. Is there some way to do the conversion only on CPU or just 1 GPU?I have tried using two different sets of parameters:
Input:
Output:
Input:
Output: