NVIDIA / Megatron-LM

Ongoing research training transformer models at scale
https://docs.nvidia.com/megatron-core/developer-guide/latest/user-guide/index.html#quick-start
Other
9.23k stars 2.08k forks source link

[BUG] the argument of parser.add_argument is wrong in tools/checkpoint/convert.py #866

Open adoda opened 2 weeks ago

adoda commented 2 weeks ago

Describe the bug

https://github.com/NVIDIA/Megatron-LM/blob/main/tools/checkpoint/convert.py#L115

It must be 'choices=['GPT', 'BERT'],'

not 'choice=['GPT', 'BERT'],'