NVIDIA / Megatron-LM

Ongoing research training transformer models at scale
https://docs.nvidia.com/megatron-core/developer-guide/latest/user-guide/index.html#quick-start
Other
10.62k stars 2.38k forks source link

[BUG] validate_yaml() isn't in sync with arguments check #1297

Open pierric opened 1 day ago

pierric commented 1 day ago

Describe the bug This check isn't in sync with the one in the arguments.py

To Reproduce

Expected behavior The setting pipeline_model_parallel_size = 2 should be accepted.

Proposed fix sync up the check in the two places.