epfLLM / Megatron-LLM

distributed trainer for LLMs
Other
504 stars 73 forks source link

Update hf_to_megatron.py #59

Closed AleHD closed 10 months ago

AleHD commented 10 months ago

Removed accidental "llama2_divisible_by_128"