Closed PeganovAnton closed 9 months ago
This PR is stale because it has been open 21 days with no activity. Remove stale label or comment or this will be closed in 7 days.
Hi Anton, I'm closing this PR as your changes are now on main branch. Thanks for taking look at this example and contribution.
What does this PR do?
Modifications done only to Megatron multinode example.
end_strings
parametermin_length
to20
. It is done to avoid empty responses.end_strings
case).nemo
checkpointNLPDDPStrategy
instead ofNLPDDPPlugin
.TritonConfig
into NeMo Megatron multinode example--model-name
parameter to NeMo Megatron multinode example--workspace
parameter to NeMo Megatron multinode example--model-path
for loading local checkpoints instead of HuggingFace checkpoints