Closed passaglia closed 4 months ago
In the new MoE Token Drop code, the code and documentation expect drop_policy to be either prob or position, but the current default argument is probs. This fixes that typo.
drop_policy
prob
position
probs
@yanring
Close due to internal MR solving this issue: https://github.com/NVIDIA/Megatron-LM/issues/812#issuecomment-2097991119
In the new MoE Token Drop code, the code and documentation expect
drop_policy
to be eitherprob
orposition
, but the current default argument isprobs
. This fixes that typo.@yanring