Closed mushan09 closed 2 months ago
check Nemo: https://github.com/NVIDIA/NeMo/pull/9849 they are already developed but not released yet,
- Experiments and Results We use the NVIDIA Megatron-LM framework [45] to implement our pruning and distillation algorithms for compression and retraining. https://arxiv.org/pdf/2407.14679
Does it support Knowledge Distillation?