NVIDIA / NeMo

A scalable generative AI framework built for researchers and developers working on Large Language Models, Multimodal, and Speech AI (Automatic Speech Recognition and Text-to-Speech)
https://docs.nvidia.com/nemo-framework/user-guide/latest/overview.html
Apache License 2.0
11.63k stars 2.44k forks source link

Converting Script for Mamba2 Hybrid to HF/Pytorch #10268

Open SkanderBS2024 opened 1 month ago

SkanderBS2024 commented 1 month ago

Is your feature request related to a problem? Please describe.

When fine-tuning for a mamba2 hybrid model we can convert it to a .nemo format but we cannot convert back to HF/Pytorch.

Describe the solution you'd like

Converting script for a nemo mamba2 Hybrid model to HF / Pytorch format.

Describe alternatives you've considered

-None

Additional context

-None

github-actions[bot] commented 1 day ago

This issue is stale because it has been open for 30 days with no activity. Remove stale label or comment or this will be closed in 7 days.