NVIDIA / NeMo

A scalable generative AI framework built for researchers and developers working on Large Language Models, Multimodal, and Speech AI (Automatic Speech Recognition and Text-to-Speech)
https://docs.nvidia.com/nemo-framework/user-guide/latest/overview.html
Apache License 2.0
11.71k stars 2.45k forks source link

Support converting the mistral-nemo huggingface weights to nemo format #9909

Closed dementrock closed 2 months ago

dementrock commented 2 months ago

Is your feature request related to a problem? Please describe.

Currently the only provided conversion script is https://github.com/NVIDIA/NeMo/blob/main/scripts/checkpoint_converters/convert_mistral_7b_hf_to_nemo.py. This script doesn't support converting the latest https://huggingface.co/mistralai/Mistral-Nemo-Base-2407 model - I encountered the following issues:

Describe the solution you'd like

ethanhe42 commented 2 months ago

@ericharper @akoumpa for vis

akoumpa commented 2 months ago

Thanks, I'll have a fix soon.

akoumpa commented 2 months ago

https://github.com/NVIDIA/NeMo/pull/9953

berserkr commented 2 months ago

How about the other way around ? :) Nemo to HF for Mistral Instruct and Base