sharing weights of the pretrained stage model

NVIDIA / audio-flamingo

PyTorch implementation of Audio Flamingo: A Novel Audio Language Model with Few-Shot Learning and Dialogue Abilities.

MIT License

168 stars 9 forks source link

Hi Zhifeng,

Since there are a few datasets that I cannot obtain, is it possible to share the weights of the model after the pretraining stage?

Also this is the training loss of pretraining stage (quite high variance):

and this is the training loss of the SFT stage

Do them look right to you?

My reproduced results are a bit far from reported results. And sharing the weights of the model after the pretraining stage will great help me narrow down the issue.

Appreciate your time and effort!

Puyuan

NVIDIA / audio-flamingo

sharing weights of the pretrained stage model #13