Open ToluClassics opened 7 months ago
Man, I can't believe this wasn't done by the MLX team months ago! Thank you for this, should come in handy. I was going to suggest Mixtral as #1 but then I found this and screamed like a little girl
https://github.com/Goekdeniz-Guelmez/Mixtral-from-scratch-in-MLX
Seems pretty thorough -- sooo if ya see this...thanks @Goekdeniz!
Also donno if there are any tweaks necessary for Llama-3/Mixtral 8x22B but I'd throw those in too. I'll be finding out in a little bit I suppose...and if I catch that/fix it I'll put in a PR
I think Mixtral would be a useful addition for sure!
Creating this issue to keep track of Models that will be nice to have ported:
--> Fuyu : https://huggingface.co/adept/fuyu-8b --> OpenELM --> CLIP --> Whisper --> SeamlessMT: facebook/seamless-m4t-medium --> TROCR: microsoft/trocr-large-handwritten --> LLAVA --> T5 --> Mistral --> Idefics: https://huggingface.co/HuggingFaceM4/idefics-9b-instruct/tree/main