Open EricLBuehler opened 2 months ago
Ok, great.
Ok, great.
You can check my #422 . I hope you don't mind me modify the API of Nonzero🙉
Not a problem 😄
@NeroHin
IBM's Granite series Code Models. Granite Code Models
The
3b
and8b
variants should already be supported as they are just based on thellama
architecture.The
20b
and34b
variants are based on theGPTBigCode
architecture which currently isn't implemented inmistral.rs
.
The 3b and 8b variants do not work out of the box, they rely on tie word embeddings (which I was able to get working in mistral.rs
), but the BPE tokenizer breaks because there are some tokens in the vocab list that are > 255 characters.
+1 to getting support for GPTBigCode and other starcoder model variants.
@EricLBuehler I'm stil working on LLaVA. Meanwhile, with so much experience with rust and Candle, have you ever encountered any problem about memory usage? I have some kinds of confusion. https://github.com/huggingface/candle/issues/2273#issue-2360380212
@chenwanqq, that is great, let me know if I can help!
I replied to the discussion 2272. However, I discovered that the shadowing does mean that the big tensor will not get dropped! See this playground and my comment for more details.
I'll add a clippy lint here to avoid this on our end.
@EricLBuehler What is missing for GGUF quantized Qwen2?
Hi @bachp, that should be relatively easy to add, it would take inspiration from the other GGUF models such as quantized_phi3.rs
. Do you think you would be able to add this?
We will be adding the Gemma 2 models shortly, see #486!
@francis2tm @chelbos @yongkangzhao we just merged LLaVA and LLaVA Next support. Kudos to @chenwanqq for their great work!
For vision models we now have:
I may be able to provide an implementation for whisper asr. If there is interest in that
It doesn't look like it's been mentioned yet but DeepSeek Coder v2 (lite) support would be amazing given it's probably the best coding model out there.
@csicar that would be amazing!
It doesn't look like it's been mentioned yet but DeepSeek Coder v2 (lite) support would be amazing given it's probably the best coding model out there.
@sammcj that would be great, I can add that.
Please let us know what model architectures you would like to be added!
Up to date todo list below. Please feel free to contribute any model, a PR without device mapping, ISQ, etc. will still be merged!
Language models
Multimodal models
Embedding models