The new Cohere model is #1 trending on huggingface right now. It excels at RAG, Tool Usage (Json generation), etc. It is a 35B parameter model so AWQ quantization support would be nice.
This model seems incredibly useful. Since this is a dense model, adding quantization support should be a bit easier. I will experiment soon with this model to see if we can quantize it.
https://huggingface.co/CohereForAI/c4ai-command-r-v01
The new Cohere model is #1 trending on huggingface right now. It excels at RAG, Tool Usage (Json generation), etc. It is a 35B parameter model so AWQ quantization support would be nice.