mistral-large Search Results

1000+ results
for mistral-large

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

unslothai/unsloth #226

Faster Inference & Training Roadmap

@danielhanchen In the unsloth Gemma intro [blogpost](https://unsloth.ai/blog/gemma), you mention VRAM increase due to larger `MLP` size in `Gemma` compared to `Llama` and `Mistral`, and show a [gr…

jeromeku updated 7 months ago
27
Azure/azureml-examples #3024

LiteLLM Example in mistral docs wrong

### Operating System MacOS ### Version Information not relevant ### Steps to reproduce https://github.com/Azure/azureml-examples/blob/main/sdk/python/foundation-models/mistral/litellm.ipynb @san…

ishaan-jaff updated 7 months ago
14
EricLBuehler/mistral.rs #156

Model Wishlist

Please let us know what model architectures you would like to be added! **Up to date todo list below. Please feel free to contribute any model, a PR without device mapping, ISQ, etc. will still be …

EricLBuehler updated 1 week ago
90
ollama/ollama #5245

Allow importing multi-file GGUF models

### What is the issue? Currently Ollama can [import GGUF files](https://github.com/ollama/ollama/blob/main/docs/import.md). However, larger models are sometimes split into separate files. Ollama shou…

jmorganca updated 1 day ago
10
unslothai/unsloth #643

Support T5 models

I tried to load a T5 model but it seems not supported. ``` --------------------------------------------------------------------------- NotImplementedError Traceback (most re…

tahirahmad2030 updated 5 months ago
2
ngxson/wllama #12

PostMessage: Data cannot be cloned, out of memory

I'm trying to load Mistral 7B 32K. I've chunked the 4.3GB model and uploaded it to huggingface. When the download is seemingly complete, there is a warning about being out of memory: It's a …

flatsiedatsie updated 4 months ago
23
BerriAI/litellm #4922

New Models/Endpoints/Providers

Parent issue to track new models/endpoints/providers to add to litellm, comment below for new ones - [x] Vertex AI Mistral - https://github.com/BerriAI/litellm/issues/4874 - [x] Vertex AI Codestr…

krrishdholakia updated 1 month ago
16
unslothai/unsloth #211

GGUF conversion cause spelling mistakes

So I finetuned a model using a custom dataset. The output should be in JSON format. All the keys are the same for each output, i.e. structure of the response JSON is the same while values need to be e…

MChamith updated 3 weeks ago
22
mistralai/mistral-finetune #82

I have finetuned mistral-instruct-v0.3 but fine tuned model …

### Python Version ```shell python 3.10.9 ``` ### Pip Freeze ```shell annotated-types==0.7.0 anyio==4.4.0 argon2-cffi==23.1.0 argon2-cffi-bindings==21.2.0 arrow==1.3.0 asttokens @ f…

ZTAP0011 updated 3 months ago
4
vllm-project/vllm #3385

Attention sliding window

In Hugging Face "eager" Mistral implementation, a sliding window of size 2048 will mask 2049 tokens. This is also true for flash attention. In the current vLLM implementation a window of 2048 will mas…

caiom updated 1 week ago
10

上一页 1...15 16 17 18 19 20 21...100 下一页

1000+ results for mistral-large

1000+ results
for mistral-large