mistral-large Search Results

1000+ results
for mistral-large

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

mlc-ai/mlc-llm #2690

[Model Request] Mistral Large Instruct 2407

## ⚙️ Request New Models - Link to an existing implementation (e.g. Hugging Face/Github): ## Additional context

BlindDeveloper updated 2 months ago
2
merplumander/ai-forecasting #1

Language Models and Knowledge Cut-offs

# Language Model Overview ## OpenAI | | gpt-4o | gpt-4o-mini …

merplumander updated 58 minutes ago
2
vllm-project/vllm #9376

[Bug]: KeyError: 'layers.60.mlp.gate_up_proj.weight' mistral…

### Your current environment The output of `python collect_env.py` ```text PyTorch version: 2.4.0+cu121 Is debug build: False CUDA used to build PyTorch: 12.1 ROCM used to build PyTorch: N…

copasseron updated 1 month ago
3
ollama/ollama #7060

Mistral-large LLM requires 56GB of RAM. Please note this.

Please make a note of this on the readme document here and on the library page at https://ollama.com/library. Ollama gave me this error message when I tried to run mistral-large. It's huge.

bulrush15 updated 1 month ago
4
slackapi/bolt-js #2184

Slack Block Builder do not update static select value in mod…

On `"@slack/bolt": "^3.19.0"`, I got a strange bug where a `static_select` element from Block Builder do not display the `initial_option` correctly in my Slack app. I'm using `"slack-block-builder"…

mister-good-deal updated 1 week ago
5
twelvelabs-io/tl-jockey #81

Support Llama3 with Ollama

# Motivation I wanted to participate more in solving the listed issues, but I already spent more than $30 on debugging with the ChatGPT API, lol. Recently, Mistral announced that they have reduced…

seyeong-han updated 4 days ago
2
ikawrakow/ik_llama.cpp #92

Bug: Quantized KV cache produces garbage in situation where …

### What happened? Was running Mistral Large 2 with partial offload with AMD 5600X + RTX 3090. Provided the same ~28k prompt to each, llama.cpp produced output that was coherent and similar to non q…

saood06 updated 3 weeks ago
22
LostRuins/koboldcpp #1095

Context shifting doesn't work in Mistral Large?

I am working with “magnum-v2-123b-Q4_K_L” model (I also tried “magnum-v2-123b-iQ4_K_M” - no difference). I've noticed that the context shift mechanism with this model works somehow wrong, if not to sa…

Vladonai updated 2 months ago
5
vllm-project/vllm #9383

[Performance]: Maximizing the performance of batch inference…

### Misc discussion on performance Hi all, I'm having trouble with maximizing the performance of batch inference of big models on vllm 0.6.3 (Llama 3.1 70b, 405b, Mistral large) My command…

Hellisotherpeople updated 3 weeks ago
1
SillyTavern/SillyTavern #2557

[FEATURE_REQUEST] Improved Mistral Large 2 Support

### Have you searched for similar requests? Yes ### Is your feature request related to a problem? If so, please describe. According to the console logs, Mistral Large 2 support in current ST: - is…

honey-tree updated 3 months ago
5

上一页 1...1 2 3 4 5 6 7...100 下一页

1000+ results for mistral-large

1000+ results
for mistral-large