mixtral Search Results - Githubissues

1000+ results
for mixtral

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

LostRuins/koboldcpp #1219

1.78 - cannot load mixtral 8x7b anymore

Hi, After upgrading to 1.78 today, I can't load mixtral-based 8x7b models anymore. Other models such as 30b/70b llama-type models work. I get the same error whether I use vulkan or CLBlast, a…

IcePanther updated 6 days ago
7
mistralai/mistral-inference #233

[BUG: ] Do we have a paper of a doc describing Mixtral-8x7B …

### Python -VV ```shell N/A ``` ### Pip Freeze ```shell N/A ``` ### Reproduction Steps N/A ### Expected Behavior N/A ### Additional Context We have a Mixtral implementation on JAX which wor…

vipannalla updated 5 days ago
1
b4rtaz/distributed-llama #132

Mixtral-8x7B don't work

Mixtral-8x7B-Instruct-v0.1 don't work, when I load the model in chat mode, it loads the model but not complete and breaks. Maybe in Hugingface they changes the Modelle or something else. https:/…

MichaelFomenko updated 1 month ago
1
mlcommons/inference #1901

mixtral-8x7b baseline reproduction : gsm8k : 72.00

Hi, I tried to reproduce your baseline on mixtral-8x7b, but the accuracy on gsm8k is 72.00 instead of 73.66. Can you reproduce it? Also, what is the version of transformers?

DehuaTang updated 3 days ago
1
NVIDIA/TensorRT-LLM #875

benchmarking Mistra/Mixtral

Hi, I see you have built an example for Mistral models that I could build successfully. However, when I try to benchmark such models using GPTSessionBenchmark I get errors like: `[TensorRT-LLM][ERR…

MustafaFayez updated 2 weeks ago
3
tenstorrent/tt-metal #12307

Mixtral e2e perf

Mistral's e2e demo perf (with tracing, embedding/argmax on host, untilizing on device) is 15.2 t/s/u. Device perf is 22.3 t/s/u. e2e:device perf ratio = 68% Dispatch times for 1 decoder layer are…

sraizada-tt updated 2 months ago
2
Lightning-AI/lightning-thunder #194

Mixtral 8x7B network support

## 🚀 Feature Mixtral 8x7B is a mixture-of-experts LLM that splits the parameters in 8 distinct groups an I would like to do both training and inference with Thunder. ### Work items - [x] Run `t…

riccardofelluga updated 2 weeks ago
5
expectedparrot/edsl #1308

Initializing Model objects for different model names can tak…

```python from edsl import Model import time models_list = [['Austism/chronos-hermes-13b-v2', 'deep_infra', 0], ['BAAI/bge-base-en-v1.5', 'together', 1], ['BAAI/bge-large-en-v1.5', 'together', …

zer0dss updated 6 days ago
2
xtekky/gpt4free #2359

"mixtral-8x22b" is gone from 0.3.3.4 version ;-(

"mixtral-8x22b" is gone ;-( you can see it (dynamic list, ordered by nb of providers, of the models) : ``` import g4f all=[] for model in g4f.models._all_models: m=g4f.models.ModelUtils.c…

manatlan updated 22 hours ago
6
NVIDIA/TensorRT-LLM #672

Mixtral optimization from vllm

Putting this here, latency change seems very substantial: https://github.com/vllm-project/vllm/pull/2090

0xymoro updated 1 week ago
2

上一页 1...1 2 3 4 5 6 7...100 下一页

1000+ results for mixtral

1000+ results
for mixtral