mixtral-8x7b Search Results

1000+ results
for mixtral-8x7b

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

AutoGPTQ/AutoGPTQ #634

Error when quantizing mixtral 8x7b model. "ZeroDivisionErr…

I am getting "float division by zero" error whenever I try to quantize mixtral related models with autogptq, and here is my code. ``` from transformers import AutoTokenizer, TextGenerationPipeli…

arceus-jia updated 1 month ago
3
vllm-project/vllm #2395

TheBloke/Mixtral-8x7B-Instruct-v0.1-GPTQ with tensor paralle…

Hi, I was able to run _TheBloke/Mixtral-8x7B-Instruct-v0.1-GPTQ_ model on 2 A10 gpus on AWS Sagemaker. I was using _ml.g5.12xlarge_ instance type. Command to run the code `python3 -m vllm.ent…

PhaneendraGunda updated 1 week ago
5
vllm-project/vllm #2871

HQQ quantization support

As we have a few models with Half-Quadratic Quantization (HQQ) out there, VLLM should also support them: ```sh api_server.py: error: argument --quantization/-q: invalid choice: 'hqq' (choose from …

max-wittig updated 4 days ago
8
Aaronhuang-778/BiLLM #10

Looking forward to supporting Mixtral_8x7b MoE

Looking forward to supporting Mixtral_8x7b MoE

Gierry updated 6 months ago
1
vllm-project/vllm #3152

Serve model Mixtral-8x7B on 4xA100 cuda=12.1, pytorch=2.1.2 …

It display error message: "cupy_backends.cuda.libs.nccl.NcclError: NCCL_ERROR_INVALID_USAGE: invalid usage" This error happens for vllm==0.3.2 while vllm==0.2.7 works oky. To reproduce it: …

cometyang updated 1 week ago
2
mudler/LocalAI #1606

mixtral-8x7b doesn't understand toolExecutionResponse and lo…

**LocalAI version:** 2.5.1-cublas-cuda12 **Environment, CPU architecture, OS, and Version:** Ubuntu 22.04 with 2 RTX A5000 24Gb GPUs **Describe my bug** My problem is that this model mixt…

FrancescoDiCat updated 4 months ago
2
livekit/agents #864

Function call issue: when using mistralai/Mixtral-8x7B-Instr…

{"message": "Error in _stream_synthesis_task\nTraceback (most recent call last):\n File \"/root/pythonenv/enve/lib/python3.10/site-packages/livekit/agents/utils/log.py\", line 16, in async_fn_logs\n …

Test-isom updated 1 day ago
1
web-arena-x/visualwebarena #27

Reproducing open-source model results

Hello, I'm looking to reproduce some of the open-source model results from the VWA paper: (1) Mixtral-8x7B model as the LLM backbone for Caption-augmented model (2) CogVLM for the Multimodal Mode…

anithselva updated 1 month ago
2
epolewski/EricLLM #10

current_seq_len and gpu_balance

While loading mixtral I get "AssertionError: Insufficient space in device allocation". Command I used "python ericLLM.py --model ./models/mistralai_Mixtral-8x7B-Instruct-v0.1 --gpu_split 24,24,24,24,…

chardog updated 3 months ago
1
Future-House/paper-qa #557

Low relevance for Doc().query answer

Hello, I wonder why my Doc().query request often achieve random and often poor quality answers i term of relevance. Papers are sometimes relevant sometimes not... Citations are, most of time, co…

Snikch63200 updated 3 weeks ago
6

上一页 1...3 4 5 6 7 8 9...100 下一页

1000+ results for mixtral-8x7b

1000+ results
for mixtral-8x7b