mixtral Search Results - Githubissues

1000+ results
for mixtral

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

Future-House/paper-qa #511

Cannot use "mixtral-8x7b-instruct-v0.1.Q4_K_M" as local LLM

Hello, I'm trying to use paper-qa with a "mixtral-8x7b-instruct-v0.1.Q4_K_M" on a local network. The LLM executable llamafile is launched with "-cb -np 4 -a my-llm-model --embedding" options as des…

Snikch63200 updated 1 month ago
1
NVIDIA/Megatron-LM #807

[core dataset compilation error]

**Describe the bug** When I am using the most recent Megatrone-LM fork I get the following error ``` make: Entering directory '/workspace/megatron-lm/megatron/core/datasets' g++ -O3 -Wall -sha…

shamanez updated 2 months ago
3
huggingface/transformers #29319

Add Mixtral Model to Flax

### Feature request I would like to implement the Mixtral model in Flax ### Motivation I am in the process of learning Flax and I have almost finished the model conversion to FLAX. ### Your contri…

Additrien updated 9 months ago
5
LuxDL/Lux.jl #1060

Compiling MLX Examples using Reactant

Original Repository: https://github.com/ml-explore/mlx-examples/ Listing out examples from there which would be nice to have. We don't expect the models to work out the moment they are translated to …

avik-pal updated 3 weeks ago
1
sourcegraph/sourcegraph-public-snapshot #61596

(stretch) Test openaicompatible support with Mixtral

slimsag updated 7 months ago
1
Alpha-VLLM/LLaMA2-Accessory #156

Finetune of SPHINX Mixtral MoE - Questions

Thanks for the great work here! I'm following along on the guide here to fine-tune the mixtral MoE version of Sphinx. https://github.com/Alpha-VLLM/LLaMA2-Accessory/tree/main/SPHINX#finetune-sphin…

babla9 updated 7 months ago
6
axolotl-ai-cloud/axolotl #930

Parallelize and optimize Mixtral MoE

### ⚠️ Please check that this feature request hasn't been suggested before. - [X] I searched previous [Ideas in Discussions](https://github.com/OpenAccess-AI-Collective/axolotl/discussions/categori…

casper-hansen updated 11 months ago
2
yxli2123/LoftQ #15

Does it support Mixtral 8x7B？

After I modified the code, there was a problem with the gate size of lora weight. After loading, I found that lora_a was the same as base_layer, and a size_mismatch problem occurred. Thanks!

iMountTai updated 9 months ago
1
microsoft/DeepSpeed-MII #443

[BUG] Issue serving Mixtral 8x7B on H100

Running into issues when serving Mixtral 8x7B on 4 x H100 (TP=4) with deepspeed-mii v0.2.3 with all other arguments default in the base image from nvidia `nvidia/cuda:12.3.1-devel-ubuntu22.04` The …

ghost updated 5 months ago
9
epolewski/EricLLM #10

current_seq_len and gpu_balance

While loading mixtral I get "AssertionError: Insufficient space in device allocation". Command I used "python ericLLM.py --model ./models/mistralai_Mixtral-8x7B-Instruct-v0.1 --gpu_split 24,24,24,24,…

chardog updated 4 months ago
1

上一页 1...9 10 11 12 13 14 15...100 下一页

1000+ results for mixtral

1000+ results
for mixtral