mixtral-8x7b Search Results

1000+ results
for mixtral-8x7b

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

pinecone-io/canopy #228

[Feature] vLLM Integration

### Is this your first time submitting a feature request? - [X] I have searched the existing issues, and I could not find an existing issue for this feature - [X] I am requesting a straightforward…

jamescalam updated 3 weeks ago
4
nomic-ai/gpt4all #2354

Python Bindings: Model no longer kept in cache

### Bug Report Just compiled the updated Python bindings V2.7.0 When terminating my GUI now the whole model needs to be loaded again which may take a long time. In previous versions only the firs…

woheller69 updated 2 months ago
2
Gene-Weaver/VoucherVision #6

Consider supporting groq

[Groq](https://groq.com) provides an [OpenAI compatible API](https://console.groq.com/docs/openai) to several LLMs e.g. LLaMA3 8b, LLaMA3 70b, Mixtral 8x7b, Gemma 7b (documented on the [models page](h…

nickynicolson updated 2 months ago
1
Aaronhuang-778/BiLLM #10

Looking forward to supporting Mixtral_8x7b MoE

Looking forward to supporting Mixtral_8x7b MoE

Gierry updated 4 months ago
1
kolbytn/mindcraft #108

There is a small but extremely fast service called 'Groq'

**NOTE: ~~It~~ Mixtral can at times be... Fragile. Let's call it that. Keep the temperature *LOW*. You can indeed drive it nuts, at least with the system prompt I was using.** I intend to make a fo…

FateUnix29 updated 3 months ago
3
mudler/LocalAI #1606

mixtral-8x7b doesn't understand toolExecutionResponse and lo…

**LocalAI version:** 2.5.1-cublas-cuda12 **Environment, CPU architecture, OS, and Version:** Ubuntu 22.04 with 2 RTX A5000 24Gb GPUs **Describe my bug** My problem is that this model mixt…

FrancescoDiCat updated 3 months ago
2
FudanDISC/DISC-FinLLM #25

[Not Issue] Will you try MOE like Mixtral-8X7B?

Since there are 4 experts adaptors using lora SFT in the paper, the next question is why not try MOE like Mixtral-8X7B?

ALLinLLM updated 3 months ago
1
janhq/cortex.cpp #462

idea: Add GPU offloading for larger/MOE models (e.g. mixtral…

**Problem** Jan is great, but I'm limited o the number of models I can run on my 16GB GPU. I saw there is a project called [mixtral-offloading](https://github.com/dvmazur/mixtral-offloading) that cou…

poldon updated 1 week ago
1
huggingface/transformers #33411

batch inference scales linearly with batch size when input i…

### System Info transformers.version=4.42.4 ### Who can help? @Gante ### Information - [ ] The official example scripts - [X] My own modified scripts ### Tasks - [ ] An officially supported tas…

platypus1989 updated 1 week ago
1
vllm-project/vllm #2251

Load Mixtral 8x7b AWQ model failed

I am using the latest vllm docker image, trying to run Mixtral 8x7b model quantized in AWQ format. I got error message as below: ``` INFO 12-24 09:22:55 llm_engine.py:73] Initializing an LLM engine …

thiner updated 6 months ago
28

上一页 1...1 2 3 4 5 6 7...100 下一页

1000+ results for mixtral-8x7b

1000+ results
for mixtral-8x7b