mixtral Search Results - Githubissues

1000+ results
for mixtral

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

microsoft/DeepSpeed #5327

Request for Mixtral 8X7B inference with DP+EP+TP

I want to use the mixtral 8X7B model for inference, but currently it only supports autoTP. How to add more support to enable it to use more parallelism (e.g. EP, DP)

haoranlll updated 1 month ago
8
tenstorrent/tt-metal #10934

Update Mixtral demo_with_prefill with 32k seqlen inputs

Update Mixtral `demo_with_prefill.py` demo script with prompts up to 16k tokens. We support KV cache sizes up to 32K. If we make the prompt 32k tokens and prefill that, we cannot generate any more…

mtairum updated 3 months ago
2
NVIDIA/TensorRT-LLM #1580

Fail to build int4_awq on Mixtral 8x7b

### System Info ubuntu 20.04 tensorrt 10.0.1 tensorrt-cu12 10.0.1 tensorrt-cu12-bindings 10.0.1 tensorrt-cu12-libs 10.0.1 tensorrt-llm 0.10.…

gloritygithub11 updated 1 month ago
17
ollama/ollama #2033

Add Vulkan runner

https://github.com/nomic-ai/llama.cpp GPT4All runs Mistral and Mixtral q4 models over 10x faster on my 6600M GPU

maxwell-kalin updated 1 week ago
26
assafelovic/gpt-researcher #908

Ollama doc update, with good news

**Is your feature request related to a problem? Please describe.** The doc refers to Ollama with the mixtral model. **Describe the solution you'd like** Update the doc. **Describe alternativ…

PieBru updated 1 month ago
7
continuedev/continue #714

Continue times out while waiting for response (setting timeo…

### Before submitting your bug report - [X] I believe this is a bug. I'll try to join the [Continue Discord](https://discord.gg/NWtdYexhMs) for questions - [X] I'm not able to find an [open issue]…

zine999 updated 4 days ago
5
TransformerOptimus/SuperAGI #1401

Using Mixtral as Local LLM Fails

### ⚠️ Check for existing issues before proceeding. ⚠️ - [X] I have searched the existing issues, and there is no existing issue for my problem ### Where are you using SuperAGI? Linux ### …

CharlesMod updated 5 months ago
4
NVIDIA/TensorRT-LLM #849

[Feature Request] Mixtral Offloading

There's a new cache technique mentioned in the paper https://arxiv.org/abs/2312.17238. (github: https://github.com/dvmazur/mixtral-offloading) They introduced LRU cache to cache experts based on patt…

shixianc updated 7 months ago
2
instructlab/sdg #357

Leaf nodes with empty sdg output

I am not getting error but after running q&a generation it took 20 minutes and got empty datasets please let me know what can be the cause. (granite1) sankar@Sankars-MacBook-Pro test1 % ilab dat…

acsankar updated 2 weeks ago
1
mlc-ai/web-llm #243

Support Mixtral 8x7B model

hmd-ai updated 10 months ago
1

上一页 1...6 7 8 9 10 11 12...100 下一页

1000+ results for mixtral

1000+ results
for mixtral