mistral-large Search Results

1000+ results
for mistral-large

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

ggerganov/llama.cpp #7048

Significantly different results (and WRONG) inference when G…

I am running llama_cpp version 0.2.68 on Ubuntu 22.04LTS under conda environment. Attached are two Jupyter notebooks with ONLY one line changed (use CPU vs GPU). As you can see for exact same environ…

phishmaster updated 6 months ago
40
modelscope/ms-swift #851

使用DDP运行时显存不够，但是使用Model Parallel时可以正常finetune，耗时很大

nproc_per_node=4 CUDA_VISIBLE_DEVICES=0,1,2,3 \ NPROC_PER_NODE=$nproc_per_node \ swift sft \ --model_id_or_path "AI-ModelScope/llava-v1.6-mistral-7b" \ --template_type "llava-mistral-inst…

AlexJJJChen updated 2 months ago
7
meta-llama/llama #380

RuntimeError: probability tensor contains either `inf`, `nan…

```python from transformers import AutoTokenizer, AutoModelForCausalLM tokenizer = AutoTokenizer.from_pretrained("meta-llama/Llama-2-7b-chat-hf") model = AutoModelForCausalLM.from_pretrained("met…

Liyan06 updated 1 month ago
66
huggingface/text-generation-inference #1750

Can't run Mistral quantized on T4

### System Info ``` +-----------------------------------------------------------------------------------------+ | NVIDIA-SMI 550.54.15 Driver Version: 550.54.15 CUDA Version: 12.4…

emillykkejensen updated 4 months ago
4
BerriAI/litellm #3896

[Feature]: Add NVIDIA NIM API provider

### The Feature Pls add the method & proxy for NVIDIA API, which had example code: ``` from openai import OpenAI client = OpenAI( base_url = "https://integrate.api.nvidia.com/v1", api_ke…

tuanlv14 updated 4 months ago
5
EricLBuehler/mistral.rs #329

bug: If device layers requested exceed model layers, host la…

## Describe the bug If they number of device layers exceed the models, then the host layers to assign seems to wrap/overflow instead of the expected `0`. **NOTE:** With `llama-cpp` you can confi…

polarathene updated 5 months ago
14
huggingface/transformers #29496

Precision issues in Mistral rotary embeddings

https://github.com/huggingface/transformers/blob/965cf677695dd363285831afca8cf479cf0c600c/src/transformers/models/mistral/modeling_mistral.py#L120-L121 https://github.com/huggingface/transformers/blo…

avnermay updated 5 months ago
10
Significant-Gravitas/AutoGPT #6947

Decouple thoughts generation from function call generation

### Duplicates - [X] I have searched the existing issues ### Summary 💡 Currently the AutoGPT app assumes the underlying LLM supports OpenAI-style function calling. Even though there is a config var…

k8si updated 4 days ago
15
vanna-ai/vanna #552

It takes 3 minutes to answer a question.

**Describe the bug** This is not a bug, but there are no other headings (e.g. usage) to select for this issue. The response time is long, there are any settings that can make it respond faster. *…

lokwingfai updated 3 months ago
4
sgl-project/sglang #165

setting mem-fraction-static to a lower value causes error

With no change, I run out of memory (A100 w/ 24GB). Setting it to anything other than the default causes the following error: ``` Exception in ModelRpcClient: Traceback (most recent call last): …

Jacsarge updated 3 months ago
4

上一页 1...89 90 91 92 93 94 95...100 下一页

1000+ results for mistral-large

1000+ results
for mistral-large