mistral-7b Search Results

1000+ results
for mistral-7b

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

huggingface/tokenizers #1552

[Bug?] Modifying normalizer for pretrained tokenizers don't …

I'm not sure if it's a bug/feature, sometimes modifying the normalizer of a pretrained tokenizer works but sometimes it doesn't. For example, it works for `"mistralai/Mistral-7B-v0.1"` but not `"m…

alvations updated 3 days ago
2
vllm-project/vllm #3781

Best server cmd for mistralai/Mistral-7B-v0.1

``` export MODEL=mistralai/Mistral-7B-v0.1 python3 -m vllm.entrypoints.openai.api_server --model $MODEL \ --tensor-parallel-size=1 \ --enable-prefix-caching --max-model-len=4096 --trust-re…

sshleifer updated 1 week ago
2
mit-han-lab/llm-awq #95

Adding Mistral 7B

Checking the constraints to add Mistral 7B in the list of models. It seems it has been benchmarked with auto-awq : https://github.com/casper-hansen/AutoAWQ

uprokevin updated 7 months ago
3
ggerganov/llama.cpp #8533

Feature Request: Architecture "LlavaMistralForCausalLM" not …

### Prerequisites - [X] I am running the latest code. Mention the version if possible as well. - [X] I carefully followed the [README.md](https://github.com/ggerganov/llama.cpp/blob/master/README.md)…

dafei2017 updated 2 days ago
1
mlabonne/llm-course #79

Link is not right

In the **"5. Preference Alignment"** section, the link in **"Fine-tune Mistral-7b with DPO"** refers to a Huggingface article about how to fine-tune llama2 with RLHF NOT Mistral-7b, I guess the correc…

MosaabMuhammed updated 1 month ago
1
h2oai/h2ogpt #1606

GPU offloading mistralai_mistral-7b-instruct-v0.2

Any reasons why mistralai_mistral-7b-instruct-v0.2 does not offload on gpu ? load INSTRUCTOR_Transformer max_seq_length 512 Starting get_model: llama Failed to listen to n_gpus: No modu…

InesBenAmor99 updated 2 months ago
3
lm-sys/FastChat #2861

mistral 7b train

Nice work. I am trying to use fastchat to train a mistral model. however, I wonder why the following code is hard code for only vicuna. [https://github.com/lm-sys/FastChat/blob/main/fastchat/train/…

bigdante updated 6 months ago
2
intel-analytics/text-generation-webui #34

Mistral-7B-Instruct-v0.2 fails for transformers 4.39

Error for 4.39.3 ``` Traceback (most recent call last): File "/home/arda/kai/webui/text-generation-webui/modules/callbacks.py", line 61, in gentask ret = self.mfunc(callback=_callback, *args…

hkvision updated 2 weeks ago
6
predibase/lorax #312

Sample command with mistral-7b failed

### System Info Nvidia GPU A100*8 Linux OS ``` ❯ /usr/local/cuda/bin/nvcc --version nvcc: NVIDIA (R) Cuda compiler driver Copyright (c) 2005-2023 NVIDIA Corpor…

hayleyhu updated 3 months ago
10
axolotl-ai-cloud/axolotl #1641

Llama 3 8b OOM with GaLore on 2x A100s (Mistral 7b is fine?)

### Please check that this issue hasn't been reported before. - [X] I searched previous [Bug Reports](https://github.com/OpenAccess-AI-Collective/axolotl/labels/bug) didn't find any similar reports. …

e-p-armstrong updated 3 weeks ago
5

上一页 1...1 2 3 4 5 6 7...100 下一页

1000+ results for mistral-7b

1000+ results
for mistral-7b