mistral-large Search Results

1000+ results
for mistral-large

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

DAMO-NLP-SG/VideoLLaMA2 #40

AttributeError: 'MistralConfig' object has no attribute 'att…

``` `root@ad966f70d032:/workspace/upvllama/VideoLLaMA2#` sh scripts/custom/finetune_lora.sh [2024-07-08 09:54:08,665] [INFO] [real_accelerator.py:191:get_accelerator] Setting ds_accelerator to cuda …

deepakHonakeri05 updated 1 month ago
6
symflower/eval-dev-quality #210

Extract model costs into log and CSVs

- [x] Extract model costs (per M request token + per M response token + per request + per response) and write into csv reports - [x] Check other API providers too (@bauersimon knows about that e.g. M…

bauersimon updated 4 months ago
1
rhymes-ai/Aria #24

vllm multi gpu

Any help on getting multi gpu support running? vLLM fails to load with tensor_parallel_size=2

matbee-eth updated 6 days ago
14
OpenNMT/CTranslate2 #1676

BENCHmarking new flash attention!

Congrats on Flash Attention in the latest version, or to be precise, in having your storage limit increased on Pypi.org so you could upload the release that was weeks ago. Here are some benchmarks fo…

BBC-Esq updated 6 months ago
10
vllm-project/vllm #7196

[Bug]: ZMQError: Address already in use (addr='tcp://127.0.0…

### Your current environment ```text PyTorch version: 2.4.0+cu121 Is debug build: False CUDA used to build PyTorch: 12.1 ROCM used to build PyTorch: N/A OS: Ubuntu 20.04.6 LTS (x86_64) GCC …

WMeng1 updated 3 months ago
14
arcee-ai/DAM #41

RuntimeError During Merging Process Possibly Due to Shared M…

**Description:** I'm encountering an error while trying to merge models using the `merge.py` script. The process loads the models and processes the layers correctly, but when it attempts to save the m…

SolshineCode updated 2 weeks ago
14
mlc-ai/mlc-llm #2854

[Bug] InternalError: Check failed: nwrite != -1 (-1 vs. -1) …

## 🐛 Bug ## To Reproduce - After running the server, wait for a period of time. - model: mistral-large-instruct-2407-q4f16_1 - "tensor_parallel_shards": 4, ``` mlcllm) a@aserver:~$ mlc…

Erxl updated 2 months ago
1
pytorch/pytorch #139110

torch.compile + FSDP1 CPU offloading + PT lightning validati…

### 🐛 Describe the bug I have a small script to reproduce how a toy model and the following three features lead to an error when combined: 1. torch.compile 2. FSDP1 with cpu offloading 3. PyTorch …

vkuzo updated 1 day ago
9
theroyallab/tabbyAPI #166

[BUG] speculative decoding too slow

### OS Linux ### GPU Library CUDA 12.x ### Python version 3.11 ### Describe the bug When running exllamav2's inference_speculative.py example with llama 3.1 8B 2.25bpw as draft and 70B 4.5bpw a…

randoentity updated 3 months ago
3
the-crypt-keeper/can-ai-code #167

Evaluate the new mistral-large and other new closed mistral …

the-crypt-keeper updated 5 months ago
1

上一页 1...80 81 82 83 84 85 86...100 下一页

1000+ results for mistral-large

1000+ results
for mistral-large