mistral-large Search Results

1000+ results
for mistral-large

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

embeddings-benchmark/arena #6

Models to support for launch

Should we do these models for the launch of the Arena? We need them all to be loadable via MTEB, so it'd be great if you can help add them to https://github.com/embeddings-benchmark/mteb/tree/main/mte…

Muennighoff updated 4 months ago
14
sgl-project/sglang #268

flushing cache effect on throughput

When running a model with ```--model-mode flashinfer``` (I have tested ```mistralai/Mistral-7B-Instruct-v0.2```), for a large batch (eg 50,000 text input), I usually see that the throughput is high th…

amirarsalan90 updated 3 months ago
6
zjysteven/lmms-finetune #47

Long Output After Finetuning

Have anyone ever ran into the issue where after finetuning the output doesn't know when to end, only ends until max new token is reached? Does it has to do with the tokenizer is not adding an eos toke…

TonyJiang17 updated 4 days ago
52
ggerganov/llama.cpp #8988

Bug: Long sample times with --top-k 0

### What happened? Sample times are greatly increased with --top-k 0, especially with Gemma models. ### Name and Version version: 3570 (4134999e) built with Apple clang version 15.0.0 (clang…

Azirine updated 3 months ago
2
NVIDIA/TensorRT-LLM #1830

Mistral with dtype=bf16 produces garbage for large prompt le…

### System Info - GPU Name: EC2 g5.12xl w/ 4 NVIDIA A10G - TensorRT-LLM: 0.8.0 - Nvidia Driver: 535.161.08 - Container: nvidia/cuda:12.1.0-devel-ubuntu22.04 - OS: Ubuntu 22.04 ### Who can he…

maaquib updated 4 months ago
4
Tencent/HunyuanDiT #28

魔搭上下载的模型不行，缺文件

```text 2024-05-16 03:24:16.542 | INFO | hydit.inference:__init__:160 - Got text-to-image model root path: ckpts/t2i 2024-05-16 03:24:21.606 | INFO | hydit.inference:__init__:172 - Loading C…

zhengyangyong updated 3 weeks ago
3
tsunamayo/Starship-EVO #4140

[New build - EXPERIMENTAL] 21w34a: The Fuel Saga #1: the Fue…

This is the first of several updates introducing a key component to the gameplay: Fuel. New Features: - Power balance is now computed at the level of each reactor. - Reactors will consume fuel wh…

tsunamayo updated 3 years ago
27
ggerganov/llama.cpp #5323

Vulkan backend performance is relatively slower at certain q…

Radeon RX 6700 XT, Ryzen 5700X ECO, model mistral-7b-instruct-v0.2 fully oflloaded to the GPU. EDIT: ROCm 6.0 Some quantization methods implementations in the Vulkan backend provide relatively slow…

Nindaleth updated 2 months ago
4
PygmalionAI/aphrodite-engine #494

[Usage]: OOM crash following Offline Inference setup

### Your current environment ```text Collecting environment information... PyTorch version: 2.3.0+cu121 Is debug build: False CUDA used to build PyTorch: 12.1 ROCM used to build PyTorch: N/A OS…

eedmond updated 2 months ago
4
langflow-ai/langflow #2229

Question: Hugging Face API

Hi, I'm using Langflow to create a ChatBot based on Mistral 7B, but i can't find any documentation or example of the module "Hugging Face API" on Langflow, and what are the exact values to put in End…

dyomed93 updated 3 months ago
2

上一页 1...82 83 84 85 86 87 88...100 下一页

1000+ results for mistral-large

1000+ results
for mistral-large