vnni Search Results - Githubissues

1000+ results
for vnni

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

vllm-project/vllm #7940

[Bug]: RuntimeError: operator torchvision::nms does not exis…

### Your current environment Collecting environment information... INFO 08-28 14:32:56 importing.py:10] Triton not installed; certain GPU-related functions will not be available. WARNING 08-28 14:3…

murray-z updated 1 day ago
12
ggerganov/llama.cpp #9530

Bug: Lower performance in pre-built binary llama-server, S…

### What happened? The generation speed of llama-server has significantly decreased since b3681, and this issue persists in the latest b3779 without improvement. For the same task and parameters "-n…

tobchef updated 1 week ago
13
pytorch/pytorch #132366

PyTorch's Distributed Checkpoint Cannot Save a Parameter of …

### 🐛 Describe the bug When a module has a parameter which is a tensor of size 1 and you try to save its FSDP with torch.distributed.checkpoint, you get the following exception: ``` NotImplemente…

dragonmeteor updated 1 month ago
6
vllm-project/vllm #5071

[Bug]: Build/Install Issues with pip install -e .

### Your current environment ```text Collecting environment information... PyTorch version: 2.3.0+cu121 Is debug build: False CUDA used to build PyTorch: 12.1 ROCM used to build PyTorch: N/A …

Msiavashi updated 2 months ago
2
ggerganov/llama.cpp #9167

Bug: mixtral 8x7b instruct imatrix creation and quantization…

### What happened? imatrix creation and subsequent quantization to IQ3_XXS of [mixtral 8x7b instruct](https://huggingface.co/TheBloke/Mixtral-8x7B-Instruct-v0.1-GGUF/blob/main/mixtral-8x7b-instruct-v…

GlasslessPizza updated 3 weeks ago
3
vllm-project/vllm #6983

[Bug]: base_model.model.model.layers.0.mlp.down_proj.lora_ma…

### Your current environment ```text Collecting environment information... PyTorch version: 2.3.0+cu121 Is debug build: False CUDA used to build PyTorch: 12.1 ROCM used to build PyTorch: N/A …

hnn123 updated 2 months ago
1
mrzhuzhe/riven #3

使用中间变量后在intel i9 cpu 上速度大幅提升，但是结果产生e-14的微小误差

## 代码：代码来源于 > blis https://github.com/flame/how-to-optimize-gemm/wiki#step-by-step-optimizations 中 optimize 07 > https://github.com/flame/how-to-optimize-gemm/wiki/Optimization_1x4_7 可以cd 到 …

mrzhuzhe updated 1 year ago
3
vllm-project/vllm #6220

[Bug]: Gemma2 supports 8192 context with sliding window, but…

### Your current environment ```text Collecting environment information... PyTorch version: 2.3.0+cu121 Is debug build: False CUDA used to build PyTorch: 12.1 ROCM used to build PyTorch: N/A …

pseudotensor updated 1 week ago
12
vllm-project/vllm #3991

[Usage]: Cannot load model on 2 4090

### Your current environment ```text The output of `python collect_env.py` ``` PyTorch version: 2.1.2+cu121 Is debug build: False CUDA used to build PyTorch: 12.1 ROCM used to build PyTorch: …

luoyuchenmlcv updated 2 months ago
1
ollama/ollama #4545

Ollama stops serving requests after 10-15 minutes

### What is the issue? Using `ollama:latest` with nvidia-docker and 2x4090. Tried blasting a bunch of 256 words long text snippets to ollama for embeddings generation using `all-minilm:l6-v2`. …

iganev updated 2 days ago
34

上一页 1...89 90 91 92 93 94 95...100 下一页

1000+ results for vnni

1000+ results
for vnni