avx512 Search Results - Githubissues

1000+ results
for avx512

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

JuliaSIMD/CPUSummary.jl #6

CPUSummary.jl v0.1.14 breaks CI of Trixi.jl on skylake-avx51…

We observed some specific problems when going from CPUSummary.jl v0.1.8 to v0.1.14 at [Trixi.jl](https://github.com/trixi-framework/Trixi.jl). Everything is fine with the old version of CPUSummary.jl.…

ranocha updated 2 years ago
8
larryxiao/libflatarray #6

Plan & Journal

## SIMD Wrapper for ARM NEON, Intel AVX512 & KNC - Pre GSoC ( - 26 April) - [X] [Study the HPX, libflatarray codebase](https://github.com/larryxiao/libflatarray/issues/1) - Community Bonding Period …

larryxiao updated 9 years ago
2
ggerganov/whisper.cpp #2099

CPU Performance Regression? (Older version much faster)

I compared an older version from Nov 23 with Apr 24, and the older version is much faster. total time = 6225.76 ms vs total time = 3817.54 ms Same CPU, same compiler and settings, same test: …

nanocosmos-ol updated 5 months ago
14
vllm-project/vllm #6889

[Bug]: "apply_gptq_marlin_linear" Error When TP > 1

### Your current environment ```text PyTorch version: 2.3.1+cu121 Is debug build: False CUDA used to build PyTorch: 12.1 ROCM used to build PyTorch: N/A OS: Ubuntu 22.04.3 LTS (x86_64) GCC ve…

sitabulaixizawaluduo updated 3 weeks ago
5
pytorch/pytorch #132020

`torch.Tensor.to` ignores `memory_format` kwarg

### 🐛 Describe the bug Minimal reproducer: ```python import torch x = torch.ones(1).expand(2) print(f"{x.is_contiguous()=}") print(f"{x.to(memory_format=torch.contiguous_format).is_contiguous(…

timmoon10 updated 3 months ago
1
ggerganov/llama.cpp #10252

Bug: CANN: Inference result garbled

### What happened? llama.cpp使用QWen2.5-7b-f16.gg在310P3乱码 ### Name and Version ./build/bin/llama-cli -m Qwen2.5-7b-f16.gguf -p "who are you" -ngl 32 -fa ### What operating system are you seeing the …

feichenchina updated 2 weeks ago
8
spack/spack #33712

Installation issue: openmpi@4.1.4

### Steps to reproduce the issue ```console $ spack install openmpi@4.1.4 %gcc@7.3.0 +legacylaunchers +gpfs +pmi schedulers=slurm >> log.openmpi 2>&1 ``` ### Error message Error message ==> In…

nish-ant updated 2 years ago
1
BlueBrain/HighFive #617

Example for adding auto serialisation support for user defi…

**Is your feature request related to a problem? Please describe.** On master branch we can: ```c++ auto v = std::vector(); /* fill v */ file.createDataSet("/path", v); ``` But we will fai…

aborzunov updated 1 year ago
4
vllm-project/vllm #5246

how to compile with GLIBCXX_USE_CXX11_ABI=1

### Your current environment Collecting environment information... PyTorch version: 2.2.0 Is debug build: False CUDA used to build PyTorch: 12.2 ROCM used to build PyTorch: N/A OS: Ubuntu 22.0…

demonatic updated 1 month ago
4
dotnet/runtime #87854

AVX-512 debugger support: view registers

AVX-512 introduces new `zmm` and `k` registers. The Visual Studio Registers window can show these if you right-click in the window and choose "AVX-512". However, the values are "grayed out" when debug…

BruceForstall updated 1 month ago
3

上一页 1...87 88 89 90 91 92 93...100 下一页

1000+ results for avx512

1000+ results
for avx512