avx512 Search Results - Githubissues

1000+ results
for avx512

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

abetlen/llama-cpp-python #1645

All requests end with 'finish_reason': 'length' when the max…

All requests end with 'finish_reason': 'length' when the max_tokens=-1 parameter is set. What could be the problem? **Model**: https://huggingface.co/IlyaGusev/saiga_mistral_7b_gguf/resolve/main/…

tur0kmagalp updated 2 months ago
1
llvm/llvm-project #70259

SLP vectorization causes 1.75x slowdown with march=skylake-a…

I think the SLP cost model might be wrong for vector gathers on skylake. Consider the following code which repeatedly permutes an array: ``` void f(const float *__restrict__ src, const int *__r…

abadams updated 1 year ago
11
oven-sh/bun #12972

Bun is auto-restarting due to crash

### How can we reproduce the crash? _No response_ ### Relevant log output ```shell --- Bun is auto-restarting due to crash [time: 1722444272358] --- ===============================================…

Judu233 updated 2 months ago
1
pytorch/pytorch #131027

AoTInductor cumsum causes illegal memory access for large te…

### 🐛 Describe the bug Hello, I found that when compiling torch.cumsum with AoTInductor a CUDA illegal memory access error gets thrown when the input tensor is large. This only happens with an AoT c…

fjneumann updated 4 months ago
1
voutcn/megahit #312

Passing check 1 failed

Hi, I hope this is not redundant with another issue. It seems that there might be an issue with passing check point 1 and I am not sure about the reason. No idea if that could impact the final…

ntromas updated 2 months ago
1
jfalcou/eve #2026

[BUG] Incorrect Results Using __mmask16 Mask in if_else Func…

Here is a example: https://godbolt.org/z/nd9Kc3cqq

Panhaolin2001 updated 4 days ago
18
intel/llvm #6138

I try to use the DPC++ with the cuda support to compile the …

I try to build dnnl with gpu engine from source , [Requirements for Building from Source](https://github.com/oneapi-src/oneDNN#gpu-engine) It needs DPC++, TBB, cuDNN, cuBLAS. The others are already …

Karssadin updated 2 years ago
1
vllm-project/vllm #9738

[Bug]: offline inference with ray fails on multinode

### Your current environment The output of `python collect_env.py` ```text Collecting environment information... PyTorch version: 2.4.0+cu121 Is debug build: False CUDA used to build PyTor…

gpucce updated 1 month ago
8
vllm-project/vllm #8840

[Bug]: exceptiongroup.ExceptionGroup: unhandled errors in a …

### Your current environment PyTorch version: 2.4.0+cu121 Is debug build: False CUDA used to build PyTorch: 12.1 ROCM used to build PyTorch: N/A OS: Ubuntu 22.04.4 LTS (x86_64) GCC version: (U…

ZHUHF123 updated 1 month ago
3
vllm-project/vllm #6861

[Bug]: Can't load BNB model

### Your current environment ```text The output of `python collect_env.py` Collecting environment information... PyTorch version: 2.3.1+cu121 Is debug build: False CUDA used to build PyTorch: 12…

eldarkurtic updated 2 weeks ago
3

上一页 1...92 93 94 95 96 97 98...100 下一页

1000+ results for avx512

1000+ results
for avx512