avx2-extensions Search Results

1000+ results
for avx2-extensions

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

WojciechMula/base64simd #5

vpermb belongs to AVX512BW?

Hi I'm running avx512bw test on my SKL which has avx512bw supported, while I got illegal instruction traps, and after some investigation, it seems vpermb/vpermi2b belongs to avx512vbmi instead, t…

fengyuleidian0615 updated 3 years ago
7
pytorch/pytorch #123082

Torch.cdist calculation problem

### 🐛 Describe the bug ```python d = torch.rand((2, 30, 2)) dis_matrix = torch.cdist(d, d) diag = torch.diag(dis_matrix[0]) print(diag) print(all(diag == 0)) ``` The diagonal of the distan…

wp133716 updated 6 months ago
1
pytorch/pytorch #105068

[linalg] test_ops.py::test_python_ref_meta__refs_linalg_svd_…

## Issue description test_ops.py failing: ``` ====================================================================== ERROR: test_python_ref_meta__refs_linalg_svd_cpu_complex128 (__main__.TestCom…

Aidyn-A updated 9 months ago
11
kiteco/issue-tracker #118

Support CPUs that don't have the AVX instruction set

This issue's URL is referenced in the message users see when they try to install Kite on a machine that doesn't support the AVX instruction set. This can be a place for discussion and so folks can …

adamsmith updated 1 year ago
131
WebAssembly/relaxed-simd #9

8bit*8bit 4-D dot-product accumulating to 32bit, similar to …

This issue is a placeholder for future discussion about supporting 4-dimensional-reducing dot-product instructions taking 8bit inputs and accumulating into 32bit, i.e. ``` int32_accumulator += int…

bjacob updated 2 years ago
14
pytorch/pytorch #110387

Change unsqueeze(0) to preserve memory layout contiguity of …

### 🐛 Describe the bug Forwarding a tensor `img` through a simple PyTorch Conv2d model produces a different result than forwarding `img + torch.zeros_like(img)`. Here is a minimal example: https…

dozed updated 9 months ago
21
pytorch/pytorch #112658

FSDP requires global device context

### 🐛 Describe the bug The only way to call an FSDP model (e.g. `fsdp_model(inputs)`) seems to be if `torch.cuda.current_device()` returns the rank/id of the current process/device, regardless of w…

nairbv updated 9 months ago
6
numpy/numpy #27274

BUG: Numpy full returns wrong value when casting from intege…

### Describe the issue: For some cases, when we use `np.full` with a fill_value that is an integer and with `dtype=np.float32`, we get a result off by 1. ### Reproduce the code example: ```python…

BrunoBelucci updated 1 month ago
2
prusa3d/PrusaSlicer #13394

Opening the given STL file hangs and then crashes PrusaSlice…

### Description of the bug PrusaSlicer crashes when opening the STL file from an archive. First, the slicer hangs witgout consuming much RAM, but it consumes around 1 core of the CPU before crashi…

Monniasza updated 4 weeks ago
4
pytorch/pytorch #106485

Increasing batch size makes network forward 1000 times slowe…

### 🐛 Describe the bug I have a two layer network. The input is a 2D array of token ids, first layer is an embedding layer that replaces each pixel the respective embedding, the second layer does a c…

grigornalbandyan updated 5 months ago
7

上一页 1...81 82 83 84 85 86 87...100 下一页

1000+ results for avx2-extensions

1000+ results
for avx2-extensions