avx2-extensions Search Results

1000+ results
for avx2-extensions

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

pytorch/pytorch #104867

rfftn and irfftn operations in pt2 return different results …

### 🐛 Describe the bug When running inference on the [Lama inpainting model](https://github.com/advimman/lama) under PT2 using the supplied [predict.py](https://github.com/advimman/lama/blob/main/b…

davidas1 updated 1 year ago
8
microsoft/DeepSpeed #2977

[Bug] Training use CPU offload raise OOM error in wsl2 syste…

cpu/gpu memory is enough to do this job，i run the model gpt-neo-125M in a nvidia 3090-24g and cpu memory is 64g. system `win10 with a nvidia 3090-24g and cpu memory is 64g.` docker image ``` REP…

lpty updated 1 year ago
9
pytorch/pytorch #99652

DistributedDataParallel doesn't work with complex buffers

### 🐛 Describe the bug DistributedDataParallel doesn't work with complex buffers, even when `broadcast_buffers=False`. ```py import os import torch from torch import nn torch.distributed.i…

samuelstevens updated 1 year ago
1
pytorch/pytorch #137779

Flex attention with mask depending on queries and keys lengt…

### 🐛 Describe the bug I tried to implement the `causal_lower_right` masking in flex attention. This requires the masking function to know the difference in lengths of keys and queries: ```python …

janchorowski updated 6 days ago
2
open-mmlab/mmcv #3071

AssertionError: MMCV==1.4.0 is used but incompatible. Please…

### Prerequisite - [X] I have searched [Issues](https://github.com/open-mmlab/mmcv/issues) and [Discussions](https://github.com/open-mmlab/mmcv/discussions) but cannot get the expected help. - [X]…

DDEONSIK updated 6 months ago
1
WebAssembly/flexible-vectors #7

Hardware support and priorities

Thanks for reaching out! I like the direction, IMO it's important to have unknown-length types for two reasons: 1) to allow using all of SSE4/AVX2/AVX-512 without source code changes; 2) to enable u…

penzn updated 3 years ago
34
facebookarchive/caffe2 #1832

Build issues on OSX

If this is a build issue, please fill out the template below. ### System information * Operating system: OSX version 10.13.2 (Macbook pro 11,3) CUDA/nvcc version 9.1 CuDNN version 7.0.5 whe…

daniildavydzik updated 5 years ago
17
pytorch/pytorch #122192

extra checks for reduce-overhead inconsistent when warmup/re…

### 🐛 Describe the bug Running ``` import torch cfn = torch.compile(torch.sin, mode='reduce-overhead') cfn2 = torch.compile(torch.cos, mode='reduce-overhead') for _ in range(2): x = cfn2(to…

isuruf updated 6 months ago
3
pytorch/pytorch #138045

CI throws unexpected segmentation fault in Intel GPU unit te…

> NOTE: Remember to label this issue with "`ci: sev`" ## Current Status ongoing ## Error looks like [CI result page](https://hud.pytorch.org/pytorch/pytorch/pull/136069?sha=546634dec51…

hoshibara updated 3 days ago
2
pytorch/pytorch #123447

dist.barrier() hangs after calling async_save

### 🐛 Describe the bug Reproduced below, `dist.barrier()` fails after calls to `torch.distributed.checkpoint.async_save`. Interestingly enough, this does not happen if we first call `all_reduce…

LucasLLC updated 3 months ago
6

上一页 1...92 93 94 95 96 97 98...100 下一页

1000+ results for avx2-extensions

1000+ results
for avx2-extensions