nvfuser Search Results - Githubissues

1000+ results
for nvfuser

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

pytorch/torchdynamo #1413

inductor+ddp+profiler encounters illegal memory access

Repro script: https://gist.github.com/davidberard98/3c746cd0c8bd79d40353bb0b263f9518 Usage: from the AWS cluster, reserve 2 gpus on a compute node. From there, run: ``` $ python profiler_error.py…

davidberard98 updated 2 years ago
4
ELS-RD/kernl #176

Fix crash on T5 inference for some specific shapes

when running tests on `main`, there is a crash. We had other issues on short seqlen and large batch on t5, not sure why... ```log ❯ pytest test/test_torchdynamo.py -k "dynamo_optimized_cuda_grap…

pommedeterresautee updated 1 year ago
2
csarofeen/pytorch #1254

Arbitrary permutation support in fuser integration

## 🚀 Feature To further expand our support on channels last, we want to further expand this to arbitrary permutation. The challenge would be to maintain a coherent behavior as with eager (TensorIt…

jjsjann123 updated 2 years ago
1
csarofeen/pytorch #1445

A reduction kernel went through PW scheduler in fragmented f…

### 🐛 Describe the bug Error extracted from failures encountered in https://github.com/pytorch/pytorch/pull/71299 ``` PYTORCH_NVFUSER_DISABLE_FALLBACK=1 python opinfo_failure_2.py Traceback (m…

jjsjann123 updated 2 years ago
7
csarofeen/pytorch #1405

Add parser support for LogSoftmax

## 🚀 Feature There is a performance opportunity to fuse the Bias for the projection linear layer to LogSoftmax that can be expensive as the hidden size out of the projection is `30258`. We just need…

kevinstephano updated 2 years ago
5
csarofeen/pytorch #1466

Scheduler hits floating point exception (dividing by 0)

### 🐛 Describe the bug I'm hitting issue when we have size-0 rank 1 tensors going through a reduction kernel. ``` import torch …

jjsjann123 updated 2 years ago
2
csarofeen/pytorch #1437

Type promotion does not match with eager mode

### 🐛 Describe the bug ```python import torch torch._C._jit_set_nvfuser_enabled(True) torch._C._jit_set_texpr_fuser_enabled(False) torch._C._jit_set_profiling_executor(True) torch._C._jit_se…

zasdfgbnm updated 2 years ago
1
pytorch/torchdynamo #185

Questions for normalize_ir()

It looks like the normalize_ir() has different behavior than I thought. For ex, I was hoping Functionalization will replace the in-place op to standard op like relu_ to relu. I run a simple test pro…

frank-wei updated 2 years ago
4
csarofeen/pytorch #1309

Add `aten::_softmax` parsing

## 🚀 Feature Need to add parser support in `parser.cpp` for `aten::_softmax` op. LTC traces the `_softmax` op variant that is different from TorchScript.

kevinstephano updated 2 years ago
1
pytorch/pytorch #24032

[JIT] Fusion of Dropout without constant is_training paramet…

## 🐛 Bug When I create a jit.script function that includes `torch.nn.functional.dropout` without a constant `is_training` parameter. The fusion does not work. This did previously work. ## To…

kevinstephano updated 2 years ago
15

上一页 1...94 95 96 97 98 99 100...100 下一页

1000+ results for nvfuser

1000+ results
for nvfuser