nvfuser Search Results - Githubissues

1000+ results
for nvfuser

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

NVIDIA/Fuser #1859

Faulty `rf` flag added for non-sliced IterDomains.

``` $ NVFUSER_DUMP=fusion_ir bin/nvfuser_tests --gtest_filter=*Noncancellable_CatOnlySubsetOfSplitOutputs ``` ``` [ RUN ] MoveSplitCatTest.Noncancellable_CatOnlySubsetOfSplitOutputs %ker…

wujingyue updated 7 months ago
2
pytorch/functorch #941

CUDA assumption in the ts_compile code

Hey folks, stumbled into a CUDA assumption (on my non-CUDA machine) Here's the fix for me, but it's obviously not very general ``` diff --git a/functorch/_src/compilers.py b/functorch/_src/comp…

bwasti updated 2 years ago
2
KoboldAI/KoboldAI-Client #391

I ran install_req[..].bat, yet it still gives me winerror 12…

```"B:\python\lib\site-packages\torch\lib\nvfuser_codegen.dll" or one of its dependencies.``` I ran the mentioned bat file, as I read recent issues, yet this did not help me fix the problem. The fi…

rust-floppy updated 1 year ago
2
csarofeen/pytorch #2360

complex float tanh is inaccurate

``` a = torch.tensor((0.0011-1.5705j,), device='cuda', dtype=torch.complex64) fs = Fusion() with FusionDefinition(fs) as fd: nv_a = fd.define_tensor(sizes=a.shape, strides=a.stride(), dtype=…

mruberry updated 1 year ago
6
csarofeen/pytorch #2096

HuggingFace BertForMaskedLM - Log Softmax Fusion with Autoca…

### 🐛 Describe the bug Benchmark commandline: ``` PYTORCH_NVFUSER_DUMP=python_definition,fusion_args python -u benchmarks/huggingface.py --training -d cuda --fast --backend nvprims_nvfuser --skip-a…

kevinstephano updated 1 year ago
6
NVIDIA/Fuser #22

[FeatureRequest] codegen reshape/view on python API

# Background reshape/view in nvfuser doesn't imply memory alias, so we'll be referring to this as reshape in this issue to keep the conversation simple and accurate. nvfuser reshape is implement…

jjsjann123 updated 1 year ago
1
pytorch/pytorch #84510

[NVFuser] RuntimeError: ref_id_it != replayed_concrete_ids_.…

### 🐛 Describe the bug ```python # debug_aev_nvfuser_minimal.py import torch torch._C._jit_set_nvfuser_single_node_mode(True) torch._C._debug_set_autodiff_subgraph_inlining(False) torch.ma…

yueyericardo updated 1 year ago
6
NVIDIA/Fuser #2930

`FusionReductionWithTrivialReduction_CUDA` fails with comput…

repro (pjnl-20240910): ``` NVFUSER_ENABLE=kernel_debug PYTORCH_NO_CUDA_MEMORY_CACHING=1 compute-sanitizer bin/nvfuser_tests --gtest_filter='*FusionReductionWithTrivialReduction_CUDA*' ``` sample stac…

xwang233 updated 1 week ago
10
pytorch/pytorch #80606

[jit.script] jit.script give uncertain results using torch.h…

### 🐛 Describe the bug `torch.jit.script` give uncertain results using `torch.half`. The result of the first execution of the function is different from that of the second execution, but the result…

Achazwl updated 2 years ago
2
NVIDIA/Fuser #1871

printing real error messages with `parallel_compile`

Currently when parallel_compile is enabled (Note this is also our current default behavior), `FusionKernelRuntime` hides all compilation error messages. See here: https://github.com/NVIDIA/Fuser/blob…

jjsjann123 updated 7 months ago
3

上一页 1...11 12 13 14 15 16 17...100 下一页

1000+ results for nvfuser

1000+ results
for nvfuser