nvfuser Search Results - Githubissues

1000+ results
for nvfuser

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

pytorch/pytorch #85081

primTorch/nvfuser: have a way to check that refs are added t…

### 🐛 Describe the bug In primTorch, with regular PyTorch refs, it's easy to check them based on the `_refs` prefix and look them up in `__all__`. For nvfuser-specific ones, it's not yet clear what t…

nkaretnikov updated 2 years ago
1
NVIDIA/Fuser #806

Silent incorrect result with contiguity set to True but pass…

To reproduce: ```py from nvfuser import FusionDefinition, DataType import torch with FusionDefinition() as fd: t0 = fd.define_tensor(shape=[-1, -1], contiguity=[True, True], dtype=DataType.…

IvanYashchuk updated 1 year ago
1
NVIDIA/Fuser #1829

Reusable zeroed memory

We currently need zeroed global memory buffers for cross-cta communication. Our current executor calls `at::zeros` to initialize this before each launch of our nvfuser kernel, adding a handful of micr…

jacobhinkle updated 6 months ago
1
NVIDIA/Fuser #2323

Opportunistically encourage uniform data path for some integ…

Integer arithmetic can follow the uniform data path if all the values match among thread within a warp, which improves performance by reducing interruptions to floating point ops and non-uniform instr…

jacobhinkle updated 4 months ago
3
RVC-Boss/GPT-SoVITS #542

还是go-webui.bat无法运行的问题

输出如下： E:\GPT-SoVITS-beta0217>runtime\python.exe webui.py Traceback (most recent call last): File "E:\GPT-SoVITS-beta0217\webui.py", line 4, in import json,yaml,warnings,torch File "E:\…

025nju updated 5 months ago
4
csarofeen/pytorch #2090

bookend `view` should be stripped from fusion

### 🚀 The feature, motivation and pitch Currently the handling of view in scheduler is sub-optimal. For views inside the fusion group that connects fusion, it makes sense, since this usually gives…

jjsjann123 updated 1 year ago
6
NVIDIA/Fuser #2444

validation err in `tests/python/pytest_ops.py::test_correctn…

This test failed several times in CI, seems due to tolerance. ``` 00:31:45 FAILED tests/python/pytest_ops.py::test_correctness_truediv_complex64 - AssertionError: Tensor-likes are not close! 00:3…

liqiangxl updated 3 months ago
4
NVIDIA/Fuser #1470

Fusion with ConsecutiveOuterWelford failed in nvrtc compile

With current [12/07/2023] main branch, the following fusion failed. ``` TEST_F( NVFuserTest, ConsecutiveOuterWelford) { std::unique_ptr fusion_ptr = std::make_unique(); auto fusion = fusi…

liqiangxl updated 7 months ago
5
NVIDIA/TransformerEngine #872

import transformer_engine initializes CUDA

``` >>> import torch >>> torch.cuda.is_initialized() False >>> import transformer_engine >>> torch.cuda.is_initialized() True ``` Import alone shouldn't initialize CUDA. Custom subprocess la…

szmigacz updated 4 months ago
1
csarofeen/pytorch #2403

Compile error in `where(x, a, b)` with single precision `a` …

### 🐛 Describe the bug Current tests use double-precision constants passed to `where()`, which works. There is currently no `using Float = Scalar` scalar defined but we'd like to extend `where` to su…

jacobhinkle updated 1 year ago
1

上一页 1...12 13 14 15 16 17 18...100 下一页

1000+ results for nvfuser

1000+ results
for nvfuser