cuda-api-wrappers Search Results

1000+ results
for cuda-api-wrappers

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

NVIDIA/MatX #426

[QST] Consider refactoring above cuda-api-wrappers

I've noticed that MatX needs to include some CUDA-related abstractions and facilities (like error handling) which are actually not specific to matrix/tensor/numeric computations in any way. I woul…

eyalroz updated 1 year ago
2
jrhemstad/nvtx_wrappers #2

Consider overlap with cuda-api-wrappers's wrappers for NVTX …

Hello Jake, Long time no speak. Actually, do I even have your email address? I think I don't. would you mind emailing me something? Anyway... as I was looking for your address in your repos, I c…

eyalroz updated 3 years ago
2
isaac-sim/IsaacLab #1460

[Bug Report] RL env countered crash PhysX error while the cu…

This is the error log ``` 0%| …

cidxb updated 1 day ago
1
pytorch/pytorch #139628

Illegal Memory Access With `torch.compile`

### 🐛 Describe the bug We encountered an illegal memory access issue with `torch.compile` and customized torch library operator. Here's one minimal example to reproduce: ```python import torch…

alpha0422 updated 4 days ago
20
DiamondLightSource/httomo #492

Combining `|` and `xp.ndarray` in a type causes warning in s…

As part of the warnings seen in #468, with a bit of trial and error I have narrowed down the cause of the "unsupported operand `|`" errors being produced by sphinx, despite running with a python 3.12 …

yousefmoazzam updated 1 month ago
2
bsc-performance-tools/extrae #103

xml2 cflags aren't propagated correctly

With version v4.1.2, during `./configure`, Extrae build system finds correctly location of `xml2` installation, with headers under `${prefix}/include/libxml2` and it sets correctly ``` XML2_CFLAGS='…

giordano updated 4 months ago
8
martinmoene/string-view-lite #50

std::search called in search() when compiling with C++14

I'm trying to compile one of the examples of my [cuda-api-wrappers library](https://github.com/eyalroz/cuda-api-wrappers), which uses string-view-lite, using C++14 instead of C++11 like I was compilin…

eyalroz updated 11 months ago
9
pytorch/pytorch #141486

FlexAttention with compiled block mask is slow when varying …

### 🐛 Describe the bug I understand from https://github.com/pytorch/pytorch/issues/134756 that it's necessary to compile `create_block_mask` to avoid materialising the full NxM mask. However when doi…

samvanstroud updated 14 hours ago
4
pytorch/pytorch #134739

Compile fails on Flex attention + FSDP

### 🐛 Describe the bug Flex attention on FSDP works without compile, but not with compile. The key error seems to be `ValueError: Pointer argument (at 2) cannot be accessed from Triton (cpu tensor?)`…

platers updated 3 hours ago
7
conan-io/conan-center-index #22071

[package] cuda-api-wrappers/0.7: Beta version mistakenly add…

### Description I'm the author of cuda-api-wrappers. The library was added to conan-center this year (great!) - but without telling me about it (not great). I have now noticed this happened, but m…

eyalroz updated 9 months ago
12

上一页 1...1 2 3 4 5 6 7...100 下一页

1000+ results for cuda-api-wrappers

1000+ results
for cuda-api-wrappers