-
## [CUDA] Add channels_last_3d support for commonly used modules
The goal is to add `channels_last_3d`, aka NDHWC, support on CUDA devices, and improve performance on 3D model training and inferenc…
-
This task is generalization of #6302
## Goal
Enable MCU build of luci-interpreter along with the regular version in ONE.
## Rationale
To run NN on MCUs we need lightweights runtime, luci-i…
-
##### System information (version)
- OpenCV => 4.0 (but every version I tested so far really)
- Operating System / Platform => iOS
- Compiler => Xcode
##### Detailed description
Performance…
-
Tobias Weinzierl has given us access to ExaHype kernel benchmarks in [Peano](https://gitlab.lrz.de/hpcsoftware/Peano) (Benchmarks described around [here](https://gitlab.lrz.de/hpcsoftware/Peano/-/blob…
-
**ISIS version(s) affected**: 7.1.0
**Description**
We can convert the .qub file into a .cub file with the vims2isis.
However, the data at Saturn can be well processed by spiceinit, but most of…
-
Starter ticket for the SGA-2024.
Working outline:
* Part 1:
* Train & validate a neural net to recognize what large galaxies "look" like, using the SGA-2020 images, blob masks, etc.
* Bonu…
-
### Describe the feature request
QuantizeLinear/DequantizeLinear CUDA kernels do not support per-channel
### Describe scenario use case
In order to fit a larger model without loss of accuracy when …
-
My initial plan here is to follow the same approach as for the field solver, that is, redistribute the fields onto a different decomposition, such that global lines in x are available on a single proc…
-
### Describe the bug
https://github.com/intel/llvm/actions/runs/10952456651/job/30411378740
```
********************
Expectedly Failed Tests (1):
SYCL :: AddressSanitizer/nullpointer/private_…
-
### Required prerequisites
- [X] Search the [issue tracker](https://github.com/NVIDIA/cuda-quantum/issues) to check if your feature has already been mentioned or rejected in other issues.
### De…