-
This has to be tested. It seems that linear interpolations do not depend on the order in which we treat the axes of the prompt tensor. However, it is unclear to me whether that is still the case for c…
ljleb updated
11 months ago
-
The input code is as follows and the insertGPUAllocs cannot deal with the following case properly. `"gpu.dealloc"(%51) : (memref) -> ()` should be inserted inside the kernel code but be inserted ousid…
-
Hello, I use your cutt to do transpose, but I have encountered a problem---'Illegal instruction (core dumped)'. My code is
`int main() {
// Four dimensional tensor
// Transpose (31, 549,…
-
ValueError: setting an array element with a sequence. The requested array has an inhomogeneous shape after 1 dimensions. The detected shape was (32,) + inhomogeneous part.
`from typing import Any, …
-
### 🚀 The feature
Currently, for using `channels_last`:
- During data loading, `torchvision.transforms.ToTensor()` converts the input images into CHW.
- Then during training, the images are conve…
-
I am trying to install nvida apex on a docker image `nvidia/cuda:12.1.0-devel-ubuntu22.04` running on windows 11. wsl2 is installed and I am able to do `nvidia-smi` and `nvcc --version` in both docker…
AvisP updated
2 weeks ago
-
# Background
nvfuser codegen should make smart decision on memory format for outputs, rather than naively assume canonical contiguous tensor with descending strides.
An example as demonstrated i…
-
MWE:
```python
coord_element = VectorElement("Lagrange", interval, 1)
mesh = Mesh(coord_element)
element = FiniteElement("Lagrange", interval, 1)
V = FunctionSpace(mesh, element)
u = TrialFu…
-
According to the original paper by [Kim et al.](http://proceedings.mlr.press/v80/kim18b/kim18b.pdf), the permutation function permutes across the batch for each dimension. In the case here, if `B, D =…
-
I think there is an issue in the way the input tensor `x` is reshaped in order to extract `a_in` and `p_in`.
It seems to me that the dimensions of `a_in` and `p_in` require a permutation before apply…