Gradient through multi-dimensional arrays

ClemensSchwarke commented 11 months ago

Hi, I had some trouble figuring out that the gradient of, in my case, a 2d array is only backpropagated correctly if its elements are accessed with [a,b] indexing. Using two slicing operators [a][b] leads to adjoints being 0.

Minimal example:

import warp as wp

@wp.kernel
def test(
    a: wp.array2d(dtype=wp.vec3),
    b: wp.array2d(dtype=wp.vec3),
    c: wp.array2d(dtype=wp.vec3),
):
    tid = wp.tid()
    c[tid][0] = a[tid][0] + b[tid][0]

wp.init()
tape = wp.Tape()

a = wp.full((1,1), value=1.0, dtype=wp.vec3, requires_grad=True)
b = wp.full((1,1), value=1.0, dtype=wp.vec3, requires_grad=True)
c = wp.zeros((1,1), dtype=wp.vec3, requires_grad=True)

with tape:
    wp.launch(
        kernel=test,
        dim = 1,
        inputs = [a, b],
        outputs = [c],
    )

c.grad = wp.full((1,1), value=1.0, dtype=wp.vec3)
tape.backward()
print(c.grad)
print(a.grad)
print(b.grad)

Output: [[[1. 1. 1.]]] [[[0. 0. 0.]]] [[[0. 0. 0.]]]

Is this intended? Thanks in advance! Clemens

mmacklin commented 10 months ago

Hi Clemens, this looks like a bug - we will look at supporting gradient propagation through slicing in the next release.

Thanks! Miles

c0d1f1ed commented 8 months ago

Sorry for the delay in getting to this. I'll have a look soon.

c0d1f1ed commented 7 months ago

A fix for this will be available in the next release.

ClemensSchwarke commented 7 months ago

Thanks a lot (:

c0d1f1ed commented 5 months ago

This was addressed as part of Warp 1.1.0.

NVIDIA / warp

Gradient through multi-dimensional arrays #158