Closed Ruishenl closed 5 months ago
The function needs to work with items of different lengths (Mi different for each i). I think you need to sum over the lengths, and then this should work.
The function needs to work with items of different lengths (Mi different for each i). I think you need to sum over the lengths, and then this should work.
Updated.
@bottler has imported this pull request. If you are a Meta employee, you can view this diff on Phabricator.
@Ruishenl has updated the pull request. You must reimport the pull request before landing.
@bottler has imported this pull request. If you are a Meta employee, you can view this diff on Phabricator.
@bottler merged this pull request in facebookresearch/pytorch3d@ccf22911d4daa74af7fbf70b3373bc0fe46d6d7c.
@Ruishenl Thank you!
For larger N and Mi value (e.g. N=154, Mi=238) I notice list_to_packed() has become a bottleneck for my application. By removing the for loop and running on GPU, i see a 10-20 x speedup.