We now let the samplers rely on our C++ "sort and dedup" logic, instead of the less efficient Python ones. This has a few benefits:
we can avoid extra copies
samplers can now return a 5D FrameBatch instead of a list of 4D FrameBatch. The 5D FrameBatch output is a "batch" of clips. Its data is of shape (num_clips, num_frames_per_clips, C, H, W) (or HWC).
Fixes https://github.com/pytorch/torchcodec/issues/256
We now let the samplers rely on our C++ "sort and dedup" logic, instead of the less efficient Python ones. This has a few benefits:
(num_clips, num_frames_per_clips, C, H, W)
(or HWC).Running our samplers benchmark:
:rocket: