iver56 / audiomentations

A Python library for audio data augmentation. Inspired by albumentations. Useful for machine learning.
https://iver56.github.io/audiomentations/
MIT License
1.83k stars 187 forks source link

`Reverse` causes negative stride error when converting to tensors #284

Open aaprasad opened 1 year ago

aaprasad commented 1 year ago

When using audiomentations.Reverse() and then converting to a tensor it leads to a ValueError: At least one stride in the given numpy array is negative, and tensors with negative strides are not currently supported. (You can probably work around this by making a copy of your array with array.copy().). This is solved by converting the array using np.ascontiguousarray but might be nice to handle Reverse differently as to not cause the issue in the first place?

iver56 commented 1 year ago

Can you show me the code snippet that reproduces this issue?

If I remember correctly, Reverse uses the fast way of reversing the waveform, which does not move the array data around in memory, but just changes the indexing. This often works well, but if you want to convert it to a torch tensor afterwards, you do indeed have to call .copy() on it for it to work, as it says in the error msg.

The alternative would be to always use the slow approach, which moves data around instead of just changing the indexing.

Slow implementation:

Fast implementation: