I am working with the AudioMAE codebase and have a question about the implementation of the _roll_mag_aug function.
I've noticed that the index for rolling the waveform is determined by np.random.randint(len(waveform)).
However, for a waveform with the shape (1, 16000), this approach seems to always return 0, leading to no actual shift in the time axis of the waveform.
Could you please clarify if this is the intended behavior? If the goal is to roll along the time axis, would it be more appropriate to use np.random.randint(waveform.shape[1]) to ensure variability in the rolling process?
Any insights would be greatly appreciated, as I aim to understand and utilize this function correctly in my project.
Thank you for your time and the amazing work on AudioMAE !
Hello,
I am working with the AudioMAE codebase and have a question about the implementation of the
_roll_mag_aug
function.I've noticed that the index for rolling the waveform is determined by
np.random.randint(len(waveform))
.However, for a waveform with the shape (1, 16000), this approach seems to always return 0, leading to no actual shift in the time axis of the waveform.
Could you please clarify if this is the intended behavior? If the goal is to roll along the time axis, would it be more appropriate to use
np.random.randint(waveform.shape[1])
to ensure variability in the rolling process?Any insights would be greatly appreciated, as I aim to understand and utilize this function correctly in my project.
Thank you for your time and the amazing work on AudioMAE !