facebookresearch / AudioMAE

This repo hosts the code and models of "Masked Autoencoders that Listen".
Other
548 stars 45 forks source link

Question Regarding _roll_mag_aug Function Implementation in AudioMAE #26

Open unoct opened 11 months ago

unoct commented 11 months ago

Hello,

I am working with the AudioMAE codebase and have a question about the implementation of the _roll_mag_aug function.

I've noticed that the index for rolling the waveform is determined by np.random.randint(len(waveform)).

However, for a waveform with the shape (1, 16000), this approach seems to always return 0, leading to no actual shift in the time axis of the waveform.

Could you please clarify if this is the intended behavior? If the goal is to roll along the time axis, would it be more appropriate to use np.random.randint(waveform.shape[1]) to ensure variability in the rolling process?

Any insights would be greatly appreciated, as I aim to understand and utilize this function correctly in my project.

Thank you for your time and the amazing work on AudioMAE !