KAIST-MACLab / PyTSMod

An open-source Python library for audio time-scale modification.
GNU General Public License v3.0
192 stars 27 forks source link

[Feature Request] Transposed output when using PyTSMod with SoundFile #12

Open SoonbeomChoi opened 3 years ago

SoonbeomChoi commented 3 years ago

When using read() from soundfile library channel places on 2nd axis. But the shape of the audio is transposed after using the implemented TSM algorithms. This might be confusing for some people. It would be grateful if you take consideration of this.

seyong92 commented 3 years ago

Nice point. The output shape of all TSM algorithms for multi-channel audio is fixed to (channel, num_samples), and also we recommend to use the audio with shape (channel, num_samples).

However, for those who want to use the shape of (num_samples, channel), I will update the function to return the output with the shape of the input.