Closed yuekaizhang closed 7 months ago
I see what you're trying to get at, but I think there may be a more elegant solution which does not need running sox commands during data preparation. You can create a
Recording
object which has multipleAudioSource
s, possibly 1 per microphone channel. Then, if you callload_audio
with the specified channels, it would load only those channels. This is better in 2 ways:
- data is not duplicated
- you can load any combination of channels from the recording
As an example, I would suggest looking at the
mdm
preparation in ICSI: https://github.com/lhotse-speech/lhotse/blob/master/lhotse/recipes/icsi.py
Done, thanks.
I see what you're trying to get at, but I think there may be a more elegant solution which does not need running sox commands during data preparation. You can create a
Recording
object which has multipleAudioSource
s, possibly 1 per microphone channel. Then, if you callload_audio
with the specified channels, it would load only those channels. This is better in 2 ways:As an example, I would suggest looking at the
mdm
preparation in ICSI: https://github.com/lhotse-speech/lhotse/blob/master/lhotse/recipes/icsi.py