lucidrains / audiolm-pytorch

Implementation of AudioLM, a SOTA Language Modeling Approach to Audio Generation out of Google Research, in Pytorch
MIT License
2.36k stars 255 forks source link

mono vs. stereo #149

Closed syjunghwang closed 1 year ago

syjunghwang commented 1 year ago

When i learn the audiolm model, there is a code that converts the music input data into a mono type(channel size=1) by averaging when the channel size is 2. Is it impossible to learn by putting stereo sound like music into the model as it is?