Now that I have the training and separation finally working, I was wondering about the limits of this framework. For example, can this be modified somehow to separate speech(dialogue) from background music? or is it only built for singing vocals?
Also, training material is in stereo, but input can be stereo or mono, however why is the output mono
if input was stereo? Is there no way to force stereo output with this framework? or is that a project for the future?
Now that I have the training and separation finally working, I was wondering about the limits of this framework. For example, can this be modified somehow to separate speech(dialogue) from background music? or is it only built for singing vocals?
Also, training material is in stereo, but input can be stereo or mono, however why is the output mono if input was stereo? Is there no way to force stereo output with this framework? or is that a project for the future?
Thanks!