Use of the IDSEGAN for Music Source Separation

Hi, I´m interesting in the Music Source Separtion (MSS) field but all the SotA models like DEMUCS and ConvTastNet produce some noise in every output track.

Could be feasible to train the ISEGAN model to "denoise" the output tracks of a MSS network (bass, drums, others and vocals)?. Training four IDSEGAN networks with pairs MMSOutputBass-OriginalBass, MSSOutputDrums-Original Drums, etc.

Could be feasible to scale the 16 KHz SE task to the 44.1 Khz used in the MSS task or the need for more frecuency bins could made the network unreliable?.

pquochuy / idsegan

Use of the IDSEGAN for Music Source Separation #1