The paper is available at https://arxiv.org/abs/2409.06245
Music source separation (MSS) aims to separate mixed music into its distinct tracks, such as vocals, bass, drums, and more. MSS is considered to be a challenging audio separation task due to the complexity of music signals. Although the RNN and Transformer architecture are not perfect, they are commonly used to model the music sequence for MSS. Recently, Mamba-2 has already demonstrated high efficiency in various sequential modeling tasks, but its superiority has not been investigated in MSS. This paper applies Mamba-2 with a twostage strategy, which introduces residual mapping based on the mask method, effectively compensating for the details absent in the mask and further improving separation performance. Experiments confirm the superiority of bidirectional Mamba2 and the effectiveness of the two-stage network in MSS.
python TS-BSmamba2.py
python L-TS-BSmamba2.py
You can download them using the following link. https://mega.nz/folder/qYBzXL7S#c0qBxQss-PxPue27YqCdfA Note: {track} = 'vocals, bass, drums, other'
TS-BSMAMBA2 is licensed under the Apache License 2.0.
If you use this work in your research or project, please cite the following paper:
@misc{bai2024twostagebandsplitmamba2network,
title={A Two-Stage Band-Split Mamba-2 Network for Music Separation},
author={Jinglin Bai and Yuan Fang and Jiajie Wang and Xueliang Zhang},
year={2024},
eprint={2409.06245},
archivePrefix={arXiv},
primaryClass={cs.SD},
url={https://arxiv.org/abs/2409.06245}
}
For any questions or feedback regarding TS-BSMAMBA2, feel free to reach out to us via email:
bjlin@mail.imu.edu.cn