Use for Music? - Githubissues

yxlu-0102 / MP-SENet

Explicit Estimation of Magnitude and Phase Spectra in Parallel for High-Quality Speech Enhancement

MIT License

325 stars 45 forks source link

Thank you for your interest in our work. I share your thoughts on this matter.

In our experiments, we found that our method performs exceptionally well in restoring harmonic structures, which are very prominent in music. Therefore, I also believe that our method should be well-suited for music enhancement.

For a 48 kHz sampling rate, increasing the FFT window size and hop size is feasible, but our phase prediction method is quite sensitive to the hop size. My main concern is that a larger hop size might lead to a decline in performance. The most suitable hop size would need to be determined based on specific experimental results.

Since I am no longer working on speech enhancement, I would appreciate it if you could share any progress you make.

yxlu-0102 / MP-SENet

Use for Music? #42