boy1dr / SpleeterGui

Windows desktop front end for Spleeter - AI source separation
Apache License 2.0
2.29k stars 250 forks source link

Extraction of Vocals not 100% accurate #56

Closed MagicAndy85 closed 2 years ago

MagicAndy85 commented 4 years ago

Dear Developers,

I've tested Spleeter with the Audio of this YouTube-Link: https://www.YouTube.com/watch?v=8nrdHcLM_ZA

It seems to Me, that the Vocals aren't extracted 100% accurate.

Maybe, You could improve Your Audio Extraction with this Sample. ;)

Yours, MagicAndy85

Gusher commented 4 years ago

Did you try other separation apps like RX8 or Phonicmind to compare results?

boy1dr commented 4 years ago

That track 2U is what we call "overcooked". The vocals are almost robotic thanks to autotune, the compression and DSP's at work there make the audio so mashed up i would be surprised if any source separation tool would work well with it. Given a decent amount of post-processing work (like RX8 would do) you could maybe clean it up a bit but not much. Spleeter would need to be trained and evaluated on such tracks to effectively separate it. The MusDB training set is far from that source audio though. In any case, how would you teach an AI what the difference between a synthesizer and the vocals in that track are if they are almost indistinguishable?

boy1dr commented 4 years ago

for best results you should be working with audio recorded with lossless compression like PCM/WAV or FLAC.

Poiuy69 commented 4 years ago

@boy1dr I get much better quality using demucs rather than spleeter. It's slower, but worth it. Maybe we need a demucs gui, hint hint lol😃 https://github.com/facebookresearch/demucs

boy1dr commented 4 years ago

@Poiuy69 i have checked demucs out and i agree there are fewer artifacts in the resulting stems. and i did evaluate it for a GUI but it doesn't appear to be an "app" that can be downloaded and run like Spleeter can so for the time being i'll just keep an eye on it and see where it goes ;)

iFUCKINGHATEcomputers commented 4 years ago

i did evaluate it for a GUI but it doesn't appear to be an "app" that can be downloaded and run like Spleeter

Interesting, does it depend on an external service or something? I wouldn't be surprised, since Facebook made it.

Thankfully someone has made an actually functional Colab for Demucs, because otherwise it's a needlessly gargantuan pain in the ass to use, like all of these things.

Gusher commented 4 years ago

I hate to say but DEMUCS doesn't give significant results compared to Spleeter or RX8 or even online methods such as Phonicmind.... to get more accurate clean results you need the raw source of audio without any filters used on them, that's the key for very good output... alot of the music online and youtube are heavily filtered, making it useless for getting excellent results. However, I might rate RX8 a few points for adding extra tools for cleanup, such as de-bleed....

Zcooger commented 4 years ago

Have you tried MultiGenreModelHP.pth from UVR GUI? https://github.com/Anjok07/ultimatevocalremovergui/releases image DISCLAIMER: It's 2-stem software (yet).

Gusher commented 4 years ago

In its ccurrent state no way will touch "Ultimate Vocal Remover"... it needs a long way to compete against SpleeterGUI...

boy1dr commented 4 years ago

@iFUCKINGHATEcomputers i'm not sure how i'd use a colab project in a GUI. Just had a quick look at it again but the reason i wouldn't do a windows GUI for demucs is that it uses Anaconda python which is a separate install process. I do like making things but when a project gets as popular as SpleeterGUI you really don't want to have to provide a lot of support for failed installations. If pip can be modified to install demucs and it works with the embeddable version of python it might be worth a shot :)

Poiuy69 commented 3 years ago

@boy1dr I'm not sure what you mean by "app". Both deezer and demucs are just python libraries. Maybe I'm missing something

boy1dr commented 3 years ago

@Poiuy69 Apologies i hadn't looked back at demucs when i wrote that comment. it is just a python program but it uses Anaconda to install. If there is a PIP method i might look in to it but being Anaconda i'm not going to support a third party installation given the volume of people downloading SpleeterGUI i just wouldn't be able to manage it.

AnshulJ999 commented 3 years ago

I just tried Demucs and came across this thread. I've been using SpleeterGUI, and it's a fantastic application. But after doing some very initial tests on Demucs, the results are definitely better. Especially for drums and bass. Much clearer. It seems like the 'demucs extra' library works well for less popular genres like progressive metal which usually have a lot of layers in different frequencies. But, demucs lacks clarity in the 'other' track, where it seems to be muddy and does not seem to have all the higher frequencies.

A request to the developer of SpleeterGUI @boy1dr - is it possible to include demucs into the application? As per your comments, there is a PIP method that I used to install demucs ( I have absolute no experience with coding whatsoever), and it was pretty easy to run. Using this with the GUI that you've built would make it accessible to everyone very easily and quickly too.

I think it may be worth combining stems from demucs and spleeter together to see what combinations work well.