Closed MohammedMehdiTBER closed 3 years ago
Have you increased the "Maximum song length" value?
Have you increased the "Maximum song length" value?
Yes, I did; I have increased it to 7200.
Have you increased the "Maximum song length" value?
Another Error
Starting processing of all songs
Processing C:\Users\Windo\Downloads\Racing Extinction (2015) [YTS.AG]\Racing.Extinction.2015.720p.BluRay.x264-[YTS.AG].mp4
Traceback (most recent call last):
File "D:\obj\windows-release\37amd64_Release\msi_python\zip_amd64\runpy.py", line 193, in _run_module_as_main
File "D:\obj\windows-release\37amd64_Release\msi_python\zip_amd64\runpy.py", line 85, in _run_code
File "C:\Users\Windo\AppData\Roaming\SpleeterGUI\python\Lib\site-packages\spleeter__main.py", line 256, in
Run complete
error clearly says MemoryError: Unable to allocate you have set the maximum song length beyond your computer's capability
error clearly says MemoryError: Unable to allocate you have set the maximum song length beyond your computer's capability
Well, you're right it needs to allocate about 8GiB only for separation that what it said to me when I used the --verbose
.
So there is no way out for me but to tear the audio into pieces.
it's a complicated piece of software. Deezer are still releasing Spleeter updates so i don't think they are finished with it yet
it's a complicated piece of software. Deezer are still releasing Spleeter updates so i don't think they are finished with it yet
I think the updates are not going to end as the sound models are quite infinite except if they released an option that analyzes the song itself and separate it according to the difference of human hearing and to inject human hearing in a python package, I think this is way hard for an AI to handle as long as there is no fundamentalist rule discovered by which to distinguish one sound from another except to use previously trained models which are not perfect in according to the attended human audio separation.
There are a limited number of different type Piano's, a few different types of Trumpet's, etc etc etc But there is infinite variations possible when mixed down with Analog FX/DSP applied. Teaching AI to separate Trumpet from Piano in a song isn't all that difficult a concept in the spectral domain but removing the FX/DSP from the mixed down audio would be darn near impossible. It could however recreate the source audio with new instruments after source separation...but without the effects and original instruments you will end up with something that sounds more like MIDI than the original song.
There are a limited number of different type Piano's, a few different types of Trumpet's, etc etc etc But there is infinite variations possible when mixed down with Analog FX/DSP applied. Teaching AI to separate Trumpet from Piano in a song isn't all that difficult a concept in the spectral domain but removing the FX/DSP from the mixed down audio would be darn near impossible. It could however recreate the source audio with new instruments after source separation...but without the effects and original instruments you will end up with something that sounds more like MIDI than the original song.
Well according to my experiences with Spleeter, I found difficult for the AI to detect a song especially if it had some reverb and mixing like you said but also It doesn't distinguish between a human vocal and an instrument solo. May be the project needs a way to distinguish between similar sounds in the future than to train more modules.
putting it simply...if 2 instruments occupy the same frequency with a similar wave form....which of the two instrument stems get that piece of audio?. Perhaps when computers get faster and can process the full audio spectrum we will have better success.
putting it simply...if 2 instruments occupy the same frequency with a similar wave form....which of the two instrument stems get that piece of audio?. Perhaps when computers get faster and can process the full audio spectrum we will have better success.
Maybe if they created an algorithm that separates different waveforms and tries to encode a separated known sound without clipping from those waves like redrawing the phoneme without effects.
I'm using the latest version and it doesn't want to process long files as it seems.