Closed th3geek closed 3 years ago
Hi @th3geek, From the comment it seems like there is a mismatch between the sizes of the estimate and target. How many sources do you want to separate to? The default is 2.
Hi,
Yes, I'm trying to separate 4 sources. I've looked through the config and don't see the option for specifying the number of sources?
You are right, I should have been clearer about that.
You should set swave.C=4
. See here: https://github.com/facebookresearch/svoice/blob/master/conf/config.yaml#L66
That worked! Thank you.
Hi, @adiyoss @th3geek as mentioned in the paper this model can separate an unknown number of sources in overlapped speech. Can you tell how we can do this example:- mix files have a different number of source's mixed audio than what will be the value of swave.C or there is a different approach to solve such problem
Hi,
I overcame my OOM problem (from #24) while trying to train the included debug set. This was accomplished by setting R=2 and segment=2. I'm now trying to train using the librimix dataset but have encountered the following error:
I've generated the relevant json files for the wavs, created the corresponding config file in the dset\ directory. The only variables I've changed was to set R=2 sample_rate=16000 dset=libri4mix segment=2. I'm considering renting a cloud instance with a GPU that has enough memory to train the model with the proper R and segment values but I'd like to know there isn't going to be any errors like this beforehand.