About microph num question

Thanks for question. I have similar observations. I suppose this is reasonable. When you learning the density prior on only mixtures of two sources, density of the mixture itself does not have the diversity to learn good enough speech models, i.e., the problem is not challenging enough. Ideally, we want the number of mics are large enough so that the initial mixtures sounds like bubble noises. This will force the model to learn some real stuff.

Let's consider the extreme case, the number of mic is 1. Then, it is clear that no meaningful information can be learned.

lixilinx / IVA4Cocktail

About microph num question #2