aim-qmul / sdx23-aimless

Source Separation training codebase for the Sound Demixing Challenge 2023.
35 stars 0 forks source link

permutation invariant training #6

Open yoyololicon opened 1 year ago

yoyololicon commented 1 year ago

For bleeding/label noise tracks

ben-hayes commented 1 year ago
ben-hayes commented 1 year ago

to keep memory requirement in O(n^2) rather than O(n!), better to adapt FLoss and TLoss to accommodate dim parameters to allow permutations to be evaluated in parallel

ben-hayes commented 1 year ago

the way some loss fns are currently implemented (e.g. the sdr-combination loss in sdr_loss) are bound to specific shapes and numbers of targets, so either I refactor all these to accommodate the general PIT wrapper, or I scrap the wrapper approach and just hard-code PIT variants of loss functions we're interested in

yoyololicon commented 1 year ago

I prefer the latter approach since the loss fns are meant to be replaceable (using configs), and we can make as many as we want. :)

ben-hayes commented 1 year ago

yeah fair point! will switch it over.