Open dgcnz opened 8 months ago
Using L1 regularization instead L2 to induce more sparseness/interpretability Measure causality of final linear layer Test for audio classification
l1 already used because we have l1_ratio of 0.99