Closed daryadedik closed 6 years ago
@daryadedik I think I need a meeting in person to understand this PR ;)
Ok, as discussed I just returned previous chi-square implementation (nothing changed here), plus some small code beatifying made, and removed category dropping (now we don't drop any categories), min_counts is input parameter (I set it to 5 as default), but in ADS afterward we will decide which threshold to use.
Sorry for mixing up the PR. I also added number of filtered entities per variant in exp.metadata. Will need that information instead computing the filtered number on ADS side.
As discussed I removed category dropping (now we don't drop any categories), min_counts is input parameter (I set it to 5 as default), but in ADS afterward we will decide which threshold to use. Also added number of filtered entities per variant in exp.metadata.