felixbur / nkululeko

Machine learning speaker characteristics
MIT License
32 stars 5 forks source link

Add statistic checks #85

Open felixbur opened 1 year ago

felixbur commented 1 year ago

The explore module computes statistic measures for correlation, but some of them have assumptions, e.g. PCC assumes that both variables are normally distributed. Add a check and a warning if requirements are not met.

bagustris commented 1 year ago

Instead of warning, adding distribution/statistic checks into reports/results would be useful too. It also supports bias checking (if the data is not normally distributed or skewed in particular characteristics).