nestauk / dap_aria_mapping

Mapping technology innovation to support The Advanced Research and Innovation Agency (ARIA)
MIT License
1 stars 0 forks source link

41 skewness #49

Closed emily-bicks closed 1 year ago

emily-bicks commented 1 year ago

Description

calculates the entropy of the frequency distribution of the entities/taxonomy category and calculates a chi square test statistic that compares the frequency to a uniform distribution. I think these metrics are more relevant than skewness because we aren't expecting the frequency distributions to be normal? In theory I think we want to compare to uniform distribution (i.e. all categories have the same number of entities)

Fixes #41

Checklist: