brezniczky / ep_elections_2019_hun

Examining the Hungarian results of the 2019 European Parliamentary elections data
GNU General Public License v2.0
3 stars 0 forks source link

Feature selection #8

Open brezniczky opened 5 years ago

brezniczky commented 5 years ago

Using a dissimilarity statistic it may be easy to verify which features are relevant and to be kept at all, i.e. yield the most disjoint vote fingerprints. Raising the overhearing prob. to some >1 power could also result in an improvement.

brezniczky commented 5 years ago

Additionally, note that the entropy drops and the digit repetitions are not at all independent. Maybe these could be examined as a joint probability distribution simulation? Guess wouldn't be easy computationally - although next to nothing compared to today's workloads :) (and neither perhaps analytically).