Before submission, could slightly improve generalization performance by mixing in a weighted average of the prior to regularise. Should be easy to do this to submission csv files, but should be able to specify in run settings so it plugs into the full pipeline; just runs at test time.
May have to store class priors from training set in settings.json. They are calculated in the notebook plotting the data from the start of the competition.
Before submission, could slightly improve generalization performance by mixing in a weighted average of the prior to regularise. Should be easy to do this to submission csv files, but should be able to specify in run settings so it plugs into the full pipeline; just runs at test time.
May have to store class priors from training set in
settings.json
. They are calculated in the notebook plotting the data from the start of the competition.