WarrierRajeev / UFC-Predictions

A web app to predict UFC fights
https://ufc-predictions.rajeevwarrier.com
116 stars 49 forks source link

Slight improvement to accuracy score #19

Open apavlo89 opened 3 years ago

apavlo89 commented 3 years ago

Hello there,

I think I increased prediction accuracy (using 80%-20% split) ever so slightly using TPOT (no oversampling applied yet).

try this:

# Average CV score on the training set was: 0.6958245897228948
exported_pipeline = make_pipeline(
    make_union(
        make_pipeline(
            StackingEstimator(estimator=BernoulliNB(alpha=0.001, fit_prior=False)),
            ZeroCount()
        ),
        FunctionTransformer(copy)
    ),
    StackingEstimator(estimator=SGDClassifier(alpha=0.01, eta0=0.1, fit_intercept=False, l1_ratio=0.0, learning_rate="constant", loss="perceptron", penalty="elasticnet", power_t=0.1)),
    MaxAbsScaler(),
    XGBClassifier(learning_rate=0.1, max_depth=2, min_child_weight=19, n_estimators=100, n_jobs=1, subsample=0.4, verbosity=0)
)
# Fix random state for all the steps in exported pipeline
set_param_recursive(exported_pipeline.steps, 'random_state', 2)

exported_pipeline.fit(training_features, training_target)
results = exported_pipeline.predict(testing_features)
apavlo89 commented 3 years ago

The accuracy score might increase even more as the optimisation code is still running on my unis cluster computer :P . I'll be updating main post with updated best

apavlo89 commented 3 years ago

I also created a new variable called B_finish_decisionratio (also for R) where I sum all the decision finish columns / finish columns. This improves the score. Keep in mind when its 0/0 i just turn it into a 0 value

MathemAddict commented 2 years ago

The accuracy score might increase even more as the optimisation code is still running on my unis cluster computer :P . I'll be updating main post with updated best

What accuracy did you end up with?