Open vicmacbec opened 1 year ago
First feature importance.
Best half most importance pairs
Looking the bias vs variance curves, we can see that there is overfitting since the test curve never down. To reduce the overfitting, we proceed to reduce the features using the most important half of features. We can observe that the the test curve does not down, so know we are going to proceed to use regularization.
Changing the target of the increase of the 5 % of the price in the next week, the bias variance plot gives better solutions. The overfitting still remain but is more acceptable.
A view of a sample of the test orders is: where the red diamonds are false positive orders (maximice specificity), the black squares are false negative orders and the black squares with red diamonds are the true positive orders (maximice them)
Notes
To do:
Bias Variance plot using cross validation with 5 folds and parameters:
Grid search was done and the best parameters are:
Execution of the model using the best parameters (the best test iteration is marked with a black point)
Confussion matrix analysis
AUC and ROC analysis
Youden index computation
Confussion Matrix with Youden index:
Test a machine learning algorithm using trading rules as features.