Data Augmentation - Githubissues

teejlab / API-Risk-Assessment-Framework

A framework for quantifying API risks.

MIT License

5 stars 9 forks source link

We currently have less data for training Machine Learning models. I suggest that we try these approaches and compare the performance:

Bootstrapping to augment the data, and training models on this higher data volume
Use Bagging Classifier as the ensemble model,, which fits base model on the bootstrapped samples and combines the results: https://scikit-learn.org/stable/modules/generated/sklearn.ensemble.BaggingClassifier.html

This thread has been opened for discussing on data augmentation techniques.

teejlab / API-Risk-Assessment-Framework