lelaboratoire / tpot-fss

Application of TPOT's new operators DatasetSelector
7 stars 5 forks source link

Application of TPOT's new Feature Set Selector (FSS) and Template

This repository contains detailed simulation and analysis code needed to reproduce the results in this study:

Scaling tree-based automated machine learning to biomedical big data with a feature set selector
Trang T Le, Weixuan Fu, Jason H Moore
Bioinformatics (2019-06-04) https://doi.org/gf3tds
DOI: 10.1093/bioinformatics/btz470 · PMID: 31165141

Simulation analyses and visualizations are in simulation, and real-world application to RNA-Seq data are in RNASeq. Each folder contains TPOT exported pipelines (with FSS in pipelines_ds and without FSS in pipelines_reg) as well as the cross validated accuracy of each pipeline (in accuracies_*).

Questions/PRs are welcomed.