GenoML / genoml2

GenoML (genoml2) is an open source Python package. It is an automated machine learning (autoML) platform for genomics data
Apache License 2.0
27 stars 17 forks source link

Sorting features by largest contribution when outputting *_approx_feature_importance.txt #7

Closed m-makarious closed 4 years ago

m-makarious commented 4 years ago

Please make sure that this is a feature request.

System information

Describe the feature and the current behavior/state. Right now, when the --feature_selection flag during munging, the resulting *_approx_feature_importance.txt file does not sort by largest contributor

Will this change the current api? How? Nope!

Who will benefit with this feature? Anyone who would be trying to just grab the head of the *_approx_feature_importance.txt file to see the top 10 contributing features to the predictive model

Any Other info. N/A

m-makarious commented 4 years ago

This was just addressed in the most recent commit to now output a sorted, tab delimited *_approx_feature_importance.txt file when the --feature_selection flag is used during munging

EDIT (Jul 28 2020): It is now *.approx_feature_importance.txt

m-makarious commented 4 years ago

Moved issue to new repo for completeness and consistency