greenelab / tybalt

Training and evaluating a variational autoencoder for pan-cancer gene expression data
BSD 3-Clause "New" or "Revised" License
162 stars 61 forks source link

High Weigh Genes Outliers #96

Closed gwaybio closed 6 years ago

gwaybio commented 6 years ago

In #95 I removed outliers from visualizations to focus on areas of highest density. I defined outliers by > 3 z score for skew or kurtosis. The features appear to be identifiying similar biological processes across algorithms (e.g. patient sex). I need to decide if focusing on biological assignment for these features should be handled separately than other features.

stale[bot] commented 6 years ago

This issue has been automatically marked as stale because it has not had recent activity. It will be closed if no further activity occurs. Thank you for your contributions.