broadinstitute / gdctools

Python and UNIX CLI utilities to simplify interaction with the NIH/NCI Genomics Data Commons
Other
31 stars 4 forks source link

BUG: multiple versions of clustering algorithms showing up in reports #64

Closed noblem closed 6 years ago

noblem commented 6 years ago

See http://gdac.broadinstitute.org/runs/tmp/reports_20170919/cancer/TCGA-COADREAD-TP/index.html

screen shot 2017-10-05 at 9 46 08 pm

Similar issue in http://gdac.broadinstitute.org/runs/tmp/reports_20170919/cancer/TCGA-OV-TP/index.html (and probably others)

This might be result of the zipfile extraction, which was done multiple times to

/broad/hptmp/mnoble/analyses__2017_09_19_extraction.2

but is more likely to be from lincRNA and mRNA reports not having unique titles. Regardless, scrutiny is clearly needed.