broadinstitute / gdctools

Python and UNIX CLI utilities to simplify interaction with the NIH/NCI Genomics Data Commons
Other
31 stars 4 forks source link

simplify & rationalize dicing of [new] data types #81

Open noblem opened 6 years ago

noblem commented 6 years ago

In the process of supporting new kinds of data for CPTAC, and new-format SEG files in GDC data release 11, it's become clear that the current approach for dicing, and in particular the way the "convert" function is identified & called during dicing needs improvement:

This is a roundup of concerns, and as we work them the above list will likely need to be corrected or extended, but for now it's sufficient to have written most of them down in summary form so that they don't fall through any cracks