mskcc / pluto-cwl

CWL workflows for helix filter scripts
1 stars 6 forks source link

need to scrub sample IDs from demo dataset #51

Closed stevekm closed 2 years ago

stevekm commented 3 years ago

related to #50 , I noticed that the maf files used for demo dataset still have original sample ID's inside them despite the filenames being changed, need to use the sanitize module on their files contents https://github.com/stevekm/sanitize This will probably change the checksum on the outputs in test cases where the demo dataset is in use, MSI CWL in particular reports the Tumor Barcode column from the maf and not the filename sample IDs

stevekm commented 3 years ago

if we can get the demo dataset scrubbed enough and small enough maybe we can host it on GitHub to aid testing as well

stevekm commented 2 years ago

scrubbing is implemented but not all test cases have been updated to use the new scrubbed Proj_1 dataset. Will migrate old test cases as they get encountered for future dev