mskcc / pluto-cwl

CWL workflows for helix filter scripts
1 stars 6 forks source link

update_cBioPortal_data.py merge_mafs uses too much memory #56

Closed stevekm closed 3 years ago

stevekm commented 3 years ago

The script update_cBioPortal_data.py is using large amounts of memory when merging Facets maf information into the data_mutations_extended.txt file.

Some ideas;

stevekm commented 3 years ago

there are some log and job files here to use for debugging this; /juno/work/ci/helix_filters_01/test_data/11089_G

stevekm commented 3 years ago

this is fixed for the moment, using the reduced Facets python dict method, if it becomes and problem again try some of these other ideas

stevekm commented 3 years ago

also note that work to test mem usage was done here /juno/work/ci/kellys5/projects/benchmarking