mskcc / tempo

CCS research pipeline to process WES and WGS TN pairs
https://cmotempo.netlify.com/
12 stars 5 forks source link

Trim MAF columns #612

Open kpjonsson opened 4 years ago

kpjonsson commented 4 years ago

Per discussion between @md and me and feedback from @cband:

Currently, the final somatic MAF contains 279 columns. These are not all necessary, and a few could be omitted or collapsed into single columns in order to minimize file size and make it easier to navigate what's important. These can be done inside the pipeline (https://github.com/mskcc/vaporware/blob/develop/containers/vcf2maf/filter-somatic-maf.R and the corresponding germline filter script). Some columns that are output by default in VEP/vcf2maf are pretty much useless.

Here are some suggested changes:

Keep in mind:

evanbiederstedt commented 4 years ago

I view this as something CCS will always be fiddling with, which is great and good for science.

We simply version the pipeline, and run it based on improved versions.

gongyixiao commented 1 year ago

https://github.com/mskcc/tempo/issues/928