skandlab / SMuRF

MIT License
20 stars 7 forks source link

SMuRF input data type #38

Closed caokai001 closed 4 years ago

caokai001 commented 4 years ago

Hi ! I would like to ask, the data types downloaded by TCGA are different from the four types recommended by this software package MuTect2, FreeBayes, VarDict and VarScan. so can I use use 2 or 3 type somatics data to train model and improve the accuracy of somatic mutation.

Thanks! by caokai image

tyler5huang commented 4 years ago

Hi caokai,

Unfortunately no for now as the software package uses exactly 4 callers which are important for the overall accuracy of the training model.

If you are using TCGA data, you can try to use the simple consensus method to find mutations called by at least two callers rather than this software. Our evaluation shows MuTect2 is a fairly accurate caller on its own as well.

Regards, Tyler

caokai001 commented 4 years ago

ok, thanks for your reply!