Closed robinjugas closed 3 years ago
Try Ensemble.sSNV.tsv.xgb.v3.6.2.classifier
and Ensemble.sINDEL.tsv.xgb.v3.6.2.classifier
instead of Ensemble.sSNV.tsv.xgb.v3.6.2.classifier.txt
and Ensemble.sINDEL.tsv.xgb.v3.6.2.classifier.txt
first. Hopefully, that's the only problem. There should be two files that end with .classifier
.
Thank you very much. It worked. Best regards, RJ
Hello Mr. Fang, I am trying to run somaticseq train and prediction, based on conda version 3.6.2 using XGBOOST Using commands:
somaticseq_parallel.py --threads 20 --output-directory somatic_seq_results/TRAINING --genome-reference /mnt/ssd/ssd_3/references/homsap/GRCh37-p13/seq/GRCh37-p13.fa --inclusion-region /mnt/ssd/ssd_3/references/homsap/GRCh37-p13/intervals/TruSeq_Exome/TruSeq_Exome.bed --truth-snv bamsurgeon/EF1059_truepositive_SNV.vcf --truth-indel bamsurgeon/EF1059_truepositive_INDELS.vcf -algo xgboost -train paired --tumor-bam-file bamsurgeon/EF1059_tumor.bam --normal-bam-file ../input_files/mapped/EF1059_normal.bam --vardict-vcf variant_calls/EF1059/vardict/VarDict.vcf --varscan-snv variant_calls/EF1059/varscan/VarScan2.snp.vcf --varscan-indel variant_calls/EF1059/varscan/VarScan2.indel.vcf
somaticseq_parallel.py --threads 20 --output-directory somatic_seq_results/EF1059 --genome-reference /mnt/ssd/ssd_3/references/homsap/GRCh37-p13/seq/GRCh37-p13.fa --inclusion-region /mnt/ssd/ssd_3/references/homsap/GRCh37-p13/intervals/TruSeq_Exome/TruSeq_Exome.bed --minimum-num-callers 0.4 --dbsnp-vcf /mnt/ssd/ssd_3/references/homsap/GRCh37-p13/annot/dbSNP/common_all.vcf.gz --classifier-snv somatic_seq_results/TRAINING/Ensemble.sSNV.tsv.xgb.v3.6.2.classifier.txt --classifier-indel somatic_seq_results/TRAINING/Ensemble.sINDEL.tsv.xgb.v3.6.2.classifier.txt -algo xgboost paired --tumor-bam-file ../input_files/mapped/EF1059_tumor.bam --normal-bam-file ../input_files/mapped/EF1059_normal.bam --vardict-vcf variant_calls/EF1059/vardict/VarDict.vcf --varscan-snv variant_calls/EF1059/varscan/VarScan2.snp.vcf --varscan-indel variant_calls/EF1059/varscan/VarScan2.indel.vcf
I am running training on artifical tumor bam created by BAMsurgeon. However, I am getting a warning in somaticseq train :
and an error in somaticseq predict:
Attaching both two logs as with bamsurgeon log, as it might be useful. somaticseq_train.log somaticseq_predict.log bamsurgeon.log
Thank you for any help, or any hints where should I look for any mistake at my side. Regards, RJ