Open HenrikBengtsson opened 4 years ago
Similarly to https://github.com/UCSF-Costello-Lab/LG3_Pipeline/issues/141#issuecomment-633708976, I've rerun lg3 2019-07-22 a second time. Validating the results of this using the develop version of lg3 test validate
:
[cbctest2@n17 lg3-demo-2019-07-22-002]$ lg3 --version
2019-07-22 (commit 218d7fb)
gives:
[cbctest2@n17 lg3-demo-2019-07-22-002]$ LG3_TEST_TRUTH=../lg3-demo-2019-07-22/truth PATIENT=Patient157t10 $LG3_HOME/bin/lg3 test validate
Sourced: /home/henrik/repositories/UCSF-CostelloLab/LG3_Pipeline-develop/lg3.conf
*** Configuration
[OK] PROJECT=LG3
[OK] PATIENT=Patient157t10
[OK] CONV=patient_ID_conversions.tsv
[OK] LG3_TEST_TRUTH=../lg3-demo-2019-07-22/truth
*** Trimming of FASTQ Files
[OK] file tree ('output/LG3/trim/Z00*-trim')
[OK] file sizes ('output/LG3/trim/Z00*-trim/*')
[OK] file md5 checksums (after gunzip) ('output/LG3/trim/Z00*-trim/*.fastq.gz')
*** BWA Alignment of FASTQ Files
[OK] file tree ('output/LG3/exomes')
[OK] file sizes ('output/LG3/exomes/Z00*/*')
[OK] file md5 checksums ('output/LG3/exomes/Z00*/*.bai')
[OK] file md5 checksums ('output/LG3/exomes/Z00*/*.bam')
[OK] file md5 checksums ('output/LG3/exomes/Z00*/*.flagstat')
*** Recalibration of BAM Files
[OK] file tree ('output/LG3/exomes_recal/Patient157t10')
[OK] file sizes ('output/LG3/exomes_recal/Patient157t10/*')
[OK] file sizes ('output/LG3/exomes_recal/Patient157t10/germline/*')
[OK] file md5 checksums ('output/LG3/exomes_recal/Patient157t10/germline/*.germline')
[OK] file sizes ('output/LG3/exomes_recal/Patient157t10/*.bai')
[OK] file md5 checksums ('output/LG3/exomes_recal/Patient157t10/*.flagstat')
[OK] file md5 checksums ('output/LG3/exomes_recal/Patient157t10/*.bai')
[OK] file md5 checksums ('output/LG3/exomes_recal/Patient157t10/*.bam')
*** Pindel Processing
[OK] file tree ('output/LG3/pindel')
[OK] file rows ('output/LG3/pindel/Patient157t10.pindel.cfg')
[OK] file sizes ('output/LG3/pindel/Patient157t10_pindel/*')
[OK] file md5 checksums ('output/LG3/pindel/Patient157t10_pindel/*')
*** MutDet Processing
[OK] file tree ('output/LG3/mutations/Patient157t10_mutect')
[WARN] unexpected file sizes ('../lg3-demo-2019-07-22/truth/Patient157t10/output/LG3/mutations/Patient157t10_mutect/*' != 'output/LG3/mutations/Patient157t10_mutect/*')
@@ -1 +1 @@
-211K output/LG3/mutations/Patient157t10_mutect/NOR-Z00599t10__REC1-Z00601t10.indels.annotated.vcf
+212K output/LG3/mutations/Patient157t10_mutect/NOR-Z00599t10__REC1-Z00601t10.indels.annotated.vcf
@@ -6,2 +6,2 @@
-70M output/LG3/mutations/Patient157t10_mutect/NOR-Z00599t10__REC1-Z00601t10.snvs.coverage.mutect.bed
-623M output/LG3/mutations/Patient157t10_mutect/NOR-Z00599t10__REC1-Z00601t10.snvs.coverage.mutect.wig
+63M output/LG3/mutations/Patient157t10_mutect/NOR-Z00599t10__REC1-Z00601t10.snvs.coverage.mutect.bed
+528M output/LG3/mutations/Patient157t10_mutect/NOR-Z00599t10__REC1-Z00601t10.snvs.coverage.mutect.wig
@@ -14,2 +14,2 @@
-88M output/LG3/mutations/Patient157t10_mutect/NOR-Z00599t10__TUM-Z00600t10.snvs.coverage.mutect.bed
-628M output/LG3/mutations/Patient157t10_mutect/NOR-Z00599t10__TUM-Z00600t10.snvs.coverage.mutect.wig
+84M output/LG3/mutations/Patient157t10_mutect/NOR-Z00599t10__TUM-Z00600t10.snvs.coverage.mutect.bed
+597M output/LG3/mutations/Patient157t10_mutect/NOR-Z00599t10__TUM-Z00600t10.snvs.coverage.mutect.wig
[OK] file md5 checksums ('output/LG3/mutations/Patient157t10_mutect/*.mutations')
[OK] file md5 checksums ('output/LG3/mutations/Patient157t10_mutect/*.txt')
[OK] file md5 checksums ('output/LG3/mutations/Patient157t10_mutect/*.intersect.bed')
*** Post-MutDet Processing
[OK] file tree ('output/LG3/MAF')
[OK] file sizes ('output/LG3/MAF/Patient157t10_MAF/*')
[OK] file sizes ('output/LG3/MAF/Patient157t10_plots/*')
[OK] file tree ('output/LG3/MutInDel')
[OK] file sizes ('output/LG3/MutInDel/*')
[OK] file content ('output/LG3/MutInDel/Patient157t10.R.mutations')
As in https://github.com/UCSF-Costello-Lab/LG3_Pipeline/issues/141#issuecomment-633708976, this suggests "... that (i) there is random component to the ./_run_MutDet step, and (ii) the validation of the 'MutDet Processing' step does not take this into account ..."
Comparing lg3 2019-07-22 Patient157t10 results to those of lg3 2019-03-23 gives: