Closed ypradat closed 1 day ago
Running with SAGE v4.0 (beta) returned the exact same error as did running with the exact same BAM files (GATK-recalibrated) as I provided to Mutect2 and Strelka.
Best regards, Y. Pradat
Going through the java code, I understood that the error comes from the loading of transcripts data in the Ensembl cache. As outlined in my first command, I used homo_sapiens_core_111_38
. Resorting to homo_sapiens_core_104_38
as indicated in the README runs without error. I suppose there has been some recent change in the Ensembl transcripts data that creates a loading error for some particular gene on chromosome 14.
Best regards, Y. Pradat
That's correct - the gene AL121790.1 has no transcripts and has since been removed from our Ensemble data cache as used in the next pipeline release (v6.0). I have in the meantime changed Sage to skip this gene when loading transcript info. This fix will be out in the Sage 4.0, also part of pipeline v6.0
Dear HMF team,
I am trying to run SAGE v3.3. on a pair of tumor-normal whole-exome sequenced. Below is my command
and here is the log of the job
Can you please advise on the cause of the error and, if possible, on a fix? I have already processed successfully the data from these two samples (not exactly the BAMs above but BAMs further processed to get BQSR via GATK) with Mutect2 and Strelka.
Best regards, Y. Pradat