Closed amyhouseman closed 2 years ago
Could you please paste a line from the input BAM after the header? Also, a line from the reference showing the ID of the chromosome.
Lines after header in input bam:
Chromosome ID in reference (it isn't all Ns, I checked):
I think it may have been a problem with my bed file!
I did this to see if it was tab or spaces between the fields (even though my text editor said it was tabs): tr -d " " < original.bed > checktabspaces.bed
The result was a file where all the columns combined so then I did this on the original file: awk 'OFS=" " {print $1"\t", $2"\t", $3}' orginial.bed | tr -d " " > tab.bed
And then reran, and it worked!
Not a deepvariant problem, but thought I'd share. Thanks! Amy
Hello, Operatin system: Linux HPC Version: 1.3.0 Installation: Singularity Data: WES - with Agilent SureSelect DNA Human All ExonV5_hg38 bed file
Steps to reproduce: Command
Error trace:
* Intermediate results will be written to Polyposis_Exome_Analysis_JOB27/deepvariant/intermediateresults/15M11163_L7_PE_output_intermediate in docker.
Running the command:
First lines: First 10 lines of sorted marked duplicate bam is: BAM?P@HD VN:1.6 SO:coordinate @SQ SN:NC_000001.11 LN:248956422 @SQ SN:NT_187361.1 LN:175055 @SQ SN:NT_187362.1 LN:32032 @SQ SN:NT_187363.1 LN:127682 @SQ SN:NT_187364.1 LN:66860 @SQ SN:NT_187365.1 LN:40176 @SQ SN:NT_187366.1 LN:42210 @SQ SN:NT_187367.1 LN:176043 @SQ SN:NT_187368.1 LN:40745
First line of reference hg38 is:
First line of bed file is: NC_000001.11 65509 65625
I have got deepvariant and the above code to work for another dataset with a different bed file used - but I'm not sure why the ValueError: Unknown: BED record has invalid number of fields error is occurring.
Thanks! Amy