Closed ziphra closed 8 months ago
The chromosome names in the "exclude" bed file should have the same convention as in the alignment bam file. The error may arise from there. The second error:
Error: malformed BED entry at line 787. Start Coordinate detected that is < 0. Exiting.
is also related to the patch chromosomes. So right now the best solution will be to remove the the patch alignments as you suggested.
Thanks for trying Straglr, please let me know if there is any other issues.
Thank you for your prompt response.
Instead of removing patch alignments, I changed their name in the exclude bed so they would match the bam file. For those having the same problem and who want to keep their patches alignment: They were written like that in my exclude bed: "chrY_MU273398v1_fix" while being written like this "MU273398.1" in my bam, so I did
sed 's/^[^_]*_//' hg38.exclude.bed | awk 'BEGIN{FS=OFS="\t"} {sub(/[_].*/,"",$1)} 1' | sed 's/v1/\.1/' | sed 's/v2/\.2/' > hg38_renamed.exclude.bed
A new version has been released that should also handle ALT chromosome alignments.
But the chromosome names between BAM and the bed file for --exclude
still have to agree
Hi,
I tried running straglr like this to detect TR expansions in ONT whole genome sequencing:
and I generated hg38.exclude.bed like this:
But while running straglr I get the following warnings:
And then this error:
Should I remove all patchs alignments from my bam ?
I would love some insights on that.
Many thanks in advance
ziphra