Closed nh13 closed 3 years ago
varlociraptor 3.0.1
Caused by:
unable to open SAM/BAM/CRAM index for tmp.normal.sam
well I don't have a SAM index ;P, it's a SAM file, so I was hoping it would just scan the file?
This is actually rust-htslib
behaviour when using a bam::IndexedReader
:
As it will fetch reads piling up above candidate sites and those may be apart by quite some distance (or you might have parallelized preprocessing over different parts of the genome), the default behavior is index-based skipping to those sites instead of streaming to them.
So would you want this to be a feature request to allow streaming instead of index-based skipping in some way? Or is the most recent error message (and then simply creating the index) fine as a solution for you?
Either the error message should say "file must be indexed" or "support streaming". Either is fine, but sometimes I use SAM files to produce test cases (unindexed) and debug odd variants, so that'd be great.
Since varlociraptor starts from candidate variants, I think we won't add support for just streaming through the bam. It will always have to jump, and just for testcases, I don't want specialized code. In particular, since it is pretty easy to convert into a BAM+index. Also, note that because of varlociraptors estimation of sampling bias related variables (see #154), it is anyway usually not optimal to have a handcrafted testcase. We have derived all of the ~60 included testcases from synthetic WGS, and real data.
But I agree about the error message. It should definitely be improved. Hence, leaving it open for now.
varlociraptor 2.3.0
command