Closed kpalin closed 6 years ago
sorry for the troubles. lots of people are running indexcov on hundreds of samples, so it must be something odd, likely with your 32nd sample. Can you narrow it down to a single sample and send me the problematic crai?
can you let me know what organims this is as well? or send the fai file?
looks like you have 2 million sequences in there!
Sent you a sample crai. The DNA is mostly human but the reference includes pretty much all known viral genomes.
is this one fixed by the update from yesterday?
ok. I see that it's not. I am looking into this now. Can you reindex the cram with the most recent samtools and verify you still see this? there were some bugs in the cram indexing that affected indexcov until recently.
I found the offending lines in your crai. I have posted a question to samtools mailing list to ask if it is valid, but I think not. So, I think it is either due to a bug in cram indexing or some other issue outside of indexcov. But we'll see what the response is.
See response from samtools developer here: https://sourceforge.net/p/samtools/mailman/message/36106841/
Can you make sure your cram is sorted and re-index?
OK. Resorting and re-indexing with samtools 1.6 made the error go away in this case. I still have few other problematic crai:s to check and need to figure out why wasn't the cram sorted. Will get back to you if something major arises.
31 samples work fine but with 32 the output ends in:
2