Closed SPPearce closed 2 months ago
I think it’s absolutely related. One temporary fix would be to swap in fgbio SortBam
for template coordinate merging for now. It isn’t as fast (not multithreaded) but would work when merging lanes. Perhaps even better as a stop gap would be just o use samtools sort
, which works, to re-sort after the merge?
Ok, thanks.
Currently samtools merge
is being ran non-multithreaded anyway, at least the first time (process_low
, and it uses task.cpus-1
).
I don’t think we have a merge tool in fgbio, so it’ll have to re-sort.
Ok. My surprise is that it worked on 7/8 samples. I think there is an issue with using igenomes too, but I'll dig into that on Monday.
You're right, this is relate tot #52 and https://github.com/samtools/samtools/pull/2062
Hi I'm new to nf-core workflows and I'm also encountering this bug.
[2024/07/31 08:02:03 | FgBioMain | Info] GroupReadsByUmi failed. Elapsed time: 0.05 minutes. Exception in thread "main" java.lang.IllegalStateException: A00232:194:H2LGVDSXC:1:1671:4797:18004 did not have a primary R1 record.
Could you elaborate on how to swap in 'fgbio SortBam'? Is this something I can specify in my config file?
I think it’s absolutely related. One temporary fix would be to swap in
fgbio SortBam
for template coordinate merging for now. It isn’t as fast (not multithreaded) but would work when merging lanes. Perhaps even better as a stop gap would be just o usesamtools sort
, which works, to re-sort after the merge?
Thanks!
This was fixed in the branch that @nh13 had made, but he seems to have deleted it now. The upstream fix is in samtools, but samtools haven't made a release yet. Nils, I think we should release a 1.0.1 version sooner than samtools might actually get round to releasing it.
Could you elaborate on how to swap in 'fgbio SortBam'? Is this something I can specify in my config file?
No, you can't do this with a config, it requires an edit to the workflow itself.
@SPPearce here's the closed PR: https://github.com/nf-core/fastquorum/pull/54. I was hoping that samtools would be released by now, but its volunteer so I can relate. I've asked for a release from here: https://github.com/samtools/samtools/issues/2090. perhaps we wait a few days and then do a release?
Fixed in #68
Description of the bug
This may be related to #52, but posting it separately as I'm not sure.
I’m finding this error on one of my 8 duplex samples on GroupReadsByUmi:
which is odd to me, because that bam file contain two reads with
A01659:139:HT77KDRX3:1:2160:16260:22326
:All 8 of these samples were sequenced over two lanes, so they are merged together. Curiously this is the only file that fails in this way, the other 7 samples are fine.
If I manually sort the merged bam file, then
GroupReadsByUmi
will resort the bam file itself and then work correctly.Command used and terminal output
No response
Relevant files
No response
System information
Running fastquorum v1.0.0 on Nextflow 23.10.1 with apptainer as the container engine.