Open GRGong opened 1 month ago
The next release will resolve this. Thanks!
Does the current main HEAD resolve this issue? I've now integrated integration tests of PAF correctness, which should be equivalent to the SAM correctness using https://github.com/ekg/pafcheck.
Thank you for the quick response! Unfortunately, I am working on a cluster that lacks some necessary libraries, and I am unable to compile wfmash from source. Would it be possible for you to provide a precompiled binary of wfmash?
@GRGong You could change the Dockerfile with wfmash HEAD for docker image. If you don't have access to root, singlularity remote builder would be help (https://cloud.sylabs.io/builder).
@GRGong here's a wfmash binary. I should probably make a release, but I prefer to do that once you've confirmed that this resolves the issues you're seeing. If not, we should resolve and add some automated tests to prevent future problems. Right now I'm testing SAM, PAF, and MAF conversion steps using github actions.
Just gunzip and make sure it's executable: wfmash-v0.21.0-38-gb731e41.gz
@ekg Thanks for the binary. I tested the provided binary using my own genomes, but it still has the CIGAR problem. For your reference, I’ve uploaded the query and target FASTA files, along with the command I used and the error log.
Here is the link: https://drive.google.com/file/d/18MzFalZhVnKt-hTfTxmxI2KsdZxh6Zsf/view?usp=sharing
Note: The two genomes belong to divergent insect species, but they are still in the same subfamily. The previous version, wfmash v0.15, worked without issues.
Dear wfmash developers,
I've identified an issue with CIGAR strings in PAF files generated by wfmash versions 0.16 and later. This problem appears to be related to the inversion patching feature introduced in v0.16.
Key points:
Example error (using
rustybam break-paf -m 5000
): toy.zipSteps to reproduce:
-t 96 -4 -p 60
This issue does not occur with wfmash v0.15.
Could you please investigate this CIGAR string inconsistency? It would be helpful to understand if this is a bug or if there have been changes in the CIGAR string format that need to be addressed in downstream tools.
Thank you for your attention to this matter.
Best regards, Gaorui