Closed jyw-atgithub closed 7 months ago
thanks for reporting the error.
It happened in a step where Straglr is trying to extract a clipped sequence from an alignment and align it against the reference to see if it hit anything with a hope to "rescue" the alignment for further analysis.
Unfortunately the start and end coordinates of do not make sense in this instance, off by quite a large margin.
It's hard to debug without doing a full analysis, but I will give it a shot if you can re-run again by adding:
--debug --tmpdir {dirname}
to your command, where for {dirname}
please specify a custom empty directory to store the tmp files.
Please direct the stdout from running Straglr to a file so that you can send it to me together with a compressed file of the tmp files.
Hi @jyw-atgithub, I've create a branch https://github.com/bcgsc/straglr/tree/rescue Could you please check out this branch on your data to see if the bug still occurs? Thanks!
Hello! Thanks for your support! I will upload the subset data after the conference (TAGC2024). In addition, I tried python 3.8 on our cluster and no error was reported but the same error was replicated under Python 3.10.2
The bugfix will kind of "escape" the problem and let the software move on To understand what caused the problem I will need to see the data as I cannot think of how it happened unless there is some unpredicted alignment scenario. Is is a human genome alignment or some other species? Anyways, wish you a very fruitful conference experience.
@jyw-atgithub, wonder if you got a chance to try the bugfix in the branch "rescue"
Hi @readmanchiu , I am working on it right now. Thanks for the follow up. I will tell you the results tomorrow if everything goes fine.
Hi @readmanchiu , Thank you very much for your patience!! In my sandbox environment, the error message did not occurr. The commands remain the same. The following shows my environment. It is operated on our school's public cluster.
$module load anaconda/2022.05
$conda activate sandbox
(sandbox) $python --version
Python 3.9.12
(sandbox) $which trf
~/.conda/envs/sandbox/bin/trf
(sandbox) $which blastn
~/.conda/envs/sandbox/bin/blastn
(sandbox) $conda --version
conda 4.12.0
(sandbox) $cd straglr/
(sandbox) $git branch -a
* rescue
remotes/origin/HEAD -> origin/master
remotes/origin/master
remotes/origin/rescue
remotes/origin/v1.2.0a
The straglr was installed by pip install . --user
in the directory pulled from github
Good to know the error is gone. I will merge the branch.
Dear @readmanchiu, I am using latest straglr.py under python-3.10.2 and GNU/Linux x86_64. Here is the commend
python3 straglr.py ${aligned_bam}/SRR9951099_ONT.trimmed-ref.SOFT.bam ${ref_genome} SRR9951099_ONT --nprocs 16 --min_ins_size 50 --max_str_len 100
Then, it produced the following error. May I know how to fix it. Thank you!Here is how I produced the alignment: