Closed mmokrejs closed 2 years ago
Hi @mmokrejs,
Just want to make sure that the distinction between the original tigmint
and tigmint-long
is clear - tigmint
utilizes linked reads for the correction, whereas tigmint-long
uses long reads. tigmint-long
supports fasta or fastq reads from any long read technology.
For tigmint-long
we simulated 'pseudo-linked reads' from the long reads, so we make an output fasta format where the 'barcode' is in the read header, and each linked read
from the same long read is given the same barcode. It looks like the error you have here is from tigmint-long
, so we prescribe what the format is, as we write the file that is aligned with minimap2.
if you're still seeing this error, feel free to post the full command and log.
Thanks, Lauren
Hi @lcoombe , thank you for confirming that tigmint should work with any long reads (notably PacBio and Nanopore).
Here are my read names from PacBio HiFi:
@m54312U_210604_060420/36/ccs
@m54312U_210604_060420/1836835/ccs
@m54312U_210604_060420/3671337/ccs
@m54312U_210604_060420/5572759/ccs
@m54312U_210604_060420/7603272/ccs
@m54312U_210604_060420/9570710/ccs
@m54312U_210604_060420/11601384/ccs
@m54312U_210604_060420/13632255/ccs
@m54312U_210604_060420/15664129/ccs
@m54312U_210604_060420/17695940/ccs
...
And here are the reads from Promethion:
...
@3449a3f3-3517-4e6c-a4a9-556181a35c0f runid=f449620d42d064e00e812250f447e112b6cda908 read=23410 ch=2579 start_time=2020-07-03T18:43:27Z flow_cell_id=PAE36794 protocol_group_id=200630Wen sample_id=D20-2216
@02e23ba5-b6db-4a2a-afbf-ee032d2b216d runid=f449620d42d064e00e812250f447e112b6cda908 read=24449 ch=2169 start_time=2020-07-03T18:43:54Z flow_cell_id=PAE36794 protocol_group_id=200630Wen sample_id=D20-2216
My command was
tigmint-make tigmint-long draft=`basename "$f" .fa` reads=`basename $myreads .fq.gz` longmap=ont span="auto" G="6.8e9" dist="auto" t=$threads
Hi @mmokrejs,
Just to confirm - is your myreads.fq.gz
a mix of ONT (fasta) and pacbio (fq) reads?
Do you see any other messages before the python TypeError? That's usually indicative of an issue/error/warning that happened upstream.
Sorry, I was in a hurry and picked a FASTA derivative of the file. I confirm the FASTQ file complies fastq. I edited the sequence above to show the reads at the end of the huge file from Promethion.
No, I haven't seen other error messages before that but I do remember seeing this reported by some other user as well, either under tigmint
or arcs
or similar bgsc project. Just poke poke through other issues on github.
Could you please post the full log? It's helpful for me to have the full log to better aid you in troubleshooting. I took a look through the issues on tigmint and arcs, but couldn't find a similar issue - if you recall where you saw that and link it here that would be helpful! As I mentioned, I have seen this error before, but generally after an upstream failure, so that's just why I'd like to have a second glance at your full log.
OK, luckily I found it in the xterm buffer so I edited the orignal post. Yeah, will try to find the similar issue on github ..., sorry that did not keep the link to it handy.
Thanks for the full log!
This is a funny one - I'm not too sure if the issue is happening in the long-to-linked-pe
stage or at the tigmint_molecule_paf.py
stage.
Just checking - have you tried running your installation on our installation test files? (https://github.com/bcgsc/tigmint/tree/master/tests/test_installation) It's just a good sanity check for us to see if it is an installation/environment issue or something with your particular data.
I don't think I ran the tigmint
tests even after brief xterm buffer poking.
But based on this python
error speaking of BX:Z
tags I moved to ntLink
and those tests passed successfully.
You know, based on this crash I though tigmint actually requires the BX:Z
tags from 10x chromium, even in the tigmint-long
mode and that I just somehow misread the README.md
. Seemed ntLink
is the way to go for me.
After the crash something was still occupying the shell, hence the ctrl+c
to quit that. Maybe it was some parallel job started by the tigmint-make
. Seemed obvious I anyway have to overcome the python crash. Probably crash on an empty or None value, would not be surprising as I do not have 10x chromium data.
Sure, if you want to continue troubleshooting the TypeError, let me know how running the tests goes when you get around to it.
And just to clarify (also for the benefit of others), tigmint-long
does use BX tags, but only under the hood, so you wouldn't notice that without the error. While the input of tigmint-long
is normal long reads, it simulates 'pseudo-linked reads' from the long reads using long-to-linked-pe
, so it generates the reads with the BX tag in the header. So looking at the entire command of that step is helpful in understanding what's going on.
I'm glad you've found ntLink, and (as you may have noticed), Tigmint-long + ntLink is in fact the default mode of our longstitch pipeline.
This issue has been automatically marked as stale because it has not had any recent activity. It will be closed if no further activity occurs. Thank you for your interest in Tigmint!
Does
tigmint-make tigmint-long
support FASTQ reads from other platforms than 10x genomics chromium?