Open Rob-murphys opened 3 years ago
My best guess is: $denovo_assembly
doesn't have the right file name, or something like that.
Indeed your sequence format looks fine, yet Query sequences=0
: impossible!
An example command from the bash script would be
cd /home/lamma/local-blast/termitomycesBGI/Genome_v3/termitodb # to get me into the database directory as it didn't seem to work if I was not there.
lastal -P4 -E 0.5 -C 2 termitodb /home/lamma/local-blast/termitomycesBGI/redundans-output_hybrid/IC35-2/scaffolds.reduced.fa > /home/lamma/local-blast/termitomycesBGI/redundans-output_hybrid/IC35-2_termitodb.maf
The assembly filepath is indeed correct:
less /home/lamma/local-blast/termitomycesBGI/redundans-output_hybrid/IC35-2/scaffolds.reduced.fa
>scaffold09232
GGCGAGAATGTCGGTTTGCCCGACGGACAGATGGGTAACTCGGAAGTAGGTCACCTCAATATCGGTGCAGGACGCATCGTATACCAGGACTTGGTAAAGA
TTAACCGTGCTTGTGCCGACGGCAGCATCCTGAAGAATAAGGAAATCGTTTCCGCTTTCTCTTACGCGAAGGAGCACGGTAAGAACATCCATTTTATGGG
ACTCACCAGCAATGGCGGTGTGCACAGTTCTTTCGACCATCTGTTCAAGCTTTGCGATATTTCAAAGGAAT
>scaffold09237
GGTGTGCGCATGAATGATCTGGCCAATTCCACCATGACGACGACTTCTCCCCCCAAGCGCATTGACTTCAATACGCCTGAGCTGCAACGCAAGCGCCGCA
TTCGCGCGCTCAAGGATCGCCTGACCCGTTGGTACGTGCTAGTCGGCGGCCCCGCCGTCCTCGGCGCCATCACCCTGATTTTCTTCTTTCTTGCCTATGT
CGTCGCGCCGCTGTTTCAAGGTGCCTCATTGACCAAGGAAGACGCCCTGACCCCGGCCTGGATCCAGGACG
>scaffold09236
GCGTAGCCCGAGAAAACTTTGGCTGCTCGTCCGGTTGGAGCCAACAAAAACGTTTTATGCTTTAGTTCGTTAAGCGTTTTCACCAAAGCCCCGACCAGAG
AGGTTTTTCCGGTTCCTGCGTAGCCTTTCATCAGTAGAAGGCTGTCCCGTTCCTCAGCCAAGAGAAATGCGGTTAGCGTATGAAGCGCTAGAATGAGAAC
ATCCGTCGGGTTATAGAGGAAATTTCGTACGAATTGCTGGCCGAAGTAACTATTTATGATTTTGACCGCAA
Hi,
I copy-pasted your 3 sequences and lastal options. I got the expected Query sequences=3
.
Could you send some minimal sequence files and exact options (including for lastdb) that exhibit the problem?
hi, where should I upload the sequence files to?
Apparently, you can attach files to these comments: https://docs.github.com/en/free-pro-team@latest/github/managing-your-work-on-github/file-attachments-on-issues-and-pull-requests
Or you could email them to my address shown in here: https://sites.google.com/site/mcfrith/
Hi there, sorry for the slow reply.
I have attached an example sequence file. P1_19_mod.zip
All commands used are below
last-train:
cd termitodb
lastdb -uNEAR -R01 termitodb ../Termitomyces_v3.0.fa
lastal
cd $db_path
lastal -P4 -E 0.5 -C 2 $db_name $denovo_assembly > $outfile
echo $db_name
termitodb
echo $denovo_assembly
/home/lamma/local-blast/termitomycesBGI/redundans-output_ragtag_hybrid/P1_19_mod.fa
echo $outfile
/home/lamma/local-blast/termitomycesBGI/redundans-output_ragtag_hybrid/P1_19_mod.maf
Hi:
I tried your commands and sequences, with LAST version 1060, and got the expected # Query sequences=1558
.
(I don't have Termitomyces_v3.0.fa
, so I used some random file instead, but I'm 99.99% sure that can't explain it.)
So I'm not sure what to suggest!
Are you using some job queuing/submission system? What if you run it in the simplest possible way, on your own local computer, without any $
variables?
I am using a job submission system yes. This is all running in slurm. What would the job submission system be doing to cause this error?
I can try run it locally 👍
I am using the LAST aligner to align a reference to a denvo assembled genome to identify contamination. I have run lastdb on the reference to create the database but when using lastal I get no significant output:
It seems as though no query sequences are being detected but the de novo assemblies are in the correct fasta format eg: