DrKaiyangZheng / VITAP

The viral taxonomic assignment pipeline
GNU General Public License v3.0
4 stars 0 forks source link

[running the test] No blast output #1

Open Sanrrone opened 2 months ago

Sanrrone commented 2 months ago

Dears, I am running the test with the file you provide. However, it seems there is an issue by calling blast.

./scripts/VITAP.sh assignment -i test.fasta -d ./DB_20240115_IMGVR_expanded -o test_result
===== The VITAP (VIral Taxonomy Assigning Pipeline, main program) Copyright 2023 Kaiyang Zheng, Viral/microbial diversity Lab. Ocean University of China ===== 
[INFO] A number of temporary files will be generated in your result directory, please keep them or delete them followed by the guidlines.
===== ORF calling for test.fasta =====
[INFO] Ten ICTV reference genomes were randomly selected and merged to your genome set! 
[INFO] Genome and ORF files have consistent non-redundant FASTA IDs. √
[INFO] Length statitic for merged_genome.fasta
[INFO] Statistic of the number of ORF per genome
===== Aligning to ICTV reference database based on BLAST-algorithm =====
[INFO] Please delete the output file (test_result/target_ICTV_blastp.align) if this step aborted!
[INFO] Please keep the output file (test_result/target_ICTV_blastp.align) if this step finished!
===== Locating contig on taxonomic framework and generating genome community network =====
[INFO] Generating Species graph for target genomes
Traceback (most recent call last):
  File "/scratch/project_2007362/software/VITAP/scripts/VITAP_assignment_20240421.py", line 400, in <module>
    assignment()    
    ^^^^^^^^^^^^
  File "/scratch/project_2007362/software/VITAP/scripts/VITAP_assignment_20240421.py", line 382, in assignment
    taxonomy_assigning(target_blast_fp, VMR_csv_file, genome_cutoff_file, taxa, length_file, orf_count_df, taxa_annot_file, taxa_annot_file)
  File "/scratch/project_2007362/software/VITAP/scripts/VITAP_assignment_20240421.py", line 86, in taxonomy_assigning
    blast_results = pd.read_csv(blast_results_file, sep="\t", names=["qseqid", "sseqid", f"bitscore"])
                    ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
  File "/scratch/project_2007362/software/mambaforge/envs/VITAP/lib/python3.12/site-packages/pandas/io/parsers/readers.py", line 948, in read_csv
    return _read(filepath_or_buffer, kwds)
           ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
  File "/scratch/project_2007362/software/mambaforge/envs/VITAP/lib/python3.12/site-packages/pandas/io/parsers/readers.py", line 611, in _read
    parser = TextFileReader(filepath_or_buffer, **kwds)
             ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
  File "/scratch/project_2007362/software/mambaforge/envs/VITAP/lib/python3.12/site-packages/pandas/io/parsers/readers.py", line 1448, in __init__
    self._engine = self._make_engine(f, self.engine)
                   ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
  File "/scratch/project_2007362/software/mambaforge/envs/VITAP/lib/python3.12/site-packages/pandas/io/parsers/readers.py", line 1705, in _make_engine
    self.handles = get_handle(
                   ^^^^^^^^^^^
  File "/scratch/project_2007362/software/mambaforge/envs/VITAP/lib/python3.12/site-packages/pandas/io/common.py", line 863, in get_handle
    handle = open(
             ^^^^^
FileNotFoundError: [Errno 2] No such file or directory: 'test_result/target_ICTV_blastp.align'

How can I proceed?

thanks in advance!

DrKaiyangZheng commented 2 months ago

Hi, Sandro, This issue is usually caused by an abnormal run of Diamond, but there could be multiple reasons for it. It might be due to Prodigal failing to successfully predict protein sequences, leading to errors in the downstream Diamond step, or it could be that Diamond itself did not run successfully. You may be able to identify these issues through the VITAP_run.log. Some more complex reasons could be related to environment configuration problems. You can try reinstalling it and see if that resolves the issue. The good news is that I have merged VITAP into the Bioconda recipes, so now you can easily install and test it using conda (conda install bioconda::vitap). Thank you for your feedback, and thank you for all the suggestions to improve the VITAP manuscript. Hope this tool can be of some help to you in advance! :)