harta55 / EnTAP

Eukaryotic Non-Model Transcriptome Annotation Pipeline - Latest Release v1.2.1 - HGT Analysis Released! Revamped figures/graphics coming soon.
https://entap.readthedocs.io/en/latest/
GNU General Public License v3.0
37 stars 9 forks source link

Version1.1.1: interproscan database error #85

Open AlexGaithuma opened 1 month ago

AlexGaithuma commented 1 month ago

I cant ran Interproscan on Entap. my debug file is as below:

interproscan-exe: /data2/Alex_MulengaLab/tools/interproscan-5.67-99.0/interproscan.sh
protein: pfam
hgt-donor: 
hgt-recipient: 
hgt-gff: 
Fri May 31 10:19:21 2024: User has input a database at: /data2/Alex_MulengaLab/db/uniprot_trembl.dmnd
Fri May 31 10:19:21 2024: User has input a database at: /data2/Alex_MulengaLab/db/Refseq.protein.dmnd
Fri May 31 10:19:21 2024: User has input a database at: /data2/Alex_MulengaLab/db/uniprot_trembl.dmnd
Fri May 31 10:19:21 2024: Verifying EnTAP database...
Fri May 31 10:19:21 2024: Reading serialized database from: /data2/Alex_MulengaLab/db/bin/entap_database.bin Of type: 2
Fri May 31 10:19:40 2024: Success!
Fri May 31 10:19:40 2024: Spawn Object - QueryData
Fri May 31 10:19:40 2024: Generating transcriptome mappings...
Fri May 31 10:19:40 2024: Transcriptome Lines - START
Fri May 31 10:19:40 2024: >JF1A_TRINITY_DN10022_c0_g1_i1.p1 JF1A_TRINITY_DN10022_c0_g1~~JF1A_TRINITY_DN10022_c0_g1_i1.p1  ORF type:internal len:108 (+),score=23.54,Defensin_4|PF07936.12|6.1e+03,Defensin_4|PF07936.12|0.15,Defensin_4|PF07936.12|6.8e+03 JF1A_TRINITY_DN10022_c0_g1_i1:2-322(+)
Fri May 31 10:19:40 2024: AGGAAAAAGGGTTCTCGAGATTCGGTGGGCCTTACAGTGTCAGCTAATGGAATTCATTGG
Fri May 31 10:19:40 2024: CGAAGAGGAACTGGACACATTCAGACTGATGATGATGTTGGTATTGTCATGGAATGCAGT
Fri May 31 10:19:40 2024: GATGATTGGTGGGCCTTTGATACTGAGATTGTTCGACCATCTGATATTCTTATACTGTCT
Fri May 31 10:19:40 2024: AGTGAGAAGCTGAAAGCCTCAAGTGGGGTTTATTGGATGTTTTATTCGGGATGCAACTCG
Fri May 31 10:19:40 2024: GATGAGATAAATGTTCCCCAAGTGTTGCTTGGGAACCCTTGGAAGTTTGAGGAGACTTGC
Fri May 31 10:19:40 2024: CTTGAATCTACTCTTTTTAGG
Fri May 31 10:19:40 2024: >JF1A_TRINITY_DN10032_c0_g1_i1.p1 JF1A_TRINITY_DN10032_c0_g1~~JF1A_TRINITY_DN10032_c0_g1_i1.p1  ORF type:5prime_partial len:102 (+),score=5.52 JF1A_TRINITY_DN10032_c0_g1_i1:1-306(+)
Fri May 31 10:19:40 2024: AATGTGCCCTTGTTAATGTTTCCGTTCATGATAACGAACTGGCTTCTTACAGTTTTGTCA
Fri May 31 10:19:40 2024: Transcriptome Lines - END
Fri May 31 10:19:40 2024: Success!
Fri May 31 10:19:40 2024: Verifying input flag fpkm
Fri May 31 10:19:40 2024: Verifying input flag qcoverage
Fri May 31 10:19:40 2024: Verifying input flag tcoverage
Fri May 31 10:19:40 2024: Verifying input flag ontology
Fri May 31 10:19:40 2024: Verifying input flag level
Fri May 31 10:19:40 2024: Killing Object - EntapDatabase
Fri May 31 10:19:45 2024: Killing Object - QueryData
Fri May 31 10:19:45 2024: QuerySequence data freed
Fri May 31 10:19:45 2024: Error code (10) not recognized
Fri May 31 10:19:45 2024: Error code: 10

InterProScan selected, but invalid databases input!
Fri May 31 10:19:45 2024: End - EnTAP
AlexGaithuma commented 1 month ago

fixed issue by installing python 2.7 in my conda environment which had python 3.7 already installed

conda env update -n my_env --file python27.yml

my .yml file contains

name: python27
dependencies:
- python=2.7
- virtualenv
- ipython
- numpy
- scipy
harta55 commented 1 month ago

Hey! Did you modify the config file interproscan-db or protein (if using older version) commands in between runs? By default, there aren't databases listed for those commands which would give you an error.

chenyidai95 commented 1 month ago

Hi! I already use EnTap Version1.2.1 in docker, and I guess the problem is at the database flag "morbidblite" in the file "entap_run.params". When I put all database: interproscan-db=tigrfam interproscan-db=sfld interproscan-db=prodom interproscan-db=hamap interproscan-db=pfam interproscan-db=smart interproscan-db=cdd interproscan-db=prositeprofiles interproscan-db=prositepatterns interproscan-db=superfamily interproscan-db=prints interproscan-db=panther interproscan-db=gene3d interproscan-db=pirsf interproscan-db=coils interproscan-db=morbidblite in "entap_run.params", the error message "Error code: 10 InterProScan selected, but invalid databases input!" was shown. However, when I removed the final one "interproscan-db=morbidblite", the EnTap worked.

But another problem happens that the output file "interpro_results.gff3" only shows the result of "Coils" database when I used all interproscan database.