Closed metalichen closed 4 years ago
How did you run antiSMASH? You used the GBK file from funannotate predict (or update)? What version of antiSMASH? Typically the proper antiSMASH result GBK file is named something like first_contig.final.gbk
Sorry, I accidentally copied a wrong command. I used file named "Cryptococcus_neoformans.gbk", named the same way as the file I submitted to the web server, but containing antismash annotations. Here is a fragment:
gene complement(1670137..1671994)
/locus_tag="crynepredictedgene_002970"
CDS complement(join(1670137..1670385,1670453..1671205,
1671262..1671376,1671432..1671994))
/codon_start=1
/locus_tag="crynepredictedgene_002970"
/product="hypothetical protein"
/protein_id="ncbi_crynepredictedgene_002970-T1"
/transl_table=1
/translation="MGPGQGYRPPPYGLPTRPSLDTPASAVSQSQRQGFPQPQPYMPQT
QQGFYPGYGYGYQPNLSGGYPSGGFHPMYAAPAPSFGQSLFQSPVAVNPEGYSYSTTYL
SSQYNPQVSAPNPPMKRQRPNNSNVMTGGVLSAKPWRNCSHPGCKFVGPGDQVEIHEED
RHLIYAPGKVPQRSEEEERFAKRKGPLPPIQGTNITLNTPEDIEKWIAERKSRWPTAKR
VLEKEEERQAAIARGEVPAKQRKGKGRRNDPASRAEEWGREVKDEEADIPRVFGGERGR
GRGRGRGSVRGRGGRAGNEGRSDGVAPVHSIVQTSTQSQSRQSSEVNPTADSLIGLGGY
DTPAESASASDSSDTESSTESDVGSDSSSDSSSGSEDDHAQLKPAEASTSSPATTTTTK
PSISTLSKPICKFFAQQGRCKFNDRCRFAHIAPDGSSVDTSAQGENRKPAPQQEKKRQP
RQPSARKLNPFERPSMLGALLANPIQNTLSQISQTIRFLVANDMLQNVEIRPGQVEEEE
KARNKVVLLDGSSKDNNGATEDNLNMEGGDDIIQELKETEGE"
protocluster 1671730..1696243
/aStool="rule-based-clusters"
/contig_edge="False"
/core_location="join{[1681729:1681757](+),
[1681807:1681821](+), [1681886:1682072](+),
[1682138:1682347](+), [1682403:1682986](+),
[1683034:1683190](+), [1683251:1683590](+),
[1683935:1684044](+), [1684122:1684341](+),
[1684397:1684404](+), [1684464:1684481](+),
[1684539:1684596](+), [1684645:1684776](+),
[1684832:1685305](+), [1685358:1685416](+),
[1685464:1685654](+), [1685707:1685781](+),
[1685836:1685907](+), [1685972:1686072](+),
[1686141:1686243](+)}"
/cutoff="20000"
/detection_rule="(Terpene_synth or Terpene_synth_C or
phytoene_synt or Lycopene_cycl or terpene_cyclase or NapT7
or fung_ggpps or fung_ggpps2 or trichodiene_synth or TRI5)"
/neighbourhood="10000"
/product="terpene"
/protocluster_number="1"
/tool="antismash"
proto_core join(1681730..1681757,1681808..1681821,1681887..1682072,
1682139..1682347,1682404..1682986,1683035..1683190,
1683252..1683590,1683936..1684044,1684123..1684341,
1684398..1684404,1684465..1684481,1684540..1684596,
1684646..1684776,1684833..1685305,1685359..1685416,
1685465..1685654,1685708..1685781,1685837..1685907,
1685973..1686072,1686142..1686243)
/aStool="rule-based-clusters"
/tool="antismash"
/cutoff="20000"
/detection_rule="(Terpene_synth or Terpene_synth_C or
phytoene_synt or Lycopene_cycl or terpene_cyclase or NapT7
or fung_ggpps or fung_ggpps2 or trichodiene_synth or TRI5)"
/neighbourhood="10000"
/product="terpene"
/protocluster_number="1"
cand_cluster 1671730..1696243
/candidate_cluster_number="1"
/contig_edge="False"
/detection_rules="(Terpene_synth or Terpene_synth_C or
phytoene_synt or Lycopene_cycl or terpene_cyclase or NapT7
or fung_ggpps or fung_ggpps2 or trichodiene_synth or TRI5)"
/kind="single"
/product="terpene"
/protoclusters="1"
/tool="antismash"
I used antiSMASH version 5.0.0 on the web server (https://fungismash.secondarymetabolites.org/#!/start).
Does the logfile have anymore information than what you posted from the terminal output?
Here it is:
[10/09/19 19:52:42]: 309 secretome and 1,464 transmembane annotations added
[10/09/19 19:52:42]: Now parsing antiSMASH results, finding SM clusters
[10/09/19 19:52:46]: Found 0 clusters, 0 biosynthetic enyzmes, and 0 smCOGs predicted by antiSMASH
[10/09/19 19:52:46]: bedtools intersect -wo -a cry_ne_preds/annotate_misc/antismash/clusters.bed -b /path/cryptococcus/cry_ne_preds/predict_results/Cryptococcus_neoformans.gff3
[10/09/19 19:52:46]: Found 0 duplicated annotations, adding 52,642 valid annotations
[10/09/19 19:52:46]: Parsing tbl file: /path/cryptococcus/cry_ne_preds/annotate_misc/genome.tbl
[10/09/19 19:52:47]: Converting to final Genbank format, good luck!
[10/09/19 19:52:47]: tbl2asn -y "Annotated using funannotate v1.5.3" -N 1 -t ../bryoria_tortuosa/template.sbt.txt -M n -j "[organism=Cryptococcus neoformans]" -V b -c fx -T -a r10u -l paired-ends -Z cry_ne_preds/annotate_misc/tbl2asn/1/discrepency.report.txt -p cry_ne_preds/annotate_misc/tbl2asn/1
[10/09/19 19:53:23]: [tbl2asn] Flatfile genome
[tbl2asn] Validating genome
[10/09/19 19:53:33]: Creating AGP file and corresponding contigs file
[10/09/19 19:53:33]: perl /home/gulnara_tagirdzhanova/downloads/funannotate-1.5.3/util/fasta2agp.pl Cryptococcus_neoformans.scaffolds.fa
[10/09/19 19:53:34]: Cross referencing SM cluster hits with MIBiG database version 1.4
[10/09/19 19:53:34]: diamond blastp --sensitive --query cry_ne_preds/annotate_misc/antismash/smcluster.proteins.fasta --threads 14 --out cry_ne_preds/annotate_misc/antismash/smcluster.MIBiG.blast.txt --db /scratch/1/gulnara/funannotate_db/mibig.dmnd --max-hsps 1 --evalue 0.001 --max-target-seqs 1 --outfmt 6
[10/09/19 19:53:34]: diamond v0.9.24.125 | by Benjamin Buchfink <buchfink@gmail.com>
Licensed under the GNU GPL <https://www.gnu.org/licenses/gpl.txt>
Check http://github.com/bbuchfink/diamond for updates.
#CPU threads: 14
Scoring parameters: (Matrix=BLOSUM62 Lambda=0.267 K=0.041 Penalties=11/1)
Temporary directory: cry_ne_preds/annotate_misc/antismash
Opening the database... [7.1e-05s]
#Target sequences to report alignments for: 1
Opening the input file... [0.000102s]
Error: Error detecting input file format. First line seems to be blank.
It seems that the problem is in parsing the antiSMASH output file
Yes. Just noticed that you have an old version of funannotate. Please update to newest version and see if the problem persists. We addressed some of these v4.5 v5 of antiSMASH in the last few months.
Hello,
I got an error while running 'funannotate annotate'. Since 'fun annotate remote' did not produce any result, I ran antiSMASH on their web server. When I tried to feed the resulting gbk file back to funannotate, I got an error:
The
cry_ne_preds/annotate_misc/antismash/
folder contains three files, clusters.bed, secmet.clusters.txt, and smcluster.proteins.fasta, but all of them are empty.If I do not use --antismash flag, everything goes smoothly.