I will get the following output. The error is at the bottom.
:: Output directory ...
/var/folders/86/v68d1x9s59128vx4pr1kls6r0000gn/T/tmpt_mes0n4/test-output
:: Anvo'o version ...
Anvi'o version ...............................: esther (v6.1-master)
Profile DB version ...........................: 31
Contigs DB version ...........................: 14
Pan DB version ...............................: 13
Genome data storage version ..................: 6
Auxiliary data storage version ...............: 2
Structure DB version .........................: 1
:: Setting up the pan analysis directory ...
:: Generating contigs databases for external genomes ...
:: INPUT DIR: /var/folders/86/v68d1x9s59128vx4pr1kls6r0000gn/T/tmpt_mes0n4/test-output/pan_test, FNAME: 01 ...
:: RENAMING CONTIGS ...
Input ........................................: /var/folders/86/v68d1x9s59128vx4pr1kls6r0000gn/T/tmpt_mes0n4/test-output/pan_test/01.fa
Output .......................................: /var/folders/86/v68d1x9s59128vx4pr1kls6r0000gn/T/tmpt_mes0n4/test-output/pan_test/01-clean.fa
Minimum length ...............................: 0
Total num contigs ............................: 1
Total num nucleotides ........................: 139,930
Contigs removed ..............................: 0 (0.00% of all)
Nucleotides removed ..........................: 0 (0.00% of all)
Deflines simplified ..........................: True
:: GENERATING THE CONTIGS DB ...
Input FASTA file .............................: /var/folders/86/v68d1x9s59128vx4pr1kls6r0000gn/T/tmpt_mes0n4/test-output/pan_test/01-clean.fa
Name .........................................: 01
Description ..................................: No description is given
Split Length .................................: 20,000
K-mer size ...................................: 4
Skip gene calling? ...........................: False
External gene calls provided? ................: None
Ignoring internal stop codons? ...............: False
Splitting pays attention to gene calls? ......: False
Finding ORFs in contigs
===============================================
Genes ........................................: /var/folders/86/v68d1x9s59128vx4pr1kls6r0000gn/T/tmp_0kaapnx/contigs.genes
Amino acid sequences .........................: /var/folders/86/v68d1x9s59128vx4pr1kls6r0000gn/T/tmp_0kaapnx/contigs.amino_acid_sequences
Log file .....................................: /var/folders/86/v68d1x9s59128vx4pr1kls6r0000gn/T/tmp_0kaapnx/00_log.txt
CITATION
===============================================
Anvi'o will use 'prodigal' by Hyatt et al (doi:10.1186/1471-2105-11-119) to
identify open reading frames in your data. When you publish your findings,
please do not forget to properly credit their work.
Result .......................................: Prodigal (v2.6.2) has identified 119 genes.
Contigs with at least one gene call ..........: 1 of 1 (100.0%)
Contigs database .............................: A new database, /var/folders/86/v68d1x9s59128vx4pr1kls6r0000gn/T/tmpt_mes0n4/test-output/pan_test/01.db, has been created.
Number of contigs ............................: 1
Number of splits .............................: 7
Total number of nucleotides ..................: 139,930
Gene calling step skipped ....................: False
Splits broke genes (non-mindful mode) ........: True
Desired split length (what the user wanted) ..: 20,000
Average split length (wnat anvi'o gave back) .: 19,990
:: RUNNING HMMs ...
Target found .................................: AA:GENE
Target found .................................: RNA:CONTIG
HMM Profiling for Bacteria_71
===============================================
Reference ....................................: Lee modified, https://doi.org/10.1093/bioinformatics/btz188
Kind .........................................: singlecopy
Alphabet .....................................: AA
Context ......................................: GENE
Domain .......................................: bacteria
HMM model path ...............................: /Users/tito/github/anvio/anvio/data/hmm/Bacteria_71/genes.hmm.gz
Number of genes ..............................: 71
Noise cutoff term(s) .........................: --cut_ga
Number of CPUs will be used for search .......: 1
Temporary work dir ...........................: /var/folders/86/v68d1x9s59128vx4pr1kls6r0000gn/T/tmp515v46x2
Log file .....................................: /var/folders/86/v68d1x9s59128vx4pr1kls6r0000gn/T/tmp515v46x2/00_log.txt
Number of raw hits ...........................: 1
HMM Profiling for Archaea_76
===============================================
Reference ....................................: Lee, https://doi.org/10.1093/bioinformatics/btz188
Kind .........................................: singlecopy
Alphabet .....................................: AA
Context ......................................: GENE
Domain .......................................: archaea
HMM model path ...............................: /Users/tito/github/anvio/anvio/data/hmm/Archaea_76/genes.hmm.gz
Number of genes ..............................: 76
Noise cutoff term(s) .........................: --cut_ga
Number of CPUs will be used for search .......: 1
Temporary work dir ...........................: /var/folders/86/v68d1x9s59128vx4pr1kls6r0000gn/T/tmp515v46x2
Log file .....................................: /var/folders/86/v68d1x9s59128vx4pr1kls6r0000gn/T/tmp515v46x2/00_log.txt
Number of raw hits ...........................: 0
* The HMM source 'Archaea_76' returned 0 hits. SAD (but it's stil OK).
HMM Profiling for Protista_83
===============================================
Reference ....................................: Delmont, http://merenlab.org/delmont-euk-scgs
Kind .........................................: singlecopy
Alphabet .....................................: AA
Context ......................................: GENE
Domain .......................................: eukarya
HMM model path ...............................: /Users/tito/github/anvio/anvio/data/hmm/Protista_83/genes.hmm.gz
Number of genes ..............................: 83
Noise cutoff term(s) .........................: -E 1e-25
Number of CPUs will be used for search .......: 1
Temporary work dir ...........................: /var/folders/86/v68d1x9s59128vx4pr1kls6r0000gn/T/tmp515v46x2
Log file .....................................: /var/folders/86/v68d1x9s59128vx4pr1kls6r0000gn/T/tmp515v46x2/00_log.txt
Number of raw hits ...........................: 0
* The HMM source 'Protista_83' returned 0 hits. SAD (but it's stil OK).
HMM Profiling for Ribosomal_RNAs
===============================================
Reference ....................................: Seemann T, https://github.com/tseemann/barrnap
Kind .........................................: Ribosomal_RNAs
Alphabet .....................................: RNA
Context ......................................: CONTIG
Domain .......................................: N\A
HMM model path ...............................: /Users/tito/github/anvio/anvio/data/hmm/Ribosomal_RNAs/genes.hmm.gz
Number of genes ..............................: 12
Noise cutoff term(s) .........................: --cut_ga
Number of CPUs will be used for search .......: 1
Temporary work dir ...........................: /var/folders/86/v68d1x9s59128vx4pr1kls6r0000gn/T/tmpzwkyhcft
Log file .....................................: /var/folders/86/v68d1x9s59128vx4pr1kls6r0000gn/T/tmpzwkyhcft/00_log.txt
Number of raw hits ...........................: 0
* The HMM source 'Ribosomal_RNAs' returned 0 hits. SAD (but it's stil OK).
:: INPUT DIR: /var/folders/86/v68d1x9s59128vx4pr1kls6r0000gn/T/tmpt_mes0n4/test-output/pan_test, FNAME: 02 ...
:: RENAMING CONTIGS ...
Input ........................................: /var/folders/86/v68d1x9s59128vx4pr1kls6r0000gn/T/tmpt_mes0n4/test-output/pan_test/02.fa
Output .......................................: /var/folders/86/v68d1x9s59128vx4pr1kls6r0000gn/T/tmpt_mes0n4/test-output/pan_test/02-clean.fa
Minimum length ...............................: 0
Total num contigs ............................: 1
Total num nucleotides ........................: 139,930
Contigs removed ..............................: 0 (0.00% of all)
Nucleotides removed ..........................: 0 (0.00% of all)
Deflines simplified ..........................: True
:: GENERATING THE CONTIGS DB ...
Input FASTA file .............................: /var/folders/86/v68d1x9s59128vx4pr1kls6r0000gn/T/tmpt_mes0n4/test-output/pan_test/02-clean.fa
Name .........................................: 02
Description ..................................: No description is given
Split Length .................................: 20,000
K-mer size ...................................: 4
Skip gene calling? ...........................: False
External gene calls provided? ................: None
Ignoring internal stop codons? ...............: False
Splitting pays attention to gene calls? ......: False
Finding ORFs in contigs
===============================================
Genes ........................................: /var/folders/86/v68d1x9s59128vx4pr1kls6r0000gn/T/tmp82lfcnnz/contigs.genes
Amino acid sequences .........................: /var/folders/86/v68d1x9s59128vx4pr1kls6r0000gn/T/tmp82lfcnnz/contigs.amino_acid_sequences
Log file .....................................: /var/folders/86/v68d1x9s59128vx4pr1kls6r0000gn/T/tmp82lfcnnz/00_log.txt
CITATION
===============================================
Anvi'o will use 'prodigal' by Hyatt et al (doi:10.1186/1471-2105-11-119) to
identify open reading frames in your data. When you publish your findings,
please do not forget to properly credit their work.
Result .......................................: Prodigal (v2.6.2) has identified 119 genes.
Contigs with at least one gene call ..........: 1 of 1 (100.0%)
Contigs database .............................: A new database, /var/folders/86/v68d1x9s59128vx4pr1kls6r0000gn/T/tmpt_mes0n4/test-output/pan_test/02.db, has been created.
Number of contigs ............................: 1
Number of splits .............................: 7
Total number of nucleotides ..................: 139,930
Gene calling step skipped ....................: False
Splits broke genes (non-mindful mode) ........: True
Desired split length (what the user wanted) ..: 20,000
Average split length (wnat anvi'o gave back) .: 19,990
:: RUNNING HMMs ...
Target found .................................: AA:GENE
Target found .................................: RNA:CONTIG
HMM Profiling for Bacteria_71
===============================================
Reference ....................................: Lee modified, https://doi.org/10.1093/bioinformatics/btz188
Kind .........................................: singlecopy
Alphabet .....................................: AA
Context ......................................: GENE
Domain .......................................: bacteria
HMM model path ...............................: /Users/tito/github/anvio/anvio/data/hmm/Bacteria_71/genes.hmm.gz
Number of genes ..............................: 71
Noise cutoff term(s) .........................: --cut_ga
Number of CPUs will be used for search .......: 1
Temporary work dir ...........................: /var/folders/86/v68d1x9s59128vx4pr1kls6r0000gn/T/tmp6rwohnjf
Log file .....................................: /var/folders/86/v68d1x9s59128vx4pr1kls6r0000gn/T/tmp6rwohnjf/00_log.txt
Number of raw hits ...........................: 1
HMM Profiling for Archaea_76
===============================================
Reference ....................................: Lee, https://doi.org/10.1093/bioinformatics/btz188
Kind .........................................: singlecopy
Alphabet .....................................: AA
Context ......................................: GENE
Domain .......................................: archaea
HMM model path ...............................: /Users/tito/github/anvio/anvio/data/hmm/Archaea_76/genes.hmm.gz
Number of genes ..............................: 76
Noise cutoff term(s) .........................: --cut_ga
Number of CPUs will be used for search .......: 1
Temporary work dir ...........................: /var/folders/86/v68d1x9s59128vx4pr1kls6r0000gn/T/tmp6rwohnjf
Log file .....................................: /var/folders/86/v68d1x9s59128vx4pr1kls6r0000gn/T/tmp6rwohnjf/00_log.txt
Number of raw hits ...........................: 0
* The HMM source 'Archaea_76' returned 0 hits. SAD (but it's stil OK).
HMM Profiling for Protista_83
===============================================
Reference ....................................: Delmont, http://merenlab.org/delmont-euk-scgs
Kind .........................................: singlecopy
Alphabet .....................................: AA
Context ......................................: GENE
Domain .......................................: eukarya
HMM model path ...............................: /Users/tito/github/anvio/anvio/data/hmm/Protista_83/genes.hmm.gz
Number of genes ..............................: 83
Noise cutoff term(s) .........................: -E 1e-25
Number of CPUs will be used for search .......: 1
Temporary work dir ...........................: /var/folders/86/v68d1x9s59128vx4pr1kls6r0000gn/T/tmp6rwohnjf
Log file .....................................: /var/folders/86/v68d1x9s59128vx4pr1kls6r0000gn/T/tmp6rwohnjf/00_log.txt
Number of raw hits ...........................: 0
* The HMM source 'Protista_83' returned 0 hits. SAD (but it's stil OK).
HMM Profiling for Ribosomal_RNAs
===============================================
Reference ....................................: Seemann T, https://github.com/tseemann/barrnap
Kind .........................................: Ribosomal_RNAs
Alphabet .....................................: RNA
Context ......................................: CONTIG
Domain .......................................: N\A
HMM model path ...............................: /Users/tito/github/anvio/anvio/data/hmm/Ribosomal_RNAs/genes.hmm.gz
Number of genes ..............................: 12
Noise cutoff term(s) .........................: --cut_ga
Number of CPUs will be used for search .......: 1
Temporary work dir ...........................: /var/folders/86/v68d1x9s59128vx4pr1kls6r0000gn/T/tmpa2d9mq0a
Log file .....................................: /var/folders/86/v68d1x9s59128vx4pr1kls6r0000gn/T/tmpa2d9mq0a/00_log.txt
Number of raw hits ...........................: 0
* The HMM source 'Ribosomal_RNAs' returned 0 hits. SAD (but it's stil OK).
:: INPUT DIR: /var/folders/86/v68d1x9s59128vx4pr1kls6r0000gn/T/tmpt_mes0n4/test-output/pan_test, FNAME: 03 ...
:: RENAMING CONTIGS ...
Input ........................................: /var/folders/86/v68d1x9s59128vx4pr1kls6r0000gn/T/tmpt_mes0n4/test-output/pan_test/03.fa
Output .......................................: /var/folders/86/v68d1x9s59128vx4pr1kls6r0000gn/T/tmpt_mes0n4/test-output/pan_test/03-clean.fa
Minimum length ...............................: 0
Total num contigs ............................: 1
Total num nucleotides ........................: 139,930
Contigs removed ..............................: 0 (0.00% of all)
Nucleotides removed ..........................: 0 (0.00% of all)
Deflines simplified ..........................: True
:: GENERATING THE CONTIGS DB ...
Input FASTA file .............................: /var/folders/86/v68d1x9s59128vx4pr1kls6r0000gn/T/tmpt_mes0n4/test-output/pan_test/03-clean.fa
Name .........................................: 03
Description ..................................: No description is given
Split Length .................................: 20,000
K-mer size ...................................: 4
Skip gene calling? ...........................: False
External gene calls provided? ................: None
Ignoring internal stop codons? ...............: False
Splitting pays attention to gene calls? ......: False
Finding ORFs in contigs
===============================================
Genes ........................................: /var/folders/86/v68d1x9s59128vx4pr1kls6r0000gn/T/tmpr99hfzo8/contigs.genes
Amino acid sequences .........................: /var/folders/86/v68d1x9s59128vx4pr1kls6r0000gn/T/tmpr99hfzo8/contigs.amino_acid_sequences
Log file .....................................: /var/folders/86/v68d1x9s59128vx4pr1kls6r0000gn/T/tmpr99hfzo8/00_log.txt
CITATION
===============================================
Anvi'o will use 'prodigal' by Hyatt et al (doi:10.1186/1471-2105-11-119) to
identify open reading frames in your data. When you publish your findings,
please do not forget to properly credit their work.
Result .......................................: Prodigal (v2.6.2) has identified 117 genes.
Contigs with at least one gene call ..........: 1 of 1 (100.0%)
Contigs database .............................: A new database, /var/folders/86/v68d1x9s59128vx4pr1kls6r0000gn/T/tmpt_mes0n4/test-output/pan_test/03.db, has been created.
Number of contigs ............................: 1
Number of splits .............................: 7
Total number of nucleotides ..................: 139,930
Gene calling step skipped ....................: False
Splits broke genes (non-mindful mode) ........: True
Desired split length (what the user wanted) ..: 20,000
Average split length (wnat anvi'o gave back) .: 19,990
:: RUNNING HMMs ...
Target found .................................: RNA:CONTIG
Target found .................................: AA:GENE
HMM Profiling for Bacteria_71
===============================================
Reference ....................................: Lee modified, https://doi.org/10.1093/bioinformatics/btz188
Kind .........................................: singlecopy
Alphabet .....................................: AA
Context ......................................: GENE
Domain .......................................: bacteria
HMM model path ...............................: /Users/tito/github/anvio/anvio/data/hmm/Bacteria_71/genes.hmm.gz
Number of genes ..............................: 71
Noise cutoff term(s) .........................: --cut_ga
Number of CPUs will be used for search .......: 1
Temporary work dir ...........................: /var/folders/86/v68d1x9s59128vx4pr1kls6r0000gn/T/tmp2k4mx7il
Log file .....................................: /var/folders/86/v68d1x9s59128vx4pr1kls6r0000gn/T/tmp2k4mx7il/00_log.txt
Number of raw hits ...........................: 1
HMM Profiling for Archaea_76
===============================================
Reference ....................................: Lee, https://doi.org/10.1093/bioinformatics/btz188
Kind .........................................: singlecopy
Alphabet .....................................: AA
Context ......................................: GENE
Domain .......................................: archaea
HMM model path ...............................: /Users/tito/github/anvio/anvio/data/hmm/Archaea_76/genes.hmm.gz
Number of genes ..............................: 76
Noise cutoff term(s) .........................: --cut_ga
Number of CPUs will be used for search .......: 1
Temporary work dir ...........................: /var/folders/86/v68d1x9s59128vx4pr1kls6r0000gn/T/tmp2k4mx7il
Log file .....................................: /var/folders/86/v68d1x9s59128vx4pr1kls6r0000gn/T/tmp2k4mx7il/00_log.txt
Number of raw hits ...........................: 0
* The HMM source 'Archaea_76' returned 0 hits. SAD (but it's stil OK).
HMM Profiling for Protista_83
===============================================
Reference ....................................: Delmont, http://merenlab.org/delmont-euk-scgs
Kind .........................................: singlecopy
Alphabet .....................................: AA
Context ......................................: GENE
Domain .......................................: eukarya
HMM model path ...............................: /Users/tito/github/anvio/anvio/data/hmm/Protista_83/genes.hmm.gz
Number of genes ..............................: 83
Noise cutoff term(s) .........................: -E 1e-25
Number of CPUs will be used for search .......: 1
Temporary work dir ...........................: /var/folders/86/v68d1x9s59128vx4pr1kls6r0000gn/T/tmp2k4mx7il
Log file .....................................: /var/folders/86/v68d1x9s59128vx4pr1kls6r0000gn/T/tmp2k4mx7il/00_log.txt
Number of raw hits ...........................: 0
* The HMM source 'Protista_83' returned 0 hits. SAD (but it's stil OK).
HMM Profiling for Ribosomal_RNAs
===============================================
Reference ....................................: Seemann T, https://github.com/tseemann/barrnap
Kind .........................................: Ribosomal_RNAs
Alphabet .....................................: RNA
Context ......................................: CONTIG
Domain .......................................: N\A
HMM model path ...............................: /Users/tito/github/anvio/anvio/data/hmm/Ribosomal_RNAs/genes.hmm.gz
Number of genes ..............................: 12
Noise cutoff term(s) .........................: --cut_ga
Number of CPUs will be used for search .......: 1
Temporary work dir ...........................: /var/folders/86/v68d1x9s59128vx4pr1kls6r0000gn/T/tmp55hpqi2q
Log file .....................................: /var/folders/86/v68d1x9s59128vx4pr1kls6r0000gn/T/tmp55hpqi2q/00_log.txt
Number of raw hits ...........................: 0
* The HMM source 'Ribosomal_RNAs' returned 0 hits. SAD (but it's stil OK).
:: Importing functions into the contigs database ...
Gene functions ...............................: 494 function calls from 5 sources for 115 unique gene calls has been added to the contigs database.
Gene functions ...............................: 494 function calls from 5 sources for 115 unique gene calls has been added to the contigs database.
Gene functions ...............................: 484 function calls from 5 sources for 113 unique gene calls has been added to the contigs database.
:: Generating an anvi'o genomes storage ...
WARNING
===============================================
Good news! Anvi'o found all these functions that are common to all of your
genomes and will use them for downstream analyses and is very proud of you:
'EGGNOG_BACT, KEGG_PATHWAYS, COG_FUNCTION, COG_CATEGORY, GO_TERMS'.
Internal genomes .............................: 0 have been initialized.
External genomes .............................: 3 found.
* g01 is stored with 119 genes (0 of which were partial)
* g02 is stored with 119 genes (0 of which were partial)
* g03 is stored with 117 genes (1 of which were partial)
The new genomes storage ......................: TEST-GENOMES.db (v6, signature: hash536a8295)
Number of genomes ............................: 3 (internal: 0, external: 3)
Number of gene calls .........................: 355
Number of partial gene calls .................: 1
:: Running the pangenome anaysis with default parameters ...
WARNING
===============================================
If you publish results from this workflow, please do not forget to cite DIAMOND
(doi:10.1038/nmeth.3176), unless you use it with --use-ncbi-blast flag, and MCL
(http://micans.org/mcl/ and doi:10.1007/978-1-61779-361-5_15)
Genomes storage ..............................: Initialized (storage hash: hash536a8295)
Num genomes in storage .......................: 3
Num genomes will be used .....................: 3
Pan database .................................: A new database, /private/var/folders/86/v68d1x9s59128vx4pr1kls6r0000gn/T/tmpt_mes0n4/test-output/pan_test/TEST/TEST-PAN.db, has been created.
Exclude partial gene calls ...................: False
AA sequences FASTA ...........................: /private/var/folders/86/v68d1x9s59128vx4pr1kls6r0000gn/T/tmpt_mes0n4/test-output/pan_test/TEST/combined-aas.fa
Num AA sequences reported ....................: 355
Num excluded gene calls ......................: 0
Unique AA sequences FASTA ....................: /private/var/folders/86/v68d1x9s59128vx4pr1kls6r0000gn/T/tmpt_mes0n4/test-output/pan_test/TEST/combined-aas.fa.unique
WARNING
===============================================
You elected to use NCBI's blastp for amino acid sequence search. Running blastp
will be significantly slower than DIAMOND (although, anvi'o developers are
convinced that you *are* doing the right thing, so, kudos to you).
BLAST search db ..............................: /private/var/folders/86/v68d1x9s59128vx4pr1kls6r0000gn/T/tmpt_mes0n4/test-output/pan_test/TEST/combined-aas.fa.unique
BLAST results ................................: /private/var/folders/86/v68d1x9s59128vx4pr1kls6r0000gn/T/tmpt_mes0n4/test-output/pan_test/TEST/blast-search-results.txt
Min percent identity .........................: 0.0
Minbit .......................................: 0.5
Filtered search results ......................: 1,053 edges stored
MCL input ....................................: /private/var/folders/86/v68d1x9s59128vx4pr1kls6r0000gn/T/tmpt_mes0n4/test-output/pan_test/TEST/mcl-input.txt
MCL inflation ................................: 2.0
MCL output ...................................: /private/var/folders/86/v68d1x9s59128vx4pr1kls6r0000gn/T/tmpt_mes0n4/test-output/pan_test/TEST/mcl-clusters.txt
Number of gene clusters ......................: 121
CITATION
===============================================
Anvi'o will use 'muscle' by Edgar, doi:10.1093/nar/gkh340
(http://www.drive5.com/muscle) to align your sequences. If you publish your
findings, please do not forget to properly credit their work.
New data for 'items' in data group 'default'
===============================================
Data key "num_genomes_gene_cluster_has_hits" .: Predicted type: int
Data key "num_genes_in_gene_cluster" .........: Predicted type: int
Data key "max_num_paralogs" ..................: Predicted type: int
Data key "SCG" ...............................: Predicted type: int
WARNING
===============================================
You (or the programmer) asked anvi'o to NOT check the consistency of the names
of your items between your additional data and the pan database you are
attempting to update. So be it. Anvi'o will not check anything, but if things
don't look the way you expected them to look, you will not blame anvi'o for your
poorly prepared data, but choose between yourself or Obama.
NEW DATA
===============================================
Database .....................................: pan
Data group ...................................: default
Data table ...................................: items
New data keys ................................: num_genomes_gene_cluster_has_hits, num_genes_in_gene_cluster, max_num_paralogs, SCG.
gene clusters info ...........................: 121 gene_clusters stored in the database
New items order ..............................: "frequency:euclidean:ward" (type newick) has been added to the database...
WARNING
===============================================
Clustering for "frequency:euclidean:ward" is already in the database. It will be
replaced with the new content.
New items order ..............................: "frequency:euclidean:ward" (type newick) has been added to the database...
New items order ..............................: "presence-absence:euclidean:ward" (type newick) has been added to the database...
New items order ..............................: "Forced synteny <> g01:NA:NA" (type basic) has been added to the database...
New items order ..............................: "Forced synteny <> g02:NA:NA" (type basic) has been added to the database...
New items order ..............................: "Forced synteny <> g03:NA:NA" (type basic) has been added to the database...
New layer_orders data...
===============================================
Data key "gene_cluster presence absence" .....: Type: newick
Data key "gene_cluster frequencies" ..........: Type: newick
New order data added to the db for layer_orders : gene_cluster presence absence, gene_cluster frequencies.
New data for 'layers' in data group 'default'
===============================================
Data key "total_length" ......................: Predicted type: int
Data key "gc_content" ........................: Predicted type: float
Data key "percent_completion" ................: Predicted type: int
Data key "percent_redundancy" ................: Predicted type: int
Data key "num_genes" .........................: Predicted type: int
Data key "avg_gene_length" ...................: Predicted type: float
Data key "num_genes_per_kb" ..................: Predicted type: float
Data key "singleton_gene_clusters" ...........: Predicted type: int
Data key "num_gene_clusters" .................: Predicted type: int
NEW DATA
===============================================
Database .....................................: pan
Data group ...................................: default
Data table ...................................: layers
New data keys ................................: total_length, gc_content, percent_completion, percent_redundancy, num_genes, avg_gene_length, num_genes_per_kb, singleton_gene_clusters, num_gene_clusters.
Genomes storage .............................................: Initialized (storage hash: hash536a8295)
Num genomes in storage ......................................: 3
Num genomes will be used ....................................: 3
Pan DB ......................................................: Initialized: /private/var/folders/86/v68d1x9s59128vx4pr1kls6r0000gn/T/tmpt_mes0n4/test-output/pan_test/TEST/TEST-PAN.db (v. 13)
Gene cluster homogeneity estimates ..........................: Functional: [NO]; Geometric: [NO]; Combined: [NO]
* Gene clusters are initialized for all 121 gene clusters in the database.
New data for 'items' in data group 'default'
===============================================
Data key "functional_homogeneity_index" ......: Predicted type: float
Data key "geometric_homogeneity_index" .......: Predicted type: float
Data key "combined_homogeneity_index" ........: Predicted type: float
WARNING
===============================================
You (or the programmer) asked anvi'o to NOT check the consistency of the names
of your items between your additional data and the pan database you are
attempting to update. So be it. Anvi'o will not check anything, but if things
don't look the way you expected them to look, you will not blame anvi'o for your
poorly prepared data, but choose between yourself or Obama.
NEW DATA
===============================================
Database .....................................: pan
Data group ...................................: default
Data table ...................................: items
New data keys ................................: functional_homogeneity_index, geometric_homogeneity_index, combined_homogeneity_index.
log file ....................................................: /private/var/folders/86/v68d1x9s59128vx4pr1kls6r0000gn/T/tmpt_mes0n4/test-output/pan_test/TEST/log.txt
:: Running ANI on genomes and storing results in the PAN database ...
usage: anvi-compute-genome-similarity [-h] [-i FILE_PATH] [-e FILE_PATH]
[-f FASTA_TEXT_FILE] -o DIR_PATH
[-p PAN_DB] --program
{pyANI,fastANI,sourmash}
[--fastani-kmer-size FASTANI_KMER_SIZE]
[--fragment-length FRAGMENT_LENGTH]
[--min-num-fragments MIN_NUM_FRAGMENTS]
[--method {ANIm,ANIb,ANIblastall,TETRA}]
[--min-alignment-fraction NUM]
[--significant-alignment-length INT]
[--min-full-percent-identity FULL_PERCENT_IDENTITY]
[--kmer-size INT] [--scale INT]
[--distance DISTANCE_METRIC]
[--linkage LINKAGE_METHOD]
[-T NUM_THREADS] [--just-do-it]
[--log-file FILE_PATH]
anvi-compute-genome-similarity: error: the following arguments are required: --program
Config Error: According to the exit code ('2'), anvi'o suspects that something may have gone
wrong while running your tests :/ We hope that the reason is clear to you from
the lines above. But if you don't see anything obvious, and especially if the
test ended up running until the end with reasonable looking final results, you
shouldn't worry too much about this error. Life is short and we all can worry
just a bit less.
This test was run on a iMac (macOS Mojave Version 10.14.6)
If I try:
I will get the following output. The error is at the bottom.
This test was run on a iMac (macOS Mojave Version 10.14.6)
I followed master installation mentioned in http://merenlab.org/2016/06/26/installation-v2/#following-the-active-codebase-youre-a-wizard-arry I also installed muscle http://merenlab.org/2016/06/18/installing-third-party-software/#muscle
Thank you for your input on how to solve this issue.
Thank you.