Closed ndusek closed 7 months ago
I suspect you have to add the --gene-data-type
argument, i.e., something like --gene-data-type 'gene-count:,'
or --gene-data-type 'gene-list:\t'
depending on how your gene_presence_absence.csv
is formated.
If that doesn't solve your problem, you could mail me your dataset and I could have a look.
@MrTomRod thank you for the quick reply!
Adding --gene-data-type 'gene-list:,'
did indeed resolve the issue.
You might consider updating the usage guide for running Scoary2 on Roary output to include that flag, since I think that is the default output format for Roary. Just a suggestion...
I added changed it to this:
# Dataset from Scoary 1: genes in Roary gene count format
scoary2 \
--genes Gene_presence_absence.csv \
--gene-data-type 'gene-count:,' \
--traits Tetracycline_resistance.csv \
--outdir out \
--n-permut 1000
# If gene_presence_absence.csv is in gene-list format, use
# --gene-data-type 'gene-list:,'
# instead
Do you think that's clear enough?
Yep, looks great to me!
We are trying to process a
gene_presence_absence.csv
file from Roary with Scoary2. Previously, we were using Scoary (v1) and were able to get results (albeit with a few errors in the log file), whereas with Scoary2, the exact same command is failing.Here are the versions we are using for each of these packages:
Scoary: 1.6.16 Scoary2: 0.0.15 Roary: 3.13.0
Scoary (v1) results
Here is the command we have been using with Scoary (v1):
The process completes successfully, although it does print the following error several times:
But this does not prevent us from getting results for isolates that were not missing, so I consider this to be acceptable.
Scoary2 results
The Scoary2 usage guide suggests that we should be able to use the exact same command with the same inputs for Scoary2, so here is what we are running:
This is failing with the following trace:
The error
contains NaN
is clear enough, but I don't understand why Scoary2 would be complaining about this all of a sudden when the original Scoary had no problem with it.Any idea what's going on here?