AdmiralenOla / Scoary

Pan-genome wide association studies
GNU General Public License v3.0
147 stars 35 forks source link

KeyError when trait file contains subset of genomes #46

Closed AdmiralenOla closed 7 years ago

AdmiralenOla commented 7 years ago

If the traits file contain only a subset of genomes of the Roary file, Scoary currently exits with a KeyError.

If you want to run Scoary on just a subset of the genomes that you ran Roary on (You might be missing phenotypic data for some isolates for example), there are currently two ways of handling this:

  1. Using --restrict_to, pointing to a csv file which lists only the genomes you want to include.

  2. Editing the Roary file by column-wise deletion of the genomes you don't have in your traits file. (Scoary doesnt use the summary statistics in the first columns of the Roary file, so this will not impact analysis)

Plan: