Closed nicola-palmieri closed 2 years ago
I am afraid your sample size is simply too small for pyseer to be used, let alone return meaningful results. The original SEER paper has some comments about sample size.
Thanks for the prompt reply. This was a test dataset but I am planning to sequence more strains. I thought I could already see some associations. I will retry when I have more data.
Dear Author
I have started playing with pyseer on a small dataset of 12 strains from a bacterium called Gallibacterium anatis, from which I have computed SNPs using parsnp. In addition I have a table with binary values for resistance to different antibiotics, I am trying to fit the first antibiotic phenotypic binary vector to the SNPs, but I get the following error:
These are the first 10 lines of the VCF file - it contains 75767 SNPs:
Here is the phenotype file for the first antibiotic (CEZ = cefazoline):
This is the mash distance table for the 12 strains:
What could be the problem?
Thank you!!
Kind regards Nicola