Closed rmcolq closed 3 years ago
First discovery, cut
doesn't work with these CSV because some fields contain commas, but have quotation marks around them so are handled by other parsers.
Summary of problems found:
unix
flavoured mode. Clusterfunk was updated accordingly..txt
instead of UK??.txt
. Have replaced by python script.
UK422 for example: 12403 in the dataset, 1240 have a phylotype: