Open kapsakcj opened 4 days ago
OK looks like seqsero2 does not predict or output this in an obvious way. I just looked through the output files produced when running the tool in -m a
microassembly mode as we have default in TheiaProk. Didn't find serogroup
anywhere
Likely only available in SISTR output TSV
SeqSero2 provides O-antigen predictions but not directly the serogroup letter. You would have to map O-antigen numbers from seqsero2 to serogroups manually. But we should get the required information from SISTR
:cool:
:pushpin: Explain the Request
A lab requested that the Salmonella serogroup is parsed from the
SeqSero2 and/orSISTR output files and output as an additional column.Serogroups are usually one letter: A,B,C,D,E, etc. and can be accompanied by a number (I'm not 100% sure on this, I'm not too familiar with these). I believe these are somehow related/connected to the O-antigen.
This value can be found in the SISTR output TSV in the
serogroup
column:B
orD1
are examples.Not sure if this info is output from seqsero2, but I'll keep looking