neherlab / ncov-simple

2 stars 1 forks source link

Convert to Nextalign translation #4

Closed corneliusroemer closed 2 years ago

corneliusroemer commented 2 years ago

This PR has been extensively used and tested in production.

It contains two main changes:

  1. Nextalign is used for translation
  2. Pango lineages are assigned by running pangolin, in addition to the pango lineages that come from GISAID. So now there are three pango labels: a) GISAID, b) default, c) usher mode

Because pangolin required its own conda environment, I had to change how jobs are submitted to SLURM. Instead of using a fixed submit shell script, submission is handled by a python script wrapper that assembles the submit script from a template, adding conda activate nextstrain/pangolin based on the specific rule.