Open huddlej opened 1 month ago
I'm not sure how to easily integrate this into fauna because the tdb/upload does an upsert using a generated index, not the reference strains.
Maybe this can be a new rule in the seasonal flu upload workflow. We could create a list of reference strains and their inclusion date, something like:
csvtk sort -t -k inclusion_date data/titers.tsv \
| csvtk cut -t -f serum_strain,inclusion_date \
| csvtk uniq -t -f serum_strain > reference_strains.tsv
Description
Incoming titer measurements often include new reference strains that we need to include in our reports. It is easy to miss these by manual inspection each week and would be more effective to generate a list of these new references as they appear and somehow report these as part of the ingest.