Closed aclum closed 7 months ago
When changes are made that aren't additive we also need a migration script so the data in mongo matches the schema. Here is a template to do this https://github.com/microbiomedata/nmdc-schema/blob/main/nmdc_schema/migrators/migrator_from_A_B_C_to_X_Y_Z.py
@eecavanna and @brynnz22 have done some of these so can help. We need a migration script to migrate 'Metagenome Bins Compression File' to 'Metagenome HQMQ Bins Compression File'
cc @hubin-keio
Here's a link to documentation about creating a migrator.
I will mock up a migrator specific to the schema changes in https://github.com/microbiomedata/nmdc-schema/pull/1791 now, which @chienchi can use as a reference.
I drafted this migrator, which y'all can use as a starting point: https://github.com/microbiomedata/nmdc-schema/pull/1837
There are three TODO
items in it:
data_object_set
collection is, indeed, the only collection requiring migration for this schema change
As a reminder, I will be out Friday.
Thank you @eecavanna , I have checked and updated the three TODO
items. I think this is the only data_object to migrate. @aclum or @brynnz22 could you help to confirm?
From Neha from summer 2023, specifically I'm not sure if we are saving
metabat-bins.tar.gz
which would be needed to implement the eukaryotic binning.