Closed manasaV3 closed 1 month ago
We are adding two new fields to alignment metadata.
Any updates to the annotation source config will be addressed as a part of a different issue, once we have a decision on portal standard annotations being a part of the original or newer deposition.
This work is complete
blocking https://github.com/chanzuckerberg/cryoet-data-portal/issues/1091
Motivation
To be able to ingest the Alignments and Frames, and create relevant metadata, we need to update the config to support the new ingestion entities and updates to the metadata.
Definition of Done
The valid config refers to a config created in the dataset_configs folder that passes all the validation checks specified in the make file.
Tasks
For the following entities, the update have to be made to:
collection_metadata
The
mdoc
files are currently ingested as a part of therawtilts
entity. This should no longer be the case.Sources
The sources should follow the default structure
Config Migration
This requires an update for all the existing config files where source globs with
mdoc
extension is listed underrawtlt
. Themdoc
glob entires should moved to be undercollection_metadata.sources
. This would require the creation of the collection_metadata entity.alignment
The alignment files are currently ingested as a part of the
rawtilts
entity. This should no longer be the case.Metadata
Sources
The sources should follow the default structure
Config Migration
This requires an update for all the existing files with
xf
to be moved toalignment
This requires an update for all the existing config files where source globs withxf
,tlt
(not rawtlt),aln
,com
extensions are listed underrawtlt
. The above entires should moved to be underalignment.sources
. This would require the creation of the alignment entity.The affine_transformation_matrix should also be moved to the alignment metadata from tomogram.
depending on what the file extension of the source is the metadata.field should be:
frames
The frames currently don't have any metadata associated to them. They can have an optional metadata field.
Metadata
Config Migration
No config migration required. All these fields will be added manually at a later point.
tomogram
The tomogram currently already have metadata associated to them. We need to update the fields it currently supports and validates.
Metadata
Config Migration
Add the two new bool fields to the all existing config yamls, with the default values
Also, remove the affine_transformation_matrix field, as it is getting moved to the alignment entity.We will retain theaffine_transformation_matrix
for backward compatibilityannotations
Standardized annotations are no longer a part of this effort.
The annotations currently already have metadata associated to them. We need to update the fields it currently supports and validates.Sources
~~As we can have portal standard annotations now. We should allow for up to 2 entries for the same shape in source, the caveat for that being, at least one of them should have a
is_portal_standard
set to true.~~Config Migration
Add the new bool field to the all existing config yamls, with the default values