chanzuckerberg / cryoet-data-portal-backend

CryoET Data Portal API server & ingestion scripts
MIT License
1 stars 2 forks source link

s3 data validation: annotations #200

Closed daniel-ji closed 2 months ago

daniel-ji commented 2 months ago

Add validation tests for annotations, some new, most of them ported over & refactored from Utz's @uermel tests. For a list of tests, see https://docs.google.com/document/d/1yMKM0DW9KRhlcYiBGPcR7oW0liGtUew6NAmBhMg5U3w/edit#heading=h.56ec9se80d6g

Also refactored data / file fetching to reflect that there is only one mrc / zarr file per segmentation mask annotation / tomogram / tiltseries. Note that in the metadata files, the mrc_files field in the path still is an list of strings and should get changed to be a single string (not done in this PR).

daniel-ji commented 2 months ago

Closed, a later s3 data validation PR affected a lot of pieces this PR was intending to merge in, so will create a new refined PR