Closed ngehlenborg closed 3 years ago
@ngehlenborg @ilan-gold xmlvalid only validates the structure of the OME.TIFF header against a schema.
This is the bare minimum. For example, the images missing the channel names passed the xmlvalid test. There should be a discussion about fields per assay type and how we can use that information at run time to trigger computations within a DAG.
I agree with @icaoberg that we should really be doing more.
tifffile
issue with multiple tags continues to be a problem (and it's not a bug), we should overwrite the incoming file (or reject it).This is what I can come up with as far as the metadata is concerned.
Another thing we could add is a way to indicate to people that their data is spatially the same, like with seqFish which has a bunch of repeat positions and hybridization cycles.
See also issue #70
Just a thought @jswelling @ngehlenborg @icaoberg but one option for this beyond running tiffcomment
or the like could be to tell people to run the bioformats2raw + raw2ometiff pipeline and check that they can "drag and drop" the output onto Avivator. I find myself doing this a lot anyway and I have started telling people to do it. We could even automate it further to do the following:
bioformats2raw
+ raw2ometiff
via conda
as mentioned here:
conda create --name bioformats python=3.8
conda activate bioformats
conda install -c ome bioformats2raw raw2ometiff
bioformats2raw
+ raw2ometiff
on one of the input files (doesn't matter which, I think this is just a sanity check since people tend to uniformly process their files so one file broken usually indicates them all being broken)This should all be doable within the python ecosystem from what I can tell. Thoughts?
I believe this is resolved by https://github.com/hubmapconsortium/ingest-validation-tests/pull/6
https://docs.openmicroscopy.org/bio-formats/6.5.1/users/comlinetools/xml-validation.html
@ilan-gold: is there anything else that you are checking for?
Current status: validation process checks that OME-TIFF is valid XML.
Next update will check OME-TIFF against the schema.