Validation errors with Slidebook ome-tiff export

tlambert03 / ome-types

native Python dataclasses for the OME data model

MIT License

51 stars 9 forks source link

I'm working with a dataset that was originally in slidebook format but was exported as an ome-tiff series. This gave the following alleged OME XML:

https://gist.github.com/jni/c4b09934715246c158397b24db7fbb3b

I tried to parse it with:

import ome_types

ome = ome_types.from_xml('ome-meta.xml', parser='lxml', validate=False)

which gives the error:

ValidationError: 624 validation errors for OME

(Full traceback at: https://gist.github.com/jni/e87f511c892475de72c880b83617e10d)

I fully expect that Slidebook is producing garbage, but I'm wondering if it's easily fixed garbage. At any rate I'm presently only after the pixel physical size, and potentially channel display colors and contrast limits, so any suggestions for grabbing that reliably from a junk xml will be appreciated. 😃

from ome_types import OME, to_dict def clean_stuff(ome_dict: dict): for image in ome_dict['images']: for channel in image['pixels']['channels']: # or try to do something more clever channel.pop('acquisition_mode', None) channel.pop('illumination_type', None) return ome_dict my_dict = to_dict('ome-meta.xml', validate=False) ome = OME(**clean_stuff(my_dict))

tlambert03 / ome-types

Validation errors with Slidebook ome-tiff export #170