Open jeanetteclark opened 2 months ago
Since the output
field is repeatable, so we just need to add id
as an optional attribute.
this is done on the feature-hashstore-support branch, awaiting review
see src/main/resources/schemas/schema1.1.xsd
also see edu.ucsb.nceas.mdqengine.dispatch.Dispatcher
line 174 for how an array coming back from python checks is handled (warning: its a little ugly right now)
after discussing with @robyngit, we determined that it would really be necessary to filter results from data quality runs by file. This isn't necessarily MVP for the data suite but I think it's close. The easiest way to do this I think would be to add a field to the
result
schema withinrun
. This would allow us to continue to us theoutput
field to give dataset-level results (eg: 8 files had congruent file types, 1 file was mistyped), in addition to file level results (eg: file.nc appears to be of type application/netcdf but is documented as application/octet-stream).proposed new schema for result could look something like this (though I'm very much not sold on "dataOutput" as the element name, it's a placeholder.
thoughts welcome @mbjones when you return