microbiomedata / issues

public repo for issues related to NMDC work
1 stars 0 forks source link

Mongo validation toolkit: Implement LinkML-based validation routine #386

Open eecavanna opened 11 months ago

eecavanna commented 11 months ago

As part of creating a toolkit developers can use to validate the contents of the NMDC Mongo database...

Implement a function(s) that developers can use to perform LinkML-based validation of data in (or retrieved from) the NMDC Mongo database.

Notes:

This task originated as a subtask of https://github.com/microbiomedata/issues/issues/266.

eecavanna commented 11 months ago

I am under the impression both JSON Schema-based, and LinkML-based validation, are necessary in order to fully validate NMDC Mongo data; but it is not clear to me what LinkML-based validation is, given that the default behavior of the linkml-validate command performs JSON Schema-based validation under the hood.

Edit: Based on what I see in the LinkML documentation about a different command, it looks to me like it may involve using Python DataClass-based validation.

image