monarch-initiative / koza

Data transformation framework for LinkML data models
https://koza.monarchinitiative.org/
BSD 3-Clause "New" or "Revised" License
48 stars 4 forks source link

Source Config: Metadata - Revisit `DatasetDescription` fields #131

Open glass-ships opened 7 months ago

glass-ships commented 7 months ago

Currently, the metadata.yaml file (properties defined in koza/model/config/source_config.py) is not being used in any way other than documentation, and has fields with misleading names relative to how they're being used:


One possible option is to simply remove the definition and usage of metadata within Koza, and simply allow it to exist for documentation purposes alongside ingest files.

Another would be to allow Koza to read the metadata file, and use the data contained within as default values for various fields during the transform process, possibly writing to an output metadata file, or as columns in the transform output.

I do think our move towards modularized ingests adds some importance to sorting this out.
Maybe we can add this as an agenda item for one of a data call?

matentzn commented 7 months ago

When you meet about this, please let me know when and where before finalising the metadata collected and the shape it is deployed in.