cidgoh / DataHarmonizer

A standardized browser-based spreadsheet editor and validator that can be run offline and locally, and which includes templates for SARS-CoV-2 and Monkeypox sampling data. This project, created by the Centre for Infectious Disease Genomics and One Health (CIDGOH), at Simon Fraser University, is now an open-source collaboration with contributions from the National Microbiome Data Collaborative (NMDC), the LinkML development team, and others.
MIT License
91 stars 23 forks source link

DataHarmonizer should honor LinkML unique_key assertions, for multiple columns with uniqueness constraint #369

Closed turbomam closed 1 year ago

turbomam commented 1 year ago

Happy new year!

I'm currently evaluating the degree to which LinkML itself honors unique_key assertions. See UniqueKey class.

Could be assigned to @pkalita-lbl

ddooley commented 1 year ago

Sounds fairly straightforward. Strkies me that this should have an equivalent rule pattern, i.e. be supported by rule functionality that can validate a unique_key as a combo of slots' values?

turbomam commented 1 year ago

I think the support for multiple unique keys was completed, but support for LinkML rules still needs discussion and planning.

pkalita-lbl commented 1 year ago

Yes, see https://github.com/cidgoh/DataHarmonizer/issues/370