ckan / ckan

CKAN is an open-source DMS (data management system) for powering data hubs and data portals. CKAN makes it easy to publish, share and use data. It powers catalog.data.gov, open.canada.ca/data, data.humdata.org among many other sites.
https://ckan.org/
Other
4.46k stars 1.99k forks source link

ckanext-validation and table designer synergies 💃🕺 #7513

Open wardi opened 1 year ago

wardi commented 1 year ago

Following dev meeting discussion: ckanext-validation and the upcoming table designer feature both define resource data schemas and validation rules.

Let's find commonalities and adopt a plan to make these systems work together as well as possible.

Comparing ckanext-validation and table designer

ckanext-validation table designer
data source csv/excel file datastore table
data schema Table Schema json in resource datastore data dictionary
validation Table Schema Python library sync or bg job datastore triggers/python error formatting
datastore API read only read/write

Ideas for integration

Other ideas?

amercader commented 1 year ago

Thanks for putting this together @wardi . Let me prepare a quick demo as well as write up our long term vision for this so it's easier to discuss. I think there is a lot of overlap between the two features and even with some of the points raised by @jqnatividad datapusher+ discussions to have a wider discussion around data ingestion, description and validation. I'll try to put something together by next week (it's a holiday week here)

wardi commented 1 year ago

That sounds really interesting @amercader I'm looking forward to it.