frictionlessdata / forum

🗣 Frictionless Data Forum esp for "How do I" type questions
https://frictionlessdata.io/
10 stars 0 forks source link

Provide more information about keywords in a data package #26

Closed jen-thomas closed 4 years ago

jen-thomas commented 4 years ago

For some data packages, I would like to be able to use keywords that come from a controlled vocabulary.

This would help to improve the quality of the data set by ensuring that keywords are properly described and not misinterpreted (eg. "litter" -> leaf litter, rubbish, litter of animals?).

Currently the keywords field is an array of strings. I would propose adding something along the lines of:

cpina commented 4 years ago

I wonder if it should be like the contributors: https://specs.frictionlessdata.io/data-package/#metadata

It says that it must be an array of objects.

So, I'm wondering if keywords can be an array of strings in some occasions and an array of objects in other occasions or always objects so new properties (e.g. source and uuid in Jen's proposal) can be added. I think that somewhere in frictionlessdata something could be either but I can't find where right now.

rufuspollock commented 4 years ago

@jen-thomas the keywords field should just be a list of strings so i would keep it that way. If you really need to add other info what about adding a separate field e.g. keywordsVocab. Re uuids there's no immediate way I can think of. Normally i would imagine that sort of information would be stored separately - e.g. you'd store the uuids in the keywords list and have a dictionary of uuid -> tagnames somewhere else.

rufuspollock commented 4 years ago

@jen-thomas any thoughts on this? Would this work for your use case?

rufuspollock commented 4 years ago

FIXED. I think we have a resolution here. Please feel free to continuing commenting or suggest re-opening.