Closed matamadio closed 3 years ago
These make sense to simplify and consolidate contribution and resolve the issue of inconsistent ISO codes and overlap of contribution information in MOVER.
We need to make sure that if a user wants to implement just 1-2 of the schema, not all, then this structure will work -- but I think they would have to replicate cf_common in that case, so there should be no issue there. Agree?
We should clearly define the content of model_source - we tend to refer to datasets, rather than model. Please review original technical documents to ensure we describe it as intended.
Outstanding in this is the question of notes+purpose, vs using abstract (which is more common and aligns better with existing metadata standards.) The intention of notes+purpose was to constrain the information being accepted, but practically this lack of alignment with abstract field in metadata standards may be an issue that overrides this intention.
Comment on bibliography field: Should we consider a new table to give more details on the publication (as already given in MOVER) for all other data types? Should we consider potential for >1 report/paper to be associated with a dataset - in which case bibliography must contain >1 author-year references
We need to make sure that if a user wants to implement just 1-2 of the schema, not all, then this structure will work -- but I think they would have to replicate cf_common in that case, so there should be no issue there. Agree?
Yep, the schema would go: contribution (common) + schema attributes (specific). So the table is shared instead of duplicated, but in practice the result is the same.
We should clearly define the content of model_source - we tend to refer to datasets, rather than model. Please review original technical documents to ensure we describe it as intended.
I have to read trhough original docs for many fields that I can't collocate rightly in the practice, and fill the examples. For example the AFG dataset, what would be the model source? I just put disasterrisk.af, in lack of better info.
Outstanding in this is the question of notes+purpose, vs using abstract (which is more common and aligns better with existing metadata standards.) The intention of notes+purpose was to constrain the information being accepted, but practically this lack of alignment with abstract field in metadata standards may be an issue that overrides this intention.
Agreed. A more general "abstract" or "description" fits better with existing schemas and data. Notes+Purpose is more detailed but also misunderstandable. I am using notes field now in JKAN for abstract.
Comment on bibliography field: Should we consider a new table to give more details on the publication (as already given in MOVER) for all other data types? Should we consider potential for >1 report/paper to be associated with a dataset - in which case bibliography must contain >1 author-year references
I'd compromise, with two short biblio fields in contribution table, that allow multiple entries.
biblio_auth_title: [author(s)A, titleA, yearA]; [author(s)B, titleB, yearB];
biblio_url: [link to publicationA]; [link to publicationB]
Superceeded by #40
The contribution table is present in all schemas and includes some of the key information of the dataset. The previous version had 4 identical but schema-specific tables which also had few overlaps within specific schema attributes. I tried my best to have something more simple and efficient, storing all general info about a dataset, with the following changes:
Old contribution table
New contribution table