eurec4a / eurec4a-intake

Intake catalogue for EUREC4A field campaign datasets
17 stars 19 forks source link

Zenodo #147

Closed d70-t closed 1 year ago

d70-t commented 1 year ago

This is a first shot at #145. However, the more involved parts are still missing. We should probably have:

here are more options

observingClouds commented 1 year ago

Thanks @d70-t to start working on this. So far I have always changed the metadata on the Zenodo interface after the automatic upload but the deposition file seems to be a more consistent approach.

As the deposition influences the citation format, I think we should first settle on a format that we like. I suggest

EUREC4A community (2023). EUREC4A [Data set]. Zenodo. 10.5281/ZENODO

This would imply the following parameters: upload_type: dataset (because we like the intake catalog to become the EUREC4A Dataset) title: EUREC4A (this would result in a reference like ...EUREC4A [Data set]...) creators: EUREC4A community

d70-t commented 1 year ago

I like using "EUREC4A community" as author, makes things a lot easier.

I'm a bit hesitant to call the thing "EUREC4A" instead something like "EUREC4A Intake catalogue", because it's a bit unspecific. On the other hand, the goal of this repository is to provide "the EUREC4A dataset", so it's probably the right thing to do?

d70-t commented 1 year ago

Do you have any preferences license-wise? I'd go for cc-zero, because I think it's the most useful one for datasets

It's hard to make reasonable use of datasets without stating where the data is from, so people will do it :+1:. On the other hand, formalizing a requirement to name authors potentially prevents good uses, because depending on the use, it can be hard to e.g. display a predefined statement. Also if we use "EUREC4A community" as author, I guess it's even less useful to go with something like cc-by.

observingClouds commented 1 year ago

I like using "EUREC4A community" as author, makes things a lot easier.

I'm a bit hesitant to call the thing "EUREC4A" instead something like "EUREC4A Intake catalogue", because it's a bit unspecific. On the other hand, the goal of this repository is to provide "the EUREC4A dataset", so it's probably the right thing to do?

Yeah, this is exactly my thought. Currently "EUREC4A dataset" is maybe a bit vague as the catalog's entries are not immutable but that's where we wanna get to.

observingClouds commented 1 year ago

The license is a bit tricky as we reference other data sets that might have specific license. However, for this specific repository I would argue that cc_zero is fine, but we should add a note somewhere on how people should add credits to the underlying data and adhere to the EUREC4A data policy.

d70-t commented 1 year ago

I just added a note on the data policy and a (very tiny) description of the catalog. Is it still fine?

observingClouds commented 1 year ago

Yes, awesome!

d70-t commented 1 year ago

Ok, let's see where this will take us...