leap-stc / LEAP_template_feedstock

A Prototype feedstock that implements independent metadata and data updates using pangeo forge
Apache License 2.0
1 stars 2 forks source link

Can we autogenerate the catalog yaml (and maybe a prefilled recipe)? #28

Open jbusecke opened 3 months ago

jbusecke commented 3 months ago

Looking at the template catalog.yaml file I suspect that there will be issues with people just not editing the paths and then stores get overwritten...

Wondering if there is an easy way to generate these paths (and maybe some other dummy content) for the catalog.yaml. The template for that would be something like:

# All the information important to cataloging.
"ncviewjs:meta_yaml_url": "https://github.com/<owner>/<repo_name>/blob/main/feedstock/meta.yaml"
tags:
  - zarr
stores:
  - id: <recipe_id (parse_from_meta_yaml)>
    url: "gs://leap-scratch/data-library/feedstocks/latest/<repo_name>/<recipe_id>.zarr"

Not super important rn, but just putting this out there. Maybe @andersy005 has an idea?

jbusecke commented 3 months ago

Thinking more about this. These sort of patterns seem bad (and maybe insecure?).

I am tending more towards the following now:

jbusecke commented 3 months ago

This could also take care of avoidable errors like I did here (jobnames cannot contain underscores). EDIT: This is only relevant to Dataflow, so we should probably handle this on the action level