Open MichaelTiemannOSC opened 3 years ago
Generic documentation and examples of pipelines to refer to: https://github.com/elyra-ai/examples
Need to re-create the pipeline again . Consolidate all the tools into one place incorporating all the new changes from Eric. Document an end to end solution to rebuild the pipeline.
@erikerlandson Is this something you are actively working on do you need clarity .
Moved back to the backlog as we need to consider which pipeline automation tool we want to adopt (Elyra / Airflow / Kubeflow).
@caldeirav is this a question for the TAC or a tactical one for DC TSC? Thanks,
There has to be a technical discussion at the level of the Data Commons stream before we could determine this. Likely the TAC is required only if we are looking at a fundamental change in approach, but not just for a choice of tooling.
As a data ingester, I want to ingest the WRI GPPD data and then pass metadata to a metadata update process. But I don't know the best way to pass my specific metadata (schema name for WRI GPPD, tables I create as a result of ingestion, and information about the fields of those tables) to a generic metadata upload process.
Please update https://github.com/os-climate/os_c_data_commons/blob/main/docs/create-processing-pipeline.md