larsyencken / shelf

A personal ETL and data lake.
0 stars 0 forks source link

Workflow for table steps #37

Open larsyencken opened 3 months ago

larsyencken commented 3 months ago

Context

Shelf is meant to be a productive small-scale ETL framework. To be productive, it needs a nice workflow for:

Currently shelf add --edit is a productive way of adding a snapshot to the content store.

Problem

Technical notes

larsyencken commented 3 months ago

I'm wondering if shelf add should become shelf snapshot, and perhaps shelf derive <dep> [<dep> [...]] would be a good interactive helper for creating a new table step.

larsyencken commented 2 months ago

Currently we have shelf new-table <qualified_name> <deps>. But now that we have both Python and SQL steps, we don't create a step template automatically.

It is working to provide a version-independent step definition though.