Closed stephen-soltesz closed 2 years ago
@SaiedKazemi FYI
Alone this does not WAI - the tracker.Job struct is used as a map key - and b/c etl is not updated with this new structure, the value passed to etl and back for updates has a different value, so progress is not recorded.
Ideally, there would be separation between the API structure used to communicate with ETL and that used for internal tracking so the two systems could be upgraded separately.
Obsolete
This change starts to support flexible dataset configurations first noted in https://github.com/m-lab/etl-gardener/issues/349
This change would obsolete the
Target
field from the config.SourceConfig and tracker.Job structures. In its place we addDatasets
record with three fields forTemp
the temporary table,Raw
the raw, 1:1 with GCS files, deduped table, andJoin
for the joined results between the raw and other datatypes. Previously these prefixes were statically defined within the gardener templates. Now, the configuration may specify alternate output target locations.This change is