ONSdigital / dp-data-pipelines

Pipeline specific python scripts and tooling for automated website data ingress.
MIT License
1 stars 0 forks source link

tidy up schema and nomanclature #65

Closed mikeAdamss closed 5 months ago

mikeAdamss commented 5 months ago

What

The word "pipeline" was getting really heavily overloaded and was making it hard for people to follow the logic, especially in relation to the config we're passing in. So this should clear all that up.

There are two principle configurable inputs coming from the config json that dictate which code from this repo is applied to a given input from s3..

_Remember, we start at s3_tar_received.start() if it helps._

They were previously a little muddled so have tided up the nomaclature, see here for the newer and more clear config example: https://github.com/ONSdigital/dp-data-pipelines/blob/minor-fixes/tests/fixtures/test-cases/test_pipeline_config_valid_id.json

How to review

Sanity check, run tests yourself.

I had to update some tests and as well and run make fmt and make lint so it looks like a lot mores changed than actually has.

Who can review

anyone.