ONSdigital / dp-data-pipelines

Pipeline specific python scripts and tooling for automated website data ingress.
MIT License
1 stars 0 forks source link

s3 tar recieved #34

Closed mikeAdamss closed 7 months ago

mikeAdamss commented 8 months ago

What is this

We need to create the umbella function that will run pipelines when given an input which is the url to a tar file in an s3 bucket.

This is not developing new functionality, this is brining together of already written and unit tested components to provide the behaviour we require.

This is the function.

What to do

Start with the assuption you have an s3 url, i.e

s3_tar_recieved.start(s3_tar_file_url_as_str)

this task is to:

You can use our intiial sketch as a reference point. Just be aware we're only implementing a very small part of it here.

Every single action should be wrapped in a try catch and notifying data engineering in the event of an issue.

Accptance Criteria

mikeAdamss commented 8 months ago

5

mikeAdamss commented 7 months ago

done