ONSdigital / SDG_11.2.1

Analysis for the UN Sustainable Development Goal 11.2.1
https://onsdigital.github.io/SDG_11.2.1/
Apache License 2.0
5 stars 7 forks source link

Create an cloud space for all the data #407

Closed jwestw closed 6 months ago

jwestw commented 1 year ago

Probably a Google bucket with read-only access

Deliverables:

jwestw commented 1 year ago

Look at the code in https://github.com/ONSdigital/research-and-development/blob/develop/src/pipeline.py for inspiration on how to create conditional imports based on a config setting (which indicates choice of environment). In our case we may want to do the same re: imports but the main thing to focus on is the creation of paths (local) vs. signed-urls (cloud). Then hopefully most of our existing file read functions will take the signed-url as a "path", just like pandas' dataframe constructor.

We want to avoid lots of if/else statements throughout the code if possible.

jwestw commented 1 year ago

May want to write to bucket which if we do we need to have two things:

paigeh-fsa commented 11 months ago

Going to download files and host on OneDrive.