ONSdigital / SDG_11.2.1

Analysis for the UN Sustainable Development Goal 11.2.1
https://onsdigital.github.io/SDG_11.2.1/
Apache License 2.0
5 stars 7 forks source link

419 duck db version #433

Closed james-westwood closed 5 months ago

james-westwood commented 11 months ago

@paigeh1 this is the DuckDB code

Pull Request submission

Insert detailed bullet points about your changes here!

Insert any instructions to help the reviewer, e.g. "install new requirements from requirements.txt"

*Let the reviewer know what data files are needed (to be grabbed from sync)

Closes or fixes

Fixes #

Code

Documentation

Any new code includes all the following forms of documentation:

Data

Testing


Peer Review Section

Final approval (post-review)

The author has responded to my review and made changes to my satisfaction.


Review comments

Insert detailed comments here!

These might include, but not exclusively:

Your suggestions should be tailored to the code that you are reviewing. Be critical and clear, but not mean. Ask questions and set actions.

paigeh-fsa commented 9 months ago

Issue with loading all csvs, going to look into this. Appears to just be loading London atm

paigeh-fsa commented 7 months ago

Next steps:

jwestw commented 7 months ago

Listing some requirements for the xlsx output:

jwestw commented 7 months ago

I am stuck on running this code after having re-downloaded the archive csvs again. This is a problem I had before and was previously (if I remember correctly) to do with the ordering of columns in the column dtypes dictionary.

I have read the csvs in via the command line head -n 1 path_to_csv and this showed that the column order was the same as my dictionary. Same for reading it in Pandas.

The dtypes are correct too I believe.

image

The code which is trying to be converted to Int (which shouldn't be) is from the "LAD11CD" column.

james-westwood commented 6 months ago

This now works to rename columns and create all tabs in the excel