GSS-Cogs / family-towns-and-high-streets

0 stars 0 forks source link

BEIS-Electric-prepayment-meter-statistics #4

Open ajtucker opened 4 years ago

LPerryman commented 4 years ago

3 Spreadsheets for MSOA, LSOA and Postcode levels. Lots of data but should be straight forward to transform

LPerryman commented 3 years ago

GitHub https://github.com/GSS-Cogs/family-towns-and-high-streets/tree/master/datasets/BEIS-Electric-prepayment-meter-statistics Jenkins https://ci.floop.org.uk/job/GSS_data/job/towns-high-streets/job/BEIS-Electric-prepayment-meter-statistics/

CharlesRendle commented 3 years ago

Stage 2 transform completed (I think). Will definitely expect to go back to do some re-working as I may have introduced a lot of confusion with more dataframes than are necessary. I also have a few questions based on how I have interpretted the spec:

Thank you!

LPerryman commented 3 years ago

Answers to above questions:

Remove the columns 'Measure Type' and 'Unit'. we can currently only have one measure and Unit type so rather than have a whole column with the same value we define the values in the info.json file.

Geography Level should be local-authority, lsoa, msoa etc. This column will probably not be needed in the future but i have put it in there just to be able to keep track of things.

We only need to upload the LSOA data as all the geography codes and their relationship to each other have been uploaded to PMD4 by Swirrl so LSOA codes will be able to refer back to their MSOA code at some point in the future.

yes, year should be formatted to "year/{2017} etc.

My mistake, looks like the 'Marker' is not needed

You have done the right thing by changing the measure type in the info.json data for each dataset. I have started doing this myself and is a nice work around until we can process multiple measures

LPerryman commented 3 years ago

Data has been published as 4 datasets

  1. Electric prepayment meter statistics - Mean Consumption
  2. Electric prepayment meter statistics - Median Consumption
  3. Electric prepayment meter statistics - Sales
  4. Electric prepayment meter statistics by Post Code - Sales

The first 3 could be joined up once we can have multiple measures but maybe Mean and Median consumption should be Attributes anyway Post Codes are a different format to the LA, LSOA and MSOA so need to be kept separate. I have not published the Mean and Median Consumption for Post Codes but they could easily be added as Attributes when needed

Tracey-B commented 3 years ago

BA comments:

The postcode field is a URI The number of meters has not been included in any of the datasets due to the inability to have more than one measure type in a dataset on PMDv4 The description under each title needs to be updated to reflect the content? Incorrect contents issued date of 28 March 2019 has been used and the correct date is 08 April 2019. The usual caveats also apply.