openclimatefix / pv-solar-farm-forecasting

Forecasting for individual solar farms
MIT License
0 stars 0 forks source link

Use UKPN Dashboard Data #8

Open jacobbieker opened 1 year ago

jacobbieker commented 1 year ago

Data Link: https://dsodashboard.ukpowernetworks.co.uk/d/jnTAkg7Gz/ukpn-dso-dashboard-grid-supply-areas?orgId=1&from=now-1y&to=now&refresh=10m&var-GSP=CANTERBURY%20NORTH

Background: UKPN has a dashboard of GSP-level generation going back a few years for the GSPs that they manage. The generation is of metered sources, and can be downloaded as a CSV file. This data also has less odd behaviour than the UKPN site CSV that we have.

Goal: Determine which GSP has the most solar generation out of the ones on the dashboard. Download the data. Create a data report.

The report should include:

Second part:

Bonus:

vrym2 commented 1 year ago

Data Link: https://dsodashboard.ukpowernetworks.co.uk/d/jnTAkg7Gz/ukpn-dso-dashboard-grid-supply-areas?orgId=1&from=now-1y&to=now&refresh=10m&var-GSP=CANTERBURY%20NORTH

Background: UKPN has a dashboard of GSP-level generation going back a few years for the GSPs that they manage. The generation is of metered sources, and can be downloaded as a CSV file. This data also has less odd behaviour than the UKPN site CSV that we have.

Goal: Determine which GSP has the most solar generation out of the ones on the dashboard. Download the data. Create a data report.

The report should include:

  • [ ] Plots of the data
  • [ ] The typical day in different times of year (Winter, Fall, Summer, Spring)
  • [ ] Compare this GSP solar output with PVLive’s estimate for the same GSP (PVLive’s output should be different as it also estimates non-metered solar generation)
  • [ ] Days where the generation was significantly different than normal for that time of year
  • [ ] Any missing or odd data that seems to be in the CSV (e.g. some solar data can be negative, etc.)

Second part:

  • [ ] Create a NetCDF/Zarr file from the CSV to use in model training. This should be similar to the GSP Zarr.
  • [ ] Create an automatic download script to download the most recent data every n hours and add it to the already saved data

@jacobbieker I have looked around the dashboard to download the data into csv, could not find any option to download, am i missing something, do I need any credentials for the access of the data?

vrym2 commented 1 year ago

@jacobbieker Never mind, I think I found it!

jacobbieker commented 1 year ago

Okay, you can go to the generation for all metered connections, the dropdown has inspect on it, then hit data, and it gives a CSV download option. Just make sure Solar is set! And that you've gotten the GSP with the most solar generation of them all.

vrym2 commented 1 year ago

UKPN-Grafana Dashboard guide is available here! - link

vrym2 commented 1 year ago

@jacobbieker, "Create an automatic download script to download the most recent data every n hours and add it to the already saved data", regarding this task, like for the UKPN metadata download through an api, this UKPN Grafana dashboard did not provide any relevant tools for automatic data acquisition, I have searched in their community forum and on the stackoverflow and other places, I have found a github link which contains some useful tools and resources for Grafana dashboard like the UKPN, but could not find how to automate the download of data from the dashboard, I believe UKPN directly feeds their live data, and this dashboard only provided us to download the data manually!

jacobbieker commented 1 year ago

@jacobbieker, "Create an automatic download script to download the most recent data every n hours and add it to the already saved data", regarding this task, like for the UKPN metadata download through an api, this UKPN Grafana dashboard did not provide any relevant tools for automatic data acquisition, I have searched in their community forum and on the stackoverflow and other places, I have found a github link which contains some useful tools and resources for Grafana dashboard like the UKPN, but could not find how to automate the download of data from the dashboard, I believe UKPN directly feeds their live data, and this dashboard only provided us to download the data manually!

Ah okay, sounds good. I think for now, just ignore that, that would be the last thing to build at this point, so we can try revisiting it later. I'd recommend focusing on the data report first, and then making the file that can be read in for training. Then can see if there is a way to figure out getting the live data.

vrym2 commented 1 year ago

image Here is a screenshot of the metered power generation in MW for the last 7 days, I have observed from the raw data that there is a linear rise trend from the 12th hour in mid night of every day in MW value until there is a sudden spike at some timestamp (for example - 2019-12-16 11:00:00) in the morning of the day. I believe this sudden spike is the actual true power generated from the panels, and the linear rise before that is just dummy data, and this trend is followed in every other day of the dataset. We can determine the trend change by looking at the slope

continuous +ve slope means there is linear rise, continuous -ve slope means there is a linear fall, No slope means, there is no change in the trend

The timestamps with the actual metered generated power data during the day will have combination of both +ve and -ve slopes as seen in the graph, the trend is random, with this we can determine the actual true data from dummy data

jacobbieker commented 1 year ago

Could you see the sunrise time for this GSP for those dates? Does the spike correspond to the sun rising? If its much after or before, then it might be something else.

vrym2 commented 1 year ago

Here is the plot of metered power generation for last the last four days (15-01-2023 : 18-01-2023) from the GSP - Canterbury North and the corresponding 'Sunrise' and 'Sunset' times taken from this website-link. I believe my observation is true @jacobbieker image

Could you see the sunrise time for this GSP for those dates? Does the spike correspond to the sun rising? If its much after or before, then it might be something else.

jacobbieker commented 1 year ago

Hmmm, interesting then, yeah, sure seems like thats the case. I guess one thing would to be see if you can clean this jump? But also not sure then if the morning generation should just shoot straight up or not. Any thoughts @peterdudfield ? For now, might be worth then making plots and stuff and comparisons both with the raw data that has this odd slope, and ones where you zero out the data before sunrise, although not sure then how the generation should ramp up.

peterdudfield commented 1 year ago

Yea, I would push to not get stuck on this, so keep focus on data report and data for training.

later we clean the night time data with this ramp

vrym2 commented 1 year ago

@jacobbieker Here is the plot that shows average power generated during the mid-day (11am-1pm) of a season (Winter, Spring, Summer, Autumn), For example, the first bar represents the average power generated in the 1st day of every month in a season image

peterdudfield commented 1 year ago

PVlive data is here - https://console.cloud.google.com/storage/browser/solar-pv-nowcasting-data/PV/GSP/v5;tab=objects?pageState=(%22StorageObjectListTable%22:(%22f%22:%22%255B%255D%22))&project=solar-pv-nowcasting&prefix=&forceOnObjectsSortingFiltering=false

This has GSP_ids from 0 to 318. 0 is national

peterdudfield commented 1 year ago

gsp_locations.csv

CSV of gsp ids. Has a name for each GSP

peterdudfield commented 1 year ago

Questions that should be answered in the data report

I'm sure there are more, but maybe just for a start, these would be useful

vrym2 commented 1 year ago

@jacobbieker Here is the report for UKPN dashbaord data, please check and let me know if you have any comments or suggestions. ukpn_dashboard_report.pdf