openclimatefix / nowcasting_dataset

Prepare batches of data for training machine learning solar electricity nowcasting data
https://nowcasting-dataset.readthedocs.io/en/stable/
MIT License
25 stars 6 forks source link

Implement `PVPhysicsPredictionDataSource` #615

Open JackKelly opened 2 years ago

JackKelly commented 2 years ago

Detailed Description

For all timesteps, and for all PV systems in the region of interest, include:

Maybe use quite long history and forecast durations. Maybe 2 days of forecast and 2 days of history?

Also include:

Before building the data source, do some experiments in a Jupyter Notebook:

Context

As discussed in https://github.com/openclimatefix/power_perceiver/issues/7, I'm now thinking of predicting PV as a chain of models, each of which predicts the residuals of the previous model.

JackKelly commented 2 years ago

I've made a start on this today, and found a few PV systems with pretty obvious shading issues (the plot shows power on the y axis and time-of-day on the x axis for June):

image

The next steps:

I'm increasingly convinced that we need to accurately model each individual PV system as accurately as possible (especially local shading and inverter clipping). For two main reasons:

JackKelly commented 2 years ago

Over the weekend, I decided that, actually, I should do some more experiments in power_perceiver before deciding whether to implement PVPhysicsPredictionDataSource in nowcasting_dataset.

Further notes will be continued in https://github.com/openclimatefix/power_perceiver/issues/10