openclimatefix / power_perceiver

Machine learning experiments using the Perceiver IO model to forecast the electricity system (starting with solar)
MIT License
7 stars 1 forks source link

Automatically detect "bad" PV data #180

Open JackKelly opened 2 years ago

JackKelly commented 2 years ago

Some automatic checks for each PV system's PV power timeseries:

bndxn commented 2 years ago

Another potential idea: what about comparing yield with other systems nearby? I guess they might have different angles/positions etc. This is what I did when looking at PV stations in Devon.

image

image

JackKelly commented 2 years ago

Another potential idea: what about comparing yield with other systems nearby

That is an interesting idea. I guess one challenge is: what happens in the worst case when all the "near neighbours" are bad? Then the PV system currently under consideration might be thrown away because it doesn't look like its bad neighbours :slightly_smiling_face:

To start with, we probably just want to throw away the "really obviously bad" data :slightly_smiling_face: Like PV systems which report zero power for an entire week. Or PV systems which always report 1 kW, even at night :slightly_smiling_face: Stuff like that :slightly_smiling_face:

bndxn commented 2 years ago

Thanks! Yeah that makes sense. I would guess that if you compare system k with n nearby systems, then as n increases, the chance that the majority of the n nearby systems are bad decreases. On the other hand if more systems are required for comparison they'd be more geographically dispersed so maybe the correlation between their yields would decrease.