Open peterdudfield opened 11 months ago
I good think to do firstly, would be to build a general pipeline that takes weather data joins pv data tgoether.
It might be a case of writing this fresh, or using ocf_datapipes
Sorry to comment here, but I would like to ask about whether this paper is relevant to the related issue. https://www.sciencedirect.com/science/article/pii/S0960148123009035#tbl1
Sorry to comment here, but I would like to ask about whether this paper is relevant to the related issue. https://www.sciencedirect.com/science/article/pii/S0960148123009035#tbl1
Hello @peterdudfield , i would like to work on this issue. Can you please assign me?
Hi @roshnaeem If its ok, I'll keep the assignees so that it enourages lots of people to tackle this issue. Is that ok? Thank you so much on working on this, please write here, if you have any questions
Some general questions:
https://github.com/openclimatefix/pv-site-prediction, but I wouldnt get too stuck into this code. I think it would be better to write something freiends
See above
Yea, but also the NWPs are going to be very important
use the evaulation script - https://github.com/openclimatefix/Open-Source-Quartz-Solar-Forecast/blob/main/scripts/run_evaluation.py
Sure @peterdudfield, thank you, i am checking the code, and will open a PR soon.
Thank you @peterdudfield for your guidance. I went through pv-site-prediction
and ocf-datapipes
repositories to understand the basics. I have a couple of questions.
ocf-datapipes
is integrating both types of data, right? Are we using it to provide training data? No live PV data means the model can run inference with only NWP data. This is what we have found lots of people want.
If you want to use it yes, currerntly its not being used in the repo
I'm not sure what you mean by two approaches? Could you clarify? I'm not sure i can tell you a step by step approach, but I can try to outline things
I good think to do firstly, would be to build a general pipeline that takes weather data joins pv data tgoether. It might be a case of writing this fresh, or using
ocf_datapipes
@peterdudfield I was talking about these two approaches you mentioned in this comment.
I good think to do firstly, would be to build a general pipeline that takes weather data joins pv data tgoether. It might be a case of writing this fresh, or using
ocf_datapipes
@peterdudfield I was talking about these two approaches you mentioned in this comment.
I'd probably try ocf_datapipes first, and if it doesnt suit, then try to write something fresh
@peterdudfield, I have a few questions regarding the GSOC proposal.
ocf_datapipes
as well as building new datapipes for the new model? I would leave the current model how it is, but aim to use ocf_datapipes for the new model.
Question ** How can we use only nwp data to predict, we would need capacity and pv site data to get the nwp data. Does live PV data means that we would be getting pv data in real time and predicting the generation in real time?
yea, it would be good to use pv metadata data, like capacity and nwp data in the model. The live PV data would also increase the accuracy of the model, but we've tried in this repo to have that as optional. So first of all the model works with NWP and PV metadata
Hi,
I am working on #27 and this discussion helped a lot, thanks!
I explored the project and ended up in psp
, since it contains the code to train the models. I ran its train and eval model after setting up the environment, but wasn't able to use its result (.pkl
) directly as a model (substituting the current default model in forecast_v1
by psp
's test_config1
model .pkl
). Should using psp
's model directly be possible?
Hi @peterdudfield I would like to work on this issue. I'll be available from march 28.
can i work on this issue ?
Can you make a new model and beat the current evaluations metrics?
You need to build a forecasts to forecast PV. The PV dataset is all here, and we also want to model to run like the current model i.e pulling NWP data from open-meteo.
We need a model that can forecast 48 hours ahead, in 15 minute intervals. We want it to run live without PV live data, but an good optional extra would be to include PV data.
This is fairly open ended on in order to not restrict anyone.