SverreNystad / power-predictor

Using Machine Learning for time series forecasting of photovoltaic measurement for solar systems based on weather features
Other
6 stars 2 forks source link

Exploratory data analysis #2

Open Gunnar2908 opened 8 months ago

Gunnar2908 commented 8 months ago

If the data is proving to be non-normality then we must find the distribution like log-normal, exponential, or even more domain-specific distributions like the Weibull distribution (often used for lifespan analyses). The professors have said that the data sets will not be the same but the public and private datasets will have the same distribution

SverreNystad commented 8 months ago

We need to fix some of the parts of EDA. We have much functionality added the last day, but we need to tidy it up and fix some bugs. They are not hard they just need some reading up on pandas docs. :)

SverreNystad commented 8 months ago

The bug is now fixed. We now have most of the analysis points

SverreNystad commented 8 months ago

Someone should read the docs for wavelets so we can have a greater understanding of wavelet analysis and experiment with different wavelets: https://pywavelets.readthedocs.io/en/latest/ref/wavelets.html

SverreNystad commented 8 months ago

Need to implement SHAP: https://medium.com/dataman-in-ai/explain-your-model-with-the-shap-values-bc36aac4de3d