eco4cast / unconf-2023

Brainstorming repo to propose and discuss unconference project ideas!
12 stars 0 forks source link

Machine learning models in automated NEON challenge forecasts #18

Open robbinscalebj opened 1 year ago

robbinscalebj commented 1 year ago

The Theory working group has been automating forecasts in a machine learning framework. Forecasts fit with linear regression, LASSO regression, random forest, and bagged multi-layer perceptron (a neural network) are actively submitting to the challenge. We developed the code as somewhat of a plug 'n' play for tidymodels, with code here: https://github.com/eco4cast/Forecast_submissions/tree/main/Generate_forecasts.

Some needs include:

robbinscalebj commented 1 year ago

Expanding a bit on my previous post:

The overall goal of this project as it was started within the Theory working group (btw, all are welcome!) is to create a variety of automated forecasts that can tell us about the 'realized' predictability of different ecological variables across time and space (others in the Theory working group are looking explicitly at 'intrinsic' predictability). Machine learning models provide a good way to forecast different ecological phenomena in ways that are easily deployable. At some point relatively soon, we want a whole suite to be up and running. As mentioned above, we have an existing forecasting framework set up that can be readily expanded.

If this project is selected, I think some brainstorming followed by coding could be fruitful in accelerating our progress. So, what are the best strategies for improving the utility, interpretability and applicability of the Theory working group models to the goals above, or more broadly? What should be prioritized for coding at Unconference?