Closed antoinecarme closed 1 year ago
The cForecastEngine.to_dict() call will produce something like this :
"Dataset": {
"Exogenous_Data": {
"Categorical_Variables": {
"Exog3": [
"AQ",
"AR",
"AS",
"AT",
"AU"
],
"Exog4": [
"P_T",
"P_R",
"P_U",
"P_S",
"P_Q"
]
},
"Continuous_Variables": {
"Exog2": {
"Mean": 6.411764705882353,
"StdDev": 3.4365094970361736
}
}
},
"Signal": "Ozone",
"Time": {
"Horizon": 12,
"TimeDelta": "<DateOffset: months=1>",
"TimeMax": "1971-12-01 00:00:00",
"TimeMin": "1955-01-01 00:00:00",
"TimeVariable": "Time"
},
"Training_Signal_Length": 204
},
Hierarchical forecasting models can have exogenous data at each node or at the whole tree level.
This task will produce a multitude of reports about each exogenous data usage.
We need more detail about exogenous data usage in ARX models in the logs. The same applies for XGBX, LGBX, LSTMX, MLPX, etc
This is just a reporting task, no need to change the underlying models,
Used categorical variables, categories used/excluded, their lags, their coefficient
Used continuous/numerical variables, their means, stddevs etc, their lags, their coefficients.
List of excluded variables/categories.
Something like :