Closed aappling-usgs closed 6 years ago
Cool!
I'm using the Heidke Skill Score to assess model performance for forecasting a threshold exceedance ('yes' or 'no') that we arbitrarily set for nitrate flux. some notes on Heidke Skill Score (HSS):
The HSS measures the fractional improvement of the forecast over the standard forecast. Like most skill scores, it is normalized by the total range of possible improvement over the standard, which means Heidke Skill scores can safely be compared on different datasets. The range of the HSS is -∞ to 1. Negative values indicate that the chance forecast is better, 0 means no skill, and a perfect forecast obtains a HSS of 1.
For the first two panels, HSS improves as lead time decreases , the last panel shows that HSS is highest at pretty far out lead times, which is similar to the relative flux error plots #55. The last panel also has a HSS above 0 for all lead times, which I think indicates that for this river (Mississippi R.) the forecasts have some skill at least better than chance.
I really like this updated way of displaying the model skill. It seems clearer to me vs the boxplot one.
Agreed, this is cool!
goal is to evaluate the model's ability to correctly forecast threshold exceedance events. will need to make up some arbitrary threshold (be consistent with #58)
maybe show one contingency table, many scores broken up by month or high/low flow or lead time bin
options for metrics: http://www.eumetrain.org/data/4/451/english/msg/ver_categ_forec/uos3/uos3_ko1.htm, https://en.wikipedia.org/wiki/Forecast_skill
whiteboarded notes: