fig: error vs lead time overall

aappling-usgs commented 6 years ago

img_20180510_162131431

jzwart commented 6 years ago

Are dots preferred or boxplots? Could also remove outliers for boxplots and put the model range side-by-side for each lead time Or z_scored error for comparison across sites

aappling-usgs commented 6 years ago

Nice set of options! This is going to be a really cool plot.

What would you think about reversing the x axis so that lead time = 0 is at the right, and as your eye moves left to right you're moving from long-before to shortly-before? That seems more intuitive to me at the moment.
Looks to me like we need to choose a version that emphasizes the change in the more central values rather than the spread - box plots help but are hard to see with the outliers...what does it look like with boxes only? Or how do you feel about violin plots, also with outliers removed or y axes truncated?
I'd be interested to see a version with abs(error) on the y axis - that would give us a little more vertical room to work with. It would take away the info about which direction the bias is in, but I think I'd be OK with that. It would also open up the option of logging the y axis to emphasize the stuff happening near 0 (where the majority of the errors are) if that's still needed after removing outliers.
Another option for the y values, either abs or not and logged or not, is to compute errors relative to truth on each date, 100% * (pred - truth) / truth. That would help with making the three sites have more comparable axes, in the spirit of the above z score option but with more environmentally meaningful units.

aappling-usgs commented 6 years ago

another option if box plots or violin plots aren't doing it would be to introduce jitter along the x axis. But I suspect boxes or violins will be better.

jzwart commented 6 years ago

I'll reverse the x-axis and work up a few more plot examples with the bullet points you listed. Good comments!

jzwart commented 6 years ago

Violin plots don't look great; they are too narrow to be useful. And jittered points are really busy so I think boxplots are the way to go.

boxplots are looking OK but I can't get the LeadTime 10-29 to be the same width as the 0-9 LeadTime boxplots. I have to adjust site labels too

absolute relative error is below:

aappling-usgs commented 6 years ago

Nice! Now the results are really popping out. Thanks for trying violin and scatter plots, and I'm content with your conclusion that boxplots are the way to go. I'm also OK with the blue/red pairs being narrower than the red-only days, especially if it's a headache to get ggplot to do something else.

I think if we show just one plot, it should be the relative error plot, though I could see including both absolute and relative errors. Do you agree?

You labeled that relative error plot "absolute relative error" - does it work to make it non-absolute so that we get some negatives in there? I'm interested in knowing/reporting the direction of bias, if there is one.

jzwart commented 6 years ago

Yeah, I think the relative error is the most useful, although magnitude of error can also be useful. We'll see if there's space for both. Or we could put some summary stats for stream flux and/or error that would give a sense for magnitude of flux error.

here is non-absolute relative error:

aappling-usgs commented 6 years ago

Cool! Could you add a horizontal line at 0 for reference, get the site labels not to overlap the data, and call it good?

jzwart commented 6 years ago

Yup! I'll make a PR with those changes

USGS-R / protoloads

fig: error vs lead time overall #55