Closed aappling-usgs closed 6 years ago
Are dots preferred or boxplots? Could also remove outliers for boxplots and put the model range side-by-side for each lead time Or z_scored error for comparison across sites
Nice set of options! This is going to be a really cool plot.
100% * (pred - truth) / truth
. That would help with making the three sites have more comparable axes, in the spirit of the above z score option but with more environmentally meaningful units.I'll reverse the x-axis and work up a few more plot examples with the bullet points you listed. Good comments!
Violin plots don't look great; they are too narrow to be useful. And jittered points are really busy so I think boxplots are the way to go.
boxplots are looking OK but I can't get the LeadTime 10-29 to be the same width as the 0-9 LeadTime boxplots. I have to adjust site labels too
absolute relative error is below:
Nice! Now the results are really popping out. Thanks for trying violin and scatter plots, and I'm content with your conclusion that boxplots are the way to go. I'm also OK with the blue/red pairs being narrower than the red-only days, especially if it's a headache to get ggplot to do something else.
I think if we show just one plot, it should be the relative error plot, though I could see including both absolute and relative errors. Do you agree?
You labeled that relative error plot "absolute relative error" - does it work to make it non-absolute so that we get some negatives in there? I'm interested in knowing/reporting the direction of bias, if there is one.
Yeah, I think the relative error is the most useful, although magnitude of error can also be useful. We'll see if there's space for both. Or we could put some summary stats for stream flux and/or error that would give a sense for magnitude of flux error.
here is non-absolute relative error:
Cool! Could you add a horizontal line at 0 for reference, get the site labels not to overlap the data, and call it good?
Yup! I'll make a PR with those changes