DOV-Vlaanderen / groundwater-logger-validation

Analysis on validation methods for groundwater logger data
MIT License
2 stars 2 forks source link

Drifts 12: analysis of barometers based on KNMI series as reference #57

Open DavorJ opened 4 years ago

DavorJ commented 4 years ago

This analysis is similar to #48, but here we use the #56 KNMI series as reference.

Here is a sample plot:

image

The top left is the comparison between the KNMI series (red) and some barometer (black). Each series is centered around the median for a better comparison. The black is plotted over the red, so that if there is no visible shift, the red should hardly be visible.

The bottom left curve is the difference between the two. This is a good visualization for the drift. For visual convenience I have added a black horizontal line which is the median of the first 25 % of the differences: assuming the first 25 % is correct, the rest should not deviate much from it.

The bottom right histogram is a summary of the bottom left differences. And the top right is a summary of the histogram.

This is how a good barometer looks like:

image

I have compiled all the graphs in a single pdf, going from worst case to best case (based on whisker range of the differences).

ToDo: compare this analysis with #51 and #55.

fredericpiesschaert commented 4 years ago

I like these graphs, they seem to give a good indication when things start to go wrong. Perhaps outliers should be excluded because they obscure possible drift signs, e.g. image I see something happening at the end of the series but the outliers at the start spoil the fun

fredericpiesschaert commented 4 years ago

I see at least four different things: 1) all well, 'perfect' series, difference and variance are stable and acceptable image 2) steady difference but increasing variance image this seems to happen quite a lot and often very suddenly: image 3) seasonal component image 4) what I would call true drift: image

it would be nice to understand the physical meaning of these cases, especially case 2 and 3 (on the assumption that it is not some kind of a data-artefact that we are overlooking)