lagotto / data-quality

ALM data quality code
0 stars 1 forks source link

Timing of alerts #3

Closed sckott closed 9 years ago

sckott commented 9 years ago

@mfenner @jenniferlin15 I notice that timing of alerts is interesting, in that I'm looking at alerts e.g. created on September 2, 2014 for a HtmlRatioTooHighError class alert with HTML:PDF ratio of 604, and when I look at the page for that article 10.1371/journal.pone.0010894, yes the ratio is high, but it seemed to be even higher in the past. With these alerts, they don't necessarily point to the worst offending data points for the alert class. I get that this doesn't make sense to point to historical data points, as those stay the same.

Is it the data at the time of the alert that's of interest, or the article itself, meaning, whatever may be going at through the life of that article.

Apologies if I'm not making myself clear

mfenner commented 9 years ago

Very clear. The data quality of an article itself is what is of interest. The alerts are only one way to get to the problematic articles.

sckott commented 9 years ago

Okay, thanks, just trying to get some clarity on the bigger picture