Open shane-kercheval opened 3 years ago
Also, just wanted to say, this is a fantastic book; I'm getting a lot of practical insights out of it. Thank you.
Hi, thanks for the kind words. Regarding the arsenic level: yes, I see your point. Something like the median and the 90th percentile, or the 25th and 75th percentile, would make more sense. I don't know if I have the energy to re-make all these graphs, but it would make sense to add a couple of sentences to make this point.
Figures 13.10 and 14.3 (a) show the probability of switching wells as a function of distance and a given level of arsenic.
The values of
arsenic
used are0.5
and1.0
.The minimum value of arsenic in the data is
0.51
, so perhaps 0.5 is close enough.But the value of
1
is something like the35% percentile
in the data. I'm not sure this is a good representation, and seems arbitrary, but perhaps I'm missing something.The median value of arsenic is
1.3
.Plotting the min/median/max values of arsenic (or even something like ~95% percent, which is ~3.79), I think provide better insight.
The values 0.5-1 of arsenic used in the graph is on the lower levels and so to say, in the book, "the interaction is small in the range of most of the data" seem misleading because we're only looking at lower levels of arsenic.
(graph b uses a distance of 0 to 50, which seems more reasonable because its covering the "closest" 65% of data (median dist is 36.8), and 50 meters is a nice conceptual value; although seems like a min/medium/max approach could benefit this graph as well)
Here's a screenshot at different levels.
code below, along with the non-interaction model