Closed petrelharp closed 4 years ago
Might make sense to reorder this section and put \section{Tree sequences} before this so that we have the terminology and data structure in mind before talking about computing things on trees?
Well, I wanted to keep it the way it is, but Reviewer 2 says just the same thing. I'll see what I can do.
When computing statistics, sometimes you have the factor $\frac{1}{L}$ and sometimes not - is there a reason why sometimes you normalize by sequence length, but sometimes not?
Well, the answer to this is that we don't normalize when looking at a single thing (a single site or a single tree) but we do when looking at a stretch of the genome. I'm not sure where to explain this, though. We do introduce the Site and Branch stats as averages over a region of the genome?
Factor of 2 in Example 2
We've got an explanatory note after Example 2 now.
Figure 5 is a bit confusing - why are the scales so different between site and branch stats?
Well, the caption explains that "The ratio of the Site statistic to the Branch statistic [...] hovers around typical per-generation estimates of the human mutation rate"
I think maybe this was talking about a previous version of the figure.
Ok - I think we've actually dealt with all these already, except as noted above. Any objections to closing this?
Nope, closing.
Great comments from @apragsdale: