Closed samreid closed 1 year ago
We can definitely discuss this with others at the design meeting. For context, here is an email exchange between me and Marilyn Hartzell, who was a part of the original pedagogy team.
My email:
On the MAD screen, students can press an "Info" button to see how the MAD is calculated. I'm curious about how you would word the definition of the MAD. In my high school experience, we word it like this, "MAD is the average distance each data point is from the mean." My definition may be confusing for 6th graders. Can you help me word the MAD definition?
Her reply:
For the wording I would use the same that you described as my initial definition. Then I would break it down into less formal language. Something along the lines of, “basically the MAD let’s us know how far away the data is from the mean, it helps us to know if the data is bunched/close together when the MAD is small or when the data is spread out when the MAD is a larger number”. The exact wording can be adjusting but I definitely talk about distance using less formal language and refer to the average as the mean. Standardized testing always asks about the mean and the not the average in the middle grades, so they are more familiar with “mean” than “average”.
After careful thought, research, and consideration...
MAD stands for
Mean (average)
Absolute (absolute value)
Deviation (difference between a single data point and the mean, in that order, for all data points).
Previously, @samreid brought up that "The MAD is the average distance each data point is from the mean" was difficult to parse because of the two "is" words. In class while teaching, the phrase, "MAD is..." is typically followed by a pause, and the phrase, "the average distance each data point is from the mean" makes sense. The definition of the deviation is this: the distance a data point is from the mean, or $x_i – \bar{x}$ ($x_i$ is the ith data point, and $\bar{x}$ is the mean of all data points). Since the deviations can be negative (indicating the data point is below the mean), we have to take the absolute value to create a distance (forcing a positive deviation).
(Fun fact: the sum of the deviations is always zero since the mean is the balancing point. The mean balances all of the positive and negative deviations - a surprise every time!)
Taking all that into consideration, let's go with: "MAD is the average distance from each data point to the mean."
Removing the meeting label, since this has already had enough time and careful consideration, with people weighing in. I agree with @catherinecarter 's final definition.
Updated, closing.
I said:
@catherinecarter said:
I replied:
Thanks, I updated it in this commit. https://github.com/phetsims/center-and-variability/commit/33b2eb0618046788176b74e20ca697c2ee5a91f9. I’m still trying to identify why it bothered me in the first place. I guess for me it’s like the difference between these 2 sentences:
To me, the latter is much easier to parse, understand and convey, even for middle schoolers. I don’t see the advantage at all of the top sentence. But I’ve committed it and if I’m the only one that feels this way, we should move on.