phetsims / center-and-variability

"Center and Variability" is an educational simulation in HTML5, by PhET Interactive Simulations.
GNU General Public License v3.0
1 stars 2 forks source link

Wording for the MAD description #163

Closed samreid closed 1 year ago

samreid commented 1 year ago

I said:

We discussed phrasing it as “MAD is the average distance between each point and the mean.” however, that seems too easy for someone to misread and think it is about distance between points. So should I leave it as “MAD is the average distance from each point to the mean.“?

@catherinecarter said:

I like putting the mean as the starting point, let’s switch the order: “MAD is the average distance from the mean to each point.” I’ll email Marilyn to confirm. I confirmed with Marilyn, the design doc wording is the way to go. “MAD is the average distance each data point is from the mean.” She said this is how it should be worded, and then teachers can unpack what it means using more informal language. The informal language varies from teacher to teachers, so this more formal language is the grounding definition.

I replied:

Thanks, I updated it in this commit. https://github.com/phetsims/center-and-variability/commit/33b2eb0618046788176b74e20ca697c2ee5a91f9. I’m still trying to identify why it bothered me in the first place. I guess for me it’s like the difference between these 2 sentences:

To me, the latter is much easier to parse, understand and convey, even for middle schoolers. I don’t see the advantage at all of the top sentence. But I’ve committed it and if I’m the only one that feels this way, we should move on.

catherinecarter commented 1 year ago

We can definitely discuss this with others at the design meeting. For context, here is an email exchange between me and Marilyn Hartzell, who was a part of the original pedagogy team.

My email:

On the MAD screen, students can press an "Info" button to see how the MAD is calculated. I'm curious about how you would word the definition of the MAD. In my high school experience, we word it like this, "MAD is the average distance each data point is from the mean." My definition may be confusing for 6th graders. Can you help me word the MAD definition?

Her reply:

For the wording I would use the same that you described as my initial definition. Then I would break it down into less formal language. Something along the lines of, “basically the MAD let’s us know how far away the data is from the mean, it helps us to know if the data is bunched/close together when the MAD is small or when the data is spread out when the MAD is a larger number”. The exact wording can be adjusting but I definitely talk about distance using less formal language and refer to the average as the mean. Standardized testing always asks about the mean and the not the average in the middle grades, so they are more familiar with “mean” than “average”.

catherinecarter commented 1 year ago

After careful thought, research, and consideration...
MAD stands for Mean (average) Absolute (absolute value) Deviation (difference between a single data point and the mean, in that order, for all data points).

Previously, @samreid brought up that "The MAD is the average distance each data point is from the mean" was difficult to parse because of the two "is" words. In class while teaching, the phrase, "MAD is..." is typically followed by a pause, and the phrase, "the average distance each data point is from the mean" makes sense. The definition of the deviation is this: the distance a data point is from the mean, or $x_i – \bar{x}$ ($x_i$ is the ith data point, and $\bar{x}$ is the mean of all data points). Since the deviations can be negative (indicating the data point is below the mean), we have to take the absolute value to create a distance (forcing a positive deviation).

(Fun fact: the sum of the deviations is always zero since the mean is the balancing point. The mean balances all of the positive and negative deviations - a surprise every time!)

Taking all that into consideration, let's go with: "MAD is the average distance from each data point to the mean."

amanda-phet commented 1 year ago

Removing the meeting label, since this has already had enough time and careful consideration, with people weighing in. I agree with @catherinecarter 's final definition.

samreid commented 1 year ago

Updated, closing.