Lesson 11: Deviation from the Mean

Let's study distances between data points and the mean and see what they tell us. 

11.1: Shooting Hoops (Part 1)

Elena, Jada, and Lin enjoy playing basketball during recess. Lately, they have been practicing free throws. They record the number of baskets they make out of 10 attempts. Here are their data sets for 12 school days.

Elena 4 5 1 6 9 7 2 8 3 3 5 7
Jada 2 4 5 4 6 6 4 7 3 4 8 7
Lin 3 6 6 4 5 5 3 5 4 6 6 7
  1. Calculate the mean number of baskets each player made, and compare the means. What do you notice?
  2. What do the means tell us in this context?

11.2: Shooting Hoops (Part 2)

Here are the dot plots showing the number of baskets Elena, Jada, and Lin each made over 12 school days.

  1. On each dot plot, mark the location of the mean with a triangle ($\Delta$). Then, contrast the dot plot distributions. Write 2–3 sentences to describe the shape and spread of each distribution.
Three dot plots. One for “number of baskets Elena made”, one for “number of baskets Jada made”, and one for “number of baskets Lin made.” On each dot plot, the numbers 0 through 10 are indicated.  On the dot plot labeled “number of baskets Elena made” the data are as follows:  1 basket, 1 dot. 2 baskets, 1 dot. 3 baskets, 2 dots. 4 baskets, 1 dot. 5 baskets, 2 dots. 6 baskets, 1 dot. 7 baskets, 2 dots. 8 baskets, 1 dot. 9 baskets, 1 dot.  On the dot plot labeled “number of baskets Jada made” the data are as follows:  2 baskets, 1 dot. 3 baskets, 1 dot. 4 baskets, 4 dots. 5 baskets, 1 dot. 6 baskets, 2 dots. 7 baskets, 2 dots. 8 baskets, 1 dot.  On the dot plot labeled “number of baskets Lin made” the data are as follows:  3 baskets, 2 dots. 4 baskets, 2 dots. 5 baskets, 3 dots. 6 baskets, 4 dots. 7 baskets, 1 dot.
 
  1. Discuss the following questions with your group. Explain your reasoning.

    1. Would you say that all three students play equally well?
    2. Would you say that all three students play equally consistently?
    3. If you could choose one player to be on your basketball team based on their records, who would you choose?

11.3: Shooting Hoops (Part 3)

The tables show Elena, Jada, and Lin’s basketball data from an earlier activity. Recall that the mean of Elena’s data, as well as that of Jada and Lin’s data, was 5. 

  1. Record the distance between each of Elena’s scores and the mean. 
    Elena 4 5 1 6 9 7 2 8 3 3 5 7
    distance from 5 1     1                

    Now find the average of the distances in the table. Show your reasoning and round your answer to the nearest tenth.

    This value is the mean absolute deviation (MAD) of Elena’s data.

    Elena’s MAD: _________

  2. Find the mean absolute deviation of Jada’s data. Round it to the nearest tenth.
    Jada 2 4 5 4 6 6 4 7 3 4 8 7
    distance from 5                        

    Jada’s MAD: _________

  3. Find the mean absolute deviation of Lin’s data. Round it to the nearest tenth.
    Lin 3 6 6 4 5 5 3 5 4 6 6 7
    distance from 5                        

    Lin’s MAD: _________

  4. Compare the MADs and dot plots of the three students’ data. Do you see a relationship between each student’s MAD and the distribution on her dot plot? Explain your reasoning.

    Three dot plots labeled “number of baskets Elena made,” “number of baskets Jada made,” and “number of baskets Lin made” are indicated. Each dot plot has the numbers 0 through 10 indicated with a triangle at the number 5.  The dot plot “number of baskets Elena made,” has the following data:  1 basket, 1 dot. 2 baskets, 1 dot. 3 baskets, 2 dots. 4 baskets, 1 dot. 5 baskets, 2 dots. 6 baskets, 1 dot. 7 baskets, 2 dots. 8 baskets, 1 dot. 9 baskets, 1 dot.  The dot plot “number of baskets Jada made,” has the following data:  2 baskets, 1 dot. 3 baskets, 1 dot. 4 baskets, 4 dots. 5 baskets, 1 dot. 6 baskets, 2 dots. 7 baskets, 2 dots. 8 baskets, 1 dot.  The dot plot “number of baskets Lin made,” has the following data:  3 baskets, 2 dots. 4 baskets, 2 dots. 5 baskets, 3 dots. 6 baskets, 4 dots. 7 baskets, 1 dot.

11.4: Game of 22

Your teacher will give your group a deck of cards. Shuffle the cards, and put the deck face down on the playing surface.

  • To play: Draw 3 cards and add up the values. An ace is a 1. A jack, queen, and king are each worth 10. Cards 2–10 are each worth their face value. If your sum is anything other than 22 (either above or below 22), say: “My sum deviated from 22 by ____ ,” or “My sum was off from 22 by ____ .”
  • To keep score: Record each sum and each distance from 22 in the table. After five rounds, calculate the average of the distances. The player with the lowest average distance from 22 wins the game.
player A round 1 round 2 round 3 round 4 round 5
sum of cards          
distance from 22          

Average distance from 22: ____________

player B round 1 round 2 round 3 round 4 round 5
sum of cards          
distance from 22          

Average distance from 22: ____________

player C round 1 round 2 round 3 round 4 round 5
sum of cards          
distance from 22          

Average distance from 22: ____________

Whose average distance from 22 is the smallest? Who won the game?

Summary

We use the mean of a data set as a measure of center of its distribution, but two data sets with the same mean could have very different distributions.

This dot plot shows the weights, in grams, of 22 cookies.

A dot plot for “cookie weights in grams.” The numbers 8 through 34, in increments of 2, are indicated. A triangle is indicated at 21 grams. The data are as follows: 18 grams, 1 dot; 19 grams, 3 dots; 20 grams, 4 dots; 21 grams, 5 dots; 22 grams, 6 dots; 23 grams, 2 dots, 24 grams, 1 dot.

The mean weight is 21 grams. All the weights are within 3 grams of the mean, and most of them are even closer. These cookies are all fairly close in weight.

This dot plot shows the weights, in grams, of a different set of 30 cookies. 

A dot plot for “cookie weights in grams.” The numbers 8 through 34, in increments of 2, are indicated. A triangle is indicated at 21 grams. The data are as follows: 9 grams, 1 dot; 10 grams, 1 dot; 11 grams, 2 dots; 12 grams, 1 dot; 14 grams, 1 dot; 16 grams, 2 dots; 17 grams, 1 dot; 18 grams, 2 dots; 19 grams, 1 dot; 20 grams, 3 dots; 21 grams, 1 dot; 22 grams, 3 dots; 23 grams, 1 dot; 24 grams, 2 dots; 26 grams, 2 dots; 28 grams, 1 dot; 30 grams, 1 dot; 32 grams, 2 dots; 33 grams, 1 dot; 34 grams, 1 dot.

The mean weight for this set of cookies is also 21 grams, but some cookies are half that weight and others are one-and-a-half times that weight. There is a lot more variability in the weight.

There is a number we can use to describe how far away, or how spread out, data points generally are from the mean. This measure of spread is called the mean absolute deviation (MAD).

Here the MAD tells us how far cookie weights typically are from 21 grams. To find the MAD, we find the distance between each data value and the mean, and then calculate the mean of those distances.

For instance, the point that represents 18 grams is 3 units away from the mean of 21 grams. We can find the distance between each point and the mean of 21 grams and organize the distances into a table, as shown.

A dot plot for “cookie weights in grams.” The numbers 8 through 34, in increments of 2, are indicated. A triangle is indicated at 21 grams. There are two perpendicular lines drawn, one at 18 grams and the other at 21 grams. A horizontal line between the two lines is labeled 3. The data are as follows: 18 grams, 1 dot; 19 grams, 3 dots; 20 grams, 4 dots; 21 grams, 5 dots; 22 grams, 6 dots; 23 grams, 2 dots, 24 grams, 1 dot.
weight in grams 18 19 19 19 20 20 20 20 21 21 21 21 21 22 22 22 22 22 22 23 23 24
distance from mean 3 2 2 2 1 1 1 1 0 0 0 0 0 1 1 1 1 1 1 2 2 3

The values in the first row of the table are the cookie weights for the first set of cookies. Their mean, 21 grams, is the mean of the cookie weights.

The values in the second row of the table are the distances between the values in the first row and 21. The mean of these distances is the MAD of the cookie weights.

What can we learn from the averages of these distances once they are calculated?

  • In the first set of cookies, the distances are all between 0 and 3. The MAD is 1.2 grams, which tells us that the cookie weights are typically within 1.2 grams of 21 grams. We could say that a typical cookie weighs between 19.8 and 22.2 grams.
  • In the second set of cookies, the distances are all between 0 and 13. The MAD is 5.6 grams, which tells us that the cookie weights are typically within 5.6 grams of 21 grams. We could say a typical cookie weighs between 15.4 and 26.6 grams.

The MAD is also called a measure of the variability of the distribution. In these examples, it is easy to see that a higher MAD suggests a distribution that is more spread out, showing more variability.

Practice Problems ▶

Glossary

mean absolute deviation (MAD)

mean absolute deviation (MAD)

The mean absolute deviation measures the spread in a distribution. It is the mean of the distances of the data points from the mean of the distribution. (It is called mean absolute deviation because the distance of a data point from the mean is the absolute value of its deviation from the mean.)