Lesson 18Using Data to Solve Problems

Learning Goal

Let’s compare data sets using visual displays.

Learning Targets

  • I can decide whether mean and MAD or median and IQR would be more appropriate for describing the center and spread of a data set.

  • I can draw an appropriate graphical representation for a set of data.

  • I can explain what the mean and MAD or the median and IQR tell us in the context of a situation and use them to answer questions.

Warm Up: Wild Bears

Problem 1

In one study on wild bears, researchers measured the head lengths and head widths, in inches, of 143 wild bears. The ages of the bears ranged from newborns (0 years) to 15 years. The box plots summarize the data from the study.

Two sets of box plots for "head length in inches" and "head width in inches". The numbers 2 through 20, in increments of 2, are indicated and there are tick marks midway between each indicated number.  For "head length in inches" the top box plot is labeled "male bears" and the five-number summary are as follows: Minimum value, 9. Maximum value, 19. Q1, 12 point 5. Q2, 13 point 5. Q3, 15 point 5. The bottom box plot is labeled "female bears" and the five-number summary are as follows: Minimum value, 10. Maximum value, 15 point 5. Q1, 12. Q2, 12 point 5. Q3, 13 point 5. For "head width in inches" the top box plot is labeled "male bears" and the five-number summary are as follows: Minimum value, 4. Maximum value, 10. Q1, 5 point 5. Q2, 6 point 5. Q3, 8. The bottom box plot is labeled "female bears" and the five-number summary are as follows: Minimum value, 4 point 5. Maximum value, 7 point 5. Q1, 5. Q2, 6. Q3, 7 point 5.
  1. Write four statistical questions that could be answered using the box plots: two questions about the head length and two questions about the head width.

  2. Trade questions with your partner.

    • Decide if each question is a statistical question.

    • Use the box plots to answer each question.

Activity 1: Math Homework (Part 1)

Over a two-week period, Mai recorded the number of math homework problems she had each school day.

  • 2

  • 15

  • 20

  • 0

  • 5

  • 25

  • 1

  • 0

  • 10

  • 12

Problem 1

Calculate the following. Show your reasoning.

  1. The mean number of math homework problems.

  2. The mean absolute deviation (MAD).

Problem 2

Interpret the mean and MAD. What do they tell you about the number of homework problems Mai had over these two weeks?

Problem 3

Find or calculate the following values and show your reasoning.

You may use the applet below to help if you choose to. Enter the values needed to calculate the IQR and the mean when prompted.

  1. The median, quartile, maximum, and minimum of the same data on Mai’s math homework problems.

  2. The interquartile range (IQR).

Print Version

Find or calculate the following values and show your reasoning.

  1. The median, quartile, maximum, and minimum of the same data on Mai’s math homework problems.

  2. The interquartile range (IQR).

Problem 4

Which pair of measures of center and variability—mean and MAD, or median and IQR—do you think summarizes the distribution of Mai’s math homework assignments better? Explain your reasoning.

Activity 2: Math Homework (Part 2)

Jada wanted to know whether a dot plot, a histogram, or a box plot would best summarize the center, variability, and other aspects of her homework data.

  • 2

  • 15

  • 20

  • 0

  • 5

  • 25

  • 1

  • 0

  • 10

  • 12

Problem 1

Use the axis to make a dot plot to represent the data. Mark the position of the mean, which you calculated earlier, on the dot plot using a triangle ().

A blank histogram labeled number of math problems with a scale of 0 to 26.

Problem 2

Use the five-number summary from the previous task and the grid to draw a box plot that represents Jada’s homework data.

A blank histogram labeled number of math problems with a scale of 0 to 26.

Problem 3

Work with your group to draw three histograms to represent Jada’s homework data. The width of the bars in each histogram should represent a different number of homework problems, as specified.

  1. The width of one bar represents 10 problems.

    A blank histogram labeled number of math problems with a scale of 0 to 30.
  2. The width of one bar represents 5 problems.

    A blank histogram labeled number of math problems with a scale of 0 to 30.
  3. The width of one bar represents 2 problems.

    A blank histogram labeled number of math problems with a scale of 0 to 30.

Problem 4

Which of the five representations should Jada use to summarize her data? Should she use a dot plot, box plot, or one of the histograms? Explain your reasoning.

You can use the applet to make each type of graph if you choose to. Begin by dragging the gray bar from the top of the applet down until you see all of the command boxes.

Print Version

Which of the five representations should Jada use to summarize her data? Should she use a dot plot, box plot, or one of the histograms? Explain your reasoning.

Activity 3: Will the Yellow Perch Survive?

Scientists studying the yellow perch, a species of fish, believe that the length of a fish is related to its age. This means that the longer the fish, the older it is. Adult yellow perch vary in size, but they are usually between 10 and 25 centimeters.

Scientists at the Great Lakes Water Institute caught, measured, and released yellow perch at several locations in Lake Michigan. The table shows a summary that is based on a sample of yellow perch from one of these locations.

length of fish in centimeters

number of fish

0 to less than 5

5 to less than 10

10 to less than 15

15 to less than 20

20 to less than 25

25 to less than 30

Problem 1

Use the data to make a histogram that shows the lengths of the captured yellow perch. Each bar should contain the lengths shown in each row in the table.

A blank grid. The horizontal axis is labeled “fish length in centimeters” and the numbers 0 through 30, in increments of 5, are indicated. The vertical axis has the numbers 0 through 40, in increments of 5, indicated. There are 4 evenly spaced horizontal gridlines between each indicated number.

Problem 2

How many fish were measured? How do you know?

Problem 3

Use the histogram to answer the following questions.

  1. How would you describe the shape of the distribution?

  2. Estimate the median length for this sample. Describe how you made this estimate.

  3. Predict whether the mean length of this sample is greater than, less than, or nearly equal to the median length for this sample of fish? Explain your prediction.

  4. Would you use the mean or the median to describe a typical length of the fish being studied? Explain your reasoning.

Problem 4

Based on your work so far:

  1. Would you describe a typical age for the yellow perch in this sample as: “young,” “adult,” or “old”? Explain your reasoning.

  2. Some researchers are concerned about the survival of the yellow perch. Do you think the lengths (or the ages) of the fish in this sample are something to worry about? Explain your reasoning.

Lesson Summary

The dot plot shows the distribution of 30 cookie weights in grams.

A box plot and a histogram for "cookie weights in grams". The numbers 8 through 34, in increments of two, are indicated. The box plot is above the dot plot.  The five-number summary for the box plot is as follows: Minimum value, 9. Maximum value, 34. Q1, 16. Q2, 20.5. Q3, 26. For the dot plot a triangle is indicated at 21 grams. A horizontal line is drawn below the triangle and begins at 15.4 and ends at 26.6. The data for the dot plot are as follows: 9 grams, 1 dot. 10 grams, 1 dot. 11 grams, 2 dots. 12 grams, 1 dot. 14 grams, 1 dot. 16 grams, 2 dots. 17 grams, 1 dot. 18 grams, 2 dots. 19 grams, 1 dot. 20 grams, 3 dots. 21 grams, 1 dot. 22 grams, 3 dots. 23 grams, 1 dot. 24 grams, 2 dots. 26 grams, 2 dots. 28 grams, 1 dot. 30 grams, 1 dot. 32 grams, 2 dots. 33 grams, 1 dot. 34 grams, 1 dot.

The mean cookie weight, marked by the triangle, is 21 grams. This tells us that if the weights of all of the cookies were redistributed so they all had the same weight, each cookie would weigh 21 grams. The MAD is 5.6 grams, which suggests that a cookie typically weighs between 15.4 grams and 26.6 grams.

The box plot for the same data set is shown above the dot plot. The median shows that half of the weights are greater than or equal to 20.5 grams, and half are less than or equal to 20.5 grams. The box shows that the IQR is 10 and that the middle half of the cookies weigh between 16 and 26 grams.

In this case, the median weight is very close to the mean weight, and the IQR is about twice the MAD. This tells us that the two pairs of measures of center and spread are very similar.

Now let’s look at another example of 30 different cookies.

A box plot and a dot plot for "cookie weights in grams". The numbers 8 through 34, in increments of two, are indicated. For the box plot: The minimum value is 9 and the maximum value is 26. Q1 has a value of 20, Q2 has a value of 23, and Q3 has a vlaue of 24. For the dot plot: A triangle is indicated at 21 grams. A line indicates the distance from 17.6 to 24.4 grams. The data is as follows: 9 grams, 1 dot. 10 grams, 1 dot. 13 grams, 1 dot. 14 grams, 1 dot. 16 grams, 1 dot. 17 grams, 1 dot. 19 grams, 1 dot. 20 grams, 2 dots. 21 grams, 2 dots. 22 grams, 3 dots. 23 grams, 6 dots. 24 grams, 5 dots. 25 grams, 4 dots. 26 grams, 1 dot.

Here the mean is 21 grams, and the MAD is 3.4 grams. This suggests that a cookie typically weighs between 17.6 and 24.4 grams. The median cookie weight is 23 grams, and the box plot shows that the middle half of the data are between 20 and 24 grams. These two pairs of measures paint very different pictures of the variability of the cookie weights.

The median (23 grams) is closer to the middle of the big cluster of values. If we were to ignore the smaller cookies, the median and IQR would give a more accurate picture of how much a cookie typically weighs.

When a distribution is not symmetrical, the median and IQR are often better measures of center and spread than the mean and MAD. However the decision on which pair of measures to use depends on what we want to know about the group we are investigating.