Lesson 14Sampling in a Fair Way

Learning Goal

Let’s explore ways to get representative samples.

Learning Targets

  • I can describe ways to get a random sample from a population.

  • I know that selecting a sample at random is usually a good way to get a representative sample.

Lesson Terms

  • mean
  • mean absolute deviation (MAD)
  • median
  • population
  • representative
  • sample

Warm Up: Ages of Moviegoers

Problem 1

A survey was taken at a movie theater to estimate the average age of moviegoers.

Here is a dot plot showing the ages of the first 20 people surveyed.

A dot plot for “age in years.” The numbers 3 through 12 are indicated. The data are as follows: 3 years old, 0 dots. 4 years old, 1 dot. 5 years old, 1 dot. 6 years old, 1 dot. 7 years old, 1 dot. 8 years old, 3 dots. 9 years old, 7 dots. 10 years old, 4 dots. 11 years old, 2 dots. 12 years old, 0 dots.
  1. What questions do you have about the data from survey?

  2. What assumptions would you make based on these results?

Activity 1: Comparing Methods for Selecting Samples

Problem 1

Take turns with your partner reading each option aloud. For each situation, discuss:

  • Would the different methods for selecting a sample lead to different conclusions about the population?

  • What are the benefits of each method?

  • What might each method overlook?

  • Which of the methods listed would be the most likely to produce samples that are representative of the population being studied?

  • Can you think of a better way to select a sample for this situation?

  1. Lin is running in an election to be president of the seventh grade. She wants to predict her chances of winning. She has the following ideas for surveying a sample of the students who will be voting:

    • Ask everyone on her basketball team who they are voting for.

    • Ask every third girl waiting in the lunch line who they are voting for.

    • Ask the first 15 students to arrive at school one morning who they are voting for.

  2. A nutritionist wants to collect data on how much caffeine the average American drinks per day. She has the following ideas for how she could obtain a sample:

    • Ask the first 20 adults who arrive at a grocery store after 10:00 a.m. about the average amount of caffeine they consume each day.

    • Every 30 minutes, ask the first adult who comes into a coffee shop about the average amount of caffeine they consume each day.

Activity 2: That’s the First Straw

Your teacher will have some students draw straws from a bag.

Problem 1

As each straw is taken out and measured, record its length (in inches) in the table.

straw 1

straw 2

straw 3

straw 4

straw 5

length (inches)

length (inches)

Problem 2

Estimate the mean length of all the straws in the bag based on:

  1. the mean of the first sample.

  2. the mean of the second sample.

Problem 3

Were your two estimates the same? Did the mean length of all the straws in the bag change in between selecting the two samples? Explain your reasoning.

Problem 4

The actual mean length of all of the straws in the bag is about 2.37 inches. How do your estimates compare to this mean length?

Problem 5

If you repeated the same process again but you selected a larger sample (such as 10 or 20 straws, instead of just 5), would your estimate be more accurate? Explain your reasoning.

Activity 3: That’s the Last Straw

Problem 1

There were a total of 35 straws in the bag. Suppose we put the straws in order from shortest to longest and then assigned each straw a number from 1 to 35. For each of these methods, decide whether it would be fair way to select a sample of 5 straws. Explain your reasoning.

  1. Select the straws numbered 1 through 5.

  2. Write the numbers 1 through 35 on pieces of paper that are all the same size. Put the papers into a bag. Without looking, select five papers from the bag. Use the straws with those numbers for your sample.

  3. Using the same bag as the previous question, select one paper from the bag. Use the number on that paper to select the first straw for your sample. Then use the next 4 numbers in order to complete your sample. (For example, if you select number 17, then you also use straws 18, 19, 20, and 21 for your sample.)

  4. Create a spinner with 35 sections that are all the same size, and number them 1 through 35. Spin the spinner 5 times and use the straws with those numbers for your sample.

Are you ready for more?

Problem 1

Computers accept inputs, follow instructions, and produce outputs, so they cannot produce truly random numbers. If you knew the input, you could predict the output by following the same instructions the computer is following. When truly random numbers are needed, scientists measure natural phenomena such as radioactive decay or temperature variations. Before such measurements were possible, statisticians used random number tables, like this:

A random number table of number between 01 and 99.

Use this table to select a sample of 5 straws. Pick a starting point at random in the table. If the number is between 01 and 35, include that number straw in your sample. If the number has already been selected, or is not between 01 and 35, ignore it, and move on to the next number.

Lesson Summary

A sample is selected at random from a population if it has an equal chance of being selected as every other sample of the same size. For example, if there are 25 students in a class, then we can write each of the students’ names on a slip of paper and select 5 papers from a bag to get a sample of 5 students selected at random from the class.

Other methods of selecting a sample from a population are likely to be biased. This means that it is less likely that the sample will be representative of the population as a whole. For example, if we select the first 5 students who walk in the door, that will not give us a random sample because students who typically come late are not likely to be selected. A sample that is selected at random may not always be a representative sample, but it is more likely to be representative than using other methods.

It is not always possible to select a sample at random. For example, if we want to know the average length of wild salmon, it is not possible to identify each one individually, select a few at random from the list, and then capture and measure those exact fish. When a sample cannot be selected at random, it is important to try to reduce bias as much as possible when selecting the sample.