Lesson 22Observing More Patterns in Scatter Plots

Learning Goal

Let’s look for other patterns in data.

Learning Targets

  • I can analyze a set of data to determine associations between two variables.

  • I can pick out clusters in data from a scatter plot.

  • I can use a scatter plot to decide if two variables have a linear association.

Lesson Terms

  • negative association
  • outlier
  • positive association

Warm Up: Notice and Wonder: Clustering

Problem 1

What do you Notice? What do you Wonder?

4 scatter plots. A has a negative slope. B has 2 clusters of data with positive slopes. C makes a sideways "v" opening to the right. D has 2 clusters both with a negative slope.

Activity 1: Scatter Plot City

Problem 1

Your teacher will give you a set of cards. Each card shows a scatter plot.

  1. Sort the cards into categories and describe each category.

  2. Explain the reasoning behind your categories to your partner. Listen to your partner’s reasoning for their categories.

  3. Sort the cards into two categories: positive associations and negative associations. Compare your sorting with your partner’s and discuss any disagreements.

  4. Sort the cards into two categories: linear associations and non-linear associations. Compare your sorting with your partner’s and discuss any disagreements.

Activity 2: Animal Brains

Problem 1

Is there an association between the weight of an animal’s body and the weight of the animal’s brain?

animal

body weight (kg)

brain weight (g)

cow

grey wolf

goat

donkey

horse

potar monkey

cat

giraffe

animal

body weight (kg)

brain weight (g)

gorilla

human

rhesus monkey

kangaroo

sheep

jaguar

chimpanzee

pig

  1. What do you notice in the table of data?

    Consider the scatter plot of the data. Are there any outliers?

    Experiment with the line to fit the data. Drag the points to move the line. You can close the expressions list by clicking on the double arrow.

  2. Without including any outliers, does there appear to be an association between body weight and brain weight? Describe the association in a sentence.

  3. Adjust the line by moving the green points, fitting the line to your scatter plot, and estimate its slope. What does this slope mean in the context of brain and body weight?

  4. Does the fitted line help you identify more outliers?

Print Version

Is there an association between the weight of an animal’s body and the weight of the animal’s brain?

animal

body weight (kg)

brain weight (g)

cow

grey wolf

goat

donkey

horse

potar monkey

cat

giraffe

animal

body weight (kg)

brain weight (g)

gorilla

human

rhesus monkey

kangaroo

sheep

jaguar

chimpanzee

pig

  1. Use the data in the table to make a scatter plot. Are there any outliers?

    a blank graph of body weight (kg) vs brain weight (g).
  2. After removing the outliers, does there appear to be an association between body weight and brain weight? Describe the association in a sentence.

  3. Using a piece of pasta and a straightedge, fit a line to your scatter plot, and estimate its slope. What does this slope mean in the context of brain and body weight?

  4. Does the fitted line help you identify more outliers?

Are you ready for more?

Problem 1

Use one of the suggestions or find another set of data that interested you to look for associations between the variables.

  • Number of wins vs number of points per game for your favorite sports team in different seasons

  • Amount of money grossed vs critic rating for your favorite movies

  • Price of a ticket vs stadium capacity for popular bands on tour

After you have collected the data:

  1. Create a scatter plot for the data.

  2. Are any of the points very far away from the rest of the data?

  3. Would a linear model fit the data in your scatter plot? If so, draw it. If not, explain why a line would be a bad fit.

  4. Is there an association between the two variables? Explain your reasoning.

Lesson Summary

Sometimes a scatter plot shows an association that is not linear:

A scatter plot with data that makes somewhat of a U shape

We call such an association a non-linear association. In later grades, you will study equations that can be models for non-linear associations.

Sometimes in a scatter plot we can see separate groups of points.

A scatter plot with 2 clusters of data with a positive correlation. A second graph with the same data but a circle around each cluster.

We call these groups clusters.

People often collect data in two variables to investigate possible associations between two numerical variables and use the connections that they find to predict more values of the variables. Data analysis usually follows these steps:

  1. Collect data.

  2. Organize and represent the data, and look for an association.

  3. Identify any outliers and try to explain why these data points are exceptions to the trend that describes the association.

  4. Find an equation that fits the data well.

Although computational systems can help with data analysis by graphing the data, finding an equation that might fit the data, and using that equation to make predictions, it is important to understand the process and think about what is happening. A computational system may find an equation that does not make sense or use a line when the situation suggests that a different model would be more appropriate.