calculator
STATISTICS

A web-based unit for Integrated Algebra
Home
Introduction
Central Tendency
Histograms
Cumulative Histograms
Stem & Leaf Plots
Box & Whisker Plots
Scatter Plots & Correlation
Quiz
Lesson Plan
Additional Resources
References
Scatter Plots & Correlation

Listen to a lecture on scatter plots, or read the lecture below.



How to make a scatter plot:

A scatter plot is very similar to graphing data and drawing a line of best fit, which you are probably familiar with from science.

For our example, we are going to plot data from an experiment to see if the amount of TV a student watched the night before a test was related to the student's test score.  The data from the experiment is displayed in the table below.

A Test of Television
Hours Watched
Test Score
0
92
0
100
0.5
89
1
82
1
90
1
95
1.5
85
2
70
2
80
2.5
65
2.5
70
3
68
3.5
60
4.5
55
5
60
 
1.)  Draw and label the axes

Our first step in drawing a scatter plot of this data is to draw and label our axes.

labeled axes

On the x-axis I have placed the hours watched.  If time is one of the variables in your data, it is generally going to go on the horizontal axis.   The scale is labeled, with every two boxes representing two hours.  The x-axis is also labeled "Hours Watched."


On the y-axis I have placed the test scores.  Since the scores go up to 100, I counted by fives, labeling every other box as 10.  I also labeled the y-axis "Test Score."

The last label I had to include was a title.  I simply took the title, "A Test of Television," from the data table, and recommend you do too, whenever possible.


2.)  Plot the data points.

Once you have drawn and labeled the axes, it is time to plot the data points.  We are going to treat the data like coordinate points.  For example, one student watched 1.5 hours of TV and earned a test score of 85.  On the graph you should plot the point (1.5, 85).  Do this for each piece of data. 

scatter plots with points plotted

DO NOT CONNECT THE POINTS!


3.)  Draw a trend line.

After plotting all your data points, it is time to draw a trend line.   The trend line is the same as a line of best fit.  Its purpose is to help determine how the two sets of data are related (if they are related at all), and help predict data. 

To draw the trend line, use a ruler.  The goal is to draw a line that passes through as many points as possible, and as close to the points as possible.

scatter plot with trend line
Now that we have graphed the line of best fit, it is possible to write an equation.  Simply take the y-intercept from the graph and find the rise and run to determine the slope.  Use this information in the slope-intercept form for a line (y=mx + b) to find the equation of the trend line.

The line above has a y-intercept of 96 and a slope of -9.   So, the equation for the trend line is y = -9x + 96.  



Correlation:

View a short digital story explaining the different types of correlation.




Practice:
  1. On a sheet of graph paper, draw a scatter plot for the following data:

    Calories and Fat in Some Common Foods
    Food
    Fat (g)
    Number of
    Calories
    Milk
    8
    150
    Eggs
    6
    80
    Chicken
    4
    90
    Ham
    19
    245
    Ice Cream
    14
    270
    Corn
    1
    70
    Ground Beef
    10
    185
    Broccoli
    1
    45
    Cheese
    9
    115

    Remember to label your graph (axes, scale, title) and use a pencil and ruler.

  2. After you have created your scatter plot, draw a trend line and find give the equation for the trend line. 

  3. On a sheet of paper, or in a word processor, state the type of correlation (if any) shown by this data set.  Explain why the scatter plot is this type of correlation.