๐ 10 min read | Class 9โ12 | FBISE ยท CBSE ยท IGCSE ยท O-Levels ยท IB
Does studying more hours lead to higher exam scores? Do taller students tend to have longer foot lengths? These are questions about the relationship between two variables โ and a scatter plot is the tool designed to answer them visually. A scatter plot plots paired data as individual points on a grid. The pattern formed by those points reveals whether the two variables are related, how strongly, and in which direction. Understanding scatter plots and correlation is one of the most practically useful statistical skills you will learn.
What Is a Scatter Plot?
A scatter plot (or scatter diagram) displays pairs of values for two variables. One variable is plotted on the x-axis (typically the independent variable) and the other on the y-axis (the dependent variable). Each pair of values becomes one point on the graph. The overall pattern of points indicates the type and strength of the relationship โ known as correlation.
Unlike a line graph, the points in a scatter plot are not connected. The pattern is read as a whole, not point to point.
Types of Correlation
| Type | What the Points Show | Real-World Example |
|---|---|---|
| Strong positive | Points cluster tightly around a line going up-right | Study hours vs exam score |
| Weak positive | Points loosely trend up-right, widely scattered | Height vs shoe size |
| Strong negative | Points cluster tightly around a line going down-right | Speed vs journey time |
| Weak negative | Points loosely trend down-right | Temperature vs heating cost |
| No correlation | Points scattered randomly, no visible trend | Shoe size vs IQ score |
Step-by-Step: Drawing a Scatter Plot
The table shows the number of hours five students studied and their test scores.
| Student | Hours Studied | Test Score (%) |
|---|---|---|
| A | 2 | 45 |
| B | 4 | 58 |
| C | 5 | 65 |
| D | 7 | 78 |
| E | 9 | 90 |
Identify the variables. Hours studied is the independent variable โ x-axis. Test score is the dependent variable โ y-axis.
Draw and label axes. x-axis: 0 to 10 hours. y-axis: 0 to 100%. Mark equal intervals.
Plot each pair as a single point (ร).
(2, 45) ยท (4, 58) ยท (5, 65) ยท (7, 78) ยท (9, 90)
Describe the correlation: The points rise from left to right in a fairly tight cluster, showing strong positive correlation โ students who studied more tended to score higher.
Draw the line of best fit. Draw a straight line that passes through the middle of the data, with roughly equal numbers of points above and below. The line should pass through or near the mean point (xฬ, ศณ) = (5.4, 67.2).
The Line of Best Fit
The line of best fit (regression line) is a straight line drawn through a scatter plot to represent the trend of the data. It is used for interpolation (predicting values within the data range) and โ with caution โ extrapolation (predicting outside the range).
To draw it by eye: imagine the cloud of points as a rugby ball shape and draw a line along its longest axis so that about half the points are above and half below. Always draw through the mean point (xฬ, ศณ) when possible.
Real-Life Applications
-
๐๏ธ
Sports science: Coaches scatter-plot training load against performance metrics to find the optimal training intensity for each athlete.
-
๐
Real estate: Property analysts plot house size against sale price to identify whether larger homes command proportionally higher prices in a given area.
-
๐ฌ
Science experiments: In physics or chemistry practicals, students plot experimental data as a scatter plot and use the line of best fit to determine gradients and intercepts.
-
๐
Public health: Epidemiologists scatter-plot variables like air pollution levels and respiratory illness rates across different cities to identify associations.
Common Mistakes Students Make
Frequently Asked Questions
Try the Scatter Plot Generator
Enter your paired data and instantly generate a fully labelled scatter plot โ complete with line of best fit and correlation description. Ideal for checking assignments or exploring data sets.
๐ต Open the Scatter Plot Generator โ