Measures of Variation in Statistics: Range, Variance, and Standard Deviation

Notes Practice Video lessons

Study Guide - Smart Notes

Tailored notes based on your materials, expanded with key definitions, examples, and context.

Chapter Outline

2.1 Frequency Distributions and Their Graphs
2.2 More Graphs and Displays
2.3 Measures of Central Tendency
2.4 Measures of Variation
2.5 Measures of Position

Measures of Variation

Section 2.4 Objectives

Find the range of a data set
Find the variance and standard deviation of a population and a sample
Use the Empirical Rule and Chebyshev's Theorem to interpret standard deviation
Approximate the sample standard deviation for grouped data
Use the coefficient of variation to compare variation in different data sets

Range

The range is the simplest measure of variation in a data set. It is defined as the difference between the maximum and minimum data entries.

Formula:
The range is a single value and is sensitive to outliers.

Example: If the starting salaries for Corporation A are 37, 39, 41, 41, 42, 44, 44, 45, 47, 52 (in thousands), then:

(thousand dollars)

Variation

Variation describes how much the data values differ from each other. Two data sets can have the same mean but different levels of variation.

Data sets with greater spread have higher variation.
Graphs can help visualize variation between data sets.

Deviation, Variance, and Standard Deviation

These are more sophisticated measures of variation that consider how each data value differs from the mean.

Deviation: The difference between a data entry and the mean (population) or (sample).
Variance: The average of the squared deviations from the mean.
Standard Deviation: The square root of the variance; measures the typical amount an entry deviates from the mean.

Population Formulas

Population Variance:
Population Standard Deviation:

Sample Formulas

Sample Variance:
Sample Standard Deviation:

Steps to Calculate Population Variance and Standard Deviation

Find the mean of the population data set.
Find the deviation of each entry: .
Square each deviation.
Add up the squared deviations: .
Divide by for variance; take the square root for standard deviation.

Steps to Calculate Sample Variance and Standard Deviation

Find the mean of the sample data set.
Find the deviation of each entry: .
Square each deviation.
Add up the squared deviations: .
Divide by for variance; take the square root for standard deviation.

Example: Calculating Population Standard Deviation

Given data: 37, 39, 41, 41, 42, 44, 44, 45, 47, 52
Mean
Sum of squared deviations
Population variance:
Population standard deviation:

Example: Calculating Sample Standard Deviation

Given sample data: 4, 7, 6, 9, 5, 9, 8, 7, 9
Mean
Sum of squared deviations
Sample variance:
Sample standard deviation:

Interpreting Standard Deviation

Standard deviation measures the typical amount an entry deviates from the mean.
The more spread out the data, the greater the standard deviation.

Empirical Rule (68-95-99.7 Rule)

For data sets with approximately normal (bell-shaped) distributions:

About 68% of the data falls within one standard deviation of the mean.
About 95% falls within two standard deviations.
About 99.7% falls within three standard deviations.

Example: If the mean height of women is 64.2 inches and the standard deviation is 2.8 inches, then about 68% of heights are between 61.4 and 67.0 inches.

Chebyshev's Theorem

Chebyshev's Theorem applies to any data set, regardless of distribution shape.

At least of the data values lie within standard deviations of the mean, for .
For , at least 75% of data within 2 standard deviations.
For , at least 88.9% of data within 3 standard deviations.

Standard Deviation for Grouped Data

When data is presented as a frequency distribution, use the following formula to estimate the sample standard deviation:

where is the frequency of each class and is the class midpoint.

Coefficient of Variation

The coefficient of variation (CV) is a standardized measure of dispersion, useful for comparing variation between data sets with different units or means.

Formula: (for sample), (for population)
Higher CV indicates greater relative variability.

Summary Table: Measures of Variation

Measure	Formula	Interpretation
Range		Difference between largest and smallest values
Population Variance		Average squared deviation from mean (population)
Sample Variance		Average squared deviation from mean (sample)
Population Standard Deviation		Typical deviation from mean (population)
Sample Standard Deviation		Typical deviation from mean (sample)
Coefficient of Variation	or	Relative variation (unitless, %)

Additional info:

Standard deviation is always non-negative.
Variance is in squared units; standard deviation is in the same units as the data.
Empirical Rule applies only to approximately normal distributions.
Chebyshev's Theorem is more general and applies to all distributions.