Skip to main content
Back

Chapter 3: Measures of Variation in Statistics – Range, Standard Deviation, and Variance

Study Guide - Smart Notes

Tailored notes based on your materials, expanded with key definitions, examples, and context.

Describing, Exploring, and Comparing Data

Introduction

This chapter focuses on the essential statistical concepts used to describe, explore, and compare data sets. The primary measures of variation discussed are range, standard deviation, and variance. Understanding these measures is crucial for interpreting the spread and consistency of data.

Measures of Variation

Round-off Rule for Measures of Variation

  • When rounding the value of a measure of variation, carry one more decimal place than is present in the original set of data.

Range

The range is the simplest measure of variation, representing the difference between the largest and smallest values in a data set.

  • Definition: The range of a set of data values is the difference between the maximum and minimum data values.

  • Formula:

  • Properties:

    • Uses only the maximum and minimum values, making it sensitive to extreme values (not resistant).

    • Does not account for all data values, so it may not accurately reflect overall variation.

  • Example: For wait times: 50, 25, 75, 35, 50, 25, 30, 50, 45, 25, 20 minutes

Standard Deviation

The standard deviation measures how much data values deviate from the mean, providing a more comprehensive view of variation than the range.

  • Notation:

    • = sample standard deviation

    • = population standard deviation

  • Sample Standard Deviation Formula:

  • Population Standard Deviation Formula:

  • Properties:

    • Measures deviation from the mean.

    • Never negative; zero only if all values are identical.

    • Larger values indicate greater variation.

    • Units are the same as the original data.

    • Can be affected by outliers (not resistant).

    • Sample standard deviation is a biased estimator of population standard deviation .

  • Example Calculation:

    1. Compute mean: min

    2. Subtract mean from each value:

    3. Square each deviation:

    4. Sum all squared deviations:

    5. Divide by :

    6. Take square root:

    7. Final answer: minutes

Shortcut Formula for Sample Standard Deviation

  • Used by calculators and software for efficiency.

  • Example: minutes

  • Additional info: The shortcut formula typically uses sums of squares and sums of data values to simplify calculations.

Range Rule of Thumb

  • Understanding Standard Deviation: Most (about 95%) sample values lie within 2 standard deviations of the mean.

  • Identifying Significant Values:

    • Significantly low: or lower

    • Significantly high: or higher

    • Not significant: between and

  • Estimating Standard Deviation:

Variance

Variance quantifies the spread of data by averaging the squared deviations from the mean. It is the square of the standard deviation.

  • Sample Variance:

  • Population Variance:

  • Properties:

    • Units are the squares of the original units.

    • Can increase dramatically with outliers (not resistant).

    • Never negative; zero only if all values are identical.

    • Sample variance is an unbiased estimator of population variance .

Why Divide by (n - 1)?

  • Only values can be freely assigned; the last value is determined by the mean.

  • Dividing by ensures sample variances center around the population variance .

  • Dividing by would underestimate the population variance.

Empirical Rule for Bell-Shaped Distributions

The Empirical Rule applies to data sets with approximately normal (bell-shaped) distributions.

  • About 68% of values fall within 1 standard deviation of the mean.

  • About 95% of values fall within 2 standard deviations of the mean.

  • About 99.7% of values fall within 3 standard deviations of the mean.

Example: IQ scores with mean 100 and standard deviation 15: 2 standard deviations: , About 95% of IQ scores are between 70 and 130.

Chebyshev’s Theorem

Chebyshev’s Theorem provides minimum proportions of data within standard deviations of the mean, applicable to any data distribution.

  • At least of values lie within standard deviations of the mean, for .

  • For : At least 75% of values within 2 standard deviations.

  • For : At least 89% of values within 3 standard deviations.

  • Example: IQ scores, mean 100, standard deviation 15: - At least 75% between 70 and 130 - At least 89% between 55 and 145

Comparing Variation in Different Samples or Populations

The coefficient of variation (CV) expresses standard deviation relative to the mean, allowing comparison across different data sets.

  • Sample:

  • Population:

  • Round CV to one decimal place (e.g., 25.3%).

Biased and Unbiased Estimators

  • Sample standard deviation is a biased estimator of population standard deviation .

  • Sample variance is an unbiased estimator of population variance .

Notation Summary

Symbol

Meaning

s

Sample standard deviation

s2

Sample variance

σ

Population standard deviation

σ2

Population variance

Pearson Logo

Study Prep