BackMeasures of Variation in Statistics: Range, Standard Deviation, and Variance
Study Guide - Smart Notes
Tailored notes based on your materials, expanded with key definitions, examples, and context.
Measures of Variation
Introduction
In statistics, measures of variation describe how spread out or dispersed the values in a data set are. Understanding variation is crucial for interpreting data, comparing distributions, and making informed decisions. The primary measures of variation include the range, standard deviation, and variance.
Range
Definition and Calculation
Range is the difference between the maximum value and the minimum value in a data set.
Formula:
Very easy to compute; quickly identifies the span of the data.
Only uses two values (the extremes), so it does not reflect the distribution of the rest of the data.
Very sensitive to outliers.
Example: For the data set 1, 3, 3, 1.5, 1, 6:
Maximum = 6, Minimum = 1
Range = 6 - 1 = 5
Standard Deviation
Definition and Purpose
Standard deviation measures how much the individual data values differ from the mean.
There are two types: sample standard deviation and population standard deviation.
Formulas
Sample standard deviation (s):
Population standard deviation (\sigma):
The standard deviation is always non-negative and is usually positive unless all values are the same (in which case it is zero).
It is greatly affected by extreme values (outliers).
The units are the same as the original data (e.g., meters, dollars).
The sample standard deviation (s) is a biased estimator of the population standard deviation \( \sigma \).
Example: For the data set 1, 3, 3, 1.5, 1, 6:
Mean (\( \bar{x} \)) = 2.25
Sample standard deviation calculation:
Note: The image shows a slightly different calculation, possibly due to rounding or a different mean value. The process above follows the standard method.
Variance
Definition and Calculation
Variance is a measure of variation that is the square of the standard deviation.
It is denoted as s2 for a sample and \( \sigma^2 \) for a population.
The units of variance are the square of the original units (e.g., m2, $2).
Variance is always non-negative and is usually positive unless all values are the same.
The sample variance s2 is an unbiased estimator of the population variance \( \sigma^2 \).
Formulas:
Sample variance:
Population variance:
Example: For the data set 1, 3, 3, 1.5, 1, 6:
Sample variance: (as shown in the image)
Range Rule of Thumb
Estimating the Standard Deviation
A quick estimate of the standard deviation can be obtained by dividing the range by 4.
Formula:
This rule is useful for rough estimates, especially when only the range is known.
Summary Table: Measures of Variation
Measure | Definition | Formula | Units | Sensitivity to Outliers |
|---|---|---|---|---|
Range | Difference between maximum and minimum values | Same as data | Very sensitive | |
Standard Deviation | Average distance of data values from the mean | Same as data | Sensitive | |
Variance | Average squared distance from the mean | Square of data units | Sensitive |
Additional info: The notes reference 'MAT 252 Siblings' as a sample data set for calculation examples. The formulas and explanations have been expanded for clarity and completeness.