BackInterpreting the Standard Deviation: Empirical Rule and Chebyshev’s Rule
Study Guide - Smart Notes
Tailored notes based on your materials, expanded with key definitions, examples, and context.
Interpreting the Standard Deviation
Empirical Rule (Rule of Thumb)
The Empirical Rule provides a quick way to estimate the spread of data in a mound-shaped (or bell-shaped, i.e., normal) distribution using the mean and standard deviation. It is especially useful for understanding the proportion of data within certain intervals around the mean.
Definition: The Empirical Rule states that for a normal distribution:
Approximately 68% of the data falls within one standard deviation of the mean:
Approximately 95% of the data falls within two standard deviations of the mean:
Approximately 99.7% of the data falls within three standard deviations of the mean:
Approximation for Standard Deviation: For mound-shaped data, a rough estimate for the standard deviation is:
Example: Battery Life
Scenario: A manufacturer claims the average life of a battery is 60 months, with a standard deviation of 10 months. The distribution is mound-shaped.
(i) Percentage lasting more than 50 months: 50 months is 1 standard deviation below the mean (). By the Empirical Rule, about 84% of batteries last more than 50 months (since 68% are within one standard deviation, so 34% are between 50 and 60, and 50% are above 60; thus, 50% + 34% = 84%).
(ii) Percentage lasting less than 40 months: 40 months is 2 standard deviations below the mean (). By the Empirical Rule, about 2.5% of batteries last less than 40 months (since 95% are within two standard deviations, so 2.5% are below ).
(iii) If a battery lasts only 37 months: 37 months is 2.3 standard deviations below the mean. This is quite rare in a normal distribution, suggesting the manufacturer's claim may be overstated.
Additional info: The percentages above are based on properties of the normal distribution. For exact probabilities, use the standard normal table.
Chebyshev’s Rule
Chebyshev’s Rule provides a minimum proportion of data within a certain number of standard deviations from the mean, regardless of the distribution's shape. It is more general than the Empirical Rule and applies to all distributions.
Definition: For any dataset (not necessarily normal), at least of the data falls within standard deviations of the mean, for any .
Key Points:
Within 2 standard deviations: at least (75%) of the data is within .
Within 3 standard deviations: at least (89%) of the data is within .
Within standard deviations: at least of the data is within for .
Note: Chebyshev’s Rule gives a lower bound; the actual percentage may be higher.
Example: Traffic at an Intersection
Scenario: The mean number of vehicles per day is 375, with a standard deviation of 25. We want to know the percentage of days with more than 425 vehicles.
(a) Unknown distribution shape: 425 is 2 standard deviations above the mean (). By Chebyshev’s Rule, at most 25% of the data can be outside two standard deviations (since at least 75% is within), so at most 12.5% can be above 425.
(b) If the distribution is mound-shaped: By the Empirical Rule, about 2.5% of the data is above two standard deviations from the mean (i.e., above 425 vehicles).
Comparison Table: Empirical Rule vs. Chebyshev’s Rule
Rule | Distribution Requirement | Within 1 SD | Within 2 SD | Within 3 SD |
|---|---|---|---|---|
Empirical Rule | Mound-shaped (normal) | 68% | 95% | 99.7% |
Chebyshev’s Rule | Any distribution | -- | ≥ 75% | ≥ 89% |
Additional info: The Empirical Rule provides more precise estimates but only applies to normal distributions. Chebyshev’s Rule is more general but less precise.