Skip to main content
Back

6.5 Assessing Normality in Normal Probability Distributions

Study Guide - Smart Notes

Tailored notes based on your materials, expanded with key definitions, examples, and context.

Normal Probability Distributions

Assessing Normality

Many statistical methods require that sample data come from a population with a normal distribution. Assessing normality is a crucial step before applying these methods. This section outlines procedures and examples for determining whether sample data are approximately normal.

Key Steps for Assessing Normality

  • Construct a Histogram: Visualize the data to check if the distribution is roughly bell-shaped.

  • Construct a Normal Quantile Plot (Normal Probability Plot): Plot each data value against its expected z-score from the standard normal distribution. Analyze the pattern of points.

Procedure for Determining Normality

  1. Histogram: If the histogram is not bell-shaped, the data likely do not follow a normal distribution.

  2. Normal Quantile Plot: If the histogram is symmetric with few outliers, generate a normal Q-Q plot. If the points are close to a straight line and show no systematic deviation, the data are likely normal.

Criteria for Normality

  • Normal Distribution: Points in the normal quantile plot are close to a straight line without systematic patterns.

  • Not Normal: Points deviate from a straight line or show systematic non-linear patterns.

Advanced Methods for Assessing Normality

  • Ryan-Joiner Test: Calculates the correlation between data and normal scores. A correlation coefficient near 1 suggests normality. If the test statistic is less than the critical value, reject normality.

  • Other Tests: Shapiro-Wilk, D’Agostino-Pearson, chi-square goodness-of-fit, Kolmogorov-Smirnov, Lilliefors, Cramer-von Mises, Jarque-Bera, and Anscombe-Glynn kurtosis tests.

Examples of Assessing Normality

Normal Distribution Example

The histogram of IQ scores is bell-shaped, and the normal quantile plot shows points close to a straight line, indicating normality.

Histogram and normal quantile plot for IQ scores (bell-shaped and linear)

Uniform Distribution Example

The histogram is flat (uniform), and the normal quantile plot, while somewhat linear, shows a systematic pattern that is not a straight line. This indicates the data are not from a normal distribution.

Histogram and normal quantile plot for uniform data (flat and non-linear)

Skewed Distribution Example

The histogram of rainfall amounts is skewed to the right, and the normal quantile plot shows points far from a straight line. The data are not normally distributed.

Histogram and normal quantile plot for skewed data (right-skewed and non-linear)

Dallas Commute Times Example

For 1000 Dallas commute times, the histogram is left-skewed and not bell-shaped. The normal Q-Q plot shows points far from a straight line, indicating the data are not from a normal distribution.

Histogram and normal quantile plot for Dallas commute times (left-skewed and non-linear)

Outliers

Outliers can significantly affect statistical results. Always check for outliers and analyze data both with and without them. Outliers should only be discarded if they are confirmed errors, as they may contain important information about the data.

Data Transformations

When data are not normal, transformations can sometimes normalize the distribution. A common transformation is taking the logarithm of each value (using natural or base 10 logarithms). If the transformed data are normal, the original data are said to follow a lognormal distribution.

  • If original values include 0, use log(x + 1).

Summary Table: Assessing Normality

Method

Purpose

Key Indicator of Normality

Histogram

Visualize data shape

Bell-shaped, symmetric

Normal Quantile Plot (Q-Q Plot)

Compare data to normal distribution

Points close to straight line

Ryan-Joiner Test

Formal test of normality

Correlation coefficient near 1

Other Tests (e.g., Shapiro-Wilk)

Formal test of normality

p-value > significance level (e.g., 0.10)

Key Formulas

  • Z-score: The z-score for a value x is given by: where is the mean and is the standard deviation.

Pearson Logo

Study Prep