Skip to main content
Back

Distribution Shapes in Statistics: Concepts and Applications

Study Guide - Smart Notes

Tailored notes based on your materials, expanded with key definitions, examples, and context.

Chapter 2: Distribution Shapes

Introduction to Distributions

In statistics, the distribution of a data set describes the values that observations can take and how frequently each value occurs. Understanding the shape of a distribution is essential for interpreting data and selecting appropriate statistical methods.

  • Frequency Distribution: Shows how often each value occurs in a data set.

  • Relative Frequency: The proportion of observations that fall within a particular category or interval.

  • Shape of the Distribution: The overall pattern or form of the data when plotted, often visualized with a smooth curve.

  • Example: The distribution of heights in inches, as shown in a histogram with a smooth curve overlay, can reveal whether the data are symmetric, skewed, or uniform.

Modality

Modality refers to the number of peaks, or modes, in a distribution. The mode is the value that appears most frequently in a data set.

  • Unimodal: Distribution with one peak.

  • Bimodal: Distribution with two distinct peaks.

  • Multimodal: Distribution with three or more peaks.

  • Example: The distribution of exam scores may be unimodal if most students score similarly, or bimodal if there are two groups with different performance levels.

Symmetry

A distribution is symmetric if it can be divided into two parts that are mirror images of each other. Symmetry is a key property in many statistical analyses.

  • Bell-shaped (Normal) Distribution: Symmetric and has a single peak at the center. The classic example is the normal distribution.

  • Uniform (Rectangular) Distribution: All values are equally likely; the distribution is flat and symmetric.

  • Example: The distribution of heights in a large population often approximates a bell-shaped curve.

Skewness

Skewness describes the degree to which a distribution deviates from symmetry. It indicates whether the data are stretched more to one side.

  • Right Skewed (Positively Skewed): The right tail (higher values) is longer than the left tail. Most data are concentrated on the left.

  • Left Skewed (Negatively Skewed): The left tail (lower values) is longer than the right tail. Most data are concentrated on the right.

  • Symmetric: Tails are of equal length; data are evenly distributed around the center.

  • Example: Income distributions are often right skewed, with most people earning moderate amounts and a few earning much more.

Identifying Skewness in Data

Skewness can be identified visually using histograms or mathematically using skewness coefficients.

  • Histogram: A graphical representation of the distribution can reveal skewness by the shape of the tails.

  • Example: The histogram of the number of people per household in the U.S. shows a right-skewed distribution, with most households having fewer people.

Population vs. Sample Distributions

Understanding the difference between population distribution and sample distribution is fundamental in statistics.

  • Population Distribution: The distribution of a variable for all individuals in the population. There is only one population distribution for a given variable.

  • Sample Distribution: The distribution of a variable for individuals in a sample. This can vary from sample to sample.

  • Approximation: Since the population distribution is often unknown, statisticians use the distribution of a simple random sample to estimate it. Larger sample sizes generally provide better approximations.

  • Example: Drawing six samples of 100 U.S. households each and plotting the household size distribution for each sample demonstrates how sample distributions can vary, but tend to approximate the population distribution as sample size increases.

Summary Table: Types of Distribution Shapes

Shape

Description

Example

Unimodal

One peak

Normal distribution of heights

Bimodal

Two peaks

Test scores with two groups

Multimodal

Three or more peaks

Survey data with several popular choices

Symmetric (Bell-shaped)

Mirror image halves

Normal distribution

Uniform

Flat, all values equally likely

Random number generator output

Right Skewed

Right tail longer

Income distribution

Left Skewed

Left tail longer

Age at retirement

Key Formulas

  • Relative Frequency:

  • Skewness Coefficient:

Applications

  • Understanding distribution shapes helps in choosing appropriate statistical tests and models.

  • Skewed data may require transformation or non-parametric methods.

  • Sample distributions are used to infer population characteristics in inferential statistics.

Additional info: Academic context and examples have been expanded for clarity and completeness.

Pearson Logo

Study Prep