Skip to main content
Back

The Central Limit Theorem and Sampling Distributions

Study Guide - Smart Notes

Tailored notes based on your materials, expanded with key definitions, examples, and context.

The Central Limit Theorem and Sampling Distributions

Introduction to Sampling Distributions

The concept of a sampling distribution is fundamental in statistics, especially when analyzing the behavior of sample means. A sampling distribution describes the distribution of a statistic (such as the sample mean or sample proportion) when all possible samples of a given size are drawn from the same population. Understanding sampling distributions is essential for making inferences about population parameters based on sample data.

Definition of sampling distribution of a statistic

Simulation of Sampling Distributions

Statistical software, such as StatCrunch, can be used to simulate sampling distributions for various statistics and populations. This allows for the exploration of concepts such as the Central Limit Theorem and the behavior of sample means.

StatCrunch sampling distributions interface

Sampling from a Uniform Distribution

Consider a random variable X with a uniform distribution, X ~ U(0, 50). The mean and standard deviation of the original population are important reference points for understanding the sampling distribution of sample means.

Population mean and standard deviation

  • Mean: 25

  • Median: 25

  • Standard deviation: 14.4309

When calculating the mean of thousands of samples for different sample sizes (n = 4, 16, 100), observe the following:

  • Shape: As n increases, the sampling distribution of means becomes more normal in shape.

  • Center: The mean of the sampling distribution remains close to the population mean.

  • Spread: The standard deviation of the sampling distribution decreases as n increases.

Visualizing the Central Limit Theorem

The Central Limit Theorem (CLT) states that for all samples of the same size n with n > 30, the sampling distribution of the sample mean can be approximated by a normal distribution, regardless of the shape of the original population distribution. The mean of the sampling distribution is equal to the population mean, and the standard deviation is equal to the population standard deviation divided by the square root of n.

Central Limit Theorem definitionSampling procedure and distribution of sample means

  • Formula for the mean of sample means:

  • Formula for the standard deviation of sample means:

Notation for the sampling distribution of sample means

Comparison of Sampling Distributions

The shape of the sampling distribution of the sample mean depends on the sample size and the shape of the original population distribution. As sample size increases, the sampling distribution becomes more normal, even if the original population is not normal.

Comparison of sampling distributions for different population shapes and sample sizes

Population Shape

Sample Size n = 2

Sample Size n = 5

Sample Size n = 30

Uniform

Wide, flat

More peaked

Normal-like

Bimodal

Two peaks

Peaks merge

Normal-like

Skewed

Skewed

Less skewed

Normal-like

Normal

Normal

Normal

Normal

Example: If the original population is skewed, the sampling distribution of the mean will become more normal as n increases.

Applications of the Central Limit Theorem

The Central Limit Theorem is used to calculate probabilities related to sample means. For example, if body temperatures are normally distributed with mean 98.2°F and standard deviation 0.6°F, the probability that a randomly selected person has a temperature above 99°F can be calculated using the normal distribution. For a group of 16 people, the sampling distribution of the mean is used, with standard deviation .

  • Formula for z-score of sample mean:

  • Application: Use the z-score to find probabilities and percentiles for sample means.

Example: For Tylenol weights, if X ~ N(605.1 mg, 5.96 mg), the probability of observing an average weight of 600 mg in a sample of 20 pills can be calculated using the sampling distribution of the mean.

Additional info: The probability of seeing a statistic at least as extreme as x is called a p-value, which is discussed further in hypothesis testing (Chapter 8).

Pearson Logo

Study Prep