Back(Lecture 17) Sampling Distributions and the Central Limit Theorem
Study Guide - Smart Notes
Tailored notes based on your materials, expanded with key definitions, examples, and context.
Sampling Distributions of the Sample Mean
Introduction to Sampling Distributions
Sampling distributions describe how a statistic, such as the sample mean, varies from sample to sample when drawn from a population. Understanding these distributions is essential for making statistical inferences about population parameters.
Sampling Distribution: The probability distribution of a statistic (e.g., sample mean) based on all possible samples of a given size from a population.
Sample Mean (̅X): The average value calculated from a sample, used to estimate the population mean (μ).
Key Results for the Sampling Distribution of the Sample Mean
There are two main results regarding the sampling distribution of the sample mean:
Formulas for Mean and Standard Deviation: The mean and standard deviation of the sampling distribution can be calculated using population parameters.
Shape of the Distribution: The sampling distribution of the sample mean is often approximately normal, especially for large sample sizes.
Sampling Distribution When the Population is Normally Distributed
Properties of the Sampling Distribution
If a random sample of size n is drawn from a normally distributed population with mean μ and standard deviation σ, then:
The sampling distribution of the sample mean ̅X is also normally distributed.
Mean of the sampling distribution:
Standard deviation of the sampling distribution (Standard Error):
Sampling Distribution for Any Population
Central Limit Theorem (CLT)
When the population distribution is not normal (not bell-shaped), the sampling distribution of the sample mean can still become approximately normal as the sample size increases. This phenomenon is explained by the Central Limit Theorem (CLT).
Central Limit Theorem (CLT): For a random sample of size n from any population with mean μ and standard deviation σ, as n increases, the sampling distribution of the sample mean approaches a normal distribution.
Implication: The mean of the sampling distribution remains μ, and the standard deviation is .
Visualizing the CLT
Regardless of the shape of the population distribution, the sampling distribution of the sample mean becomes more bell-shaped (normal) as the sample size n increases.
Population Distribution | Sampling Distribution of ̅X (n=2) | Sampling Distribution of ̅X (n=30) |
|---|---|---|
Skewed | Skewed | Approximately Normal |
Uniform | Uniform | Approximately Normal |
Normal | Normal | Normal |
Irregular | Irregular | Approximately Normal |
Additional info: The table above summarizes how the sampling distribution of the mean becomes normal as sample size increases, regardless of the original population shape.
Example: Average Salary
Scenario Description
Peter works as a seasonal waiter. The salary for each shift follows a probability distribution with mean μ = €150 and standard deviation σ = €60. At the end of the season, Peter randomly selects pay stubs from 32 shifts and calculates the mean salary per shift.
Population Mean: μ = €150
Population Standard Deviation: σ = €60
Sample Size: n = 32
Expected Value and Variability
Expected Mean Salary per Shift: The sample mean is expected to fluctuate around the population mean, μ = €150.
Standard Deviation of the Sampling Distribution:
Interpretation: From one season to the next, Peter's mean salary per shift will vary around €150, with variability described by a standard deviation of €10.6.
Effect of Sample Size on Standard Deviation
Relationship Between Sample Size and Variability
The standard deviation of the sample mean decreases as the sample size increases, making the sample mean a more precise estimate of the population mean.
Formula:
Implication: Larger samples yield smaller standard errors, so the sample mean is more likely to be close to the population mean.
Practical Implications of the Central Limit Theorem
Making Inferences
The CLT and the formula for the standard error of the sample mean have important implications for statistical inference:
If the sampling distribution of the sample mean is approximately normal, then the sample mean falls within 2 standard deviations of the population mean with probability close to 0.95, and almost certainly within 3 standard deviations.
For large n, the sampling distribution is approximately normal even if the population distribution is not. This allows us to make inferences about population means regardless of the population's shape.
This is especially useful in practice, as the shape of the population distribution is often unknown or irregular.
Summary Table: Key Properties of the Sampling Distribution of the Sample Mean
Property | Formula | Description |
|---|---|---|
Mean | Equals the population mean | |
Standard Deviation (Standard Error) | Population standard deviation divided by square root of sample size | |
Shape | Normal (for large n) | Approximately normal for large samples, regardless of population shape |
Additional info: These properties are foundational for constructing confidence intervals and conducting hypothesis tests about population means.