BackSampling Distributions and the Central Limit Theorem: Study Notes
Study Guide - Smart Notes
Tailored notes based on your materials, expanded with key definitions, examples, and context.
Sampling Distributions and Estimators
Introduction to Sampling Distributions
Sampling distributions are foundational in inferential statistics, allowing us to understand how sample statistics behave when drawn from a population. Instead of focusing on individual data values, we analyze the distribution of statistics such as sample means or proportions.
Sampling Distribution of a Statistic: The probability distribution of a statistic (e.g., sample mean, sample proportion) when all possible samples of a given size are drawn from the same population.
Typically represented as a probability histogram, formula, or table.
Sampling Distribution of the Sample Proportion
The sampling distribution of the sample proportion describes the distribution of sample proportions (denoted as p̂) from all possible samples of the same size from a population.
Notation:
x = number of successes
n = sample size
N = population size
p̂ = x/n (sample proportion)
p = x/N (population proportion)
Behavior:
The distribution of sample proportions tends to approximate a normal distribution as sample size increases.
The mean of all sample proportions (p̂) equals the population proportion (p).
Example: Rolling a die 5 times and finding the proportion of odd numbers. Repeating this process many times, the distribution of sample proportions is approximately normal with mean 0.5 (since half the numbers on a die are odd).
Sampling Distribution of the Sample Mean
The sampling distribution of the sample mean (x̄) is the distribution of all possible sample means from samples of the same size drawn from a population.
Behavior:
The distribution of sample means tends to be normal, especially as sample size increases (Central Limit Theorem).
The mean of the sample means equals the population mean (μ).
Example: Rolling a die 5 times and finding the mean. Repeating this process many times, the distribution of sample means is approximately normal with mean 3.5 (the population mean for a fair die).
Estimators: Unbiased and Biased
An estimator is a statistic used to infer the value of a population parameter.
Unbiased Estimator: A statistic whose sampling distribution has a mean equal to the population parameter it estimates.
Biased Estimator: A statistic whose sampling distribution does not target the population parameter.
Unbiased Estimators | Biased Estimators |
|---|---|
Sample Proportion (p̂) | Median |
Sample Mean (x̄) | Range |
Sample Variance (s²) | Sample Standard Deviation (s) |
Note: The sample standard deviation (s) is a biased estimator of the population standard deviation (σ), but the bias is small for large samples, so s is often used in practice.
Sampling with Replacement
Sampling with replacement ensures independent events, simplifying probability calculations.
For large populations and small samples, the difference between sampling with and without replacement is negligible.
The Central Limit Theorem (CLT)
Statement of the Central Limit Theorem
The Central Limit Theorem is a key result in statistics, stating that for sufficiently large sample sizes (n > 30), the sampling distribution of the sample mean (x̄) will be approximately normal, regardless of the population's distribution.
Mean of sampling distribution:
Standard deviation (standard error):
Practical Rules for Applications
If the population is normal or n > 30, the sampling distribution of x̄ is approximately normal.
If the population is not normal and n ≤ 30, the sampling distribution of x̄ may not be normal, and normal-based methods do not apply.
Z-score for sample mean:
Considerations for Problem Solving
Check Requirements: Confirm that the population is normal or n > 30 before applying normal distribution methods to sample means.
Individual Value vs. Sample Mean:
For an individual value:
For a sample mean:
Summary Table: Key Formulas
Statistic | Mean of Sampling Distribution | Standard Deviation (Standard Error) |
|---|---|---|
Sample Proportion (p̂) | p | |
Sample Mean (x̄) | μ |
Additional info: The standard error is sometimes denoted as SEM (Standard Error of the Mean).