BackConfidence Intervals and Sample Size in Statistics
Study Guide - Smart Notes
Tailored notes based on your materials, expanded with key definitions, examples, and context.
Confidence Intervals for Proportions
Introduction
Confidence intervals are a fundamental concept in statistics, used to estimate the range within which a population parameter is likely to fall, based on sample data. This section focuses on constructing confidence intervals for population proportions.
Population proportion (p-hat): The fraction of the sample exhibiting a particular characteristic.
Critical z-score (Z): The value from the standard normal distribution corresponding to the desired confidence level (e.g., 1.96 for 95%).
Margin of error (E): The maximum expected difference between the true population parameter and a sample estimate.
Key Formulas
Sample proportion: , where x is the number of successes and n is the sample size.
Complement of sample proportion:
Margin of error for proportion:
Confidence interval:
Example
If a survey of 100 people finds that 55 eat breakfast every morning:
For 95% confidence,
Confidence interval: or
Interpretation: We are 95% confident that the true proportion of people who eat breakfast every morning is between 45.2% and 64.8%.
Practice Question
Given a sample of 100 people, 60 drink coffee every morning. Find the 95% confidence interval for this proportion.
Steps:
Calculate and
Find the critical z-score
Compute the margin of error
State the confidence interval
Determining Sample Size for a Desired Margin of Error
Introduction
To ensure that a sample estimate is within a specified margin of error at a given confidence level, it is necessary to calculate the required sample size before conducting a survey or experiment.
Desired margin of error (E): The maximum acceptable difference between the sample estimate and the true population value.
Estimated proportion (p): If unknown, use 0.5 for maximum variability.
Key Formula
Sample size for proportion:
Example
To be within 4 percentage points (E = 0.04) at 95% confidence, assuming :
Round up: Sample size needed is 451
Practice Question
Find the sample size needed to be within 5 percentage points (E = 0.05) at 95% confidence, assuming no prior knowledge of the proportion.
Confidence Intervals for Means (Using t-distribution)
Introduction
When estimating the mean of a population from sample data, especially with small samples or unknown population standard deviation, the t-distribution is used to construct confidence intervals.
Sample mean (x̄): The average value from the sample.
Sample standard deviation (s): Measures the spread of sample data.
Degrees of freedom (df): , where n is the sample size.
Critical t-value (tα/2): Value from the t-distribution table for the desired confidence level and degrees of freedom.
Margin of error (E):
Key Formula
Confidence interval for mean:
Example
Sample size = 40, sample mean = 30, sample standard deviation = 4, 95% confidence:
Degrees of freedom:
Critical t-value (from table):
Margin of error:
Confidence interval: or
Practice Question
Construct a confidence interval for a mean of 20, sample size 49, sample standard deviation 3.
Summary Table: Confidence Interval Formulas
Parameter | Formula | Distribution |
|---|---|---|
Proportion | Normal (z) | |
Mean (unknown population SD) | t-distribution | |
Sample size for proportion | Normal (z) |
Additional info:
When the population proportion is unknown, use for maximum sample size.
Always round up sample size calculations to ensure the margin of error is not exceeded.
For small samples (typically n < 30), use the t-distribution for means.