BackConfidence Intervals for Proportions
Study Guide - Smart Notes
Tailored notes based on your materials, expanded with key definitions, examples, and context.
Chapter 15: Confidence Intervals for Proportions
Introduction to Confidence Intervals for Proportions
Confidence intervals provide a range of plausible values for a population parameter, such as a proportion, based on sample data. This chapter focuses on constructing and interpreting confidence intervals for population proportions.
Population proportion (p): The true proportion of interest in the population.
Sample proportion (\(\hat{p}\)): The proportion observed in the sample, used to estimate \(p\).
If the sample size \(n\) is sufficiently large (\(np \geq 10\) and \(n(1-p) \geq 10\)), the sampling distribution of \(\hat{p}\) is approximately normal:
$
The standard error (SE) of \(\hat{p}\) is:
$
The 68-95-99.7 Rule and Confidence Intervals
The empirical rule (68-95-99.7 rule) helps us understand how sample proportions vary:
About 68% of sample proportions fall within \(\hat{p} \pm SE(\hat{p})\) of \(p\).
About 95% fall within \(\hat{p} \pm 2SE(\hat{p})\) of \(p\).
About 99.7% fall within \(\hat{p} \pm 3SE(\hat{p})\) of \(p\).
These intervals are called confidence intervals for the population proportion.
Confidence Intervals for the Population Proportion
General formula for a confidence interval (CI):
$
\(z^*\) is the critical value from the standard normal distribution, determined by the desired confidence level.
Confidence Level (C)
The probability (expressed as a percentage) that a confidence interval encloses the population proportion \(p\).
Common confidence levels: 90%, 95%, 99%.
The confidence level determines the critical value \(z^*\):
$
For example, for a 90% CI, \(z^* = 1.645\).
Margin of Error (ME)
Measures the width of the confidence interval and the precision of the estimate.
$
A larger margin of error means a wider interval (more likely to contain \(p\)), but less precision.
Assumptions and Conditions for Constructing a Confidence Interval
The sample is randomly drawn from the population (to avoid bias).
Sampled values are independent (sample size is a small fraction of the population size).
Sample size is sufficiently large: \(np \geq 10\) and \(nq \geq 10\), where \(q = 1-p\).
Properties of a Confidence Interval
Centered at the sample proportion \(\hat{p}\).
Width increases with higher confidence level (lower precision).
Width decreases as sample size increases (greater precision).
Interpretation of a Confidence Interval
Over many repeated samples, a proportion \(C\)% of confidence intervals constructed will contain the true population proportion \(p\).
We say we are \(C\)% confident that the true proportion falls within the interval obtained from our sample.
Important Notes about Confidence Intervals
Confidence intervals are constructed for parameters (like \(p\)), not statistics (like \(\hat{p}\)).
The population proportion \(p\) is fixed; the sample proportion \(\hat{p}\) and the confidence interval are random due to sampling variability.
For a specific interval (e.g., (0.50, 0.65)), we are 95% confident it contains \(p\), but \(p\) either is or is not in the interval—we do not assign a probability to this one interval.
Sample Size Determination for Estimating a Population Proportion
To achieve a desired margin of error (ME) at a given confidence level, the required sample size \(n\) is:
$
If prior estimates of \(\hat{p}\) are available, use them.
If no prior estimates, use \(\hat{p} = 0.5\) (maximizes the required sample size).
Example Exercise
A university dean randomly selects 200 students and finds that 92 receive financial aid.
(a) Construct a 95% confidence interval for the true proportion of students who receive financial aid.
(b) Interpret the confidence interval.
(c) If the sample size is tripled, the width of the 95% confidence interval will decrease (since standard error decreases as \(n\) increases).
(d) To estimate the proportion within 0.05 with 95% confidence, calculate the required sample size using the formula above.
(e) If no prior information is available, use \(\hat{p} = 0.5\) for the calculation.
Appendix: Maximizing Sample Size
To ensure the margin of error does not exceed a specified value, use \(\hat{p} = 0.5\) when no prior information is available, as this maximizes \(\hat{p}(1-\hat{p})\) and thus the required sample size.