Confidence Intervals and Estimation of Population Parameters

Study Guide - Smart Notes

Tailored notes based on your materials, expanded with key definitions, examples, and context.

Section 8.1 Point and Interval Estimates of Population Parameters

Statistical Inference Methods

Statistical inference methods are essential in statistics for drawing conclusions about population parameters based on sample data. These methods rely on probability calculations and sampling distributions.

Randomization Assumption: Probability calculations assume data are gathered from a random sample or randomized experiment.
Sampling Distribution: The probability calculations refer to the sampling distribution of a statistic, which is often approximately normal.

Types of Statistical Inference

Estimation: Determining population parameter values.
Hypothesis Testing: Testing hypotheses about parameter values.
The most informative estimation method constructs a confidence interval, an interval believed to contain the unknown parameter value.

Point Estimate and Interval Estimate

Estimation can be done using a single value or an interval.

Point Estimate: A single number that is the best guess for the parameter.
Interval Estimate: An interval of numbers believed to contain the actual value of the parameter.
A point estimate alone does not indicate how close it is to the parameter; an interval estimate incorporates a margin of error to gauge accuracy.

Properties of Point Estimators

Unbiasedness: An estimator is unbiased if its sampling distribution is centered at the parameter it estimates.
The sample mean () is an unbiased estimator of the population mean ().
The sample proportion () is an unbiased estimator of the population proportion ().
Small Standard Deviation: A good estimator has a small standard deviation, meaning it tends to fall closer to the parameter than other estimators.
For a normal distribution, the sample mean has a smaller standard deviation than the sample median.

Confidence Interval

A confidence interval is an interval containing the most believable values for a parameter. The probability that this method produces an interval containing the parameter is called the confidence level, commonly set at 0.95 (95%).

Section 8.2 Constructing a Confidence Interval to Estimate a Population Proportion

Logic Behind Constructing a Confidence Interval

Confidence intervals are constructed using properties of sampling distributions, especially for proportions.

The sampling distribution of the sample proportion () is approximately normal for large samples, with and .
Mean of sampling distribution: (population proportion).
Standard deviation:

Margin of Error

Approximately 95% of a normal distribution falls within 1.96 standard deviations of the mean.
Margin of error:
95% confidence interval for a proportion:

Example 1: Confidence Interval for a Proportion

Sample: 315 of 1285 respondents agreed with a statement.
Estimated standard deviation: 0.01
Margin of error:
95% CI:
Interpretation: The population proportion is predicted to be between 0.29 and 0.33.

Finding the 95% Confidence Interval for a Population Proportion

Population proportion symbol:
Sample proportion symbol: ("p-hat")
For large samples, is approximately normal.
Margin of error:
Standard deviation: (unknown is estimated by )
Standard error:
95% CI:

Example 2: Paying Higher Prices to Protect the Environment

Sample: , $637\hat{p} = 0.468$)
Standard error:
95% CI:
Interpretation: Estimated population proportion is between 44% and 49%.

Sample Size Needed for Validity

For validity of the 95% CI: and
If not met, use an adjusted formula (see Section 8.4).

Using Confidence Levels Other Than 95%

Higher confidence level (e.g., 99%) increases the chance of correct inference but widens the interval.
Lower confidence level narrows the interval but increases the risk of incorrect inference.
Common z-scores: 1.645 (90%), 1.96 (95%), 2.58 (99%)

Confidence Level	Error Probability	z-Score
0.90	0.10	1.645
0.95	0.05	1.96
0.99	0.01	2.58

Example 3: Influenza Vaccine

Sample: , $26\hat{p} = 0.0067$)
Standard error:
99% CI:

Summary: Confidence Interval for a Population Proportion

Assumptions: Data obtained by randomization; sample size large enough (, )

Effects of Confidence Level and Sample Size on Margin of Error

Margin of error increases as confidence level increases.
Margin of error decreases as sample size increases.

Section 8.3 Constructing a Confidence Interval to Estimate a Population Mean

Confidence Interval for a Population Mean

To estimate a population mean, use the sample mean as the point estimate and construct a confidence interval using the standard error.

Standard error: where is the sample standard deviation.
Confidence interval:

The t Distribution and Its Properties

When population standard deviation is unknown, use the t-distribution instead of the normal distribution.
The t-distribution is bell-shaped, symmetric about 0, and has thicker tails than the normal distribution.
Degrees of freedom:
Margin of error:

Example 4: Buying on eBay

Sample: , sample mean , sample standard deviation
Standard error:
Degrees of freedom:
For 95% CI,
95% CI:
Interpretation: With 95% confidence, the mean closing price is between $569 and $599.

Robustness of the t Method

t methods are robust to most violations of normality, but outliers can affect validity.
Randomization is required for valid inference.

Standard Normal Distribution as t Distribution with Infinite df

As , the t-distribution approaches the standard normal distribution.
For large , values approach values (e.g., for 95% CI).

Section 8.4 Choosing the Sample Size for a Study

Sample Size for Estimating a Population Proportion

Decide desired margin of error () and confidence level.
Sample size formula:
Use if no prior estimate is available.

Example 5: Sample Size for Exit Poll

Desired margin of error:
Estimated ,
Required sample size:

Sample Size for Estimating a Population Mean

Sample size formula:
If unknown, estimate from similar studies or use range/6 for bell-shaped distributions.

Example 6: Estimating Mean Education in South Africa

Range: 0 to 18 years, year
Estimate
Sample size:

Other Factors Affecting Sample Size

Desired precision (margin of error)
Confidence level
Variability in the data
Cost

Using Small Sample Sizes

t methods for means are valid for any , but caution is needed for outliers and non-normality.
For proportions, need at least 15 successes and 15 failures for normal approximation.

Confidence Interval for a Proportion with Small Samples

If sample does not meet the 15 successes/failures rule, adjust by adding 2 to both successes and failures (add 4 to ).
Formula: