Skip to main content
Back

Confidence Intervals and Sample Size in Statistics

Study Guide - Smart Notes

Tailored notes based on your materials, expanded with key definitions, examples, and context.

Confidence Intervals for Proportions

Introduction

Confidence intervals are a fundamental concept in statistics, used to estimate the range within which a population parameter is likely to fall, based on sample data. This section focuses on constructing confidence intervals for population proportions.

  • Population proportion (p-hat): The fraction of the sample exhibiting a particular characteristic.

  • Critical z-score (Z): The value from the standard normal distribution corresponding to the desired confidence level (e.g., 1.96 for 95%).

  • Margin of error (E): The maximum expected difference between the true population parameter and a sample estimate.

Key Formulas

  • Sample proportion: , where x is the number of successes and n is the sample size.

  • Complement of sample proportion:

  • Margin of error for proportion:

  • Confidence interval:

Example

If a survey of 100 people finds that 55 eat breakfast every morning:

  • For 95% confidence,

  • Confidence interval: or

Interpretation: We are 95% confident that the true proportion of people who eat breakfast every morning is between 45.2% and 64.8%.

Practice Question

  • Given a sample of 100 people, 60 drink coffee every morning. Find the 95% confidence interval for this proportion.

  • Steps:

    1. Calculate and

    2. Find the critical z-score

    3. Compute the margin of error

    4. State the confidence interval

Determining Sample Size for a Desired Margin of Error

Introduction

To ensure that a sample estimate is within a specified margin of error at a given confidence level, it is necessary to calculate the required sample size before conducting a survey or experiment.

  • Desired margin of error (E): The maximum acceptable difference between the sample estimate and the true population value.

  • Estimated proportion (p): If unknown, use 0.5 for maximum variability.

Key Formula

  • Sample size for proportion:

Example

To be within 4 percentage points (E = 0.04) at 95% confidence, assuming :

  • Round up: Sample size needed is 451

Practice Question

  • Find the sample size needed to be within 5 percentage points (E = 0.05) at 95% confidence, assuming no prior knowledge of the proportion.

Confidence Intervals for Means (Using t-distribution)

Introduction

When estimating the mean of a population from sample data, especially with small samples or unknown population standard deviation, the t-distribution is used to construct confidence intervals.

  • Sample mean (x̄): The average value from the sample.

  • Sample standard deviation (s): Measures the spread of sample data.

  • Degrees of freedom (df): , where n is the sample size.

  • Critical t-value (tα/2): Value from the t-distribution table for the desired confidence level and degrees of freedom.

  • Margin of error (E):

Key Formula

  • Confidence interval for mean:

Example

Sample size = 40, sample mean = 30, sample standard deviation = 4, 95% confidence:

  • Degrees of freedom:

  • Critical t-value (from table):

  • Margin of error:

  • Confidence interval: or

Practice Question

  • Construct a confidence interval for a mean of 20, sample size 49, sample standard deviation 3.

Summary Table: Confidence Interval Formulas

Parameter

Formula

Distribution

Proportion

Normal (z)

Mean (unknown population SD)

t-distribution

Sample size for proportion

Normal (z)

Additional info:

  • When the population proportion is unknown, use for maximum sample size.

  • Always round up sample size calculations to ensure the margin of error is not exceeded.

  • For small samples (typically n < 30), use the t-distribution for means.

Pearson Logo

Study Prep