BackStatistics Unit 3: Confidence Intervals and Hypothesis Testing (Chapters 9-11) – Study Guide
Study Guide - Smart Notes
Tailored notes based on your materials, expanded with key definitions, examples, and context.
Vocabulary and Notation
Key Terms
Point Estimate: A single value used to estimate a population parameter (e.g., sample mean \( \bar{x} \) estimates population mean \( \mu \)).
Confidence Interval: An interval estimate, calculated from the sample data, that is likely to contain the population parameter with a specified level of confidence.
Level of Confidence (\( (1-\alpha) \cdot 100\% \)): The probability that the confidence interval contains the true parameter.
Margin of Error: The maximum expected difference between the point estimate and the true parameter value.
Critical Value: The value that defines the endpoints of the confidence interval, based on the desired confidence level (e.g., z\( \alpha/2 \) or t\( \alpha/2 \)).
Student’s t-Distribution: A probability distribution used when estimating the mean of a normally distributed population in situations where the sample size is small and population standard deviation is unknown.
Bootstrapping: A resampling method used to estimate the sampling distribution of a statistic by sampling with replacement from the original data.
Percentile Method Confidence Interval: A confidence interval constructed from the percentiles of the bootstrap distribution.
Hypothesis: A statement about a population parameter. Includes the null hypothesis (\( H_0 \)) and alternative hypothesis (\( H_1 \)).
Hypothesis Testing: A statistical method for testing a claim about a population parameter using sample data.
Type I Error: Rejecting the null hypothesis when it is true (false positive).
Type II Error: Failing to reject the null hypothesis when it is false (false negative).
Level of Significance (\( \alpha \)): The probability of making a Type I error.
P-value: The probability, under the null hypothesis, of obtaining a result equal to or more extreme than what was actually observed.
Statistical Significance: When the observed effect is unlikely to have occurred by chance, as determined by the p-value.
Practical Significance: When the observed effect is large enough to be meaningful in real-world terms.
Independent Samples: Samples in which the observations in one sample are not related to those in the other.
Dependent Samples (Matched-Pairs): Samples in which each observation in one sample can be paired with an observation in the other sample.
Robust Test: A statistical test that is valid even when certain assumptions are violated.
Randomization Test: A nonparametric method for hypothesis testing using random resampling.
Notation
Population proportion: \( p \)
Sample proportion: \( \hat{p} \)
Population mean: \( \mu \)
Sample mean: \( \bar{x} \)
Sample standard deviation: \( s \)
One Sample Confidence Intervals
Confidence Interval for One Sample Proportion
Used to estimate the true population proportion based on a sample proportion.
Formula:
Assumptions:
Sample obtained by simple random sampling or randomized experiment.
\( n\hat{p}(1-\hat{p}) \geq 10 \)
Sampled values are independent (sample size < 5% of population size).
Example: If \( \hat{p} = 0.6 \), \( n = 100 \), and 95% confidence (\( z_{0.025} = 1.96 \)), the interval is:
t Confidence Interval for Mean
Used to estimate the population mean when the population standard deviation is unknown.
Formula:
Assumptions:
Sample obtained by simple random sampling or randomized experiment.
No outliers; population is normally distributed or \( n \geq 30 \).
Sampled values are independent.
Example: For \( \bar{x} = 50 \), \( s = 10 \), \( n = 25 \), 95% confidence (\( t_{0.025,24} \approx 2.064 \)):
One Sample Hypothesis Tests
z Test for One Sample Proportion
Tests whether the population proportion equals a specified value.
Test Statistic:
Hypotheses:
Two-tailed: \( H_0: p = p_0 \), \( H_1: p \neq p_0 \)
Left-tailed: \( H_0: p = p_0 \), \( H_1: p < p_0 \)
Right-tailed: \( H_0: p = p_0 \), \( H_1: p > p_0 \)
Assumptions:
Simple random sample or randomized experiment.
\( n p_0 (1-p_0) \geq 10 \)
Sampled values are independent (sample size < 5% of population).
t Test for Mean
Tests whether the population mean equals a specified value when the population standard deviation is unknown.
Test Statistic:
Degrees of Freedom: \( df = n - 1 \)
Hypotheses:
Two-tailed: \( H_0: \mu = \mu_0 \), \( H_1: \mu \neq \mu_0 \)
Left-tailed: \( H_0: \mu = \mu_0 \), \( H_1: \mu < \mu_0 \)
Right-tailed: \( H_0: \mu = \mu_0 \), \( H_1: \mu > \mu_0 \)
Assumptions:
Simple random sample or randomized experiment.
No outliers; population is normal or \( n \geq 30 \).
Sampled values are independent.
Two Sample Hypothesis Tests
Two Sample z Test for Proportions
Compares the proportions of two independent groups.
Test Statistic:
where \( \hat{p} = \dfrac{x_1 + x_2}{n_1 + n_2} \)
Hypotheses:
Two-tailed: \( H_0: p_1 = p_2 \), \( H_1: p_1 \neq p_2 \)
Left-tailed: \( H_0: p_1 = p_2 \), \( H_1: p_1 < p_2 \)
Right-tailed: \( H_0: p_1 = p_2 \), \( H_1: p_1 > p_2 \)
Assumptions:
Independent samples from simple random sampling or randomized experiment.
\( n\hat{p}(1-\hat{p}) \geq 10 \) for both samples.
Sample sizes < 5% of respective populations.
Two Sample t Test for Dependent Means (Matched Pairs)
Compares means from two related groups (e.g., before and after measurements).
Test Statistic:
Degrees of Freedom: \( df = n - 1 \)
Hypotheses:
Two-tailed: \( H_0: \mu_d = 0 \), \( H_1: \mu_d \neq 0 \)
Left-tailed: \( H_0: \mu_d = 0 \), \( H_1: \mu_d < 0 \)
Right-tailed: \( H_0: \mu_d = 0 \), \( H_1: \mu_d > 0 \)
Assumptions: Same as for one sample mean, but applied to the differences.
Two Sample t Test for Independent Means (Unequal Variances)
Compares means from two independent groups, not assuming equal variances.
Test Statistic:
Degrees of Freedom (Welch-Satterthwaite):
Hypotheses:
Two-tailed: \( H_0: \mu_1 = \mu_2 \), \( H_1: \mu_1 \neq \mu_2 \)
Left-tailed: \( H_0: \mu_1 = \mu_2 \), \( H_1: \mu_1 < \mu_2 \)
Right-tailed: \( H_0: \mu_1 = \mu_2 \), \( H_1: \mu_1 > \mu_2 \)
Assumptions:
Independent samples from simple random sampling or randomized experiment.
Populations are normal or sample sizes are large (\( n_1, n_2 \geq 30 \)).
Sample sizes < 5% of respective populations.
Two Sample Confidence Intervals
Confidence Interval for Difference Between Two Proportions
Formula:
Assumptions:
Independent samples from simple random sampling or randomized experiment.
\( n_1\hat{p}_1(1-\hat{p}_1) \geq 10 \) and \( n_2\hat{p}_2(1-\hat{p}_2) \geq 10 \)
Sample sizes < 5% of respective populations.
Confidence Interval for Mean of Differences (Paired Data)
Formula:
Degrees of Freedom: \( df = n - 1 \)
Assumptions: Same as for one sample mean, but applied to the differences.
Confidence Interval for Difference Between Two Independent Means (Unequal Variances)
Formula:
Degrees of Freedom: See formula above (Welch-Satterthwaite).
Assumptions:
Independent samples from simple random sampling or randomized experiment.
Populations are normal or sample sizes are large (\( n_1, n_2 \geq 30 \)).
Sample sizes < 5% of respective populations.
Sample Size Calculations
Sample Size Needed for Proportions
With Prior Estimate \( \hat{p} \):
Without Prior Estimate:
Where E is the desired margin of error (as a decimal).
Sample Size Needed for Means
Formula:
Where E is the desired margin of error.
Summary Table: Hypothesis Tests and Confidence Intervals
Test/Interval | Parameter | Statistic | Assumptions | Formula |
|---|---|---|---|---|
One-sample z for proportion | p | \( \hat{p} \) | Random sample, \( n p_0 (1-p_0) \geq 10 \), independence | |
One-sample t for mean | \( \mu \) | \( \bar{x} \) | Random sample, normality or large n, independence | |
Two-sample z for proportions | \( p_1 - p_2 \) | \( \hat{p}_1 - \hat{p}_2 \) | Independent random samples, \( n\hat{p}(1-\hat{p}) \geq 10 \) | |
Two-sample t for means (independent, unequal variances) | \( \mu_1 - \mu_2 \) | \( \bar{x}_1 - \bar{x}_2 \) | Independent random samples, normality or large n | |
Paired t for means | \( \mu_d \) | \( \bar{d} \) | Random sample of pairs, normality or large n |
Additional Info
Statistical software (e.g., StatCrunch) can be used to perform all calculations and simulations described above.
Bootstrap and randomization methods provide alternatives to traditional parametric inference, especially when assumptions are questionable.