Hypothesis Testing and Confidence Intervals: Formula Sheet Study Guide

Study Guide - Smart Notes

Tailored notes based on your materials, expanded with key definitions, examples, and context.

Testing Means & Proportions

Overview

Statistical tests for means and proportions are fundamental tools for making inferences about populations based on sample data. These procedures allow us to test hypotheses and construct confidence intervals for population parameters.

Estimate: The sample statistic used to estimate the population parameter (e.g., sample mean \( \bar{x} \), sample proportion \( \hat{p} \)).
Standard Deviation / Standard Error: Measures variability of the estimate.
Null Hypothesis (H0): The default assumption about the population parameter.
Test Statistic: Quantifies the difference between the sample estimate and the hypothesized value, standardized by variability.

One-Proportion z-Test

Estimate: \( \hat{p} = \frac{x}{n} \), \( \hat{q} = 1 - \hat{p} \)
Standard Error:
Null Hypothesis: \( H_0: p = p_0 \)
Test Statistic:
Assumptions: Independent observations, SRS, sample size conditions.
Example: Testing if the proportion of defective items is equal to 0.05.

Two-Proportion z-Test (Unpooled and Pooled)

Estimate: \( \hat{p}_1 = \frac{x_1}{n_1} \), \( \hat{p}_2 = \frac{x_2}{n_2} \)
Standard Error (Unpooled):
Null Hypothesis: \( H_0: p_1 - p_2 = p_0 \)
Test Statistic (Unpooled):
Standard Error (Pooled): where
Test Statistic (Pooled):
Assumptions: Independent samples, SRS, sample size conditions.
Example: Comparing proportions of success in two groups.

One-Sample Mean (Known and Unknown SD)

Known SD:
Unknown SD: with df = n-1
Assumptions: Independent observations, SRS, nearly normal distribution.
Example: Testing if the average weight differs from a known value.

Two-Sample Mean Tests (Equal and Unequal Variance)

Unequal Variance:
Degrees of Freedom:
Equal Variance:
Pooled Variance:
Assumptions: Independent samples, SRS, nearly normal distribution.
Example: Comparing mean scores between two classes.

Paired Sample t-Test

Estimate: (mean of differences)
Standard Error:
Test Statistic:
Assumptions: Differences have nearly normal distribution.
Example: Before-and-after measurements on the same subjects.

Testing Categorical Data (Chi-Square Tests)

Overview

Chi-square tests are used to analyze categorical data, testing hypotheses about distributions or relationships between variables.

Goodness of Fit: Tests if observed frequencies match expected frequencies.
Independence/Homogeneity: Tests if two categorical variables are independent or if distributions are the same across groups.

Chi-Square Goodness of Fit

Degrees of Freedom: # cells - 1
Expected Frequency:
Test Statistic:
Null Hypothesis: Population proportions are as specified.
Assumptions: Count data, independence, SRS, large sample, expected counts ≥ 5.

Chi-Square Test of Independence/Homogeneity

Degrees of Freedom: (r-1)(c-1)
Expected Frequency:
Test Statistic:
Null Hypothesis: Variables are independent.

Testing for Non-Normal Distributions (Nonparametric Tests)

Overview

Nonparametric tests are used when data do not meet the assumptions of normality. They are based on ranks rather than actual values.

Wilcoxon Signed-Rank Test: Tests median against a hypothesized value.
Paired Wilcoxon Signed-Rank: Tests median of differences.
Mann-Whitney U (Wilcoxon Sum Rank): Compares distributions of two independent samples.

Wilcoxon Signed-Rank Test

Null Hypothesis: Median = θ0
Test Statistic: where and are sums of ranks of negative and positive differences.
Assumptions: Independent observations, SRS, reasonably symmetric distribution.

Mann-Whitney U Test

Null Hypothesis: Two populations have identical distributions.
Test Statistic: where
Assumptions: Independent observations, SRS, reasonably symmetric distributions.

Hypothesis Testing Decision Rules

Overview

Decision rules specify how to interpret test statistics and p-values to accept or reject hypotheses.

One-sided tests: For Ha: parameter > value, reject if test statistic > critical value.
Two-sided tests: For Ha: parameter ≠ value, reject if |test statistic| > critical value.
Chi-square tests: Reject if
Nonparametric tests: Reject if test statistic falls in critical region.

Confidence Interval Formulas

Overview

Confidence intervals provide a range of plausible values for population parameters based on sample statistics.

Two-sided CI: or
One-sided upper bound CI for means: or
One-sided upper bound CI for proportions:
One-sided lower bound CI for means: or
One-sided lower bound CI for proportions:

Margin of Error and Sample Size Calculations

Overview

Margin of error quantifies the uncertainty in an estimate. Sample size formulas help determine how large a sample is needed to achieve a desired margin of error.

Margin of Error (ME): The maximum expected difference between the estimate and the true parameter.
Sample Size for Proportions:
Sample Size for Means:

Summary Table: Hypothesis Test Procedures

Test	Estimate	Null Hypothesis	Assumptions
1-proportion z-test	\( \hat{p} \)	\( p = p_0 \)	Independence, SRS, sample size
2-proportion z-test (unpooled)	\( \hat{p}_1 - \hat{p}_2 \)	\( p_1 - p_2 = p_0 \)	Independence, SRS, sample size
2-proportion z-test (pooled)	\( \hat{p}_1 - \hat{p}_2 \)	\( p_1 - p_2 = 0 \)	Independence, SRS, sample size
1-sample mean (known SD)	\( \bar{x} \)	\( \mu = \mu_0 \)	Independence, SRS, normality
1-sample mean (unknown SD)	\( \bar{x} \)	\( \mu = \mu_0 \)	Independence, SRS, normality
2-sample mean (unequal variance)	\( \bar{x}_1 - \bar{x}_2 \)	\( \mu_1 - \mu_2 = \mu_0 \)	Independence, SRS, normality
2-sample mean (equal variance)	\( \bar{x}_1 - \bar{x}_2 \)	\( \mu_1 - \mu_2 = \mu_0 \)	Independence, SRS, normality
Paired t-test	\( \bar{d} \)	\( \mu_d = \mu_0 \)	Independence, SRS, normality of differences

Summary Table: Chi-Square and Nonparametric Tests

Test	Null Hypothesis	Assumptions
Chi-square Goodness of Fit	Population proportions as specified	Count data, independence, SRS, expected counts ≥ 5
Chi-square Independence/Homogeneity	Variables are independent	Count data, independence, SRS, expected counts ≥ 5
Wilcoxon Signed-Rank	Median = θ0	Independence, SRS, symmetric distribution
Mann-Whitney U	Identical distributions	Independence, SRS, symmetric distributions

Additional info:

"SRS" stands for Simple Random Sample.
"n < 0.1N" means the sample size is less than 10% of the population, ensuring independence.
Critical values (zα, tα,df, χ2α,df) are determined by the significance level (α) and degrees of freedom.
For hypothesis tests, p-values are calculated based on the test statistic and compared to α to make decisions.