Statistical Hypothesis Testing and Confidence Intervals: Key Concepts and Applications

Notes Practice Video lessons

Study Guide - Smart Notes

Tailored notes based on your materials, expanded with key definitions, examples, and context.

Chapter 16: Hypothesis Testing Fundamentals

Null and Alternative Hypotheses

In statistical hypothesis testing, we begin by stating two competing hypotheses:

Null Hypothesis (H0): The default assumption that there is no effect or no difference.
Alternative Hypothesis (Ha): The hypothesis that there is an effect or a difference.

For example, in testing whether a coin is fair:

H0: The coin is fair (p = 0.5).
Ha: The coin is not fair (p ≠ 0.5).

P-Value and Significance Level

The p-value measures the probability of observing data as extreme as the sample, assuming the null hypothesis is true. The significance level () is the threshold for rejecting H0.

If p-value < , reject H0.
If p-value > , fail to reject H0.

One-Sided vs. Two-Sided Tests

A one-sided test checks for deviation in one direction (e.g., greater than), while a two-sided test checks for deviation in both directions.

One-sided p-value: Probability of observing a value as extreme in one direction.
Two-sided p-value: Probability of observing a value as extreme in either direction (often double the one-sided p-value).

Calculating P-Values for Z-Scores

For a z-score, the p-value is found using the standard normal distribution:

One-sided: or
Two-sided:

Assumptions for One-Proportion Z-Test

Random sample
Sample size large enough for normal approximation: and

Performing a One-Proportion Z-Test

Calculate test statistic:
Find p-value and compare to

Stating Conclusions

Clearly state whether H0 is rejected or not, and interpret in context.

Chapter 18: Confidence Intervals and One-Sample T-Test

Constructing a Confidence Interval for a Mean

A confidence interval estimates the range in which a population mean likely falls.

Formula:
= sample mean, = sample standard deviation, = sample size, = critical value from t-distribution

Interpreting a Confidence Interval

A 95% confidence interval means that, in repeated sampling, 95% of intervals will contain the true mean.

Assumptions for T-Test and Confidence Interval

Random sample
Approximately normal population or large sample size ()

Performing a One-Sample T-Test

Test statistic:
Compare p-value to

Interpreting T-Test Output

State whether the sample mean is significantly different from the hypothesized mean.

Chapter 19: Two-Sample T-Test and Confidence Intervals for Differences

Assumptions for Two-Sample T-Test

Independent random samples
Approximately normal populations or large sample sizes

Confidence Interval for Difference Between Means

Formula:

Using Confidence Interval to Approximate Hypothesis Test

If the interval does not contain 0, there is evidence of a difference.

Performing a Two-Sample T-Test

Test statistic:

Interpreting Output

State whether there is a significant difference between the two means.

Chapter 20: Paired T-Test, Sign Test, and Choosing Tests

Graphs of Distributions

Visualize data using histograms, boxplots, or density plots to assess normality and identify outliers.

Paired T-Test

Used when samples are paired (e.g., before-and-after measurements).
Test statistic: where is mean difference, is standard deviation of differences.

Interpreting Paired T-Test Output

Determine if the mean difference is significant.

Choosing Appropriate Test

Assess data type (mean or proportion), sample structure (paired or independent), and assumptions.

Confidence Interval for a Difference

Construct as for two-sample or paired data, depending on design.

Sign Test for Paired Samples

Non-parametric test for paired data; counts number of positive and negative differences.

Z-Test for Proportion in Sign Test

Used to find p-value for sign test when sample size is large.

Interpreting Sign Test Output

State whether there is a significant difference in paired samples.

Chapter 28 (Sections 28.1 and 28.3): Errors and Reporting

Type I and Type II Errors

Type I Error: Rejecting H0 when it is true (false positive).
Type II Error: Failing to reject H0 when it is false (false negative).

Reporting Results

Clearly state test used, assumptions checked, p-value, and conclusion.
Provide biological or practical interpretation if relevant.

Study Design for Testing Means

Ensure randomization, appropriate sample size, and correct test selection.

Chapter 29 (Second Edition, Sections 29.1 and 29.3): Non-Parametric Tests

Wilcoxon Rank Sum Test

Non-parametric test for comparing two independent samples.
Assumptions: independent samples, ordinal or continuous data.

Interpreting Wilcoxon Rank Sum Test Output

State whether there is a significant difference in distributions.

Wilcoxon Signed Rank Test

Non-parametric test for paired samples.
Assumptions: paired samples, differences are symmetric.

Interpreting Wilcoxon Signed Rank Test Output

State whether there is a significant difference in paired data.

When to Use Non-Parametric Tests

When data do not meet parametric test assumptions (e.g., non-normality, ordinal data).

Summary Table: Choosing the Right Statistical Test

Test	Data Type	Sample Structure	Assumptions
One-Proportion Z-Test	Proportion	Single sample	Large sample, random
One-Sample T-Test	Mean	Single sample	Normality or large n
Two-Sample T-Test	Mean	Two independent samples	Normality or large n
Paired T-Test	Mean	Paired samples	Normality of differences
Sign Test	Paired differences	Paired samples	None (non-parametric)
Wilcoxon Rank Sum	Ordinal/continuous	Two independent samples	Independent samples
Wilcoxon Signed Rank	Ordinal/continuous	Paired samples	Symmetric differences

Additional info: Some details on formulas and assumptions were expanded for clarity and completeness.