BackStatistical Hypothesis Testing and Confidence Intervals: Key Concepts and Applications
Study Guide - Smart Notes
Tailored notes based on your materials, expanded with key definitions, examples, and context.
Chapter 16: Hypothesis Testing Fundamentals
Null and Alternative Hypotheses
In statistical hypothesis testing, we begin by stating two competing hypotheses:
Null Hypothesis (H0): The default assumption that there is no effect or no difference.
Alternative Hypothesis (Ha): The hypothesis that there is an effect or a difference.
For example, in testing whether a coin is fair:
H0: The coin is fair (p = 0.5).
Ha: The coin is not fair (p ≠ 0.5).
P-Value and Significance Level
The p-value measures the probability of observing data as extreme as the sample, assuming the null hypothesis is true. The significance level () is the threshold for rejecting H0.
If p-value < , reject H0.
If p-value > , fail to reject H0.
One-Sided vs. Two-Sided Tests
A one-sided test checks for deviation in one direction (e.g., greater than), while a two-sided test checks for deviation in both directions.
One-sided p-value: Probability of observing a value as extreme in one direction.
Two-sided p-value: Probability of observing a value as extreme in either direction (often double the one-sided p-value).
Calculating P-Values for Z-Scores
For a z-score, the p-value is found using the standard normal distribution:
One-sided: or
Two-sided:
Assumptions for One-Proportion Z-Test
Random sample
Sample size large enough for normal approximation: and
Performing a One-Proportion Z-Test
Calculate test statistic:
Find p-value and compare to
Stating Conclusions
Clearly state whether H0 is rejected or not, and interpret in context.
Chapter 18: Confidence Intervals and One-Sample T-Test
Constructing a Confidence Interval for a Mean
A confidence interval estimates the range in which a population mean likely falls.
Formula:
= sample mean, = sample standard deviation, = sample size, = critical value from t-distribution
Interpreting a Confidence Interval
A 95% confidence interval means that, in repeated sampling, 95% of intervals will contain the true mean.
Assumptions for T-Test and Confidence Interval
Random sample
Approximately normal population or large sample size ()
Performing a One-Sample T-Test
Test statistic:
Compare p-value to
Interpreting T-Test Output
State whether the sample mean is significantly different from the hypothesized mean.
Chapter 19: Two-Sample T-Test and Confidence Intervals for Differences
Assumptions for Two-Sample T-Test
Independent random samples
Approximately normal populations or large sample sizes
Confidence Interval for Difference Between Means
Formula:
Using Confidence Interval to Approximate Hypothesis Test
If the interval does not contain 0, there is evidence of a difference.
Performing a Two-Sample T-Test
Test statistic:
Interpreting Output
State whether there is a significant difference between the two means.
Chapter 20: Paired T-Test, Sign Test, and Choosing Tests
Graphs of Distributions
Visualize data using histograms, boxplots, or density plots to assess normality and identify outliers.
Paired T-Test
Used when samples are paired (e.g., before-and-after measurements).
Test statistic: where is mean difference, is standard deviation of differences.
Interpreting Paired T-Test Output
Determine if the mean difference is significant.
Choosing Appropriate Test
Assess data type (mean or proportion), sample structure (paired or independent), and assumptions.
Confidence Interval for a Difference
Construct as for two-sample or paired data, depending on design.
Sign Test for Paired Samples
Non-parametric test for paired data; counts number of positive and negative differences.
Z-Test for Proportion in Sign Test
Used to find p-value for sign test when sample size is large.
Interpreting Sign Test Output
State whether there is a significant difference in paired samples.
Chapter 28 (Sections 28.1 and 28.3): Errors and Reporting
Type I and Type II Errors
Type I Error: Rejecting H0 when it is true (false positive).
Type II Error: Failing to reject H0 when it is false (false negative).
Reporting Results
Clearly state test used, assumptions checked, p-value, and conclusion.
Provide biological or practical interpretation if relevant.
Study Design for Testing Means
Ensure randomization, appropriate sample size, and correct test selection.
Chapter 29 (Second Edition, Sections 29.1 and 29.3): Non-Parametric Tests
Wilcoxon Rank Sum Test
Non-parametric test for comparing two independent samples.
Assumptions: independent samples, ordinal or continuous data.
Interpreting Wilcoxon Rank Sum Test Output
State whether there is a significant difference in distributions.
Wilcoxon Signed Rank Test
Non-parametric test for paired samples.
Assumptions: paired samples, differences are symmetric.
Interpreting Wilcoxon Signed Rank Test Output
State whether there is a significant difference in paired data.
When to Use Non-Parametric Tests
When data do not meet parametric test assumptions (e.g., non-normality, ordinal data).
Summary Table: Choosing the Right Statistical Test
Test | Data Type | Sample Structure | Assumptions |
|---|---|---|---|
One-Proportion Z-Test | Proportion | Single sample | Large sample, random |
One-Sample T-Test | Mean | Single sample | Normality or large n |
Two-Sample T-Test | Mean | Two independent samples | Normality or large n |
Paired T-Test | Mean | Paired samples | Normality of differences |
Sign Test | Paired differences | Paired samples | None (non-parametric) |
Wilcoxon Rank Sum | Ordinal/continuous | Two independent samples | Independent samples |
Wilcoxon Signed Rank | Ordinal/continuous | Paired samples | Symmetric differences |
Additional info: Some details on formulas and assumptions were expanded for clarity and completeness.