BackStatistical Power and Sample Size Determination
Study Guide - Smart Notes
Tailored notes based on your materials, expanded with key definitions, examples, and context.
Statistical Power and Sample Size Determination
Introduction
Statistical power and sample size calculations are essential components of experimental design in statistics. They ensure that studies are adequately equipped to detect meaningful effects and that resources are used efficiently. This guide covers the concepts of statistical errors, power, effect size, and the practical use of power analysis in research.
Errors in Hypothesis Testing
Types of Errors
Type I Error (α): Rejecting the null hypothesis when it is actually true. The probability of a Type I error is denoted by α (significance level).
Type II Error (β): Failing to reject the null hypothesis when it is actually false. The probability of a Type II error is denoted by β.
Decision Table for Hypothesis Testing
Decision | H0 is really true (no effect) | H0 is really false (effect exists) |
|---|---|---|
Retain H0 | Correct decision: prob = 1 - α | Type II error: prob = β |
Reject H0 | Type I error: prob = α | Correct decision: prob = 1 - β |
Statistical Power
Definition and Importance
Statistical Power (1 - β): The probability of correctly rejecting a false null hypothesis (i.e., detecting a true effect).
Power is typically set at 80% as a standard in research.
Power depends on:
Significance level (α)
Sample size (n)
Effect size
Power Analysis
Purpose and Benefits
Determines the sample size needed for an experiment to achieve a desired power.
Ensures results are conclusive and interpretable.
Minimizes unnecessary use of resources and participant burden.
Addresses ethical considerations by avoiding underpowered or overpowered studies.
Types of Power Analysis
A priori power analysis: Determines the required sample size before data collection, given a desired effect size and significance level.
Post hoc power analysis: Calculates the power of a test after data collection, useful for interpreting non-significant results.
Compromised power analysis: Balances error risks when sample size is restricted, adjusting α and β accordingly.
Examples of Power Analysis
A Priori Power Analysis Example
Clinical trial comparing Drug A and control for reducing cancer cell proliferation.
Hypotheses:
H0: No difference in proliferation rate between Drug A and control.
H1: Drug A reduces proliferation rate by 15% compared to control.
Power analysis determines the sample size needed to detect a 15% difference.
Post Hoc Power Analysis Example
Given a fixed sample (e.g., 40 patients per group), power analysis can assess whether a non-significant result is due to insufficient power (high β).
Numerical Example: Fasting Blood Glucose
Population mean: 85 mg/dL, standard deviation: 7.2, sample size: 30, α = 0.05.
Critical values for hypothesis testing are calculated using:
If the true mean is 90 mg/dL, power analysis estimates the probability of detecting this difference.
How to Increase Statistical Power
1. Increase Alpha (α)
Higher α increases power but also increases the risk of Type I error.
Common values: α = 0.05 (5%), α = 0.01 (1%).
2. Increase Sample Size (n)
Larger sample sizes reduce β, increasing power.
Resource and ethical considerations must be balanced.
3. Increase Effect Size
Effect size is the magnitude of the difference between groups, relative to the standard deviation.
Larger effect sizes are easier to detect, requiring smaller sample sizes for the same power.
Effect size can be quantified using Cohen's d:
Effect Size Classification
Effect size | Difference between means |
|---|---|
Small effect | d = 0.2 standard deviations |
Medium effect | d = 0.5 standard deviations |
Large effect | d = 0.8 standard deviations |
Example: Calculating Cohen's d
Subject | Blood Glucose (mmol/L) after Chinese Tea | Blood Glucose (mmol/L) after Milk Tea | Difference |
|---|---|---|---|
Peter | 7 | 10 | 3 |
Simon | 8 | 11 | 3 |
David | 9 | 11 | 2 |
John | 11 | 13 | 2 |
Mike | 11 | 13 | 2 |
Mean | 8.6 | 10.8 | 2.2 |
SD | 1.51 | 1.48 | 1.49 |
Cohen's d = 1.472 (Large effect)
Using G*Power Software
Overview
G*Power is a statistical software tool for conducting power analyses.
Allows users to specify test type, effect size, α, β, and sample size to calculate the required parameters for adequate power.
Example Output from G*Power
For a given dataset, G*Power may suggest that 7 samples per group are needed to achieve power > 80%.
If only 5 subjects are available, the program may indicate that 2 more are needed.
Summary Table: Factors Affecting Power
Factor | Effect on Power |
|---|---|
Increase α | Increases power, but increases Type I error risk |
Increase sample size | Increases power, reduces Type II error |
Increase effect size | Increases power, easier to detect differences |
Conclusion
Understanding statistical power and sample size is crucial for designing robust experiments and interpreting results. Proper power analysis ensures that studies are neither underpowered (risking false negatives) nor wastefully overpowered, balancing scientific rigor with ethical and practical considerations.