Skip to main content
Back

Statistical Power and Sample Size Determination

Study Guide - Smart Notes

Tailored notes based on your materials, expanded with key definitions, examples, and context.

Statistical Power and Sample Size Determination

Introduction

Statistical power and sample size calculations are essential components of experimental design in statistics. They ensure that studies are adequately equipped to detect meaningful effects and that resources are used efficiently. This guide covers the concepts of statistical errors, power, effect size, and the practical use of power analysis in research.

Errors in Hypothesis Testing

Types of Errors

  • Type I Error (α): Rejecting the null hypothesis when it is actually true. The probability of a Type I error is denoted by α (significance level).

  • Type II Error (β): Failing to reject the null hypothesis when it is actually false. The probability of a Type II error is denoted by β.

Decision Table for Hypothesis Testing

Decision

H0 is really true (no effect)

H0 is really false (effect exists)

Retain H0

Correct decision: prob = 1 - α

Type II error: prob = β

Reject H0

Type I error: prob = α

Correct decision: prob = 1 - β

Statistical Power

Definition and Importance

  • Statistical Power (1 - β): The probability of correctly rejecting a false null hypothesis (i.e., detecting a true effect).

  • Power is typically set at 80% as a standard in research.

  • Power depends on:

    • Significance level (α)

    • Sample size (n)

    • Effect size

Power Analysis

Purpose and Benefits

  • Determines the sample size needed for an experiment to achieve a desired power.

  • Ensures results are conclusive and interpretable.

  • Minimizes unnecessary use of resources and participant burden.

  • Addresses ethical considerations by avoiding underpowered or overpowered studies.

Types of Power Analysis

  • A priori power analysis: Determines the required sample size before data collection, given a desired effect size and significance level.

  • Post hoc power analysis: Calculates the power of a test after data collection, useful for interpreting non-significant results.

  • Compromised power analysis: Balances error risks when sample size is restricted, adjusting α and β accordingly.

Examples of Power Analysis

A Priori Power Analysis Example

  • Clinical trial comparing Drug A and control for reducing cancer cell proliferation.

  • Hypotheses:

    • H0: No difference in proliferation rate between Drug A and control.

    • H1: Drug A reduces proliferation rate by 15% compared to control.

  • Power analysis determines the sample size needed to detect a 15% difference.

Post Hoc Power Analysis Example

  • Given a fixed sample (e.g., 40 patients per group), power analysis can assess whether a non-significant result is due to insufficient power (high β).

Numerical Example: Fasting Blood Glucose

  • Population mean: 85 mg/dL, standard deviation: 7.2, sample size: 30, α = 0.05.

  • Critical values for hypothesis testing are calculated using:

  • If the true mean is 90 mg/dL, power analysis estimates the probability of detecting this difference.

How to Increase Statistical Power

1. Increase Alpha (α)

  • Higher α increases power but also increases the risk of Type I error.

  • Common values: α = 0.05 (5%), α = 0.01 (1%).

2. Increase Sample Size (n)

  • Larger sample sizes reduce β, increasing power.

  • Resource and ethical considerations must be balanced.

3. Increase Effect Size

  • Effect size is the magnitude of the difference between groups, relative to the standard deviation.

  • Larger effect sizes are easier to detect, requiring smaller sample sizes for the same power.

  • Effect size can be quantified using Cohen's d:

Effect Size Classification

Effect size

Difference between means

Small effect

d = 0.2 standard deviations

Medium effect

d = 0.5 standard deviations

Large effect

d = 0.8 standard deviations

Example: Calculating Cohen's d

Subject

Blood Glucose (mmol/L) after Chinese Tea

Blood Glucose (mmol/L) after Milk Tea

Difference

Peter

7

10

3

Simon

8

11

3

David

9

11

2

John

11

13

2

Mike

11

13

2

Mean

8.6

10.8

2.2

SD

1.51

1.48

1.49

Cohen's d = 1.472 (Large effect)

Using G*Power Software

Overview

  • G*Power is a statistical software tool for conducting power analyses.

  • Allows users to specify test type, effect size, α, β, and sample size to calculate the required parameters for adequate power.

Example Output from G*Power

  • For a given dataset, G*Power may suggest that 7 samples per group are needed to achieve power > 80%.

  • If only 5 subjects are available, the program may indicate that 2 more are needed.

Summary Table: Factors Affecting Power

Factor

Effect on Power

Increase α

Increases power, but increases Type I error risk

Increase sample size

Increases power, reduces Type II error

Increase effect size

Increases power, easier to detect differences

Conclusion

Understanding statistical power and sample size is crucial for designing robust experiments and interpreting results. Proper power analysis ensures that studies are neither underpowered (risking false negatives) nor wastefully overpowered, balancing scientific rigor with ethical and practical considerations.

Pearson Logo

Study Prep