Statistical Comparison of Two Independent Groups: t-Test and Mann-Whitney U Test

Study Guide - Smart Notes

Tailored notes based on your materials, expanded with key definitions, examples, and context.

Comparing Two Independent Groups

Introduction

In biological research, comparing two independent groups is a common method to determine if there is a significant difference between them. This is often done using statistical tests such as the two sample t-test or its non-parametric equivalent, the Mann-Whitney U test. These methods are essential for analyzing experimental data and drawing valid scientific conclusions.

Independent groups are groups where the observations in one group are not related to those in the other.
Examples include comparing male vs. female animals, treatment vs. control groups, or different environmental conditions.

Two Sample t-Test

Definition and Purpose

The two sample t-test is a parametric test used to compare the means of two independent groups. It determines whether the difference between the group means is statistically significant.

Parametric test: Assumes data are normally distributed and samples are random.
Used when both groups meet the assumptions of normality and random sampling.

Assumptions

Both samples are randomly selected.
Both samples have normal distributions (check with a histogram).

t-Test Statistic Formula

The t-test statistic quantifies the difference between the two sample means relative to the standard error of the difference:

and are the means of groups A and B.
is the standard error of the difference between means.

Interpreting Results

The test statistic indicates how closely the data match the null hypothesis.
The p-value tells us the probability of observing the data if the null hypothesis is true.
If p < 0.05, reject the null hypothesis (statistically significant difference).

Example: Horned Lizards

Research question: Do horned lizards killed by shrikes have shorter horns?

Null hypothesis (H0): Mean horn length does not differ between killed and surviving lizards.
Alternative hypothesis (HA): Mean horn length does differ between groups.

Variables Table

hornlength	condition
25.3	dead
21.7	dead
16.8	dead
24.5	dead
23.4	dead
19.9	dead
29.6	alive
25.2	alive

Additional info: The explanatory variable is 'condition' (dead/alive), and the response variable is 'hornlength'.

Checking Assumptions

Use a histogram to check for normality of the numeric variable.
Ensure samples are random and representative.

Reporting Results

Report means, t-value, p-value, and degrees of freedom.
Example: "Horned lizards killed by shrikes had significantly shorter horns than live lizards ()."

Graphical Presentation

Use bar graphs with error bars to show mean differences.
Label axes and include sample sizes.

Mann-Whitney U Test (Wilcoxon Rank Sum)

Definition and Purpose

The Mann-Whitney U test is a non-parametric alternative to the t-test. It compares the medians of two independent groups and does not require normal distribution.

Used when data are not normally distributed.
Considers medians rather than means.

Assumptions

Samples are independent and randomly selected.
No assumption of normality.

Hypotheses

Null hypothesis (H0): Median horn length does not differ between groups.
Alternative hypothesis (HA): Median horn length does differ between groups.

Reporting Results

Report medians, U-value, p-value, and sample sizes.
Example: "Horned lizards killed by shrikes had significantly shorter horns than live lizards ()."

Graphical Presentation

Use box plots to show median, interquartile range, and whiskers for range.

Summary Table: Parametric vs. Non-parametric Tests

	Parametric	Non-parametric
Unpaired	t-test	Wilcoxon rank sum

Key Points for Analysis

Always check assumptions before choosing a test.
Use histograms to assess normality.
Report results with appropriate statistics and graphical representation.
Choose non-parametric tests when data do not meet parametric assumptions.

Additional info:

These statistical methods are foundational for interpreting experimental results in biology, especially when comparing traits or responses between two groups.
Understanding the difference between paired and unpaired data is crucial for selecting the correct test.