Skip to main content
Back

Chapter 11-B

Study Guide - Smart Notes

Tailored notes based on your materials, expanded with key definitions, examples, and context.

ANOVA: Analysis of Variance

Introduction to ANOVA

Analysis of Variance (ANOVA) is a statistical method used to test whether there are significant differences between the means of three or more independent groups. It is commonly applied in experimental and observational studies to compare group means and determine if at least one group mean is different from the others.

  • Purpose: To test the null hypothesis that all group means are equal.

  • Applications: Comparing means across different treatments, locations, or time periods.

Key Terms and Definitions

  • Factor: The categorical independent variable (e.g., plant, engine type).

  • Level: The different groups or categories within a factor.

  • Response Variable: The quantitative outcome measured (e.g., AQI, dBA, number of accidents).

  • Between-group Variation: Variation due to differences between group means.

  • Within-group Variation: Variation within each group (random error).

Constructing the ANOVA Table

ANOVA Table Structure and Formulas

The ANOVA table summarizes the sources of variation, their degrees of freedom, sum of squares, mean squares, F-statistic, and p-value.

Source

df

SS

MS

F

p-Value

Between

SSB

MSB

Computed

Within

SSW

MSW

Total

SST

  • SSB (Sum of Squares Between):

  • SSW (Sum of Squares Within):

  • SST (Total Sum of Squares):

  • MSB (Mean Square Between):

  • MSW (Mean Square Within):

  • F-statistic:

  • p-value: Probability of observing an F-statistic as extreme as, or more extreme than, the observed value under the null hypothesis.

Computing the F-statistic

  • Calculate the mean square between groups (MSB) and within groups (MSW).

  • Compute the F-statistic:

  • Compare the F-statistic to the critical value from the F-distribution or use the p-value for decision-making.

p-value Calculation:

Steps in ANOVA Hypothesis Testing

Step 1: Hypothesize – Formulation

  • Null Hypothesis (): All group means are equal.

  • Alternative Hypothesis (): At least one group mean is different. Not all are equal

Step 2: Prepare – Check Conditions

  • Set the significance level (e.g., 0.05, 0.02, 0.15).

  • Random samples are required.

  • Independence between and within groups (random assignment).

  • Normality of the response variable within each group.

Step 3: Compute (Statistics) to Compare

  • Calculate the ANOVA table values (SS, df, MS, F, p-value).

  • Use the formulas above to compute each component.

Step 4: Interpret and Conclude

  • Compare the p-value with :

    • If p-value , reject the null hypothesis ().

    • If p-value , do not reject the null hypothesis ().

  • State the conclusion in the context of the problem.

Worked Examples

Example 1: Air Quality Index (AQI) in Plants

A researcher collects AQI readings from three plants (A, B, C) to test if the mean AQI varies by plant.

AQI Readings

Plant A

Plant B

Plant C

75

43

6

39

80

11

49

33

19

28

52

22

27

47

12

  • Step 1: ; Not all means are equal.

  • Step 2: ; independence and normality assumed.

  • Step 3: ANOVA table constructed as follows:

Source

df

SS

MS

F

p-Value

Between

2

2984.3323

1492.1661

4.871

0.028

Within

12

3565.1533

297.0961

Total

14

6549.4866

  • Step 4: Since p-value (0.028) > (0.02), do not reject . Not enough evidence that the means are different at 2% significance.

  • At , since 0.028 < 0.05, reject at 5% significance.

Example 2: Sound Levels in Engines

Testing if mean decibel readings differ among three engine types (P, Q, R).

Decibel Readings (dBA)

Engine P

Engine Q

Engine R

79

90

35

65

80

20

30

40

40

55

60

25

  • Step 1: ; Not all means are equal.

  • Step 2: ; independence and normality assumed.

  • Step 3: ANOVA table constructed as follows:

Source

df

SS

MS

F

p-Value

Between

2

2767.545

1383.773

2.466

0.129

Within

9

374.111

41.568

Total

11

3141.656

  • Step 4: Since p-value (0.129) > (0.02), do not reject . Not enough evidence that the means are different at 2% significance.

  • At , since 0.129 < 0.15, reject at 15% significance.

Example 3: Annual Accidents in Plants

Testing if the mean number of annual accidents differs among four plants (A, B, C, D).

# of Annual Accidents

Plant A

Plant B

Plant C

Plant D

2

2

5

6

3

1

3

9

5

2

6

8

4

1

7

8

  • Step 1: ; Not all means are equal.

  • Step 2: ; independence and normality assumed.

  • Step 3: ANOVA table constructed as follows:

Source

df

SS

MS

F

p-Value

Between

3

54.1500

18.0500

3.8201

0.0307

Within

16

75.6000

4.7250

Total

19

129.7500

  • Step 4: Since p-value (0.0307) < (0.05), reject . There is evidence that not all means are equal at 5% significance.

  • At , since 0.0307 > 0.02, do not reject .

Summary Table: Decision Rules in ANOVA

p-value

Significance Level ()

Decision

p-value <

e.g., 0.05

Reject

p-value >

e.g., 0.05

Do not reject

Conclusion

ANOVA is a powerful tool for comparing means across multiple groups. By following the structured steps—formulating hypotheses, checking assumptions, computing the ANOVA table, and interpreting results—researchers can make informed decisions about group differences. Always ensure assumptions are met for valid results.

Additional info: The examples provided illustrate the application of ANOVA in real-world scenarios, including environmental and industrial studies. The formulas and tables are standard for one-way ANOVA.

Pearson Logo

Study Prep