Skip to main content
Back

Chapter 11: Chi-Square Tests – Study Notes for Business Statistics

Study Guide - Smart Notes

Tailored notes based on your materials, expanded with key definitions, examples, and context.

Chi-Square Tests

Introduction

The chi-square test is a fundamental statistical tool used to analyze categorical data. It helps determine whether observed frequencies differ significantly from expected frequencies under a specific hypothesis. In business statistics, chi-square tests are commonly applied to contingency tables, tests of independence, and comparisons of proportions across groups.

Contingency Tables

Definition and Purpose

  • Contingency tables (also called cross-classification tables) are used to classify sample observations according to two or more categorical variables.

  • They are especially useful for comparing multiple population proportions and examining relationships between categorical variables.

Example: Hand Preference vs. Gender

Suppose we examine a sample of 300 children, classified by hand preference (Left/Right) and gender (Male/Female). This forms a 2 x 2 contingency table:

Gender

Hand Preference: Left

Hand Preference: Right

Total

Female

12

108

120

Male

24

156

180

Total

36

264

300

Chi-Square Test for the Difference Between Two Proportions

Hypotheses

  • Null hypothesis (H0): (Proportion of females who are left-handed equals proportion of males who are left-handed)

  • Alternative hypothesis (H1): (The two proportions are not the same)

Test Statistic

  • The chi-square test statistic is:

  • = observed frequency in a cell

  • = expected frequency in a cell (if H0 is true)

  • For a 2 x 2 table, degrees of freedom = 1

Decision Rule

  • If , reject H0; otherwise, do not reject H0.

  • is the critical value from the chi-square distribution for the chosen significance level .

Computing the Overall Proportion

  • The overall proportion of left-handed children is:

  • Where and are the counts of left-handed children in each group, and are the group sizes.

Finding Expected Frequencies

  • Expected frequency for left-handed females:

  • Expected frequency for left-handed males:

Observed vs. Expected Frequencies Table

Gender

Left (Observed)

Left (Expected)

Right (Observed)

Right (Expected)

Total

Female

12

14.4

108

105.6

120

Male

24

21.6

156

158.4

180

Total

36

264

300

Example Calculation

  • Test statistic:

  • Critical value at and 1 d.f.:

  • Since , do not reject H0. There is not sufficient evidence that the two proportions are different at .

Using P-value

  • P-value = 0.3481 > 0.05, so the null hypothesis is not rejected.

Chi-Square Test for Differences Among More Than Two Proportions

Hypotheses

  • Null hypothesis (H0): (All population proportions are equal)

  • Alternative hypothesis (H1): Not all are equal (for )

Test Statistic

  • The chi-square test statistic is:

  • Degrees of freedom:

  • Each cell should have expected frequency of at least 1.

Example: University Calendar Opinion

Opinion

Administrators

Students

Faculty

Total

Favor

63

20

37

120

Oppose

37

30

13

80

Total

100

50

50

200

  • Test statistic:

  • Critical value at , 2 d.f.:

  • Since , reject H0. There is evidence that at least one group has a different opinion.

  • P-value = 0.0017 < 0.05, so the null hypothesis is rejected.

The Marascuilo Procedure

Purpose and Steps

  • Used when the null hypothesis of equal proportions is rejected in a chi-square test.

  • Allows pairwise comparisons to determine which proportions differ significantly.

  • Calculate the observed differences for all pairs, and compare to a critical range.

Critical Range Formula

  • A pair of proportions is significantly different if

Example: Marascuilo Procedure

Opinion

Administrators

Students

Faculty

Total

Favor

63

20

37

120

Oppose

37

30

13

80

Total

100

50

50

200

  • At 1% significance, there is evidence of a difference in opinion between administrators & students and students & faculty.

Chi-Square Test of Independence

Definition and Hypotheses

  • Tests whether two categorical variables are independent (no relationship) or dependent (there is a relationship).

  • Null hypothesis (H0): The two categorical variables are independent.

  • Alternative hypothesis (H1): The two categorical variables are dependent.

Test Statistic

  • Degrees of freedom: , where r = number of rows, c = number of columns.

Expected Cell Frequencies

  • Where n = overall sample size.

Decision Rule

  • If , reject H0; otherwise, do not reject H0.

  • is the critical value from the chi-square distribution with degrees of freedom.

Example: Meal Plan and Class Standing

Class Standing

20/week

10/week

none

Total

Fresh.

24

32

14

70

Soph.

22

26

12

60

Junior

10

14

6

30

Senior

14

16

10

40

Total

70

88

42

200

  • Test statistic:

  • Critical value at , 6 d.f.:

  • P-value = 0.9943 > 0.05, so the null hypothesis is not rejected. There is not sufficient evidence of a relationship between meal plan and class standing.

Summary of Chapter 11

  • Application of the chi-square test for the difference between two proportions.

  • Application of the chi-square test for differences among more than two proportions.

  • Use of the Marascuilo procedure for pairwise comparisons after rejecting the null hypothesis of equal proportions.

  • Application of the chi-square test for independence between categorical variables.

Additional info: These notes expand on the original slides and text, providing full definitions, formulas, and context for each statistical procedure. All tables have been reconstructed and formulas provided in LaTeX format for clarity.

Pearson Logo

Study Prep