BackChapter 11: Chi-Square Tests – Study Notes for Business Statistics
Study Guide - Smart Notes
Tailored notes based on your materials, expanded with key definitions, examples, and context.
Chi-Square Tests
Introduction
The chi-square test is a fundamental statistical tool used to analyze categorical data. It helps determine whether observed frequencies differ significantly from expected frequencies under a specific hypothesis. In business statistics, chi-square tests are commonly applied to contingency tables, tests of independence, and comparisons of proportions across groups.
Contingency Tables
Definition and Purpose
Contingency tables (also called cross-classification tables) are used to classify sample observations according to two or more categorical variables.
They are especially useful for comparing multiple population proportions and examining relationships between categorical variables.
Example: Hand Preference vs. Gender
Suppose we examine a sample of 300 children, classified by hand preference (Left/Right) and gender (Male/Female). This forms a 2 x 2 contingency table:
Gender | Hand Preference: Left | Hand Preference: Right | Total |
|---|---|---|---|
Female | 12 | 108 | 120 |
Male | 24 | 156 | 180 |
Total | 36 | 264 | 300 |
Chi-Square Test for the Difference Between Two Proportions
Hypotheses
Null hypothesis (H0): (Proportion of females who are left-handed equals proportion of males who are left-handed)
Alternative hypothesis (H1): (The two proportions are not the same)
Test Statistic
The chi-square test statistic is:
= observed frequency in a cell
= expected frequency in a cell (if H0 is true)
For a 2 x 2 table, degrees of freedom = 1
Decision Rule
If , reject H0; otherwise, do not reject H0.
is the critical value from the chi-square distribution for the chosen significance level .
Computing the Overall Proportion
The overall proportion of left-handed children is:
Where and are the counts of left-handed children in each group, and are the group sizes.
Finding Expected Frequencies
Expected frequency for left-handed females:
Expected frequency for left-handed males:
Observed vs. Expected Frequencies Table
Gender | Left (Observed) | Left (Expected) | Right (Observed) | Right (Expected) | Total |
|---|---|---|---|---|---|
Female | 12 | 14.4 | 108 | 105.6 | 120 |
Male | 24 | 21.6 | 156 | 158.4 | 180 |
Total | 36 | 264 | 300 |
Example Calculation
Test statistic:
Critical value at and 1 d.f.:
Since , do not reject H0. There is not sufficient evidence that the two proportions are different at .
Using P-value
P-value = 0.3481 > 0.05, so the null hypothesis is not rejected.
Chi-Square Test for Differences Among More Than Two Proportions
Hypotheses
Null hypothesis (H0): (All population proportions are equal)
Alternative hypothesis (H1): Not all are equal (for )
Test Statistic
The chi-square test statistic is:
Degrees of freedom:
Each cell should have expected frequency of at least 1.
Example: University Calendar Opinion
Opinion | Administrators | Students | Faculty | Total |
|---|---|---|---|---|
Favor | 63 | 20 | 37 | 120 |
Oppose | 37 | 30 | 13 | 80 |
Total | 100 | 50 | 50 | 200 |
Test statistic:
Critical value at , 2 d.f.:
Since , reject H0. There is evidence that at least one group has a different opinion.
P-value = 0.0017 < 0.05, so the null hypothesis is rejected.
The Marascuilo Procedure
Purpose and Steps
Used when the null hypothesis of equal proportions is rejected in a chi-square test.
Allows pairwise comparisons to determine which proportions differ significantly.
Calculate the observed differences for all pairs, and compare to a critical range.
Critical Range Formula
A pair of proportions is significantly different if
Example: Marascuilo Procedure
Opinion | Administrators | Students | Faculty | Total |
|---|---|---|---|---|
Favor | 63 | 20 | 37 | 120 |
Oppose | 37 | 30 | 13 | 80 |
Total | 100 | 50 | 50 | 200 |
At 1% significance, there is evidence of a difference in opinion between administrators & students and students & faculty.
Chi-Square Test of Independence
Definition and Hypotheses
Tests whether two categorical variables are independent (no relationship) or dependent (there is a relationship).
Null hypothesis (H0): The two categorical variables are independent.
Alternative hypothesis (H1): The two categorical variables are dependent.
Test Statistic
Degrees of freedom: , where r = number of rows, c = number of columns.
Expected Cell Frequencies
Where n = overall sample size.
Decision Rule
If , reject H0; otherwise, do not reject H0.
is the critical value from the chi-square distribution with degrees of freedom.
Example: Meal Plan and Class Standing
Class Standing | 20/week | 10/week | none | Total |
|---|---|---|---|---|
Fresh. | 24 | 32 | 14 | 70 |
Soph. | 22 | 26 | 12 | 60 |
Junior | 10 | 14 | 6 | 30 |
Senior | 14 | 16 | 10 | 40 |
Total | 70 | 88 | 42 | 200 |
Test statistic:
Critical value at , 6 d.f.:
P-value = 0.9943 > 0.05, so the null hypothesis is not rejected. There is not sufficient evidence of a relationship between meal plan and class standing.
Summary of Chapter 11
Application of the chi-square test for the difference between two proportions.
Application of the chi-square test for differences among more than two proportions.
Use of the Marascuilo procedure for pairwise comparisons after rejecting the null hypothesis of equal proportions.
Application of the chi-square test for independence between categorical variables.
Additional info: These notes expand on the original slides and text, providing full definitions, formulas, and context for each statistical procedure. All tables have been reconstructed and formulas provided in LaTeX format for clarity.