Hypothesis Testing for Population Proportions

Study Guide - Smart Notes

Tailored notes based on your materials, expanded with key definitions, examples, and context.

Chapter 15 (Part I): Testing Hypotheses About Population Proportions

Introduction to Hypothesis Testing

Hypothesis testing is a fundamental statistical procedure used to make inferences about population parameters based on sample data. In statistics, a hypothesis is a conjecture or claim about a population parameter, such as a mean or proportion. Hypotheses are typically stated in terms of parameters like the population mean (μ) or population proportion (p).

Example 1: Population proportion, $p = 0.3$
Example 2: Mean number of sick leave days taken by employees, $\mu = 8.7$
Example 3: Proportion of voters favoring candidate A, $p = 0.55$

Key Definitions

Null Hypothesis ($H_0$): A statement specifying the value of a population parameter. It is the hypothesis that is initially assumed to be true and is tested against the data.
Alternative Hypothesis ($H_a$): A statement that contradicts the null hypothesis. It represents the claim or suspicion that the researcher wants to test.

Steps in Hypothesis Testing

The process of hypothesis testing involves several key steps:

State the null hypothesis ($H_0$) and alternative hypothesis ($H_a$).
Collect sample data from the population.
Calculate the test statistic based on the sample data.
Determine the p-value or observed significance level.
Compare the p-value to the significance level ($\alpha$) to make a decision.
Interpret the results in the context of the problem.

Elements of a Test of Hypothesis

The null hypothesis, $H_0$
The alternative hypothesis, $H_a$
The decision rule
The test statistic
Calculation of the observed significance level, or p-value
Interpretation of the results

Test Statistic for Population Proportion

When testing a hypothesis about a population proportion, the test statistic is calculated as follows:

$z = \frac{\hat{p} - p}{\sqrt{\frac{p(1-p)}{n}}}$

$\hat{p}$ = sample proportion
$p$ = hypothesized population proportion
$n$ = sample size

Decision Rule and Significance Level

The decision rule specifies the conditions under which the null hypothesis will be rejected. The significance level ($\alpha$) is the probability of making a Type I error (rejecting $H_0$ when it is true). Common choices for $\alpha$ are 0.05 or 0.01.

Reject $H_0$ if the p-value < $\alpha$
Fail to reject $H_0$ if the p-value ≥ $\alpha$

Understanding the p-value

The p-value is the probability, assuming the null hypothesis is true, of obtaining a test statistic at least as extreme as the one observed. It quantifies the evidence against the null hypothesis.

For $H_0: p = 0.30$ vs $H_a: p < 0.30$, the p-value is the probability of getting a sample proportion less than or equal to the observed value.
For $H_0: p = 0.20$ vs $H_a: p > 0.20$, the p-value is the probability of getting a sample proportion greater than or equal to the observed value.
For $H_0: p = 0.30$ vs $H_a: p \neq 0.30$, the p-value is twice the probability of getting a sample proportion as extreme or more extreme than the observed value (in either direction).

Types of Errors in Hypothesis Testing

Type I Error: Rejecting the null hypothesis when it is actually true. The probability of a Type I error is $\alpha$.
Type II Error: Failing to reject the null hypothesis when the alternative hypothesis is true. The probability of a Type II error is denoted by $\beta$.

Identifying a z-Test of Hypothesis About p

The population of interest contains qualitative data; the parameter of interest is the population proportion $p$.
The objective is to determine if the sample data supports the claim that $p$ is greater than, less than, or not equal to a hypothesized value.
A large, random sample is collected, and the sample proportion $\hat{p}$ is calculated.

Elements of a z-Test of Hypothesis About p

Define the parameter $p$ in the context of the test.
State the hypotheses:
- $H_0: p = p_0$
- $H_a: p > p_0$, $p < p_0$, or $p \neq p_0$
Decision Rule: Reject $H_0$ if the p-value < $\alpha$
Test Statistic:
$z = \frac{\hat{p} - p_0}{\sqrt{\frac{p_0(1-p_0)}{n}}}$
Calculate the p-value
Make a decision
Interpret the results

Worked Example: Hypothesis Test for a Proportion

Scenario: A producer claims that 20% of all orange juice drinkers prefer its product. A competitor samples 200 drinkers and finds that 34 prefer the producer's brand. Test at a 0.05 significance level whether the competitor's claim (that the 20% figure is too large) is supported.

Parameter of interest: $p$ = proportion of all orange juice drinkers who prefer the producer's brand.
Null hypothesis: $H_0: p = 0.20$
Alternative hypothesis: $H_a: p < 0.20$
Sample proportion: $\hat{p} = \frac{34}{200} = 0.17$
Test statistic:
$z = \frac{0.17 - 0.20}{\sqrt{\frac{0.20 \times 0.80}{200}}}$
Calculate p-value: Find the probability that $z$ is less than the calculated value.
Decision: If p-value < 0.05, reject $H_0$; otherwise, do not reject $H_0$.
Interpretation: State whether there is sufficient evidence to support the competitor's claim.

Types of Hypothesis Tests for Proportions

Test Type	Null Hypothesis ($H_0$)	Alternative Hypothesis ($H_a$)	When to Use
Left-tailed	$p = p_0$	$p < p_0$	Testing if the proportion is less than a specified value
Right-tailed	$p = p_0$	$p > p_0$	Testing if the proportion is greater than a specified value
Two-tailed	$p = p_0$	$p \neq p_0$	Testing if the proportion is different from a specified value

Types of Errors: Example Application

Suppose an entrepreneur is considering opening a new restaurant if more than 15% of local residents are interested. A survey of 500 people finds 90 interested.

Type I Error: Concluding that more than 15% are interested (and opening the restaurant) when in fact 15% or fewer are interested.
Type II Error: Concluding that 15% or fewer are interested (and not opening the restaurant) when in fact more than 15% are interested.

Significance Level ($\alpha$): The entrepreneur should choose $\alpha$ based on the relative consequences of these errors. For example, if opening a restaurant is costly, a smaller $\alpha$ may be preferred to reduce the risk of a Type I error.

Confidence Interval for a Proportion

A confidence interval provides a range of plausible values for the population proportion. The 95% confidence interval for $p$ is given by:

$\hat{p} \pm z^* \sqrt{\frac{\hat{p}(1-\hat{p})}{n}}$

$z^*$ is the critical value from the standard normal distribution (for 95%, $z^* \approx 1.96$)
Interpretation: We are 95% confident that the true population proportion lies within this interval.

Example: If 48 out of 80 graduate students have purchased online, $\hat{p} = 0.6$. The 95% confidence interval is:

$0.6 \pm 1.96 \sqrt{\frac{0.6 \times 0.4}{80}}$

Calculate the lower and upper limits to interpret the interval.

Additional info: These notes cover the foundational concepts and procedures for hypothesis testing about population proportions, including the formulation of hypotheses, calculation of test statistics and p-values, types of errors, and confidence intervals. The examples and tables are designed to help students apply these concepts to real-world scenarios.