BackTesting Hypotheses About Proportions: One-Proportion z-Test
Study Guide - Smart Notes
Tailored notes based on your materials, expanded with key definitions, examples, and context.
Chapters 16 & 17: Testing Hypotheses About Proportions
Motivating Example
Hypothesis testing for proportions is a fundamental statistical method used to determine whether an observed difference in sample proportions reflects a true difference in the population or is simply due to chance variation. For example, a manufacturing process with a historical defect rate of 6% is evaluated after a process change. A random sample of 300 machines produced under the new process reveals a defect rate of 5% (15 out of 300). The question is whether this reduction is statistically significant.
Population parameter of interest: The true proportion of defective items produced by the new process.
Statistical question: Is the observed reduction in defect rate real, or could it be due to random sampling variation?
Fundamentals of Hypothesis Testing
Definition of Hypothesis
In statistics, a hypothesis is a statement or claim about a population parameter (a numerical characteristic of a population). Hypothesis tests are used to assess these claims.
Null and Alternative Hypotheses
Null hypothesis (): A statement about the value of a population parameter, typically representing no effect or no difference. For a population proportion , the null hypothesis is usually , where is a specified value.
Alternative hypothesis (): A statement that contradicts the null hypothesis, suggesting that the observed difference is real. It can take three forms:
(two-sided, two-tailed test)
(one-sided, right-tailed test)
(one-sided, left-tailed test)
One-Proportion z-Test
The hypothesis test for a population proportion is called a one-proportion z-test.
Procedure for Hypothesis Testing
Identify the population and parameter of interest.
Set up the hypotheses: Null () and Alternative ().
Compute a test statistic based on the sample data.
Compute the P-value associated with the test statistic.
Draw a conclusion based on the P-value and the chosen significance level ().
Sampling Distribution and Test Statistic
For large samples, the sampling distribution of the sample proportion is approximately Normal with mean and standard deviation .
Assuming is true (), the null model for is:
Conditions for Normal approximation: and .
Calculating the Test Statistic
The test statistic for a population proportion is the z-score:
This measures how many standard deviations the observed sample proportion is from the hypothesized population proportion under .
P-Value and Statistical Significance
The P-value is the probability, assuming is true, of obtaining a test statistic as extreme or more extreme than the observed value.
For two-sided tests, the P-value is the area in both tails beyond .
For one-sided tests, the P-value is the area in the relevant tail (right or left) beyond .
A small P-value (less than ) suggests strong evidence against .
Decision and Conclusion
Significance level (): Common values are 0.01, 0.05, and 0.10. This threshold is set before examining the data.
If P-value , reject . The result is statistically significant.
If P-value , do not reject . The result is not statistically significant.
Never say "accept " or "prove is true"; we only gather evidence for or against .
Type I and Type II Errors
Type I error: Rejecting when is true (false positive).
Type II error: Failing to reject when is false (false negative).
A small probability of both errors is desirable.
Decision | true | true |
|---|---|---|
Reject | Type I error | Correct decision |
Do not reject | Correct decision | Type II error |
Example Application
Parameter of interest: Proportion of defective items with the new process ().
Null hypothesis:
Alternative hypothesis: (testing for a reduction in defect rate)
Sample data: ,
Test statistic:
P-value: Area to the left of the calculated in the standard normal distribution (for left-tailed test).
Conclusion: Compare P-value to (e.g., 0.10) to decide whether to reject .
Summary of Steps in Hypothesis Testing
Identify the population and parameter of interest.
Set up the null and alternative hypotheses.
Compute the test statistic.
Compute the P-value.
Draw a conclusion based on the P-value and significance level.
Additional info: These notes provide a comprehensive overview of the one-proportion z-test, including the logic of hypothesis testing, calculation of test statistics, interpretation of P-values, and the importance of error types in statistical inference.