When performing hypothesis tests for two means with known population standard deviations, the process closely resembles tests where these values are unknown, with two main differences. Instead of using sample standard deviations, the known population standard deviations are used, and the test statistic follows a z distribution rather than a t distribution. This allows for a two-sample z-test to compare the means.
Consider a scenario where a grocery chain wants to determine if self-checkout lanes have shorter checkout times than cashier lanes. They collect independent random samples of 35 checkout times from each lane type. The sample means are 4.5 minutes for self-checkout and 6.4 minutes for cashier lanes. Prior knowledge provides population standard deviations of σ₁ = 1.1 and σ₂ = 1.4. The goal is to test the claim that self-checkout times are shorter, using a significance level of α = 0.05.
The hypothesis test begins by stating the null hypothesis H₀: μ₁ = μ₂, indicating no difference in mean checkout times, and the alternative hypothesis Hₐ: μ₁ < μ₂, suggesting self-checkout lanes are faster.
Next, the test statistic z is calculated using the formula:
\[ z = \frac{\bar{x}_1 - \bar{x}_2}{\sqrt{\frac{\sigma_1^2}{n_1} + \frac{\sigma_2^2}{n_2}}} \]
Here, 𝑥̄₁ and 𝑥̄₂ are the sample means, σ₁ and σ₂ are the known population standard deviations, and n₁ and n₂ are the sample sizes. Substituting the values:
\[ z = \frac{4.5 - 6.4}{\sqrt{\frac{1.1^2}{35} + \frac{1.4^2}{35}}} \approx -6.31 \]
The resulting z-score indicates how many standard errors the observed difference in means is from zero under the null hypothesis.
To determine the p-value, which represents the probability of observing a z-score as extreme as -6.31 or more under the null hypothesis, one looks up the cumulative probability for z < -6.31. This p-value is approximately 1.37 × 10-10, an extremely small value.
Since the p-value is much less than the significance level α = 0.05, the null hypothesis is rejected. This provides strong evidence that the mean checkout time for self-checkout lanes is indeed shorter than for cashier lanes.
Before finalizing conclusions, it is essential to verify the assumptions of the test: the samples must be independent and randomly selected, and the sampling distribution of the difference in means should be approximately normal. With sample sizes greater than 30, the Central Limit Theorem ensures normality, satisfying these conditions.
Understanding how to conduct a two-sample z-test with known population standard deviations is crucial for accurately comparing means when such information is available. This method provides a reliable way to test claims about differences between two population means, leveraging the z distribution for hypothesis testing.
