Inferences Comparing Two Population Means: Independent and Dependent Samples

Study Guide - Smart Notes

Tailored notes based on your materials, expanded with key definitions, examples, and context.

Inferences Comparing Two Population Means

Introduction

Statistical inference about two population means is a fundamental topic in inferential statistics. It involves comparing the means of two populations using sample data, which can be either independent or dependent. The choice of method depends on the relationship between the samples.

Types of Samples

Independent Samples

Two samples are independent if the selection of subjects from one population is unrelated to the selection from the other population.

Example: Comparing test scores from students in two different schools.

Dependent Samples (Paired Samples)

Samples are dependent if each subject in one sample is paired with a subject in the other sample, often forming similar pairs of objects.

Example: Measuring blood pressure before and after treatment in the same individuals.

Inferences About Two Population Means: Independent Samples

Notation and Parameters

Population I	Population II
= Population mean = Population Std. Dev.	= Population mean = Population Std. Dev.
Sample I	Sample II
= Sample mean = Sample Std. Dev. = Sample size	= Sample mean = Sample Std. Dev. = Sample size

Conditions for Using the t-Distribution

Two random samples must be selected, one from each population of interest.
The samples must be independent.
Either both populations must be normally distributed, or both sample sizes (, ) must be at least 30.
The population standard deviations (, ) are unknown.

Sampling Distribution of

Mean:
Standard Error: Note: In practice, use sample standard deviations and .

Test Statistic for Independent Samples

When population standard deviations are unknown:

Test Statistic:

Confidence Interval for

Formula:
is the critical value from the t-distribution.

Elements of a t-Test of Hypothesis

Null Hypothesis ():
Alternative Hypothesis (): , , or
Decision Rule: If p-value < , reject .

Example Application: Braces Study

A laboratory compares the mean wearing times for standard braces and new bands in children. Forty children are randomly assigned to each group. The number of days required for correction is recorded. The goal is to determine if the new bands require less time on average, using a significance level of 0.01.

Type I Error: Concluding the new bands are different when they are not.
Type II Error: Failing to detect a difference when one exists.
Confidence Interval: Estimate the difference in mean wearing time with a specified level of confidence.

Inferences About Two Population Means: Dependent Samples

Notation and Parameters

Population of Differences
= Population mean of differences = Population Std. Dev. of differences
Sample of Differences
= Sample mean of differences = Sample Std. Dev. of differences = Sample size

Conditions for Using the t-Statistic

A random sample of paired observations must be selected.
The population of differences must be normally distributed, or the sample size must be at least 30.
The population standard deviation is unknown.

Test Statistic for Dependent Samples

Test Statistic:

Confidence Interval for

Formula:
is the critical value from the t-distribution.

Elements of a t-Test of Hypothesis

Null Hypothesis ():
Alternative Hypothesis (): , , or
Decision Rule: If p-value < , reject .

Example Application: Biofeedback Blood Pressure Study

Researchers measure blood pressure in subjects before and after biofeedback training. The difference in measurements is analyzed to determine if training reduces blood pressure, using a paired t-test and a 95% confidence interval to estimate the mean reduction.

Statistical Analysis Using Software

Interpreting Output Tables

Statistical software (e.g., StatCrunch) provides hypothesis test results and confidence intervals. The following table summarizes typical output:

Hypothesis Test Results	90% Confidence Interval Results
Difference Sample Diff. Std. Err. DF T-Stat P-value	Difference Sample Diff. Std. Err. DF L Limit U Limit
1.3 1.2931443 13.029 1.0050316 0.166	1.3 1.2931443 13.029 -0.97875273 3.5787527

Additional info: The table above is inferred from the context and typical StatCrunch output for two-sample t-tests.

Steps in Hypothesis Testing

State the parameters of interest.
Formulate null and alternative hypotheses.
State the decision rule (based on significance level and p-value).
Calculate the test statistic and p-value.
Make the decision (reject or fail to reject ).
Interpret the results in context.

Types of Errors

Type I Error: Incorrectly rejecting a true null hypothesis.
Type II Error: Failing to reject a false null hypothesis.
Application: In the context of work time lost due to sickness, a Type I error would mean concluding a difference exists when it does not.

Summary Table: Independent vs. Dependent Samples

Feature	Independent Samples	Dependent Samples
Sample Relationship	Unrelated	Paired/Matched
Test Statistic
Confidence Interval
Typical Application	Comparing two groups	Before-and-after studies

Additional info: Table structure and content inferred for clarity and completeness.