Chapter 9

Notes Practice Video lessons

Study Guide - Smart Notes

Tailored notes based on your materials, expanded with key definitions, examples, and context.

Chapter 9: Inferring Population Means

Learning Objectives

Understand the distribution of the sample mean and why it is a good estimator of the population mean.
Apply the Central Limit Theorem (CLT) for populations with Normal or any distribution.
Compute and interpret confidence intervals for a population mean.
Develop and conduct statistical hypothesis tests about the value of a population mean, distinguishing between hypothesis tests and confidence intervals.

Section 9.1: Sample Means of Random Samples

Three Characteristics of the Sample Mean

The sample mean (\( \bar{x} \)) is a random variable used to estimate the population mean (\( \mu \)). Its properties are:

Accuracy: On average, the sample mean equals the population mean (unbiased estimator).
Precision: The variability of the sample mean decreases as sample size increases.
Probability Distribution: The sample mean has its own distribution, called the sampling distribution.

Notation Review

\( \mu \): Population mean
\( \bar{x} \): Sample mean
\( \sigma \): Population standard deviation
\( s \): Sample standard deviation

Example: Accuracy and Precision

Accuracy: Sample mean is close to the population mean.
Precision: Sample means are close to each other (low variability).
Illustration: Target diagrams show combinations of accuracy and precision (e.g., shots clustered at the center are both accurate and precise).

Sampling Distribution of the Sample Mean

The sampling distribution is the probability distribution of all possible sample means from samples of a fixed size drawn from the population.

As sample size increases, the sampling distribution becomes more concentrated around the population mean.
The standard deviation of the sampling distribution (standard error) decreases as sample size increases.

Simulation Example

Sample	Q1	Q2	Q3	Q4	Q5	Average	Deviation from Mean
Sim#1	31	32	29	27	26	29.0	-1.3
Sim#2	36	35	34	32	31	33.6	3.3
...	Additional info: Table continues for other simulations, showing how sample means vary around the population mean.

Key Formulas

Bias of the sample mean:
Standard deviation of the sample mean (Standard Error):
Variance of the sample mean:

Example: Calculating Sample Variance

Population variance = 150, sample size = 30
Sample variance =

Section 9.2: Central Limit Theorem for Sample Means

The Central Limit Theorem (CLT): Conditions

The CLT states that, under certain conditions, the sampling distribution of the sample mean is approximately Normal, regardless of the population's distribution.

Random Sample: Observations are randomly selected and independent.
Normality: Either the population is Normal, or the sample size is large (usually ).
Big Population: If sampling without replacement, the population should be at least 10 times larger than the sample size.

CLT: Conclusions

The sampling distribution of is approximately Normal.
Mean of the sampling distribution:
Standard deviation (standard error):

Example: Estimating Probabilities with the CLT

Given: Mean number of cigarettes = 5, SD = 6, sample size = 1000
Standard error:
Probability calculations use the Normal distribution with mean 5 and SD 0.19.

Visualizing the CLT

As sample size increases, the sampling distribution becomes more Normal and less spread out.
Even for non-Normal populations, the sample mean's distribution approaches Normality as increases.

Section 9.3: Answering Questions About the Mean of a Population

Two Approaches

Confidence Intervals: Estimate parameter values.
Hypothesis Tests: Decide whether a parameter's value is one thing or another.

Confidence Intervals for a Population Mean

A confidence interval gives a range of plausible values for the population mean, along with a confidence level (e.g., 95%).

Conditions:
- Random sample
- Normality (population Normal or )
Interpretation: The confidence level is the long-run proportion of intervals that will contain the population mean if the procedure is repeated many times.

Margin of Error

Margin of error = (multiplier) × standard error
For unknown , use and the -distribution:

Confidence Interval Formula

is the critical value from the -distribution with degrees of freedom for the desired confidence level.

Example: Constructing a Confidence Interval

Sample mean = 5218, = 893, = 34, 95% CI
(from -table, df = 33)
Standard error:
CI:

Finding the Multiplier

Use -tables or technology for the appropriate degrees of freedom and confidence level.

Interpretation of the Confidence Level

We are 95% confident that the interval contains the true population mean.
We cannot say there is a 95% probability that the specific interval contains the mean; the confidence level refers to the method, not a particular interval.

Examples: Additional Calculations

Standard error for means:
t-critical values: e.g., for 95% CI with 29 df
Margin of error:

Summary Table: Confidence Intervals

Source	Mean	SD	n	CI
Sample	52.11	8.32	31	48.95, 55.41
Additional info: Table shows how to compute CI for different samples.

Hypothesis Testing for a Population Mean

The Four Steps in Hypothesis Testing

Hypothesis: State null () and alternative () hypotheses about the population mean.
Prepare: Choose the test statistic, state/check conditions, select significance level ().
Compute & Compare: Calculate the test statistic and p-value; compare to .
Interpret & Conclude: State the result in context; never "accept" , only fail to reject or reject.

Types of Hypotheses

Type	Null Hypothesis	Alternative Hypothesis
Two-Tailed
One-Tailed (Left)
One-Tailed (Right)

Test Statistics

z-test: Use when population standard deviation is known and CLT conditions are met.
t-test: Use when is unknown; use sample standard deviation and -distribution.

Level of Significance ()

Common values: 0.05, 0.01
Represents the probability of rejecting when it is true (Type I error).

p-Values

The probability, under , of obtaining a result as extreme or more extreme than the observed sample statistic.
If -value < , reject .

Example: Complete Hypothesis Test

Given: , sample mean , , ,
Calculate :
Find p-value (from -table or technology); compare to .
Interpret: If -value < 0.05, conclude mean years of experience has increased.

Additional Practice and Concept Checks

As sample size increases, the standard deviation of the sample mean decreases.
For a population with , , and , the standard deviation of the sample mean is .
CLT applies for for non-Normal populations.
Choosing one-tailed or two-tailed tests depends on the research question and alternative hypothesis.