BackChapter 9
Study Guide - Smart Notes
Tailored notes based on your materials, expanded with key definitions, examples, and context.
Chapter 9: Inferring Population Means
Learning Objectives
Understand the distribution of the sample mean and why it is a good estimator of the population mean.
Apply the Central Limit Theorem (CLT) for populations with Normal or any distribution.
Compute and interpret confidence intervals for a population mean.
Develop and conduct statistical hypothesis tests about the value of a population mean, distinguishing between hypothesis tests and confidence intervals.
Section 9.1: Sample Means of Random Samples
Three Characteristics of the Sample Mean
The sample mean (\( \bar{x} \)) is a random variable used to estimate the population mean (\( \mu \)). Its properties are:
Accuracy: On average, the sample mean equals the population mean (unbiased estimator).
Precision: The variability of the sample mean decreases as sample size increases.
Probability Distribution: The sample mean has its own distribution, called the sampling distribution.
Notation Review
\( \mu \): Population mean
\( \bar{x} \): Sample mean
\( \sigma \): Population standard deviation
\( s \): Sample standard deviation
Example: Accuracy and Precision
Accuracy: Sample mean is close to the population mean.
Precision: Sample means are close to each other (low variability).
Illustration: Target diagrams show combinations of accuracy and precision (e.g., shots clustered at the center are both accurate and precise).
Sampling Distribution of the Sample Mean
The sampling distribution is the probability distribution of all possible sample means from samples of a fixed size drawn from the population.
As sample size increases, the sampling distribution becomes more concentrated around the population mean.
The standard deviation of the sampling distribution (standard error) decreases as sample size increases.
Simulation Example
Sample | Q1 | Q2 | Q3 | Q4 | Q5 | Average | Deviation from Mean |
|---|---|---|---|---|---|---|---|
Sim#1 | 31 | 32 | 29 | 27 | 26 | 29.0 | -1.3 |
Sim#2 | 36 | 35 | 34 | 32 | 31 | 33.6 | 3.3 |
... | Additional info: Table continues for other simulations, showing how sample means vary around the population mean. | ||||||
Key Formulas
Bias of the sample mean:
Standard deviation of the sample mean (Standard Error):
Variance of the sample mean:
Example: Calculating Sample Variance
Population variance = 150, sample size = 30
Sample variance =
Section 9.2: Central Limit Theorem for Sample Means
The Central Limit Theorem (CLT): Conditions
The CLT states that, under certain conditions, the sampling distribution of the sample mean is approximately Normal, regardless of the population's distribution.
Random Sample: Observations are randomly selected and independent.
Normality: Either the population is Normal, or the sample size is large (usually ).
Big Population: If sampling without replacement, the population should be at least 10 times larger than the sample size.
CLT: Conclusions
The sampling distribution of is approximately Normal.
Mean of the sampling distribution:
Standard deviation (standard error):
Example: Estimating Probabilities with the CLT
Given: Mean number of cigarettes = 5, SD = 6, sample size = 1000
Standard error:
Probability calculations use the Normal distribution with mean 5 and SD 0.19.
Visualizing the CLT
As sample size increases, the sampling distribution becomes more Normal and less spread out.
Even for non-Normal populations, the sample mean's distribution approaches Normality as increases.
Section 9.3: Answering Questions About the Mean of a Population
Two Approaches
Confidence Intervals: Estimate parameter values.
Hypothesis Tests: Decide whether a parameter's value is one thing or another.
Confidence Intervals for a Population Mean
A confidence interval gives a range of plausible values for the population mean, along with a confidence level (e.g., 95%).
Conditions:
Random sample
Normality (population Normal or )
Interpretation: The confidence level is the long-run proportion of intervals that will contain the population mean if the procedure is repeated many times.
Margin of Error
Margin of error = (multiplier) × standard error
For unknown , use and the -distribution:
Confidence Interval Formula
is the critical value from the -distribution with degrees of freedom for the desired confidence level.
Example: Constructing a Confidence Interval
Sample mean = 5218, = 893, = 34, 95% CI
(from -table, df = 33)
Standard error:
CI:
Finding the Multiplier
Use -tables or technology for the appropriate degrees of freedom and confidence level.
Interpretation of the Confidence Level
We are 95% confident that the interval contains the true population mean.
We cannot say there is a 95% probability that the specific interval contains the mean; the confidence level refers to the method, not a particular interval.
Examples: Additional Calculations
Standard error for means:
t-critical values: e.g., for 95% CI with 29 df
Margin of error:
Summary Table: Confidence Intervals
Source | Mean | SD | n | CI |
|---|---|---|---|---|
Sample | 52.11 | 8.32 | 31 | 48.95, 55.41 |
Additional info: Table shows how to compute CI for different samples. | ||||
Hypothesis Testing for a Population Mean
The Four Steps in Hypothesis Testing
Hypothesis: State null () and alternative () hypotheses about the population mean.
Prepare: Choose the test statistic, state/check conditions, select significance level ().
Compute & Compare: Calculate the test statistic and p-value; compare to .
Interpret & Conclude: State the result in context; never "accept" , only fail to reject or reject.
Types of Hypotheses
Type | Null Hypothesis | Alternative Hypothesis |
|---|---|---|
Two-Tailed | ||
One-Tailed (Left) | ||
One-Tailed (Right) |
Test Statistics
z-test: Use when population standard deviation is known and CLT conditions are met.
t-test: Use when is unknown; use sample standard deviation and -distribution.
Level of Significance ()
Common values: 0.05, 0.01
Represents the probability of rejecting when it is true (Type I error).
p-Values
The probability, under , of obtaining a result as extreme or more extreme than the observed sample statistic.
If -value < , reject .
Example: Complete Hypothesis Test
Given: , sample mean , , ,
Calculate :
Find p-value (from -table or technology); compare to .
Interpret: If -value < 0.05, conclude mean years of experience has increased.
Additional Practice and Concept Checks
As sample size increases, the standard deviation of the sample mean decreases.
For a population with , , and , the standard deviation of the sample mean is .
CLT applies for for non-Normal populations.
Choosing one-tailed or two-tailed tests depends on the research question and alternative hypothesis.