Sampling Distributions: Concepts and Applications in Business Statistics

Study Guide - Smart Notes

Tailored notes based on your materials, expanded with key definitions, examples, and context.

Chapter 7: Sampling Distributions

Objectives

This chapter introduces the concept of sampling distributions, their properties, and their importance in statistical inference for business applications. Students will learn to:

Understand what a sampling distribution is.
Calculate probabilities related to the sample mean and the sample proportion.
Recognize the importance of the Central Limit Theorem (CLT).

Sampling Distributions

Definition and Importance

A sampling distribution is the probability distribution of a given statistic (such as the mean or proportion) based on a random sample of a specific size drawn from a population. It is fundamental in inferential statistics, as it allows us to estimate population parameters and assess the variability of sample statistics.

Sample Statistic: A numerical value calculated from a sample (e.g., sample mean \( \bar{X} \)).
Population Parameter: A numerical value that describes the entire population (e.g., population mean \( \mu \)).
Example: If you repeatedly sample 50 students from a college and calculate the mean GPA for each sample, the distribution of all these sample means forms the sampling distribution of the mean.

Developing a Sampling Distribution

Illustrative Example

Suppose we have a small population with the following characteristics:

Population size: N = 4
Variable of interest: Age (X)
Values: 18, 20, 22, 24 years

Population Summary Measures

Population Mean:
Population Variance:

Population Distribution Table

Value (X)	Probability P(X)
18	0.25
20	0.25
22	0.25
24	0.25

This is a uniform distribution since each value is equally likely.

All Possible Samples of Size n = 2 (with Replacement)

When sampling with replacement, each sample of size 2 can be any combination of the four values. There are 16 possible samples:

1st Observation	2nd Observation
18	18
18	20
18	22
18	24
20	18
20	20
20	22
20	24
22	18
22	20
22	22
22	24
24	18
24	20
24	22
24	24

Sample Means for Each Sample

Sample	Sample Mean (\( \bar{X} \))
18, 18	18
18, 20	19
18, 22	20
18, 24	21
20, 18	19
20, 20	20
20, 22	21
20, 24	22
22, 18	20
22, 20	21
22, 22	22
22, 24	23
24, 18	21
24, 20	22
24, 22	23
24, 24	24

Distribution of Sample Means

The sample means are not uniformly distributed, even though the original population is. This illustrates how the sampling distribution of the mean can differ in shape from the population distribution.

Summary Measures of the Sampling Distribution

Mean of the Sampling Distribution:
Variance of the Sampling Distribution: For this example, and .

Properties of the Sample Mean Sampling Distribution

Standard Error of the Mean

The standard error of the mean measures the variability of sample means around the population mean. It is calculated as:

As sample size n increases, the standard error decreases, making sample means more tightly clustered around the population mean.

If the Population is Normal

If the population is normally distributed with mean and standard deviation , then the sampling distribution of the sample mean is also normal, regardless of sample size.
Mean:
Standard Error:

Z-Value for the Sampling Distribution of the Mean

To standardize a sample mean and find probabilities, use:

This converts the sample mean to a standard normal variable.

Effect of Sample Size

Larger sample sizes result in smaller standard errors, making the sampling distribution narrower.
Smaller sample sizes result in greater variability among sample means.

Determining an Interval for a Fixed Proportion of Sample Means

To find an interval that contains a certain percentage (e.g., 95%) of sample means:

Use the formula:
For 95%, (from the standard normal table).
Example: If , , :
- Lower limit:
- Upper limit:
95% of sample means will fall between 362.12 and 373.88.

Central Limit Theorem (CLT)

Statement and Importance

The Central Limit Theorem states that, for a sufficiently large sample size, the sampling distribution of the sample mean will be approximately normal, regardless of the shape of the population distribution.

For most distributions, n > 30 is considered large enough.
For symmetric distributions, n > 15 may suffice.
If the population is normal, the sampling distribution is always normal, regardless of sample size.

Properties of the Sampling Distribution (Non-Normal Population)

Mean:
Standard Error:
As n increases, the distribution becomes more normal and the standard error decreases.

Sampling Distribution of the Sample Proportion

Definition and Properties

The sample proportion (\( p \)) estimates the proportion of the population (\( \pi \)) with a certain characteristic. For large samples, the sampling distribution of p is approximately normal.

Mean:
Standard Error:
Normal approximation is valid if and .

Z-Value for Proportions

Standardize the sample proportion using:

Example: Probability for a Sample Proportion

If the true proportion of voters supporting a proposition is and , what is ?
Calculate standard error:
Standardize endpoints:
From the standard normal table:

Chapter Summary

Sampling distributions describe the variability of sample statistics.
The standard error quantifies the spread of the sampling distribution.
The Central Limit Theorem allows normal approximation for sample means and proportions, facilitating probability calculations.
Understanding these concepts is essential for making inferences about populations from samples in business statistics.

1st Observation	2nd Observation
18	18
18	20
18	22
18	24
20	18
20	20
20	22
20	24
22	18
22	20
22	22
22	24
24	18
24	20
24	22
24	24

1st Observation	2nd Observation
18	18
18	20
18	22
18	24
20	18
20	20
20	22
20	24
22	18
22	20
22	22
22	24
24	18
24	20
24	22
24	24

1st Observation	2nd Observation
18	18
18	20
18	22
18	24
20	18
20	20
20	22
20	24
22	18
22	20
22	22
22	24
24	18
24	20
24	22
24	24