Sampling Distributions, Normal Approximation, and Probability in Statistics

Study Guide - Smart Notes

Tailored notes based on your materials, expanded with key definitions, examples, and context.

Sampling Distributions and the Central Limit Theorem

Definition and Properties

The sampling distribution of a statistic (such as the mean or proportion) describes the probability distribution of that statistic over all possible samples of a fixed size from a population. The Central Limit Theorem (CLT) states that, for sufficiently large sample sizes, the sampling distribution of the sample mean will be approximately normal, regardless of the population's distribution, provided the population has a finite mean and variance.

Sample Mean (): The average value from a sample of size .
Mean of Sampling Distribution: (population mean)
Standard Deviation of Sampling Distribution (Standard Error):
Normal Approximation: If the population is normal or , the sampling distribution of is approximately normal.

Example: If the height of 20-year-old American women is normally distributed with mean 64 inches and standard deviation 3 inches, the sampling distribution for the average height of 36 randomly chosen women will be normal with mean 64 inches and standard deviation inches.

Normal Distribution and Probability Calculations

Standard Normal Distribution

The standard normal distribution is a normal distribution with mean 0 and standard deviation 1. Probabilities for normal distributions are often calculated by converting values to z-scores:

Z-score formula:
Use standard normal tables or software to find probabilities associated with z-scores.

Example: To find for , calculate and use the normal table to find .

Binomial Distribution and Normal Approximation

Conditions for Normal Approximation

The binomial distribution models the number of successes in independent Bernoulli trials with probability of success. For large $n$, the binomial can be approximated by a normal distribution if:

and
Mean:
Standard deviation:

Example: For a sample of 49 customers with , and , so normal approximation is valid.

Probability Functions and R Commands

Using R for Probability Calculations

Statistical software such as R provides functions to compute probabilities for normal and binomial distributions:

pnorm(x, mean, sd): Computes for normal distribution.
qnorm(p, mean, sd): Computes the quantile (inverse cumulative probability).
1 - pnorm(x, mean, sd): Computes .

Example: pnorm(60, mean=54, sd=4) computes for .

Sampling Distribution of Proportion

Sample Proportion and Its Distribution

The sample proportion is the fraction of successes in a sample. Its sampling distribution is approximately normal if and :

Mean:
Standard deviation:

Example: For a sample of 620 customers with , , .

Application: Real-World Scenarios

Interpreting Statistical Results

Statistical methods are used to answer questions about populations based on sample data, such as estimating average blood pressure, average time spent on the internet, or the proportion of customers outside a city.

Always check conditions for normal approximation before applying it.
Use sample statistics to estimate population parameters and compute probabilities.

Tables: Summary of Key Formulas

Statistic	Mean	Standard Deviation	Distribution (if normal approx. valid)
Sample Mean ()			Normal
Sample Proportion ()			Normal
Binomial ()			Normal (if )

Additional info:

Some questions reference R code for probability calculations, which is common in statistics courses.
All examples and scenarios are typical applications of sampling distributions, normal approximation, and probability in introductory statistics.