BackSampling Distributions, Normal Approximation, and Probability in Statistics
Study Guide - Smart Notes
Tailored notes based on your materials, expanded with key definitions, examples, and context.
Sampling Distributions and the Central Limit Theorem
Definition and Properties
The sampling distribution of a statistic (such as the mean or proportion) describes the probability distribution of that statistic over all possible samples of a fixed size from a population. The Central Limit Theorem (CLT) states that, for sufficiently large sample sizes, the sampling distribution of the sample mean will be approximately normal, regardless of the population's distribution, provided the population has a finite mean and variance.
Sample Mean (): The average value from a sample of size .
Mean of Sampling Distribution: (population mean)
Standard Deviation of Sampling Distribution (Standard Error):
Normal Approximation: If the population is normal or , the sampling distribution of is approximately normal.
Example: If the height of 20-year-old American women is normally distributed with mean 64 inches and standard deviation 3 inches, the sampling distribution for the average height of 36 randomly chosen women will be normal with mean 64 inches and standard deviation inches.
Normal Distribution and Probability Calculations
Standard Normal Distribution
The standard normal distribution is a normal distribution with mean 0 and standard deviation 1. Probabilities for normal distributions are often calculated by converting values to z-scores:
Z-score formula:
Use standard normal tables or software to find probabilities associated with z-scores.
Example: To find for , calculate and use the normal table to find .
Binomial Distribution and Normal Approximation
Conditions for Normal Approximation
The binomial distribution models the number of successes in independent Bernoulli trials with probability of success. For large $n$, the binomial can be approximated by a normal distribution if:
and
Mean:
Standard deviation:
Example: For a sample of 49 customers with , and , so normal approximation is valid.
Probability Functions and R Commands
Using R for Probability Calculations
Statistical software such as R provides functions to compute probabilities for normal and binomial distributions:
pnorm(x, mean, sd): Computes for normal distribution.
qnorm(p, mean, sd): Computes the quantile (inverse cumulative probability).
1 - pnorm(x, mean, sd): Computes .
Example: pnorm(60, mean=54, sd=4) computes for .
Sampling Distribution of Proportion
Sample Proportion and Its Distribution
The sample proportion is the fraction of successes in a sample. Its sampling distribution is approximately normal if and :
Mean:
Standard deviation:
Example: For a sample of 620 customers with , , .
Application: Real-World Scenarios
Interpreting Statistical Results
Statistical methods are used to answer questions about populations based on sample data, such as estimating average blood pressure, average time spent on the internet, or the proportion of customers outside a city.
Always check conditions for normal approximation before applying it.
Use sample statistics to estimate population parameters and compute probabilities.
Tables: Summary of Key Formulas
Statistic | Mean | Standard Deviation | Distribution (if normal approx. valid) |
|---|---|---|---|
Sample Mean () | Normal | ||
Sample Proportion () | Normal | ||
Binomial () | Normal (if ) |
Additional info:
Some questions reference R code for probability calculations, which is common in statistics courses.
All examples and scenarios are typical applications of sampling distributions, normal approximation, and probability in introductory statistics.