BackLecture 23: Normal Distributions, Binomial Approximations, and Parameter Estimation
Study Guide - Smart Notes
Tailored notes based on your materials, expanded with key definitions, examples, and context.
Percentiles of the Normal Distribution
Understanding Percentiles
Percentiles are values below which a certain percentage of data in a normal distribution falls. They are commonly used to interpret scores and probabilities in statistics.
Percentile Definition: The kth percentile is the value below which k% of the data falls.
Finding Percentiles: To find the kth percentile of a normal distribution , use the formula:
Where is the z-score corresponding to the desired percentile (from the standard normal table).
Example: Calculating Percentiles
Given (so , , ):
95th percentile:
5th percentile:
Value such that :
Binomial Experiments and Normal Approximation
Characteristics of a Binomial Experiment
The experiment is performed n independent times (trials).
Each trial has two mutually exclusive outcomes: success or failure.
The probability of success, p, is the same for each trial.
Normal Approximation to the Binomial
When the number of trials n is large, the binomial distribution can be approximated by a normal distribution.
Rule of Thumb: If , the binomial distribution is approximately normal.
Parameters for Approximation:
This approximation simplifies calculations for large n.
Continuity Correction
Since the binomial is discrete and the normal is continuous, a continuity correction is applied when approximating binomial probabilities with the normal distribution.
To approximate , compute under the normal curve.
To approximate , compute .
To approximate , compute .
Exact Probability Using Binomial | Approximate Probability Using Normal | Graphical Depiction |
|---|---|---|
Area between and under normal curve | ||
Area left of | ||
Area right of |
Example: Normal Approximation
Suppose 7% of people have blood type O-negative. In a sample of 500, what is ?
Check (approximation valid).
,
Apply continuity correction:
Convert to z-score:
From z-table:
Automotive Example
35% of households have 3+ cars. In a sample of 400, what is ?
,
Apply continuity correction:
For : , ,
Parameter Estimation
Parameters and Statistics
Parameter: A number that describes a property of a population (usually unknown and constant).
Statistic: A number that describes a property of a sample (known, but varies from sample to sample).
Point Estimation and Margin of Error
Sample statistics are used as point estimates for unknown population parameters.
Sample statistics vary from sample to sample; this variability is quantified by the margin of error.
Margin of error provides a range of plausible values for the parameter, often associated with a confidence level (e.g., 95%).
Example: Margin of Error in Surveys
If a survey reports 41% ± 2.9%, we are 95% confident that the true proportion is between 38.1% and 43.9%.
Similarly, 49% ± 4.4% means the true proportion is between 44.6% and 53.4% with 95% confidence.
Sampling Distributions
Sampling Statistics as Random Variables
Statistics such as the sample mean are random variables because their values vary from sample to sample. The probability distribution of a statistic is called its sampling distribution.
Sampling Distribution of the Sample Mean
The sampling distribution of the sample mean is the probability distribution of all possible values of computed from samples of size from a population with mean and standard deviation .
The shape, center, and spread of the sampling distribution depend on the sample size and sampling design.
Procedure for Constructing a Sampling Distribution (for small N and n)
Specify the sample size and the sampling design (SRS with or without replacement, order matters or not).
List all possible random samples of size and their probabilities.
Compute the corresponding values of the sample statistic for each sample.
Compute the probabilities for the obtained values of the statistic.
Example: Sampling Distribution of the Sample Mean
Population: , ; sample size ; SRS with replacement, ordered.
There are possible samples, each with probability .
Sample | Value of | Probability |
|---|---|---|
(0,0) | 0 | 1/9 |
(0,6), (6,0) | 3 | 2/9 |
(0,9), (9,0) | 4.5 | 2/9 |
(6,6) | 6 | 1/9 |
(6,9), (9,6) | 7.5 | 2/9 |
(9,9) | 9 | 1/9 |
Compare with the population distribution:
0 | 1/3 |
6 | 1/3 |
9 | 1/3 |
Additional info: The sampling distribution of the sample mean is generally less variable than the population distribution, and its mean equals the population mean.