Skip to main content
Back

Lecture 23: Normal Distributions, Binomial Approximations, and Parameter Estimation

Study Guide - Smart Notes

Tailored notes based on your materials, expanded with key definitions, examples, and context.

Percentiles of the Normal Distribution

Understanding Percentiles

Percentiles are values below which a certain percentage of data in a normal distribution falls. They are commonly used to interpret scores and probabilities in statistics.

  • Percentile Definition: The kth percentile is the value below which k% of the data falls.

  • Finding Percentiles: To find the kth percentile of a normal distribution , use the formula:

  • Where is the z-score corresponding to the desired percentile (from the standard normal table).

Example: Calculating Percentiles

  • Given (so , , ):

  • 95th percentile:

  • 5th percentile:

  • Value such that :

Binomial Experiments and Normal Approximation

Characteristics of a Binomial Experiment

  • The experiment is performed n independent times (trials).

  • Each trial has two mutually exclusive outcomes: success or failure.

  • The probability of success, p, is the same for each trial.

Normal Approximation to the Binomial

When the number of trials n is large, the binomial distribution can be approximated by a normal distribution.

  • Rule of Thumb: If , the binomial distribution is approximately normal.

  • Parameters for Approximation:

  • This approximation simplifies calculations for large n.

Continuity Correction

Since the binomial is discrete and the normal is continuous, a continuity correction is applied when approximating binomial probabilities with the normal distribution.

  • To approximate , compute under the normal curve.

  • To approximate , compute .

  • To approximate , compute .

Exact Probability Using Binomial

Approximate Probability Using Normal

Graphical Depiction

Area between and under normal curve

Area left of

Area right of

Example: Normal Approximation

  • Suppose 7% of people have blood type O-negative. In a sample of 500, what is ?

  • Check (approximation valid).

  • ,

  • Apply continuity correction:

  • Convert to z-score:

  • From z-table:

Automotive Example

  • 35% of households have 3+ cars. In a sample of 400, what is ?

  • ,

  • Apply continuity correction:

  • For : , ,

Parameter Estimation

Parameters and Statistics

  • Parameter: A number that describes a property of a population (usually unknown and constant).

  • Statistic: A number that describes a property of a sample (known, but varies from sample to sample).

Point Estimation and Margin of Error

  • Sample statistics are used as point estimates for unknown population parameters.

  • Sample statistics vary from sample to sample; this variability is quantified by the margin of error.

  • Margin of error provides a range of plausible values for the parameter, often associated with a confidence level (e.g., 95%).

Example: Margin of Error in Surveys

  • If a survey reports 41% ± 2.9%, we are 95% confident that the true proportion is between 38.1% and 43.9%.

  • Similarly, 49% ± 4.4% means the true proportion is between 44.6% and 53.4% with 95% confidence.

Sampling Distributions

Sampling Statistics as Random Variables

Statistics such as the sample mean are random variables because their values vary from sample to sample. The probability distribution of a statistic is called its sampling distribution.

Sampling Distribution of the Sample Mean

  • The sampling distribution of the sample mean is the probability distribution of all possible values of computed from samples of size from a population with mean and standard deviation .

  • The shape, center, and spread of the sampling distribution depend on the sample size and sampling design.

Procedure for Constructing a Sampling Distribution (for small N and n)

  1. Specify the sample size and the sampling design (SRS with or without replacement, order matters or not).

  2. List all possible random samples of size and their probabilities.

  3. Compute the corresponding values of the sample statistic for each sample.

  4. Compute the probabilities for the obtained values of the statistic.

Example: Sampling Distribution of the Sample Mean

  • Population: , ; sample size ; SRS with replacement, ordered.

  • There are possible samples, each with probability .

Sample

Value of

Probability

(0,0)

0

1/9

(0,6), (6,0)

3

2/9

(0,9), (9,0)

4.5

2/9

(6,6)

6

1/9

(6,9), (9,6)

7.5

2/9

(9,9)

9

1/9

Compare with the population distribution:

0

1/3

6

1/3

9

1/3

Additional info: The sampling distribution of the sample mean is generally less variable than the population distribution, and its mean equals the population mean.

Pearson Logo

Study Prep