Normal Distribution and Z-Scores: Study Notes for Statistics

Study Guide - Smart Notes

Tailored notes based on your materials, expanded with key definitions, examples, and context.

Normal Distribution

Introduction to the Normal Distribution

The normal distribution is a fundamental probability distribution in statistics, characterized by its symmetric, bell-shaped curve. It is widely used to model real-world phenomena such as heights, test scores, and measurement errors.

Definition: A continuous probability distribution defined by its mean () and standard deviation ().
Properties:
- Symmetric about the mean
- Mean, median, and mode are equal
- Asymptotic: tails approach but never touch the horizontal axis
- Total area under the curve is 1
Probability Density Function (PDF):

Parameters:
- = mean (center of the distribution)
- = standard deviation (spread of the distribution)

Example: Heights of adult males in a population often follow a normal distribution with a mean of 70 inches and a standard deviation of 3 inches.

Standard Normal Distribution

The standard normal distribution is a special case of the normal distribution with a mean of 0 and a standard deviation of 1.

Standardization: Any normal variable can be transformed to the standard normal variable using:

Purpose: Allows comparison across different normal distributions and use of standard normal tables.
Notation:

Example: If , , , then .

Z-Scores and Probability Calculations

Understanding Z-Scores

A Z-score indicates how many standard deviations an element is from the mean. It is used to determine the relative position of a value within a normal distribution.

Formula:
Interpretation:
- Z > 0: Value is above the mean
- Z < 0: Value is below the mean
- Z = 0: Value equals the mean

Example: A test score of 85 with a mean of 80 and standard deviation of 5 has .

Using Z-Tables

Z-tables (standard normal tables) provide the area (probability) to the left of a given Z-score under the standard normal curve.

Steps to Use Z-Tables:
1. Calculate the Z-score for your value.
2. Locate the Z-score in the table to find the corresponding probability.
3. For probabilities to the right, subtract the table value from 1.
Applications:
- Finding probabilities for ranges of values
- Determining percentiles
- Comparing scores from different distributions

Example: For , the table value is approximately 0.8944, meaning 89.44% of values are below this Z-score.

Empirical Rule (68-95-99.7 Rule)

Overview of the Empirical Rule

The Empirical Rule describes the percentage of data within certain standard deviations of the mean in a normal distribution.

Approximately 68% of data falls within 1 standard deviation ()
Approximately 95% within 2 standard deviations ()
Approximately 99.7% within 3 standard deviations ()

Example: If the mean test score is 80 and the standard deviation is 5, about 95% of scores are between 70 and 90.

Comparing Normal Distributions

Shape, Center, and Spread

Normal distributions can differ in their mean (center) and standard deviation (spread), but all share the same general bell-shaped curve.

Mean (): Determines the center of the distribution.
Standard Deviation (): Determines the width of the curve; larger means more spread out.
Comparisons:
- Distributions with the same but different have the same center but different spreads.
- Distributions with different are centered at different locations.

Example: SAT and ACT scores can be compared using Z-scores, even though they have different means and standard deviations.

HTML Table: Empirical Rule Summary

Interval	Percentage of Data
	68%
	95%
	99.7%

HTML Table: Example Z-Table Entries

Z-Score	Area to the Left
0.00	0.5000
1.00	0.8413
1.96	0.9750
2.00	0.9772
-1.00	0.1587

Applications of the Normal Distribution

Real-World Uses

The normal distribution is used in various fields, including psychology, education, and natural sciences, to model continuous data and make statistical inferences.

Standardized testing (SAT, ACT)
Quality control in manufacturing
Measurement errors in experiments

Example: Determining the probability that a randomly selected person is taller than 6 feet, given the population mean and standard deviation.

Summary

The normal distribution is a cornerstone of statistical analysis, enabling probability calculations, data comparisons, and inferential statistics through the use of Z-scores and standard normal tables. Mastery of these concepts is essential for success in college-level statistics.

Additional info: Some explanations and table entries were inferred and expanded for completeness and clarity.