BackNormal Distribution and Z-Scores: Study Notes for Statistics
Study Guide - Smart Notes
Tailored notes based on your materials, expanded with key definitions, examples, and context.
Normal Distribution
Introduction to the Normal Distribution
The normal distribution is a fundamental probability distribution in statistics, characterized by its symmetric, bell-shaped curve. It is widely used to model real-world phenomena such as heights, test scores, and measurement errors.
Definition: A continuous probability distribution defined by its mean () and standard deviation ().
Properties:
Symmetric about the mean
Mean, median, and mode are equal
Asymptotic: tails approach but never touch the horizontal axis
Total area under the curve is 1
Probability Density Function (PDF):
Parameters:
= mean (center of the distribution)
= standard deviation (spread of the distribution)
Example: Heights of adult males in a population often follow a normal distribution with a mean of 70 inches and a standard deviation of 3 inches.
Standard Normal Distribution
The standard normal distribution is a special case of the normal distribution with a mean of 0 and a standard deviation of 1.
Standardization: Any normal variable can be transformed to the standard normal variable using:
Purpose: Allows comparison across different normal distributions and use of standard normal tables.
Notation:
Example: If , , , then .
Z-Scores and Probability Calculations
Understanding Z-Scores
A Z-score indicates how many standard deviations an element is from the mean. It is used to determine the relative position of a value within a normal distribution.
Formula:
Interpretation:
Z > 0: Value is above the mean
Z < 0: Value is below the mean
Z = 0: Value equals the mean
Example: A test score of 85 with a mean of 80 and standard deviation of 5 has .
Using Z-Tables
Z-tables (standard normal tables) provide the area (probability) to the left of a given Z-score under the standard normal curve.
Steps to Use Z-Tables:
Calculate the Z-score for your value.
Locate the Z-score in the table to find the corresponding probability.
For probabilities to the right, subtract the table value from 1.
Applications:
Finding probabilities for ranges of values
Determining percentiles
Comparing scores from different distributions
Example: For , the table value is approximately 0.8944, meaning 89.44% of values are below this Z-score.
Empirical Rule (68-95-99.7 Rule)
Overview of the Empirical Rule
The Empirical Rule describes the percentage of data within certain standard deviations of the mean in a normal distribution.
Approximately 68% of data falls within 1 standard deviation ()
Approximately 95% within 2 standard deviations ()
Approximately 99.7% within 3 standard deviations ()
Example: If the mean test score is 80 and the standard deviation is 5, about 95% of scores are between 70 and 90.
Comparing Normal Distributions
Shape, Center, and Spread
Normal distributions can differ in their mean (center) and standard deviation (spread), but all share the same general bell-shaped curve.
Mean (): Determines the center of the distribution.
Standard Deviation (): Determines the width of the curve; larger means more spread out.
Comparisons:
Distributions with the same but different have the same center but different spreads.
Distributions with different are centered at different locations.
Example: SAT and ACT scores can be compared using Z-scores, even though they have different means and standard deviations.
HTML Table: Empirical Rule Summary
Interval | Percentage of Data |
|---|---|
68% | |
95% | |
99.7% |
HTML Table: Example Z-Table Entries
Z-Score | Area to the Left |
|---|---|
0.00 | 0.5000 |
1.00 | 0.8413 |
1.96 | 0.9750 |
2.00 | 0.9772 |
-1.00 | 0.1587 |
Applications of the Normal Distribution
Real-World Uses
The normal distribution is used in various fields, including psychology, education, and natural sciences, to model continuous data and make statistical inferences.
Standardized testing (SAT, ACT)
Quality control in manufacturing
Measurement errors in experiments
Example: Determining the probability that a randomly selected person is taller than 6 feet, given the population mean and standard deviation.
Summary
The normal distribution is a cornerstone of statistical analysis, enabling probability calculations, data comparisons, and inferential statistics through the use of Z-scores and standard normal tables. Mastery of these concepts is essential for success in college-level statistics.
Additional info: Some explanations and table entries were inferred and expanded for completeness and clarity.