BackStatistics Study Guide: Normal Distribution, Sampling, and Standard Error
Study Guide - Smart Notes
Tailored notes based on your materials, expanded with key definitions, examples, and context.
Normal Distribution and Related Concepts
Standard Normal Distribution
The standard normal distribution is a special case of the normal distribution with a mean of 0 and a standard deviation of 1. It is used to calculate probabilities and percentiles for normally distributed data using the Z-table.
Definition: A normal distribution with mean and standard deviation .
Z-score: The number of standard deviations a value is from the mean. Calculated as .
Empirical Rule: Approximately 68% of data falls within 1 standard deviation, 95% within 2, and 99.7% within 3.
Probability Calculation: Use the Z-table to find the probability that a value falls within a certain range.
Example: For , , the probability that falls between and is found using the Z-table.
Z-scores and Percentiles
Z-scores standardize values from any normal distribution, allowing comparison and probability calculation.
Formula:
Percentile: The percentage of data below a given value. Find by looking up the Z-score in the Z-table.
Example: If the mean height of women is 65 inches () and the standard deviation is 3.5 inches (), the Z-score for 62 inches is . The percentile is found by looking up in the Z-table, which gives approximately 19.4%.
Calculating Probabilities Using the Z-table
To find the probability that an observation falls within a certain region, use the Z-table to look up the area under the curve for the relevant Z-score(s).
Single Value: Probability to the left of is the cumulative probability.
Between Two Values: Subtract the cumulative probability of the lower Z-score from the higher Z-score.
Example: Probability that falls between and is .
Sampling Distributions and the Central Limit Theorem
Sampling Distribution of the Mean
The sampling distribution of the mean describes the distribution of sample means from repeated samples of the same size from a population.
Mean of Sampling Distribution: Equal to the population mean .
Standard Error (SE): The standard deviation of the sampling distribution, calculated as .
Effect of Sample Size: Increasing sample size decreases the standard error, making the sample mean more precise.
Example: If and , .
Central Limit Theorem (CLT)
The Central Limit Theorem states that the sampling distribution of the sample mean approaches a normal distribution as the sample size increases, regardless of the population's distribution, provided the sample size is sufficiently large (usually ).
Application: Allows use of normal probability methods for sample means.
Implication: Even if the population is not normal, the distribution of sample means will be approximately normal for large .
Standard Error and Law of Large Numbers
Standard Error
Standard error quantifies the variability of sample means around the population mean. It decreases as sample size increases.
Formula:
Interpretation: Smaller SE means sample means are closer to the population mean.
Example: Increasing sample size from 100 to 400 reduces SE from to .
Law of Large Numbers
The Law of Large Numbers states that as the number of trials or sample size increases, the sample mean will get closer to the population mean.
Application: Justifies using large samples for more accurate estimates.
Probability Distribution Vocabulary
Key Terms
Random Variable: A variable whose value is subject to randomness.
Probability Distribution: Describes the likelihood of all possible outcomes for a random variable.
Discrete Random Variable: Takes on countable values.
Weighted Average: Average where each value has a specific weight.
Cumulative Probability: Probability that a variable is less than or equal to a value.
Empirical Rule: See above under Standard Normal Distribution.
Population Distribution: Distribution of a variable in the entire population.
Population Parameter: Numerical summary of a population (e.g., mean, standard deviation).
Sampling Distribution: Distribution of a statistic (e.g., mean) from repeated samples.
Variability: Measure of spread in data.
Sample Problems and Solutions
Probability in Graphs
Use the Z-table to find probabilities for shaded regions under the normal curve.
Example: For and , use Z-table to find area.
Solution: , .
Central Probabilities
Find probability that a value falls within a certain number of standard deviations from the mean.
Example: Probability within 1.44 standard deviations: .
Probability within 2.58 standard deviations: .
Sample Multiple Choice Problems
Problem | Calculation | Answer |
|---|---|---|
Probability | 0.2005 | |
Height less than 62 inches (, ) | , | 0.1949 |
Standard error for , | 0.3 | |
Standard error for , | 0.15 | |
Empirical Rule for , , | ; 99% interval: | 495.2 to 504.8 |
Summary Table: Key Formulas
Concept | Formula (LaTeX) | Description |
|---|---|---|
Z-score | Standardizes a value | |
Standard Error | Standard deviation of sample mean | |
Empirical Rule | 68%: 95%: 99.7%: | Approximate coverage of normal distribution |
Additional info:
Some vocabulary and textbook sections were inferred from the syllabus and review lists.
Examples and explanations were expanded for clarity and completeness.