BackProbability, Counting, and Normal Distributions: Structured Study Notes for Statistics Students
Study Guide - Smart Notes
Tailored notes based on your materials, expanded with key definitions, examples, and context.
Probability and Its Applications
Basic Concepts of Probability
Probability is a fundamental concept in statistics, quantifying the likelihood of events. Values range from 0 (impossible event) to 1 (certain event). Understanding probability is essential for hypothesis testing and inferential statistics.
Probability Value: Expressed as a decimal between 0 and 1.
Rare Event Rule: Events with very low probability (≤0.05) are considered rare and may indicate significant results.
Notation: P(A) denotes the probability of event A; P(A—) denotes the probability that event A does not occur.
Three Approaches to Probability:
Relative Frequency Approximation:
Classical Approach (Equally Likely Outcomes):
Subjective Probability: Estimated using knowledge and judgment.
Law of Large Numbers: As the number of trials increases, the relative frequency probability approaches the true probability.
Complementary Events
The complement of event A is the event that A does not occur. The probability of the complement is:
Example: If (adult uses Internet), then (adult does not use Internet).
Odds
Odds are another way to express likelihood, commonly used in gambling.
Actual Odds Against:
Payoff Odds: Set by gambling operators, often differ from actual odds to ensure profit.
Advantages: Probabilities are preferred for calculations; odds are used for money transfers.
Probability Examples
Probability of rolling a 5 on a six-sided die:
Probability of getting heads on a coin:
Probability of winning California Daily 4 lottery:
Probability of a car rollover in a crash:
Addition and Multiplication Rules
Addition Rule
The addition rule is used to find the probability that either event A or event B occurs (or both).
General Addition Rule:
Disjoint Events: If A and B cannot occur together,
Complementary Events:
Multiplication Rule
The multiplication rule is used to find the probability that both event A and event B occur.
General Multiplication Rule:
Independent Events:
Dependent Events: Adjust based on the outcome of A.
Sampling: With replacement = independent; without replacement = dependent.
5% Guideline: For small samples from large populations, treat events as independent.
Redundancy
Redundancy increases reliability by using multiple components. The probability that all components fail is the product of their individual failure probabilities.
Example: Probability both hard drives fail:
Probability at least one works:
Complements, Conditional Probability, and Bayes' Theorem
Probability of "At Least One"
To find the probability of at least one occurrence of an event, use the complement rule:
Example: Probability of at least one malfunctioning iPhone in 3 selected:
Conditional Probability
Conditional probability is the probability of event B occurring given that event A has already occurred.
Notation:
Example: Probability of a professional golfer making a hole in one: (given professional status)
Confusion of the Inverse: in general.
Bayes' Theorem
Bayes' theorem updates the probability of an event based on new information.
Formula:
Example: Probability of cancer given a positive test result:
Counting Principles
Multiplication Counting Rule
Used to find the total number of possibilities from a sequence of events.
Example: Number of possible 3-digit codes:
Factorial Rule
The number of ways n different items can be arranged (order matters):
Example:
Permutations and Combinations
Permutations (Order Matters):
Permutations with Identical Items:
Combinations (Order Does Not Matter):
Examples:
Number of ways to select 5 players from 11:
Number of ways to arrange the word "statistics": Use permutation formula for multiset.
Probability of winning Powerball: Use combinations for main numbers and separate probability for Powerball number.
Binomial Probability Distributions
Binomial Distribution
A binomial distribution describes the probability of a fixed number of successes in a fixed number of independent trials, each with the same probability of success.
Criteria: Fixed number of trials, independent trials, only two possible outcomes, constant probability of success.
Binomial Probability Formula: , where
Mean:
Standard Deviation:
Range Rule of Thumb
Significantly Low:
Significantly High:
Values between these are not significant.
Other Discrete Distributions
Geometric Distribution: Probability of first success on the x-th trial:
Multinomial Distribution: For more than two categories:
Hypergeometric Distribution: Sampling without replacement:
Normal Probability Distributions
Standard Normal Distribution
The standard normal distribution is a bell-shaped curve with mean 0 and standard deviation 1.
Properties: Bell-shaped, mean = 0, standard deviation = 1.
z Score: Distance from the mean in standard deviation units.
Empirical Rule:
68% within 1 standard deviation
95% within 2 standard deviations
99.7% within 3 standard deviations
Finding Probabilities and z Scores
Use technology or Table A-2 to find areas under the curve.
For a given z score, find the cumulative area from the left.
For a given area, find the corresponding z score.
Critical values: denotes the z score with area to its right.
Nonstandard Normal Distributions
Normal distributions with mean and standard deviation can be converted to the standard normal distribution using:
To find x from z:
Applications
IQ scores: ,
Pulse rates, heights, weights, and other measurements are often normally distributed.
Percentiles and quartiles can be calculated using z scores.
The Central Limit Theorem
Statement and Importance
The central limit theorem states that the distribution of sample means approaches a normal distribution as the sample size increases (n > 30), regardless of the population's distribution.
For sample means:
Allows estimation of population parameters and hypothesis testing.
Finite Population Correction
When sampling without replacement from a finite population, adjust the standard deviation:
Correction factor:
Corrected standard deviation:
Simulations for Hypothesis Tests
Purpose and Method
Simulations are used to test claims about population parameters and to find probabilities for complex scenarios.
Generate many samples with assumed parameters.
Compare observed sample statistics to simulated results.
If observed statistics are significantly low or high, the claim is likely incorrect.
Summary Table: Probability Approaches
Approach | Definition | Formula |
|---|---|---|
Relative Frequency | Based on observed outcomes | |
Classical | Equally likely outcomes | |
Subjective | Based on judgment | N/A |
Summary Table: Counting Rules
Rule | Formula | When to Use |
|---|---|---|
Multiplication | Sequence of events | |
Factorial | Arranging n items | |
Permutation | Order matters | |
Combination | Order does not matter |
Summary Table: Binomial Distribution
Parameter | Formula |
|---|---|
Mean | |
Standard Deviation | |
Probability of x successes |
Summary Table: Normal Distribution
Parameter | Formula |
|---|---|
z Score | |
Area under curve | Use Table A-2 or technology |
Sample mean z | |
Finite population correction |
Key Definitions
Probability: Measure of likelihood of an event.
Complement: Probability that an event does not occur.
Conditional Probability: Probability of one event given another.
Permutation: Arrangement of items where order matters.
Combination: Selection of items where order does not matter.
Binomial Distribution: Probability distribution for number of successes in fixed trials.
Normal Distribution: Bell-shaped probability distribution.
Central Limit Theorem: Distribution of sample means approaches normal as sample size increases.
Additional info: These notes expand on brief points with academic context, examples, and formulas to ensure completeness and clarity for exam preparation.