BackStatistics Study Guide: Probability, Distributions, and Statistical Inference
Study Guide - Smart Notes
Tailored notes based on your materials, expanded with key definitions, examples, and context.
Probability and Unusual Events
Defining Unusual Events
In statistics, an event is considered unusual if its probability is very low, typically less than 0.05 (5%). This threshold helps identify outcomes that are rare or unexpected under normal circumstances.
Unusual Event: Probability < 0.05
Example: An event with probability is not considered unusual.
Discrete Probability Distributions
Probability Distribution Table
A probability distribution lists the probabilities of all possible outcomes for a discrete random variable. The sum of all probabilities must equal 1.
y (Number of Patients) | P(Y = y) |
|---|---|
0 | 0.160 |
1 | 0.330 |
2 | 0.275 |
3 | 0.145 |
4 | 0.065 |
5 | 0.020 |
6 | 0.005 |
Finding Probability: To find the probability that at least one patient has the condition, use .
Formula:
Conditional Probability and Diagnostic Testing
False Positives and Negatives
Diagnostic tests can yield false positives (test positive, but do not have the condition) and false negatives (test negative, but do have the condition).
Conditional Probability: Probability of an event given another event has occurred.
Example: Given test results, calculate the probability that a randomly selected participant did not have the flu or tested positive.
Joint and Marginal Probability
Contingency Tables
Contingency tables summarize data for two categorical variables. Probabilities can be calculated for joint or marginal events.
Reported Poor Sleep: Yes | Reported Poor Sleep: No | |
|---|---|---|
Used Electronic Devices Before Bed | 800 | 2940 |
Did Not Use Electronic Devices Before Bed | 210 | 4030 |
Joint Probability: Probability that a student did not use electronic devices before bed and did not report poor sleep:
Probability in Categorical Data
Graduation Data Table
Science | Non-Science | |
|---|---|---|
Male | 75,000 | 105,000 |
Female | 95,000 | 125,000 |
Marginal Probability: Probability that a randomly selected science graduate is male:
Probability Without Replacement
Dependent Events
When selecting items without replacement, probabilities change after each selection.
Example: Probability all three selected students are left-handed from a group of 8 (4 left-handed):
Bayes' Theorem
Conditional Probability Formula
Bayes' Theorem allows calculation of the probability of an event based on prior knowledge of conditions related to the event.
Formula:
Simple Events
Counting Outcomes
A simple event is an event with only one outcome.
Example: Drawing marble number 7 from 10 marbles is a simple event with 1 outcome.
Probability of Random Guessing
Counting Principle
When guessing digits, the probability of a correct guess is , where n is the number of digits.
Example: Guessing 4 digits:
Linear Transformations of Mean
Adjusting Mean Values
When a mean is increased by a percentage and a fixed amount, the new mean is calculated as:
Formula:
Example:
Binomial and Unusual Values
Histogram Shape and Unusual Probabilities
For a binomial random variable, the histogram's shape depends on the probability of success.
Symmetrical: Probability near 0.5
Skewed Right: Probability < 0.5
Skewed Left: Probability > 0.5
Unusual Value: Probability < 0.05
Poisson Distribution
Probability of Events in Fixed Interval
The Poisson distribution models the number of events in a fixed interval of time or space.
Formula:
Example: For , :
Hypergeometric Distribution
Sampling Without Replacement
Used when sampling without replacement from a finite population.
Formula:
Example: Probability none of 6 selected bolts are substandard from 7,500 bolts, 18% substandard.
Normal Distribution
Calculating Probabilities
The normal distribution is a continuous probability distribution characterized by its mean () and standard deviation ().
Standardization:
Example: for ,
Probability Between Two Values
Normal Distribution Applications
Formula:
Example: Nitrogen dioxide levels between 12.0 and 25.0 parts per billion
Sampling Distribution of the Mean
Central Limit Theorem
For large samples, the sampling distribution of the mean is approximately normal.
Standard Error:
Example: Probability sample mean rainfall > 22.8 mm
Sample Size Estimation
Confidence in Estimating Standard Deviation
To estimate the population standard deviation with a specified confidence and margin of error, use sample size formulas based on the desired confidence level.
Example: Minimum sample size for 99% confidence that sample standard deviation is within 3% of population standard deviation.
Confidence Intervals
Estimating Population Mean
A confidence interval provides a range of values within which the population parameter is likely to fall.
Formula:
Example: 95% confidence interval for mean weight of dimes minted after 2000.
Interpretation: If the confidence interval includes the specification value, the mean meets the specification.
Additional info: Some formulas and context have been expanded for clarity and completeness.