BackProbability Concepts in Statistics: Foundations and Applications
Study Guide - Smart Notes
Tailored notes based on your materials, expanded with key definitions, examples, and context.
Probability in Statistics
Introduction to Probability in Daily Life
Probability is a fundamental concept in statistics, providing a structured way to understand and quantify uncertainty in various contexts. It forms the bridge between descriptive statistics (summarizing data) and inferential statistics (drawing conclusions about populations from samples).
Descriptive statistics summarize data from samples.
Inferential statistics use probability to make predictions or inferences about a population based on sample data.
Probability distributions allow us to attach probabilities to sample statistics, enabling the construction of confidence intervals and hypothesis tests.
Understanding probability helps us interpret the likelihood of events and manage uncertainty quantitatively.
Random Phenomena and Probability
Random Phenomena
Random phenomena are processes or experiments that can result in different outcomes, even under identical conditions. Each occurrence yields only one outcome, but the outcome cannot be predicted with certainty in advance.
Example: Flipping a coin, rolling dice, drawing a card.
With a small number of trials, observed outcomes may deviate from expectations (e.g., getting four heads in a row).
With a large number of trials, the relative frequencies of outcomes stabilize and approach theoretical probabilities.
Probability as Long-Run Proportion
The probability of an event is defined as the proportion of times the event would occur in a very large number of repetitions of the random process.
Probability values range from 0 (impossible event) to 1 (certain event).
For example, the probability of getting heads in a fair coin flip approaches 0.5 as the number of flips increases.
Key Definitions in Probability
Sample Space and Events
Sample space (S): The set of all possible outcomes of a random experiment.
Event (A): Any subset of the sample space; can consist of one or more outcomes.
Impossible event (∅): The empty set, representing an event that cannot occur.
Examples of Sample Spaces and Events
Flipping a coin: S = {H, T}
Flipping a coin twice: S = {HH, HT, TH, TT}
Rolling two dice: S = {(1,1), (1,2), ..., (6,6)}
Drawing a card: S = {all 52 cards in a deck}
Event: Getting at least one head in two coin flips: A = {HT, TH, HH}
Tree Diagrams
Tree diagrams visually represent all possible outcomes of a sequence of events, such as answering questions on a quiz or flipping coins. Each path through the tree corresponds to a unique outcome in the sample space.
Example: For a three-question quiz with answers Correct (C) or Incorrect (I), the sample space has 8 outcomes: {CCC, CCI, CIC, CII, ICC, ICI, IIC, III}.
Calculating Probabilities
Classical (Theoretical) Probability
When all outcomes in the sample space are equally likely, the probability of an event A is:
Formula:
Example: Probability of rolling a sum of 7 with two dice:
Example: Probability of drawing an ace from a deck:
Empirical (Objective) Probability
Empirical probability is based on observed data from repeated trials of an experiment.
Formula:
As the number of trials increases, empirical probability approaches theoretical probability (Law of Large Numbers).
Subjective Probability
Subjective probability is based on personal judgment or belief about how likely an event is to occur, often used when empirical data is unavailable.
Example: Estimating the chance of rain tomorrow based on weather reports and experience.
Bayesian statistics uses subjective probability as a foundation.
Operations with Events
Intersection and Union of Events
Intersection (A ∩ B): The set of outcomes in both A and B ("A and B").
Union (A ∪ B): The set of outcomes in A or B or both ("A or B").
Mutually exclusive (disjoint) events: Events that cannot occur together; .
Complement (Ac): All outcomes in the sample space not in A ("not A").
Probability Axioms
(the probability of the sample space is 1)
for any event A
If A and B are mutually exclusive,
Addition Rule for Probabilities
General Addition Rule:
If A and B are disjoint, , so
Multiplication Rule and Independence
Multiplication Rule for Independent Events
For independent events A and B:
For more than two independent events:
Example: Probability of getting all three questions correct by guessing (with 5 options per question):
Dependent Events and Conditional Probability
Events are independent if the occurrence of one does not affect the probability of the other:
Otherwise, they are dependent.
Conditional probability:
Multiplication rule for any events:
Examples of Conditional Probability
Medical testing: Probability of disease given a positive test result.
Sampling without replacement: Probability changes after each selection.
Contingency Tables and Probability
Using Contingency Tables
Contingency tables (cross-tabulations) summarize data for two or more categorical variables, allowing calculation of various probabilities.
Unconditional probability: Probability of a single event (row or column total divided by grand total).
Conditional probability: Probability of one event given another (cell count divided by row or column total).
Joint probability: Probability of both events occurring (cell count divided by grand total).
Coke | Pepsi | Total | |
|---|---|---|---|
Female | 19 | 8 | 27 |
Male | 14 | 13 | 27 |
Total | 33 | 21 | 54 |
Additional info: Probabilities can be calculated by dividing cell counts by the appropriate totals (row, column, or grand total).
Probability Models
Defining a Probability Model
A probability model specifies the sample space and the probability of each outcome. It is essential to clearly state all assumptions when constructing a probability model, as real-world situations may only approximate idealized models.
Example: Birthday problem – probability that at least two people in a group share a birthday.
Probability models help formalize and solve such problems.
Summary Table: Types of Probability
Type | Definition | Example |
|---|---|---|
Classical | Assumes equally likely outcomes | Rolling a fair die |
Empirical | Based on observed data | Flipping a coin 100 times |
Subjective | Based on personal belief | Chance of rain tomorrow |