BackFoundations of Probability and Categorical Associations
Study Guide - Smart Notes
Tailored notes based on your materials, expanded with key definitions, examples, and context.
Section 5.1 What is Randomness?
Understanding Randomness in Statistics
Randomness is a fundamental concept in statistics, referring to outcomes that are unpredictable in the short term but may have a predictable distribution in the long run. This section introduces key terms and ideas related to randomness and probability.
Randomness: The quality of lacking any predictable order or plan. In statistics, randomness is often achieved using computers or other randomizing devices.
Probability: A measure of how likely an event is to occur, expressed as a number between 0 and 1.
Theoretical Probability: Probability calculated based on the possible outcomes in a perfect model, rather than observed data.
Empirical Probability: Probability based on observed data from experiments or historical records.
Simulation: The use of a model or computer program to imitate a real-world process or system over time, often used to estimate probabilities.
Key Points:
Randomness is difficult to achieve without the use of a computer or other randomizing device.
Given a probability experiment, determine if it is an example of an empirical or theoretical probability.
Example: Flipping a fair coin is a random process. The theoretical probability of getting heads is .
Section 5.2 Finding Theoretical Probabilities
Calculating Probabilities Using Mathematical Models
Theoretical probability is determined by analyzing all possible outcomes in a sample space. This section covers the vocabulary and methods for finding these probabilities.
Sample Space: The set of all possible outcomes of a probability experiment.
Event: A subset of the sample space; a specific outcome or group of outcomes.
Venn Diagram: A visual representation of events and their relationships within a sample spac e.
AND (Intersection): The event that both A and B occur, denoted as .
OR (Union): The event that at least one of A or B occurs, denoted as .
Mutually Exclusive Events: Events that cannot occur at the same time (their intersection is empty).
Complement of an Event: All outcomes in the sample space that are not in the event.
Key Points:
Read and understand each of the probability rules, such as the Addition Rule and Multiplication Rule.
Try to create your own example for each of the probability rules.
Formulas:
Probability of an event:
Addition Rule (for mutually exclusive events):
Addition Rule (general):
Complement Rule:
Example: Rolling a six-sided die: The probability of rolling a 2 or a 4 (mutually exclusive events) is .
Section 5.3 Associations in Categorical Variables
Understanding Relationships Between Categorical Variables
This section explores how to identify and interpret associations between categorical variables, often using contingency tables and probability rules.
Conditional Probability: The probability of one event occurring given that another event has occurred. Denoted as .
Associated Events: Events that are not independent; the occurrence of one affects the probability of the other.
Independent Events: Events where the occurrence of one does not affect the probability of the other. .
Multiplication Rule: For independent events, .
Key Points:
Focus on understanding conditional probability.
Be able to recognize a conditional probability in context.
Study and think about the difference between a conditional probability and one requiring the use of AND.
Study the notation for conditional probability.
Try to understand the formula for conditional probability.
Study the informal meaning of independence.
Formulas:
Conditional Probability:
Multiplication Rule (for independent events):
Example: If 30% of students are left-handed and 60% are female, and handedness and gender are independent, the probability of a randomly selected student being both left-handed and female is .
Section 5.4 Finding Empirical Probabilities
Estimating Probabilities from Data
Empirical probability is based on observed data rather than theoretical models. This section discusses how to calculate empirical probabilities and introduces related concepts.
Empirical Probability: Probability calculated from observed data, also called experimental probability.
Influential Points: Data points that have a significant effect on the outcome of a statistical analysis.
Aggregate Data: Data combined from several measurements.
Regression: A statistical method for modeling the relationship between variables.
Law of Large Numbers: As the number of trials increases, the empirical probability approaches the theoretical probability.
Key Points:
Study the notions of independence and associated events from Section 5.3.
Be able to find the empirical probability for a sequence of independent events.
Be able to list out the Law of Large Numbers.
Formulas:
Empirical Probability:
Example: If a coin is flipped 100 times and lands on heads 47 times, the empirical probability of heads is .
Additional info: The Law of Large Numbers states that as the number of trials increases, the empirical probability will converge to the theoretical probability.