BackBinomial Probability Distributions – 5.2:Essentials of Statistics (Chapter 5 Study Notes)
Study Guide - Smart Notes
Tailored notes based on your materials, expanded with key definitions, examples, and context.
Discrete Probability Distributions
Introduction to Discrete Probability Distributions
Discrete probability distributions describe the probabilities of outcomes for discrete random variables, which can take on a countable number of values. These distributions are fundamental in statistics for modeling scenarios where outcomes are distinct and separate.
Discrete Probability Distribution: A listing or function that gives the probability of each possible value of a discrete random variable.
Main Types: Binomial and Poisson probability distributions are commonly studied in introductory statistics.
Applications: Used in quality control, genetics, reliability engineering, and many other fields.
Binomial Probability Distributions
Key Concept: Binomial Probability Distribution
The binomial probability distribution models the number of successes in a fixed number of independent trials, each with the same probability of success. It is widely used to answer questions about the likelihood of a certain number of successes in repeated experiments.
Definition: The probability distribution of the number of successes in a sequence of n independent experiments, each asking a yes/no question, and each with its own boolean-valued outcome.
Examples:
Flipping a fair coin 10 times and counting the number of heads.
Transmitting bits through a digital channel and counting the number of errors.
Testing air samples for a rare molecule and counting the number of positive samples.
Requirements for a Binomial Probability Distribution
For a random experiment to be modeled by a binomial distribution, it must satisfy four key requirements:
Fixed Number of Trials: The procedure has a predetermined number of trials, denoted by n.
Independence: Each trial is independent; the outcome of one trial does not affect the others.
Two Categories: Each trial results in one of exactly two possible outcomes, commonly called success and failure.
Constant Probability: The probability of success, denoted by p, remains the same for each trial.
Notation and Parameters
Standard notation is used to describe the binomial distribution:
n: Number of trials
x: Number of successes (can be any integer from 0 to n)
p: Probability of success in a single trial
q: Probability of failure in a single trial ()
Note: The term "success" is arbitrary and does not necessarily mean something positive; it simply refers to the outcome being counted.
Binomial Probability Formula
The probability of getting exactly x successes in n independent trials is given by the binomial formula:
Formula:
Where:
is the factorial of n
is the probability of success
is the probability of failure ()
is the number of successes
Example: Cashless Adults
Suppose the probability that a randomly selected adult smartphone owner is cashless is 0.05. What is the probability that, among 10 randomly selected adults, exactly 2 are cashless?
Parameters: , , ,
Calculation:
Evaluating this gives (rounded to three significant digits).
Example: Vegetarians
Based on a poll, 5% of U.S. adults are vegetarians. If five adults are randomly selected, find:
Probability that exactly two are vegetarians:
Probability that fewer than three are vegetarians:
Mean and Standard Deviation of Binomial Distributions
Formulas for Mean, Variance, and Standard Deviation
The mean, variance, and standard deviation of a binomial distribution are given by:
Mean:
Variance:
Standard Deviation:
These formulas allow us to describe the expected value and spread of the distribution.
Range Rule of Thumb for Significance
The range rule of thumb helps identify values that are significantly low or high in a binomial distribution:
Significantly low values: Less than
Significantly high values: Greater than
Not significant: Between and
Example: NFL Overtime Wins
Suppose in 460 NFL overtime games, the probability of winning is 0.5 for each team. Find the mean and standard deviation for the number of wins, and determine if 252 wins is significantly high.
Parameters: , ,
Mean:
Standard Deviation:
Significantly high threshold:
Conclusion: 252 wins is significantly high because it exceeds 251.4.
Summary Table: Binomial Distribution Properties
Parameter | Symbol | Formula | Description |
|---|---|---|---|
Number of trials | n | - | Fixed number of independent experiments |
Probability of success | p | - | Probability of success in a single trial |
Probability of failure | q | Probability of failure in a single trial | |
Mean | Expected number of successes | ||
Variance | Measure of spread | ||
Standard Deviation | Square root of variance | ||
Probability of x successes | Probability of exactly x successes in n trials |