BackModule 7
Study Guide - Smart Notes
Tailored notes based on your materials, expanded with key definitions, examples, and context.
Probability Distributions
Introduction
Probability distributions are foundational concepts in statistics, describing how probabilities are assigned to the outcomes of random experiments. Understanding these distributions is essential for statistical inference, data analysis, and real-world problem-solving in health sciences and beyond. This module covers both discrete and continuous probability distributions, including their properties, calculations, and applications.
Key Concepts and Learning Objectives
Define and distinguish between discrete and continuous probability distributions.
Calculate probabilities using probability mass functions (PMFs) and probability density functions (PDFs), as well as cumulative distribution functions (CDFs).
Describe the characteristics of key distributions: Uniform, Binomial, Poisson, and Normal.
Compute expected values (means) and variances for given distributions.
Apply probability distributions to solve real-world problems, especially in health sciences.
Interpret graphs and tables related to probability distributions.
Understand the importance of the Normal distribution in statistical inference.
Random Experiments, Sample Space, and Random Variables
Random Experiment and Sample Space
Random Experiment: An action or process that leads to one of several possible outcomes, e.g., tossing a coin twice.
Sample Space (S): The set of all possible outcomes. For tossing a coin twice: S = {HH, HT, TH, TT}.
Sample Point: An individual outcome in the sample space.
Random Variables (RV)
A random variable (X) assigns a real number to each outcome in the sample space.
Notation: Use uppercase letters (X, Y) for random variables; lowercase (x, y) for their values.
Example: Let X = number of tails in two coin tosses. Possible values: X = 0, 1, 2.
Probability Distributions
Definition and Representation
A probability distribution describes how probabilities are distributed over the possible values of a random variable.
Can be represented as a list, table, graph, or formula.
The sum of all probabilities for a discrete random variable is 1.
Types of Random Variables
Discrete Random Variable: Takes countable values (e.g., number of patients, number of side effects).
Continuous Random Variable: Takes any real value within an interval (e.g., body temperature, blood pressure).
Binary Variable: Only two possible outcomes (e.g., positive/negative, recovered/not recovered).
Categorical Variable: Distinct categories with no inherent order (e.g., blood type, treatment type).
Probability Mass Function (PMF) and Probability Density Function (PDF)
PMF (Discrete Random Variables)
The probability mass function (PMF) gives the probability that a discrete random variable is exactly equal to some value x: .
Example: Tossing a coin twice, X = number of tails.
PDF (Continuous Random Variables)
The probability density function (PDF) describes the likelihood of a continuous random variable taking a value in an interval.
Probability at a single point is zero: .
Probability over an interval: .
Comparison Table: PMF vs PDF
Type | Function | Probability at a Point | Computation |
|---|---|---|---|
Discrete | PMF | Summation | |
Continuous | Integration (area under curve) |
Expectation (Mean) and Variance
Expected Value (Mean): for discrete, for continuous.
Variance: (discrete), (continuous).
Standard Deviation:
Common Probability Distributions
Uniform Distribution
Discrete Uniform
All outcomes are equally likely.
PMF: for
Example: Drawing a ball from a box with 3 colors, each color has probability 1/3.
Continuous Uniform
All values in interval [A, B] are equally likely.
PDF: for ; otherwise.
Mean:
Variance:
Example: IV infusion time uniformly distributed between 30 and 60 minutes.
Binomial Distribution
Models the number of successes in n independent Bernoulli trials (each with probability p of success).
PMF: for
Mean:
Variance:
Assumptions: Fixed number of trials, independent trials, constant probability of success.
Example: Probability that exactly 8 out of 10 patients recover if treatment is 80% effective.
Poisson Distribution
Models the number of times an event occurs in a fixed interval of time or space, given a constant mean rate .
PMF: for
Mean and Variance:
Assumptions: Events occur independently, at a constant rate, and two events cannot occur at exactly the same instant.
Example: Number of patients arriving at a clinic per hour.
Normal Distribution
Continuous, symmetric, bell-shaped distribution; also called the Gaussian distribution.
PDF:
Mean = median = mode =
Standard deviation determines the spread.
Empirical Rule: 68.2% within 1, 95.4% within 2, 99.7% within 3 of the mean.
Standard Normal Distribution:
Applications: Blood pressure, body temperature, BMI, test scores, etc.
Normal Approximation
Binomial and Poisson distributions can be approximated by the normal distribution when n is large and p is not too close to 0 or 1.
Continuity correction: For discrete to continuous, use .
Cumulative Distribution Function (CDF)
The CDF gives the probability that a random variable X is less than or equal to a value x: .
For continuous distributions, .
Used to find percentiles, probabilities over intervals, and to interpret Z-tables.
Using Z-Tables and Calculating Probabilities
Convert X to Z-score:
Use Z-tables to find probabilities and percentiles for normal distributions.
Example: If SBP is normally distributed with mmHg, mmHg, the probability that SBP > 140 mmHg is .
Applications in Health Sciences
Vital signs (blood pressure, temperature, heart rate) often follow normal distributions.
Drug response, side effects, and dosage studies use probability distributions for analysis.
Resource allocation, scheduling, and quality control in healthcare use uniform and Poisson distributions.
Clinical trials and diagnostic test interpretation rely on binomial and normal distributions.
Summary Table: Key Distributions
Distribution | Type | Parameters | Mean | Variance | Example |
|---|---|---|---|---|---|
Uniform (Discrete) | Discrete | k (number of outcomes) | Rolling a fair die | ||
Uniform (Continuous) | Continuous | A, B (interval) | IV infusion time | ||
Binomial | Discrete | n, p | Number of recoveries | ||
Poisson | Discrete | Clinic arrivals/hour | |||
Normal | Continuous | , | Blood pressure |
Conclusion
Probability distributions are essential tools in statistics, enabling the modeling and analysis of random phenomena in health sciences. Mastery of these concepts allows for effective data interpretation, decision-making, and research in medical and scientific contexts.