BackStatistics Review: Probability, Binomial Distributions, Normal Distributions, and Bayes' Rule
Study Guide - Smart Notes
Tailored notes based on your materials, expanded with key definitions, examples, and context.
Review of Key Statistics Topics
Finding Z-scores (Section 7.2)
Z-scores are standardized values that indicate how many standard deviations a data point is from the mean of a normal distribution. They are essential for calculating probabilities and percentiles in normally distributed data.
Definition: The z-score of a value is given by , where is the mean and is the standard deviation.
Application: Z-scores allow us to use the standard normal table to find probabilities for any normal distribution.
Example: If the mean incubation time for eggs is 21 days with a standard deviation of 1 day, the z-score for an egg that hatches in 19 days is .
Finding Percentiles (Section 7.2)
Percentiles indicate the relative standing of a value within a data set. In a normal distribution, percentiles can be found using z-scores and the standard normal table.
Definition: The p-th percentile is the value below which p% of the data fall.
Finding a Percentile: To find the value corresponding to a given percentile, use the z-score from the standard normal table and solve for in .
Example: To find the 17th percentile for egg incubation times (mean = 21, SD = 1), find the z-score for 0.17 (approximately -0.95), then days.
Normal Approximation to the Binomial (Section 7.4)
When the sample size is large, the binomial distribution can be approximated by the normal distribution, making calculations easier.
Conditions: The approximation is appropriate when and .
Continuity Correction: When using the normal approximation, add or subtract 0.5 to the discrete x-value (continuity correction).
Formula: For a binomial variable with parameters and , approximately.
Example: In a survey of 500 students where 31% are expected to lie, , ; both are greater than 10, so normal approximation is valid.
Binomial Random Variable (Section 6.2)
The binomial random variable counts the number of successes in a fixed number of independent trials, each with the same probability of success.
Definition: A binomial experiment has n independent trials, each with probability p of success.
Probability Formula:
Mean and Standard Deviation: ,
Example: For 15 flights with 80% on-time rate, , .
Sample Table: Binomial Probabilities
Number of On-Time Flights (k) | Probability |
|---|---|
10 | Calculated using |
Fewer than 10 | Sum for to $9$ |
At least 10 | Sum for to $15$ |
Between 8 and 10 | Sum , , |
Bayes' Rule (Section 5.8)
Bayes' Rule allows us to update probabilities based on new evidence. It is especially useful in diagnostic testing and decision-making under uncertainty.
Formula:
Application: Used to find the probability that a person is innocent or guilty given a positive or negative test result.
Example: If 95% of suspects are guilty, and the polygraph is 90% accurate for guilty and 99% accurate for innocent, Bayes' Rule can be used to find the probability a suspect is innocent given a positive test.
Additional info: These topics are foundational for understanding probability, distributions, and statistical inference in college-level statistics courses. Mastery of these concepts is essential for interpreting data and making informed decisions based on statistical evidence.