Skip to main content
Back

Statistics Exam 2 Study Guide: Correlation, Probability, and Distributions

Study Guide - Smart Notes

Tailored notes based on your materials, expanded with key definitions, examples, and context.

Chapter 4: Correlation and Regression

Understanding Correlation

Correlation measures the strength and direction of a linear relationship between two quantitative variables. It is a key concept in statistics for identifying trends and making predictions.

  • Correlation Coefficient (r): A numerical measure ranging from -1 to 1 that indicates the strength and direction of a linear relationship.

  • Positive Correlation: As one variable increases, the other also increases (r > 0).

  • Negative Correlation: As one variable increases, the other decreases (r < 0).

  • No Correlation: No discernible linear relationship (r ≈ 0).

  • Scatterplots: Graphical representations used to visualize the relationship between two variables.

Example: A scatterplot of household income vs. years of education may show a positive correlation.

Regression Analysis

Regression is used to model the relationship between a dependent variable and one or more independent variables. The most common is simple linear regression.

  • Regression Line (Least Squares Line): The line that best fits the data, minimizing the sum of squared residuals.

  • Equation of the Regression Line:

  • Interpretation: The slope (b) represents the change in the dependent variable for a one-unit increase in the independent variable.

  • Prediction: The regression line can be used to predict values of the dependent variable given values of the independent variable.

Example: Predicting household income based on years of education.

Chapter 5: Probability

Basic Probability Concepts

Probability quantifies the likelihood of events occurring. It is foundational for inferential statistics.

  • Experimental Probability: Based on observed data from experiments.

  • Theoretical Probability: Based on reasoning or known possible outcomes.

  • Probability Formula:

  • Mutually Exclusive Events: Events that cannot occur at the same time.

  • Independent Events: The occurrence of one event does not affect the probability of the other.

  • Conditional Probability: The probability of one event given that another has occurred.

Example: Drawing a red card from a standard deck of cards.

Probability Rules

  • Addition Rule (for mutually exclusive events):

  • Multiplication Rule (for independent events):

  • General Addition Rule:

Chapter 6: Discrete and Continuous Probability Distributions

Discrete Probability Distributions

Discrete probability distributions describe the probabilities of outcomes of a discrete random variable (one that can take on a countable number of values).

  • Probability Distribution Table: Lists each possible value and its probability.

  • Mean (Expected Value):

  • Variance:

  • Binomial Distribution: Models the number of successes in a fixed number of independent trials.

Example: The probability of getting 3 heads in 5 coin tosses.

Continuous Probability Distributions

Continuous distributions describe probabilities for continuous random variables (those that can take any value in an interval).

  • Normal Distribution: A symmetric, bell-shaped distribution characterized by its mean and standard deviation.

  • Standard Normal Distribution: A normal distribution with mean 0 and standard deviation 1.

  • Using Z-scores: Standardize values to compare them to the standard normal distribution.

  • Using Tables: Standard normal tables (Z-tables) are used to find probabilities and percentiles.

Example: Finding the probability that a randomly selected value is less than a given number.

Tables and Data Interpretation

Probability Distribution Table Example

The following table shows a probability distribution for household income (in thousands of dollars):

Income ($1000s)

Probability

10

0.10

20

0.20

30

0.30

40

0.25

50

0.15

Main Purpose: To calculate expected value, variance, and to interpret the distribution of household incomes.

Contingency Table Example

The following table summarizes survey responses by gender and preference:

Male

Female

Total

Likes

52

28

80

Neutral

12

8

20

Dislikes

16

4

20

Total

80

40

120

Main Purpose: To calculate joint, marginal, and conditional probabilities.

Summary of Key Concepts

  • Be able to interpret scatterplots and calculate correlation coefficients.

  • Understand the difference between experimental and theoretical probability.

  • Calculate probabilities using rules for mutually exclusive and independent events.

  • Work with discrete and continuous probability distributions, including the normal distribution.

  • Use tables to find probabilities and expected values.

Additional info: These notes synthesize information from a study guide, review questions, and handwritten solutions, providing a comprehensive overview for exam preparation in a college-level statistics course.

Pearson Logo

Study Prep