Skip to main content
Back

Key Concepts in Correlation, Regression, Probability, and the Normal Distribution

Study Guide - Smart Notes

Tailored notes based on your materials, expanded with key definitions, examples, and context.

Chapter 4: Correlation and Regression

Linear Correlation Coefficient

The linear correlation coefficient, denoted as r, measures the strength and direction of the linear relationship between two quantitative variables.

  • Range: -1 ≤ r ≤ 1

  • Interpretation:

    • r = 1: Perfect positive linear correlation

    • r = -1: Perfect negative linear correlation

    • r = 0: No linear correlation

  • Formula:

Least Squares Regression Line

The least squares regression line is the straight line that best fits the data, minimizing the sum of the squared vertical distances between the observed values and the line.

  • Equation:

  • Slope (b1):

  • Intercept (b0):

Scatter Diagrams and Correlation

Scatter diagrams are graphical representations of the relationship between two quantitative variables. They help visualize the direction, form, and strength of the relationship.

Coefficient of Determination (R2)

The coefficient of determination, , represents the proportion of the variance in the dependent variable that is predictable from the independent variable.

  • Formula:

  • Interpretation: An of 0.85 means 85% of the variation in y is explained by x.

Contingency Tables and Association

Contingency tables are used to examine the relationship between two categorical variables. They display the frequency distribution of the variables and help identify associations.

Chapter 5: Probability Rules and Counting Techniques

Basic Probability Rules

Probability rules help calculate the likelihood of events. Conditional probability is the probability of an event given that another event has occurred.

  • Conditional Probability Formula:

  • Contingency tables can be used to compute conditional probabilities.

Counting Techniques

Counting techniques are used to determine the number of ways events can occur.

  • Permutations: Arrangements where order matters.

  • Combinations: Selections where order does not matter.

  • Non-distinct items: Use the multinomial coefficient for arrangements with repeated items.

Probability Rules

  • Addition Rule: For mutually exclusive events,

  • Multiplication Rule: For independent events,

Chapter 6: Discrete Random Variables and the Binomial Distribution

Discrete Random Variables

A discrete random variable takes on a countable number of possible values. The mean (expected value) of a discrete random variable is the long-run average value of repetitions of the experiment.

  • Expected Value Formula:

Binomial Probability Distribution

The binomial distribution models the number of successes in a fixed number of independent trials, each with the same probability of success.

  • Formula:

  • Applications: Insurance, quality control, genetics, etc.

Chapter 7: The Normal Distribution

Properties of the Normal Distribution

The normal distribution is a continuous, symmetric, bell-shaped distribution characterized by its mean (μ) and standard deviation (σ).

  • Probability Density Function:

  • Empirical Rule: Approximately 68% of data falls within 1σ, 95% within 2σ, and 99.7% within 3σ of the mean.

Standard Normal Curve and Z-Scores

The standard normal distribution has a mean of 0 and a standard deviation of 1. Z-scores measure how many standard deviations a value is from the mean.

  • Z-Score Formula:

  • Table V: Used to find probabilities and percentiles for standard normal values.

Using the Standard Normal Curve for Binomial Probabilities

For large n, the binomial distribution can be approximated by the normal distribution using a continuity correction.

  • Continuity Correction: Adjust discrete x by ±0.5 when using the normal approximation.

  • Conditions: np ≥ 5 and n(1-p) ≥ 5

Distribution

Type

Key Parameters

Example

Binomial

Discrete

n (number of trials), p (probability of success)

Number of defective items in a batch

Normal

Continuous

μ (mean), σ (standard deviation)

Heights of adult males

Additional info: These notes are based on a course outline and include expanded academic context for clarity and completeness.

Pearson Logo

Study Prep