Skip to main content
Back

Statistics Homework: Correlation, Regression, and Normal Distribution Guidance

Study Guide - Smart Notes

Tailored notes based on your materials, expanded with key definitions, examples, and context.

Q1. Determine the correlation, R-value and write it below

Background

Topic: Correlation Coefficient (Pearson's r)

This question is testing your ability to calculate the correlation coefficient, which measures the strength and direction of the linear relationship between two quantitative variables (here: study hours and test grades).

Key Terms and Formulas

  • Correlation coefficient (r): A value between -1 and 1 that indicates the strength and direction of a linear relationship.

  • Formula for r:

  • = individual study hours

  • = individual test grades

  • = mean of study hours

  • = mean of test grades

Step-by-Step Guidance

  1. Calculate the mean of the study hours () and the mean of the test grades ().

  2. For each data pair, subtract the mean from each value to get and .

  3. Multiply these differences for each pair and sum them to get .

  4. Calculate and separately.

  5. Plug these sums into the formula for above, but stop before calculating the final value.

Try solving on your own before revealing the answer!

Q2. Based on the R-value is the correlation strong? Is it positive or negative?

Background

Topic: Interpreting Correlation Coefficient

This question asks you to interpret the value of r you found in Q1. The sign and magnitude of r tell you about the strength and direction of the relationship.

Key Terms

  • Positive correlation: As one variable increases, so does the other.

  • Negative correlation: As one variable increases, the other decreases.

  • Strength: |r| close to 1 is strong, close to 0 is weak.

Step-by-Step Guidance

  1. Look at the sign of your r value from Q1 to determine if the correlation is positive or negative.

  2. Compare the absolute value of r to common thresholds (e.g., 0.7–1.0 is strong, 0.3–0.7 is moderate, below 0.3 is weak).

  3. Write a sentence describing the strength and direction of the correlation based on your findings.

Try solving on your own before revealing the answer!

Q3. Create a line of best fit for the data and write it below

Background

Topic: Linear Regression (Least Squares Line)

This question is about finding the equation of the regression line (line of best fit) that models the relationship between study hours and test grades.

Key Terms and Formulas

  • Regression line:

  • Slope (m):

  • Intercept (b):

Step-by-Step Guidance

  1. Use the means and you calculated earlier.

  2. Calculate the slope using the formula above.

  3. Calculate the y-intercept using the formula above.

  4. Write the equation in the form , but do not substitute the final values yet.

Try solving on your own before revealing the answer!

Q4. Describe the slope and y-intercept in the context of the situation

Background

Topic: Interpreting Regression Coefficients

This question asks you to explain what the slope and y-intercept mean in the context of study hours and test grades.

Key Terms

  • Slope (m): The expected change in test grade for each additional hour studied.

  • Y-intercept (b): The predicted test grade for a student who studied 0 hours.

Step-by-Step Guidance

  1. Interpret the slope: For every 1 hour increase in study time, the test grade is expected to increase by m points.

  2. Interpret the y-intercept: If a student studies 0 hours, their predicted test grade is b.

  3. Write these interpretations in the context of the problem, using the variables given.

Try solving on your own before revealing the answer!

Q5. Use your equation to predict the following:

  • a) At 3.1 hours

  • b) At 4.4 hours

  • c) At 6 hours

  • d) At 5.75 hours

Background

Topic: Making Predictions with Regression

This question asks you to use your regression equation to predict test grades for given study hours.

Key Formula

Step-by-Step Guidance

  1. For each value of x (study hours), substitute it into your regression equation.

  2. Multiply the slope (m) by the given x value.

  3. Add the y-intercept (b) to the result from step 2.

  4. Set up the calculation for each value, but do not compute the final result.

Try solving on your own before revealing the answer!

Q6. Use rguroo to find the residuals and write down how many are positive and how many are negative. Use this information to determine if the line is a good fit for the data.

Background

Topic: Residuals and Model Fit

This question is about analyzing the residuals (differences between observed and predicted values) to assess the fit of your regression model.

Key Terms

  • Residual:

  • Positive residual: Actual value is above the predicted value.

  • Negative residual: Actual value is below the predicted value.

Step-by-Step Guidance

  1. For each data point, use your regression equation to calculate the predicted value.

  2. Subtract the predicted value from the actual value to get the residual for each point.

  3. Count how many residuals are positive and how many are negative.

  4. Consider whether the residuals are randomly distributed (good fit) or show a pattern (potentially poor fit).

Try solving on your own before revealing the answer!

Q7. Calculate the coefficient of determination, and interpret it

Background

Topic: Coefficient of Determination ()

This question asks you to calculate , which tells you the proportion of variance in the dependent variable explained by the independent variable.

Key Formula

Step-by-Step Guidance

  1. Take the correlation coefficient (r) you calculated earlier.

  2. Square the value of r to get .

  3. Interpret as the percentage of variation in test grades explained by study hours.

Try solving on your own before revealing the answer!

Q8. The data for the time to take the test was gathered and normalized so that the information formed a curve that is N(90, 12). Using this information, determine the percentage of students who would fall into the following criteria: x < 80

Background

Topic: Normal Distribution and Z-scores

This question is about finding the probability that a value from a normal distribution falls below a certain value.

Key Terms and Formulas

  • Normal distribution: where is the mean and is the standard deviation.

  • Z-score:

Step-by-Step Guidance

  1. Identify the mean () and standard deviation ().

  2. Calculate the z-score for using the formula above.

  3. Use a standard normal table or calculator to find the probability corresponding to this z-score.

Try solving on your own before revealing the answer!

Q9. x < 107

Background

Topic: Normal Distribution and Z-scores

This question is similar to Q8, but with a different value for x.

Key Terms and Formulas

  • Use the same formulas as in Q8.

Step-by-Step Guidance

  1. Calculate the z-score for .

  2. Look up the corresponding probability in the standard normal table.

Try solving on your own before revealing the answer!

Q10. x > 122

Background

Topic: Normal Distribution (Upper Tail Probability)

This question asks for the probability that a value is greater than a certain value.

Key Terms and Formulas

  • Calculate the z-score for .

  • Find the probability for , which is .

Step-by-Step Guidance

  1. Calculate the z-score for .

  2. Find using the standard normal table.

  3. Subtract this value from 1 to get .

Try solving on your own before revealing the answer!

Q11. x > 53

Background

Topic: Normal Distribution (Lower Bound Probability)

This question asks for the probability that a value is greater than a low value.

Key Terms and Formulas

  • Calculate the z-score for .

  • Find .

Step-by-Step Guidance

  1. Calculate the z-score for .

  2. Find using the standard normal table.

  3. Subtract this value from 1 to get .

Try solving on your own before revealing the answer!

Q12. 60 < x < 105

Background

Topic: Normal Distribution (Probability Between Two Values)

This question asks for the probability that a value falls between two values.

Key Terms and Formulas

  • Calculate the z-scores for and .

  • Find .

Step-by-Step Guidance

  1. Calculate the z-score for .

  2. Calculate the z-score for .

  3. Find and using the standard normal table.

  4. Subtract from to get the probability between the two values.

Try solving on your own before revealing the answer!

Q13. 83 < x < 123

Background

Topic: Normal Distribution (Probability Between Two Values)

This question is similar to Q12, but with different bounds.

Key Terms and Formulas

  • Calculate the z-scores for and .

  • Find .

Step-by-Step Guidance

  1. Calculate the z-score for .

  2. Calculate the z-score for .

  3. Find and using the standard normal table.

  4. Subtract from to get the probability between the two values.

Try solving on your own before revealing the answer!

Pearson Logo

Study Prep