BackStatistics Homework: Correlation, Regression, and Normal Distribution Guidance
Study Guide - Smart Notes
Tailored notes based on your materials, expanded with key definitions, examples, and context.
Q1. Determine the correlation, R-value and write it below
Background
Topic: Correlation Coefficient (Pearson's r)
This question is testing your ability to calculate the correlation coefficient, which measures the strength and direction of the linear relationship between two quantitative variables (here: study hours and test grades).
Key Terms and Formulas
Correlation coefficient (r): A value between -1 and 1 that indicates the strength and direction of a linear relationship.
Formula for r:
= individual study hours
= individual test grades
= mean of study hours
= mean of test grades
Step-by-Step Guidance
Calculate the mean of the study hours () and the mean of the test grades ().
For each data pair, subtract the mean from each value to get and .
Multiply these differences for each pair and sum them to get .
Calculate and separately.
Plug these sums into the formula for above, but stop before calculating the final value.
Try solving on your own before revealing the answer!
Q2. Based on the R-value is the correlation strong? Is it positive or negative?
Background
Topic: Interpreting Correlation Coefficient
This question asks you to interpret the value of r you found in Q1. The sign and magnitude of r tell you about the strength and direction of the relationship.
Key Terms
Positive correlation: As one variable increases, so does the other.
Negative correlation: As one variable increases, the other decreases.
Strength: |r| close to 1 is strong, close to 0 is weak.
Step-by-Step Guidance
Look at the sign of your r value from Q1 to determine if the correlation is positive or negative.
Compare the absolute value of r to common thresholds (e.g., 0.7–1.0 is strong, 0.3–0.7 is moderate, below 0.3 is weak).
Write a sentence describing the strength and direction of the correlation based on your findings.
Try solving on your own before revealing the answer!
Q3. Create a line of best fit for the data and write it below
Background
Topic: Linear Regression (Least Squares Line)
This question is about finding the equation of the regression line (line of best fit) that models the relationship between study hours and test grades.
Key Terms and Formulas
Regression line:
Slope (m):
Intercept (b):
Step-by-Step Guidance
Use the means and you calculated earlier.
Calculate the slope using the formula above.
Calculate the y-intercept using the formula above.
Write the equation in the form , but do not substitute the final values yet.
Try solving on your own before revealing the answer!
Q4. Describe the slope and y-intercept in the context of the situation
Background
Topic: Interpreting Regression Coefficients
This question asks you to explain what the slope and y-intercept mean in the context of study hours and test grades.
Key Terms
Slope (m): The expected change in test grade for each additional hour studied.
Y-intercept (b): The predicted test grade for a student who studied 0 hours.
Step-by-Step Guidance
Interpret the slope: For every 1 hour increase in study time, the test grade is expected to increase by m points.
Interpret the y-intercept: If a student studies 0 hours, their predicted test grade is b.
Write these interpretations in the context of the problem, using the variables given.
Try solving on your own before revealing the answer!
Q5. Use your equation to predict the following:
a) At 3.1 hours
b) At 4.4 hours
c) At 6 hours
d) At 5.75 hours
Background
Topic: Making Predictions with Regression
This question asks you to use your regression equation to predict test grades for given study hours.
Key Formula
Step-by-Step Guidance
For each value of x (study hours), substitute it into your regression equation.
Multiply the slope (m) by the given x value.
Add the y-intercept (b) to the result from step 2.
Set up the calculation for each value, but do not compute the final result.
Try solving on your own before revealing the answer!
Q6. Use rguroo to find the residuals and write down how many are positive and how many are negative. Use this information to determine if the line is a good fit for the data.
Background
Topic: Residuals and Model Fit
This question is about analyzing the residuals (differences between observed and predicted values) to assess the fit of your regression model.
Key Terms
Residual:
Positive residual: Actual value is above the predicted value.
Negative residual: Actual value is below the predicted value.
Step-by-Step Guidance
For each data point, use your regression equation to calculate the predicted value.
Subtract the predicted value from the actual value to get the residual for each point.
Count how many residuals are positive and how many are negative.
Consider whether the residuals are randomly distributed (good fit) or show a pattern (potentially poor fit).
Try solving on your own before revealing the answer!
Q7. Calculate the coefficient of determination, and interpret it
Background
Topic: Coefficient of Determination ()
This question asks you to calculate , which tells you the proportion of variance in the dependent variable explained by the independent variable.
Key Formula
Step-by-Step Guidance
Take the correlation coefficient (r) you calculated earlier.
Square the value of r to get .
Interpret as the percentage of variation in test grades explained by study hours.
Try solving on your own before revealing the answer!
Q8. The data for the time to take the test was gathered and normalized so that the information formed a curve that is N(90, 12). Using this information, determine the percentage of students who would fall into the following criteria: x < 80
Background
Topic: Normal Distribution and Z-scores
This question is about finding the probability that a value from a normal distribution falls below a certain value.
Key Terms and Formulas
Normal distribution: where is the mean and is the standard deviation.
Z-score:
Step-by-Step Guidance
Identify the mean () and standard deviation ().
Calculate the z-score for using the formula above.
Use a standard normal table or calculator to find the probability corresponding to this z-score.
Try solving on your own before revealing the answer!
Q9. x < 107
Background
Topic: Normal Distribution and Z-scores
This question is similar to Q8, but with a different value for x.
Key Terms and Formulas
Use the same formulas as in Q8.
Step-by-Step Guidance
Calculate the z-score for .
Look up the corresponding probability in the standard normal table.
Try solving on your own before revealing the answer!
Q10. x > 122
Background
Topic: Normal Distribution (Upper Tail Probability)
This question asks for the probability that a value is greater than a certain value.
Key Terms and Formulas
Calculate the z-score for .
Find the probability for , which is .
Step-by-Step Guidance
Calculate the z-score for .
Find using the standard normal table.
Subtract this value from 1 to get .
Try solving on your own before revealing the answer!
Q11. x > 53
Background
Topic: Normal Distribution (Lower Bound Probability)
This question asks for the probability that a value is greater than a low value.
Key Terms and Formulas
Calculate the z-score for .
Find .
Step-by-Step Guidance
Calculate the z-score for .
Find using the standard normal table.
Subtract this value from 1 to get .
Try solving on your own before revealing the answer!
Q12. 60 < x < 105
Background
Topic: Normal Distribution (Probability Between Two Values)
This question asks for the probability that a value falls between two values.
Key Terms and Formulas
Calculate the z-scores for and .
Find .
Step-by-Step Guidance
Calculate the z-score for .
Calculate the z-score for .
Find and using the standard normal table.
Subtract from to get the probability between the two values.
Try solving on your own before revealing the answer!
Q13. 83 < x < 123
Background
Topic: Normal Distribution (Probability Between Two Values)
This question is similar to Q12, but with different bounds.
Key Terms and Formulas
Calculate the z-scores for and .
Find .
Step-by-Step Guidance
Calculate the z-score for .
Calculate the z-score for .
Find and using the standard normal table.
Subtract from to get the probability between the two values.