Skip to main content
Ch. 10 - Correlation and Regression
Triola - Elementary Statistics 14th Edition
Triola14th EditionElementary StatisticsISBN: 9780137366446Not the one you use?Change textbook
Chapter 10, Problem 10.4.7

Interpreting a Computer Display
In Exercises 5–8, we want to consider the correlation between heights of fathers and mothers and the heights of their sons. Refer to the StatCrunch display and answer the given questions or identify the indicated items. The display is based on Data Set 10 “Family Heights” in Appendix B. (The response y variable represents heights of sons.)
[IMAGE]


Height of Son Should the multiple regression equation be used for predicting the height of a son based on the height of his father and mother? Why or why not?

Verified step by step guidance
1
Step 1: Understand the context of the problem. The goal is to determine whether the multiple regression equation should be used to predict the height of a son based on the heights of his father and mother. This involves assessing the appropriateness of the regression model.
Step 2: Review the assumptions of multiple regression. For a multiple regression model to be valid, certain conditions must be met, such as linearity, independence of errors, homoscedasticity (constant variance of errors), and normality of residuals. Check if these assumptions are satisfied based on the provided data or display.
Step 3: Examine the statistical significance of the regression model. Look at the p-values associated with the regression coefficients for the father's and mother's heights. If the p-values are small (typically less than 0.05), it suggests that these variables are significant predictors of the son's height.
Step 4: Evaluate the goodness-of-fit of the model. Check the R-squared value from the display. A higher R-squared value indicates that a larger proportion of the variability in the son's height is explained by the heights of the father and mother. This helps assess the model's predictive power.
Step 5: Consider practical significance and potential limitations. Even if the model is statistically significant, consider whether the relationship is strong enough to be practically useful. Additionally, assess whether there are any outliers or influential points that might affect the reliability of the predictions.

Verified video answer for a similar problem:

This video solution was recommended by our tutors as helpful for the problem above.
Video duration:
3m
Was this helpful?

Key Concepts

Here are the essential concepts you must grasp in order to answer the question correctly.

Correlation

Correlation measures the strength and direction of a linear relationship between two variables. In this context, it helps to understand how the heights of fathers and mothers relate to the heights of their sons. A positive correlation indicates that as one variable increases, the other tends to increase as well, while a negative correlation suggests the opposite. Understanding correlation is essential for determining whether a relationship exists before applying regression analysis.
Recommended video:
Guided course
05:43
Correlation Coefficient

Multiple Regression

Multiple regression is a statistical technique used to model the relationship between one dependent variable and two or more independent variables. In this case, the height of the son is the dependent variable, while the heights of the father and mother are the independent variables. This method allows for the assessment of how well the combination of parental heights can predict the son's height, taking into account the influence of both parents simultaneously.
Recommended video:
05:54
Probability of Multiple Independent Events

Predictive Validity

Predictive validity refers to the extent to which a model accurately predicts outcomes based on input variables. In the context of predicting a son's height from parental heights, it is crucial to evaluate whether the multiple regression model provides reliable predictions. If the model shows strong predictive validity, it indicates that the heights of the parents are significant predictors of the son's height, justifying the use of the regression equation for predictions.
Recommended video:
Guided course
05:18
How to Create Frequency Distributions Example 2
Related Practice
Textbook Question

Finding the Best Model

In Exercises 5–16, construct a scatterplot and identify the mathematical model that best fits the given data. Assume that the model is to be used only for the scope of the given data, and consider only linear, quadratic, logarithmic, exponential, and power models.

Dirt Cheap The Cherry Hill Construction company in Branford, CT sells screened topsoil by the “yard,” which is actually a cubic yard. Let the variable x be the length (yd) of each side of a cube of screened topsoil. The table below lists the values of x along with the corresponding cost (dollars).

132
views
Textbook Question

Exercises 1–10 are based on the following sample data consisting of costs of dinner (dollars) and the amounts of tips (dollars) left by diners. The data were collected by students of the author.

[IMAGE]

Change in Scale Exercise 1 stated that for the given paired data, r = 0.846. How does that value change if all of the amounts of dinners are left unchanged but all of the tips are expressed in cents instead of dollars?

255
views
Textbook Question

Testing for a Linear Correlation

In Exercises 13–28, construct a scatterplot, and find the value of the linear correlation coefficient r. Also find the P-value or the critical values of r from Table A-6. Use a significance level of α = 0.05. Determine whether there is sufficient evidence to support a claim of a linear correlation between the two variables. (Save your work because the same data sets will be used in Section 10-2 exercises.)

Taxis Using the data from Exercise 15, is there sufficient evidence to support the claim that there is a linear correlation between the distance of the ride and the tip amount? Does it appear that riders base their tips on the distance of the ride?

190
views
Textbook Question

Notation The author conducted an experiment in which the height of each student was measured in centimeters and those heights were matched with the same students’ scores on the first statistics test. If we find that r = 0, does that indicate that there is no association between those two variables?

269
views
Textbook Question

Testing for a Linear Correlation

In Exercises 13–28, construct a scatterplot, and find the value of the linear correlation coefficient r. Also find the P-value or the critical values of r from Table A-6. Use a significance level of α = 0.05. Determine whether there is sufficient evidence to support a claim of a linear correlation between the two variables. (Save your work because the same data sets will be used in Section 10-2 exercises.)

Powerball Jackpots and Tickets Sold Listed below are the same data from Table 10-1 in the Chapter Problem, but an additional pair of values has been added from actual Powerball results. Is there sufficient evidence to conclude that there is a linear correlation between lottery jackpots and numbers of tickets sold? Comment on the effect of the added pair of values in the last column. Compare the results to those obtained in Example 4.

153
views
Textbook Question

Variation and Prediction Intervals

In Exercises 17–20, find the (a) explained variation, (b) unexplained variation, and (c) indicated prediction interval. In each case, there is sufficient evidence to support a claim of a linear correlation, so it is reasonable to use the regression equation when making predictions.

Weighing Seals with a Camera The table below lists overhead widths (cm) of seals measured from photographs and the weights (kg) of the seals (based on “Mass Estimation of Weddell Seals Using Techniques of Photogrammetry,” by R. Garrott of Montana State University). For the prediction interval, use a 99% confidence level with an overhead width of 9.0 cm.

108
views