Skip to main content
Ch. 9 - Correlation and Regression
Larson - Elementary Statistics: Picturing the World 8th Edition
Larson8th EditionElementary Statistics: Picturing the WorldISBN: 9780137493470Not the one you use?Change textbook
Chapter 9, Problem 9.1.31

In Exercise 25, remove the data for the international soccer player with a maximum weight of 170 kilograms and a jump height of 64 centimeters. Describe how this affects the correlation coefficient r.

Verified step by step guidance
1
Recall that the correlation coefficient r measures the strength and direction of the linear relationship between two variables—in this case, weight and jump height.
Identify that the data point with a weight of 170 kilograms and a jump height of 64 centimeters is an outlier because the weight is unusually high compared to typical values.
Understand that outliers can have a strong influence on the correlation coefficient, often pulling the value of r toward themselves and potentially inflating or deflating the correlation.
By removing this outlier, recalculate the correlation coefficient r using the remaining data points to see how the linear relationship changes without the extreme value.
Compare the new correlation coefficient to the original one to describe whether the strength of the linear relationship has increased, decreased, or remained about the same after removing the outlier.

Verified video answer for a similar problem:

This video solution was recommended by our tutors as helpful for the problem above.
Video duration:
2m
Was this helpful?

Key Concepts

Here are the essential concepts you must grasp in order to answer the question correctly.

Correlation Coefficient (r)

The correlation coefficient measures the strength and direction of a linear relationship between two variables, ranging from -1 to 1. A value close to 1 or -1 indicates a strong linear relationship, while a value near 0 suggests little to no linear association.
Recommended video:
Guided course
05:43
Correlation Coefficient

Influence of Outliers on Correlation

Outliers are data points that differ significantly from others and can disproportionately affect the correlation coefficient. Removing an outlier can increase or decrease the value of r, depending on whether the outlier was strengthening or weakening the linear relationship.
Recommended video:
Guided course
05:43
Correlation Coefficient

Data Cleaning and Its Impact on Statistical Measures

Data cleaning involves removing or correcting inaccurate or irrelevant data points to improve analysis quality. Eliminating extreme values, like the player with max weight and low jump height, can lead to a more representative correlation coefficient that better reflects the typical relationship in the dataset.
Recommended video:
Guided course
04:39
Visualizing Qualitative vs. Quantitative Data