In statistics, what does the represent in a set of numbers?
Table of contents
- 1. Intro to Stats and Collecting Data1h 14m
- 2. Describing Data with Tables and Graphs1h 55m
- 3. Describing Data Numerically2h 5m
- 4. Probability2h 16m
- 5. Binomial Distribution & Discrete Random Variables3h 6m
- 6. Normal Distribution and Continuous Random Variables2h 11m
- 7. Sampling Distributions & Confidence Intervals: Mean3h 23m
- Sampling Distribution of the Sample Mean and Central Limit Theorem19m
- Distribution of Sample Mean - Excel23m
- Introduction to Confidence Intervals15m
- Confidence Intervals for Population Mean1h 18m
- Determining the Minimum Sample Size Required12m
- Finding Probabilities and T Critical Values - Excel28m
- Confidence Intervals for Population Means - Excel25m
- 8. Sampling Distributions & Confidence Intervals: Proportion1h 25m
- 9. Hypothesis Testing for One Sample3h 29m
- 10. Hypothesis Testing for Two Samples4h 50m
- Two Proportions1h 13m
- Two Proportions Hypothesis Test - Excel28m
- Two Means - Unknown, Unequal Variance1h 3m
- Two Means - Unknown Variances Hypothesis Test - Excel12m
- Two Means - Unknown, Equal Variance15m
- Two Means - Unknown, Equal Variances Hypothesis Test - Excel9m
- Two Means - Known Variance12m
- Two Means - Sigma Known Hypothesis Test - Excel21m
- Two Means - Matched Pairs (Dependent Samples)42m
- Matched Pairs Hypothesis Test - Excel12m
- 11. Correlation1h 24m
- 12. Regression1h 50m
- 13. Chi-Square Tests & Goodness of Fit2h 21m
- 14. ANOVA1h 57m
3. Describing Data Numerically
Median
Problem 3.1.4
Textbook Question
Resistant Measures Listed below are 10 wait times (minutes) for “Rock ‘n’ Roller Coaster” at 10 AM (from Data Set 33 “Disney World Wait Times”). The data are listed in order from lowest to highest. Find the mean and median of these ten values. Then find the mean and median after excluding the value of 180, which appears to be an outlier. Compare the two sets of results. How much was the mean affected by the inclusion of the outlier? How much is the median affected by the inclusion of the outlier?
15 20 25 30 30 35 45 50 50 180
Verified step by step guidance1
Step 1: Calculate the mean of the original dataset. To find the mean, sum all the values in the dataset and divide by the total number of values. Use the formula: , where represents each data point and is the number of data points.
Step 2: Calculate the median of the original dataset. Since the dataset is already ordered, locate the middle value(s). If the number of values is odd, the median is the middle value. If the number of values is even, the median is the average of the two middle values. Use the formula: , where and are the two middle values.
Step 3: Exclude the outlier value of 180 from the dataset. Recalculate the mean using the same formula as in Step 1, but now sum the remaining values and divide by the new total number of values ( becomes 9).
Step 4: Recalculate the median after excluding the outlier. With the new dataset, locate the middle value(s) as described in Step 2. Since the number of values is now odd, the median will be the middle value.
Step 5: Compare the mean and median from the original dataset to the mean and median of the modified dataset (after excluding the outlier). Determine how much the mean and median were affected by the inclusion of the outlier by calculating the difference between the original and modified values for both measures.
Verified video answer for a similar problem:This video solution was recommended by our tutors as helpful for the problem above
Video duration:
5mPlay a video:
Was this helpful?
Key Concepts
Here are the essential concepts you must grasp in order to answer the question correctly.
Mean
The mean, or average, is calculated by summing all values in a data set and dividing by the number of values. It provides a central value that represents the data set but can be heavily influenced by outliers, which are extreme values that differ significantly from other observations. In this question, calculating the mean both with and without the outlier (180) will illustrate how the outlier skews the average.
Recommended video:
Guided course
Calculating the Mean
Median
The median is the middle value of a data set when arranged in ascending order. If there is an even number of observations, the median is the average of the two middle numbers. Unlike the mean, the median is resistant to outliers, making it a better measure of central tendency when extreme values are present. This question requires comparing the median with and without the outlier to assess its impact.
Recommended video:
Guided course
Calculating the Median
Outliers
Outliers are data points that lie significantly outside the range of the rest of the data. They can arise from variability in the data or may indicate measurement errors. In this scenario, the value of 180 is considered an outlier among the wait times, and its presence can distort the mean while having a minimal effect on the median. Understanding how to identify and handle outliers is crucial in statistical analysis.
Recommended video:
Guided course
Comparing Mean vs. Median
Watch next
Master Calculating the Median with a bite sized video explanation from Patrick
Start learningRelated Videos
Related Practice
Multiple Choice
11
views
