Skip to main content
Back

Measures of Center: Describing, Exploring, and Comparing Data

Study Guide - Smart Notes

Tailored notes based on your materials, expanded with key definitions, examples, and context.

Describing, Exploring, and Comparing Data

Measures of Center

Measures of center are statistical values that describe the central point or typical value of a data set. Understanding these measures is fundamental for summarizing and interpreting data in statistics. The most common measures of center are the mean, median, mode, and midrange.

Mean (Arithmetic Mean)

  • Definition: The mean of a set of data is the sum of all data values divided by the number of data values.

  • Formula: Where represents individual data values and is the number of data values.

  • If the data are a sample, the mean is denoted by ("x-bar"). If the data are the entire population, the mean is denoted by ("mu").

  • Caution: The term "average" is not used by statisticians for the mean, as it can refer to other measures of center.

  • Properties:

    • Sample means from the same population tend to vary less than other measures of center.

    • The mean uses every data value in its calculation.

    • The mean is not resistant; a single outlier can substantially affect its value.

  • Example: For the wait times (minutes) for "Space Mountain" at 10 AM: 50, 25, 75, 35, 50, 25, 30, 50, 45, 25, 20 minutes

Median

  • Definition: The median is the middle value of a data set when the values are arranged in order of increasing (or decreasing) magnitude.

  • Properties:

    • The median is resistant to extreme values (outliers).

    • The median does not directly use every data value; changing the largest value does not affect the median.

  • Calculation:

    1. If the number of data values is odd, the median is the value in the exact middle of the sorted list.

    2. If the number of data values is even, the median is the mean of the two middle values in the sorted list.

  • Notation: The median may be denoted by ("x-tilde"), , or Med.

  • Example (Odd): For the wait times: 50, 25, 75, 35, 50, 25, 30, 50, 45, 25, 20 Sorted: 20, 25, 25, 25, 30, 35, 45, 50, 50, 50, 75 Median = 35.0 minutes

  • Example (Even): For the wait times: 50, 25, 75, 35, 50, 25, 30, 50, 45, 25, 20, 50 Sorted: 20, 25, 25, 25, 30, 35, 45, 50, 50, 50, 50, 75 Median = minutes

Mode

  • Definition: The mode of a data set is the value(s) that occur(s) with the greatest frequency.

  • Properties:

    • The mode can be found for qualitative (categorical) data.

    • A data set can have no mode, one mode, or multiple modes.

  • Types:

    • Bimodal: Two values occur with the same greatest frequency.

    • Multimodal: More than two values occur with the same greatest frequency.

    • No mode: No value is repeated.

  • Example: For "Tower of Terror" wait times: 35, 35, 20, 50, 95, 75, 45, 50, 30, 35, 30 Sorted: 20, 30, 30, 35, 35, 35, 45, 50, 50, 75, 95 Mode = 35 minutes (occurs three times)

  • Other Examples:

    • Two modes: 30, 30, 50, 50, 75 → Modes: 30 and 50

    • No mode: 20, 30, 35, 50, 75 → No value is repeated

Midrange

  • Definition: The midrange is the value midway between the maximum and minimum values in a data set.

  • Properties:

    • The midrange is not resistant; it is very sensitive to extreme values.

    • Rarely used in practice, but easy to compute.

    • Highlights that there are multiple ways to define the center of a data set.

    • Sometimes confused with the median; clear definitions help avoid confusion.

  • Example: For "Space Mountain" wait times: 50, 25, 75, 35, 50, 25, 30, 50, 45, 25, 20 minutes

Round-Off Rules for Measures of Center

  • For the mean, median, and midrange, carry one more decimal place than is present in the original data values.

  • For the mode, leave the value as is without rounding.

Critical Thinking in Measures of Center

While measures of center can always be calculated, it is important to consider whether their use is meaningful for the data at hand. The sampling method and the nature of the data should be evaluated before interpreting these statistics.

  • Examples where mean and median are not meaningful:

    • Zip codes: Numbers are labels, not measurements.

    • Ranks: Reflect order, not quantity.

    • Jersey numbers: Substitute for names, not measurements.

    • Top 5 CEO compensation: Not representative of the larger population.

    • Mean of state mean ages: Does not account for population sizes; use weighted mean instead.

Calculating the Mean from a Frequency Distribution

When data are summarized in a frequency distribution, the mean can be approximated by multiplying each class midpoint by its frequency, summing these products, and dividing by the total frequency.

  • Formula: Where is the frequency and is the class midpoint.

  • This method provides an approximation, as it assumes all values in a class are equal to the class midpoint.

Weighted Mean

When data values are assigned different weights, the weighted mean is used. This is common in calculations such as grade-point averages.

  • Formula: Where is the weight assigned to each value .

  • Example: Calculating GPA with course credits as weights and grade points as values.

Pearson Logo

Study Prep