Skip to main content
Back

Descriptive Statistics: Measures of Central Tendency

Study Guide - Smart Notes

Tailored notes based on your materials, expanded with key definitions, examples, and context.

Descriptive Statistics

Measures of Central Tendency

Measures of central tendency are statistical values that represent a typical or central entry of a data set. They are fundamental for summarizing and understanding data distributions. The most common measures are mean, median, and mode.

  • Mean: The arithmetic average, calculated as the sum of all data entries divided by the number of entries.

  • Median: The middle value when the data set is ordered. If the number of entries is odd, it is the middle entry; if even, it is the mean of the two middle entries.

  • Mode: The value that occurs with the greatest frequency. A data set may have no mode, one mode, or multiple modes (bimodal, multimodal).

Mean

The mean is a reliable measure because it considers every entry in the data set, but it is sensitive to outliers.

  • Population Mean Formula:

  • Sample Mean Formula:

  • Example: For the sample weights 274, 235, 223, 268, 290, 285, 235, the mean is pounds.

Median

The median divides the ordered data set into two equal parts. It is less affected by outliers than the mean.

  • Odd number of entries: Median is the middle entry.

  • Even number of entries: Median is the mean of the two middle entries.

  • Example: For ordered weights 223, 235, 235, 268, 274, 285, 290, the median is 268 pounds (middle entry).

  • Example (even entries): For 223, 235, 235, 268, 274, 290, the median is pounds.

Mode

The mode is the value that appears most frequently in the data set. It is useful for categorical data.

  • No mode: If no entry repeats.

  • Bimodal: If two entries share the highest frequency.

  • Example: In 223, 235, 235, 268, 274, 285, 290, the mode is 235 (appears twice).

  • Example (categorical): If "Democrat" is the most frequent response in a survey, the mode is "Democrat".

Comparing Mean, Median, and Mode

Each measure describes a typical entry, but their suitability depends on the data distribution and presence of outliers.

  • Mean: Reliable for symmetric distributions, but affected by outliers.

  • Median: Robust against outliers; best for skewed distributions.

  • Mode: Useful for categorical or discrete data.

  • Example: In a class age data set with an outlier (e.g., age 65), the mean is higher than most entries, the mode is lower, and the median best represents a typical entry.

Weighted Mean

The weighted mean is used when data entries have different weights or importance. It is calculated as:

  • Formula: , where is the weight of each entry .

  • Example: Grade point average calculation, where grades are weighted by credit hours.

Mean of Grouped Data

When data is grouped into classes, the mean can be estimated using class midpoints and frequencies.

  • Formula: , where is the class midpoint and is the frequency.

  • Example: Estimating mean screen time from a frequency distribution yields an approximate value close to the actual mean.

The Shape of Distributions

The shape of a distribution affects the interpretation of central tendency measures.

  • Symmetric Distribution: The graph is mirrored around the center; mean, median, and mode are equal.

  • Uniform Distribution: All values have equal frequency; symmetric.

  • Skewed Left (Negatively Skewed): Tail extends left; mean < median < mode.

  • Skewed Right (Positively Skewed): Tail extends right; mean > median > mode.

Elementary Statistics textbook cover

Additional info: The textbook cover is included as it directly identifies the source and context of the study notes, reinforcing the academic content.

Pearson Logo

Study Prep