Skip to main content
Back

Measures of Central Tendency in Descriptive Statistics

Study Guide - Smart Notes

Tailored notes based on your materials, expanded with key definitions, examples, and context.

Measures of Central Tendency

Introduction

Measures of central tendency are statistical values that represent a typical or central entry of a data set. They are fundamental in summarizing and describing data, providing insight into the general trend of the data. The most common measures are mean, median, and mode.

  • Mean: The arithmetic average of all data entries.

  • Median: The middle value when the data set is ordered.

  • Mode: The value that occurs most frequently in the data set.

Mean

The mean is calculated by summing all data entries and dividing by the number of entries. It is a reliable measure because it considers every value in the data set, but it can be affected by outliers.

  • Population Mean Formula:

  • Sample Mean Formula:

  • Example: For the weights 274, 235, 223, 268, 290, 285, 235, the mean is pounds.

Median

The median divides the ordered data set into two equal parts. It is less affected by outliers and skewed data.

  • If the number of entries is odd, the median is the middle value.

  • If the number of entries is even, the median is the mean of the two middle values.

  • Example: For the ordered weights 223, 235, 235, 268, 274, 285, 290, the median is 268 pounds.

Mode

The mode is the value that appears most frequently in the data set. It is useful for categorical data and can be unimodal, bimodal, or multimodal.

  • Unimodal: One value occurs most frequently.

  • Bimodal: Two values occur with the same highest frequency.

  • Multimodal: Three or more values occur with the same highest frequency.

  • No mode: If no value repeats or more than three values share the highest frequency.

  • Example: In the weights 223, 235, 235, 268, 274, 285, 290, the mode is 235 pounds.

Comparing Mean, Median, and Mode

Each measure describes a typical entry, but their suitability depends on the data's characteristics.

  • Mean: Reliable for symmetric data without outliers.

  • Median: Best for skewed data or data with outliers.

  • Mode: Useful for categorical data or when identifying the most common value.

  • Example: In a class with ages 20, 20, 21, 22, 23, 65, the mean is influenced by the outlier (65), so the median (20.5) best represents the data.

When to Use Mean vs. Median

Graphical representation, such as dot plots, helps determine which measure is most appropriate.

  • Use the Mean: When data is symmetric and has no outliers.

  • Use the Median: When data is skewed or contains outliers.

Weighted Mean

The weighted mean accounts for entries with varying importance or frequency. It is commonly used in calculating grade point averages (GPA).

  • Weighted Mean Formula: where w is the weight and x is the value.

  • Example: Mark's GPA is calculated using the weighted mean of his grades.

Mean of Grouped Data

When data is presented in a frequency distribution, the mean can be estimated using class midpoints and frequencies.

  • Formula: where x is the class midpoint and f is the frequency.

  • Example: For cell phone screen times grouped by frequency, the estimated mean is 287.7 minutes.

Shape of Distributions

The shape of a distribution affects the choice of central tendency measure.

  • Symmetric Distribution: The halves of the graph are mirror images.

  • Uniform Distribution: All classes have equal frequencies; also symmetric.

  • Normal Distribution: Symmetric and unimodal; mean, median, and mode are equal.

  • Skewed Left: Tail extends to the left.

  • Skewed Right: Tail extends to the right.

Using StatCrunch to Find Mean and Median

Statistical software like StatCrunch can be used to calculate mean and median efficiently. Data is entered, and summary statistics are generated through menu options.

StatCrunch screenshot showing calculation of mean and median

Summary Table: Comparison of Central Tendency Measures

Measure

Definition

Best Use

Limitation

Mean

Average of all values

Symmetric data, no outliers

Affected by outliers

Median

Middle value in ordered data

Skewed data, outliers present

Ignores extreme values

Mode

Most frequent value

Categorical data

May not exist or may be multiple

Conclusion

Understanding and selecting the appropriate measure of central tendency is crucial for accurately summarizing data. The mean, median, and mode each have unique strengths and limitations, and their use depends on the data's distribution and characteristics.

Pearson Logo

Study Prep