BackMeasures of Center in Statistics: Mean, Median, and Mode
Study Guide - Smart Notes
Tailored notes based on your materials, expanded with key definitions, examples, and context.
Measures of Center
Introduction
Measures of center are statistical values that describe the central tendency of a data set. They help summarize and represent a typical value within a collection of data. The three primary measures of center are the mean, median, and mode.
Resistant and Non-Resistant Statistics
Understanding the concept of resistance is crucial when selecting a measure of center. A statistic is resistant if it is not significantly affected by extreme values (outliers).
Mean is not resistant to outliers; a single extreme value can greatly affect its value.
Median is resistant to outliers; it remains relatively stable even when outliers are present.
Mean (Arithmetic Mean)
Definition
The mean (or arithmetic mean) is the sum of all data values divided by the number of values. It is the most commonly used measure of center for quantitative data.
Formulas and Notation
Sample Mean ():
Population Mean ():
Where represents individual data values, is the sample size, and is the population size.
Important Properties
Sample means drawn from the same population tend to vary less than other measures of center.
The mean uses every data value in its calculation.
It is sensitive to outliers; a single extreme value can substantially change the mean.
Caution: Use of the Term "Average"
The term average is often used for the mean but can also refer to other measures of center.
Statisticians avoid using "average" in professional contexts to prevent confusion.
Median
Definition
The median is the middle value of a data set when the values are arranged in order of increasing (or decreasing) magnitude. It divides the data into two equal halves.
Important Properties
The median is resistant to outliers; it does not change significantly when extreme values are added.
The median does not directly use every data value; only the order matters.
Calculation and Notation
The sample median is often denoted by , , or Med.
Steps to find the median:
Sort the data values in order.
If the number of data values () is odd, the median is the middle value.
If is even, the median is the mean of the two middle values.
Mode
Definition
The mode of a data set is the value(s) that occur(s) with the greatest frequency. It is the only measure of center that can be used with qualitative (categorical) data.
Important Properties
The mode can be found with qualitative or quantitative data.
A data set can have:
No mode (if no value repeats)
One mode (unimodal)
Two modes (bimodal)
More than two modes (multimodal)
Finding the Mode
Identify the value(s) that appear most frequently in the data set.
If two values tie for the highest frequency, the set is bimodal.
If more than two values tie, the set is multimodal.
If no value repeats, there is no mode.
Summary Table: Comparison of Measures of Center
Measure | Definition | Resistance to Outliers | Data Type |
|---|---|---|---|
Mean | Sum of values divided by number of values | Not resistant | Quantitative |
Median | Middle value when data is ordered | Resistant | Quantitative |
Mode | Value(s) with greatest frequency | Resistant | Quantitative or Qualitative |
Examples
Mean Example: For the data set {3, 5, 7}, the mean is .
Median Example: For the data set {3, 5, 7}, the median is 5. For {3, 5, 7, 9}, the median is .
Mode Example: For the data set {2, 4, 4, 6}, the mode is 4.
Additional info: These notes cover the foundational concepts of measures of center, which are essential for describing and analyzing data in statistics. Understanding the properties and appropriate use of each measure helps in selecting the best summary statistic for a given data set.