Skip to main content
Back

Descriptive Measures: Measures of Center and Variation

Study Guide - Smart Notes

Tailored notes based on your materials, expanded with key definitions, examples, and context.

Chapter 3: Descriptive Measures

Measures of Center

Measures of center are statistical values that describe the central tendency of a dataset. The most common measures are the mode, median, and mean. Each provides a different perspective on what is considered 'typical' for a set of data.

Mode

  • Definition: The mode of a data set is its most frequently occurring value.

  • If more than one value occurs with the greatest frequency, the data set is multimodal.

  • If no value repeats, the data set has no mode.

  • Example: In Data Set II: 400, 400, 300, 940, 300, 300, 400, 300, 400, 450, 800, 450, 1050, the modes are 300 and 400.

Median

  • Definition: The median is the middle value in an ordered list of data.

  • If the number of observations is odd, the median is the observation in the middle.

  • If the number of observations is even, the median is the average of the two middle observations.

  • Example: For Data Set I: 300, 300, 300, 400, 400, 450, 800, 940, 1050, the median is 400.

Mean

  • Definition: The mean (arithmetic average) is the sum of all data values divided by the number of observations.

  • Formula:

  • Example: For Data Set I: Mean = 484; for Data Set II: Mean = 474.

Comparing Mean and Median in Skewed Distributions

  • Right-skewed distribution: Very large data points result in a relatively large mean compared to the median.

  • Left-skewed distribution: Very small data points result in a relatively small mean compared to the median.

  • Symmetric distribution: Mean and median are approximately equal.

  • Sensitivity: The mean is sensitive to extreme observations (outliers), while the median is resistant to them.

Summary Table: Properties of Measures of Center

Measure

Definition

Sensitivity to Outliers

Mean

Arithmetic average

Sensitive

Median

Middle value

Resistant

Mode

Most frequent value

Resistant

Measures of Variation

Measures of variation describe the spread or dispersion of data values in a dataset. Common measures include the range and standard deviation.

Range

  • Definition: The range is the difference between the largest and smallest values in a dataset.

  • Formula:

  • Example: For Team I: Heights = 72, 78; Range = 6 inches. For Team II: Heights = 67, 84; Range = 17 inches.

Standard Deviation

  • Definition: The standard deviation measures the average distance of each data point from the mean. It quantifies the amount of variation or dispersion in a dataset.

  • Method #1 (Direct Calculation):

  • Method #2 (Shortcut Formula):

  • Example: For Team I: Heights = 72, 73, 76, 76, 78; Mean = 75. Calculate deviations: -3, -2, 1, 1, 3. Variance: Standard deviation: inches. For Team II: Heights = 67, 72, 76, 76, 84; Mean = 75. Standard deviation: inches.

Interpreting Standard Deviation

  • The larger the standard deviation, the greater the variation in the data set.

  • Standard deviation is not resistant to extreme observations (outliers).

Summary Table: Comparison of Variation Measures

Measure

Definition

Resistant to Outliers

Range

Max - Min

No

Standard Deviation

Average distance from mean

No

Applications and Examples

  • Sports Teams: Comparing the heights of players on two teams illustrates how measures of center and variation can describe differences in typical height and diversity of heights.

  • Data Analysis: Understanding the sensitivity of mean and standard deviation to outliers is crucial when interpreting real-world data.

Additional info: The notes infer that the median and mode are more robust to outliers than the mean, and that standard deviation is a preferred measure of spread for symmetric distributions but less so for skewed data.

Pearson Logo

Study Prep