BackDescriptive Statistics: Measures of Central Tendency
Study Guide - Smart Notes
Tailored notes based on your materials, expanded with key definitions, examples, and context.
Descriptive Statistics
Measures of Central Tendency
Measures of central tendency are statistical values that represent a typical or central entry of a data set. They are fundamental for summarizing and understanding data distributions. The most common measures are mean, median, and mode.
Mean: The arithmetic average, calculated as the sum of all data entries divided by the number of entries.
Median: The middle value when the data set is ordered. If the number of entries is odd, it is the middle entry; if even, it is the mean of the two middle entries.
Mode: The value that occurs with the greatest frequency. A data set may have no mode, one mode, or multiple modes (bimodal, multimodal).
Mean
The mean is a reliable measure because it considers every entry in the data set, but it is sensitive to outliers.
Population Mean Formula:
Sample Mean Formula:
Example: For the sample weights 274, 235, 223, 268, 290, 285, 235, the mean is pounds.
Median
The median divides the ordered data set into two equal parts. It is less affected by outliers than the mean.
Odd number of entries: Median is the middle entry.
Even number of entries: Median is the mean of the two middle entries.
Example: For ordered weights 223, 235, 235, 268, 274, 285, 290, the median is 268 pounds (middle entry).
Example (even entries): For 223, 235, 235, 268, 274, 290, the median is pounds.
Mode
The mode is the value that appears most frequently in the data set. It is useful for categorical data.
No mode: If no entry repeats.
Bimodal: If two entries share the highest frequency.
Example: In 223, 235, 235, 268, 274, 285, 290, the mode is 235 (appears twice).
Example (categorical): If "Democrat" is the most frequent response in a survey, the mode is "Democrat".
Comparing Mean, Median, and Mode
Each measure describes a typical entry, but their suitability depends on the data distribution and presence of outliers.
Mean: Reliable for symmetric distributions, but affected by outliers.
Median: Robust against outliers; best for skewed distributions.
Mode: Useful for categorical or discrete data.
Example: In a class age data set with an outlier (e.g., age 65), the mean is higher than most entries, the mode is lower, and the median best represents a typical entry.
Weighted Mean
The weighted mean is used when data entries have different weights or importance. It is calculated as:
Formula: , where is the weight of each entry .
Example: Grade point average calculation, where grades are weighted by credit hours.
Mean of Grouped Data
When data is grouped into classes, the mean can be estimated using class midpoints and frequencies.
Formula: , where is the class midpoint and is the frequency.
Example: Estimating mean screen time from a frequency distribution yields an approximate value close to the actual mean.
The Shape of Distributions
The shape of a distribution affects the interpretation of central tendency measures.
Symmetric Distribution: The graph is mirrored around the center; mean, median, and mode are equal.
Uniform Distribution: All values have equal frequency; symmetric.
Skewed Left (Negatively Skewed): Tail extends left; mean < median < mode.
Skewed Right (Positively Skewed): Tail extends right; mean > median > mode.

Additional info: The textbook cover is included as it directly identifies the source and context of the study notes, reinforcing the academic content.