Skip to main content
Back

Measures of Central Tendency and Distribution Shapes in Statistics

Study Guide - Smart Notes

Tailored notes based on your materials, expanded with key definitions, examples, and context.

Measures of Central Tendency

Definitions

Measures of central tendency are statistical values that represent the center or typical value of a dataset. They are essential for summarizing and understanding data distributions.

  • Sample: Any measurement taken from a sample (denoted by Roman letters).

  • Population: Any measurement taken from a population (denoted by Greek letters).

  • Frequency Distribution Table: Organizes raw data into table form; the frequency indicates the number of occurrences of any given data point.

Example Frequency Table:

x (Data Pt)

f (frequency)

0

1

1

2

2

3

(sample size)

Notation

Measure

Statistic (Sample)

Parameter (Population)

Mean

Median

MD

Mode

MO

Midrange

MR

  • : Sum of all data values x

  • n: sample size

  • f: frequency

Calculating Measures of Central Tendency

Mean

The mean is the arithmetic average of a dataset and is calculated as:

  • Mean cannot be calculated for nominal data (qualitative data).

Median

The median is the middle value in a ranked dataset. If the dataset has an odd number of values, the median is the middle value. If even, it is the average of the two middle values.

  • Rank data in order (low to high).

  • Median position:

Mode

The mode is the value that occurs most frequently in the dataset. A dataset may have one mode (unimodal), more than one mode (multimodal), or no mode.

  • Unimodal: One value occurs most often.

  • Bimodal: Two values occur most often.

  • No mode: All values occur with equal frequency.

Midrange

The midrange is the average of the lowest and highest values in a dataset:

Rounding Rule

  • When calculating measures of central tendency, express the answer with one additional decimal place than the raw data.

Electronic Tools

  • For tabulated data, use frequency tables and calculators/statistical software to compute mean, median, mode, and midrange.

  • Calculator steps: Enter data and frequencies, then use statistical functions to compute measures.

Raw Data and Outliers

Raw Data Example

Sample data: 3.7, 4.8, 4.8, 4.0, 4.7, 5.1, 4.3

  • Mean:

  • Median:

  • Mode:

  • Midrange:

Raw Data Example with Outlier

Sample data: 3.7, 4.8, 4.8, 4.0, 4.7, 8.1, 4.3

  • Mean:

  • Median:

Outliers can cause the mean and median to be significantly different. The mean is more sensitive to outliers than the median or mode.

  • If the outlier is high, the mean will be higher than the median.

  • If the outlier is low, the mean will be lower than the median.

Trimmed Mean: To mitigate the effects of outliers, a trimmed mean can be used, where a certain percentage of the lowest and highest data values are removed before calculating the mean.

Frequency Data Examples

Frequency Table Example

Number of books read by 19 students:

x (books)

f (students)

f*x

c.f. (cumulative freq.)

0

2

0

2

1

3

3

5

2

5

10

10

3

4

12

14

4

5

20

19

  • Mean:

  • Median: Median position is th value; median is 2.

  • Mode: 4 (most frequent value)

  • Midrange:

Frequency Data Example Using Classes

Endurance times for 80 students:

Class Bounds (hrs)

f (frequency)

x (midpt of class)

f*x

c.f.

52.5 - 63.5

6

58

348

6

63.5 - 74.5

12

69

828

18

74.5 - 85.5

25

80

2000

43

85.5 - 96.5

18

91

1638

61

96.5 - 107.5

14

102

1428

75

107.5 - 118.5

5

113

565

80

  • Mean:

  • Median: Median position is th value; median is 80.

  • Mode: 80 (most frequent class midpoint)

  • Midrange:

Frequency Table for Median and Mode

x (Data Value)

f (Frequency)

c.f. (cumulative)

8

112

112

17

102

214

27

197

411

37

520

931

43

186

1217

  • Median: ; median position is th value; median is 37.

  • Mode: 37 (largest frequency, 520 occurrences)

Weighted Mean

Definition and Calculation

Weighted means are used when not all values are equally represented. The formula is:

  • Weights (w) represent the relative importance or frequency of each value.

Weighted Mean Example: Stock Portfolio

Price, x

Stocks, w

Product, w x

10

8

80

12

20

240

17

15

255

20

30

600

35

7

245

  • Weighted mean:

Weighted Mean Example: Camera Ratings

Category

w (weight)

Cony X (score)

Cony w x

Sanon X (score)

Sanon w x

Image Quality

0.5

8

4

9

4.5

Battery Life

0.3

6

1.8

6

1.8

Zoom Range

0.2

7

1.4

6

1.2

  • Cony weighted mean:

  • Sanon weighted mean:

Weighted Mean Example: GPA Calculation

Letter Grade

Numeric Grade (x)

Number of Classes

Credit per class (w)

Total Credits

Total Points Earned (w x)

A

4

1

4

4

16

B

3

3

3

9

27

C

2

1

3

3

6

D

1

1

4

4

4

  • Weighted mean (GPA):

Shapes of Distributions

Symmetric Distributions

When a vertical line can be drawn through the middle of the graph and the resulting halves are approximately mirror images, the distribution is symmetric.

  • Mode = Mean = Median

Uniform Distribution

All data values have the same (or nearly the same) frequency. The graph is rectangular.

Skewed Distributions

  • Right-skewed (Positively Skewed): Disproportionately large amount of small values; long tail on right. Outliers at high values. Mean > Median > Mode.

  • Left-skewed (Negatively Skewed): Disproportionately large amount of large values; long tail on left. Outliers at low values. Mean < Median < Mode.

Graphical Examples

  • Uniform: Bar graph with equal heights.

  • Symmetric (Normal): Bell-shaped curve.

  • Positively Skewed: Tail extends to the right.

  • Negatively Skewed: Tail extends to the left.

Summary Table: Measures of Central Tendency

Measure

Definition

Formula

Sensitivity to Outliers

Mean

Arithmetic average

High

Median

Middle value

Middle position in ranked data

Low

Mode

Most frequent value

Value with highest frequency

Low

Midrange

Average of min and max

High

Weighted Mean

Mean with weights

Depends on weights

Key Points

  • Central tendency measures summarize the center of a dataset.

  • Mean is sensitive to outliers; median and mode are more robust.

  • Weighted mean accounts for varying importance or frequency of data points.

  • Distribution shape affects the relationship between mean, median, and mode.

Pearson Logo

Study Prep