Skip to main content
Back

Central Tendency and Variability: Key Concepts and Calculations

Study Guide - Smart Notes

Tailored notes based on your materials, expanded with key definitions, examples, and context.

Central Tendency and Variability

Introduction

Central tendency and variability are foundational concepts in statistics, describing the center and spread of a data set. These measures are essential for summarizing and interpreting data in both the behavioral sciences and scientific research, including chemistry and related fields.

Central Tendency

Definition and Purpose

  • Central tendency: A descriptive statistic that best represents the center of a data set, or the value around which other data points cluster.

  • Common measures: Mean, Median, and Mode.

Types of Central Tendency

  • Mean: The arithmetic average of a set of values.

  • Median: The middle score when all values are arranged in ascending order.

  • Mode: The most frequently occurring value in a data set.

Estimating Central Tendency with Histograms

  • Histograms visually display the frequency of data points and help estimate the central tendency by showing where data cluster.

Calculating the Mean

  • Add all scores together.

  • Divide the sum by the total number of scores.

Formula:

Statistics vs. Parameters

  • Statistic: A number based on a sample, usually symbolized by Latin letters (e.g., M or \( \bar{X} \)).

  • Parameter: A number based on the entire population, symbolized by Greek letters (e.g., \( \mu \)).

Number

Used for

Symbol

Pronounced

Statistic

Sample

M or \( \bar{X} \)

"M" or "X bar"

Parameter

Population

\( \mu \)

"Mew"

Calculating the Median

  • Arrange scores in ascending order.

  • For an odd number of scores, the median is the middle value.

  • For an even number of scores, the median is the average of the two middle values.

Calculating the Mode

  • Arrange scores in ascending order.

  • Identify the most frequent score.

  • Neither the median nor the mode has a standard abbreviation.

Types of Modes

  • Unimodal distribution: One mode (most common score).

  • Bimodal distribution: Two modes (two most common scores).

  • Multimodal distribution: More than two modes.

Example: Bimodal Distribution

  • The age distribution of American girls named Violet is bimodal, with two peaks in the frequency of ages.

Outliers and the Mean

  • Outliers can significantly affect the mean, making it less representative of the data set's center.

  • Choosing the best measure of central tendency depends on the data's distribution and the presence of outliers.

Choosing the Best Measure of Central Tendency

  • Mean: Best for symmetric distributions without outliers.

  • Median: Best for skewed distributions or those with outliers.

  • Mode: Used when one score dominates, the distribution is bimodal/multimodal, or data are nominal (categorical).

Measures of Variability

Definition and Purpose

  • Variability: Describes how much spread exists in a distribution.

  • Key measures: Range, Variance, and Standard deviation.

Calculating the Range

  • Identify the highest and lowest scores.

  • Subtract the lowest score from the highest score.

Formula:

Interquartile Range (IQR)

  • Measures the distance between the first (Q1) and third (Q3) quartiles.

  • 1st quartile (Q1): 25th percentile.

  • Median: 50th percentile.

  • 3rd quartile (Q3): 75th percentile.

Calculating the Interquartile Range

  • Calculate the median.

  • Q1: Median of scores below the overall median.

  • Q3: Median of scores above the overall median.

  • Subtract Q1 from Q3:

Calculating the Variance

  • Subtract the mean from each score (deviation).

  • Square each deviation.

  • Sum all squared deviations.

  • Divide by the total number of scores (N) for a sample.

Formula:

Variance and Standard Deviation: Symbols

Number

Used for

Standard Deviation Symbol

Variance Symbol

Statistic

Sample

SD or s

SD2, s2, or MS

Parameter

Population

\( \sigma \)

\( \sigma^2 \)

Calculating the Standard Deviation

  • Standard deviation is the typical amount that scores deviate from the sample mean.

  • It is the square root of the variance.

Formula:

Summary Table: Measures of Central Tendency and Variability

Measure

Definition

Formula

Mean

Arithmetic average

Median

Middle value

--

Mode

Most frequent value

--

Range

Difference between highest and lowest

Variance

Average squared deviation from mean

Standard Deviation

Square root of variance

Additional info: These concepts are foundational for understanding data analysis in all sciences, including chemistry, where summarizing experimental results and understanding variability are crucial for interpreting measurements and drawing valid conclusions.

Pearson Logo

Study Prep