Skip to main content
Back

Fundamentals of Descriptive Statistics: Populations, Frequency Distributions, and Data Variation

Study Guide - Smart Notes

Tailored notes based on your materials, expanded with key definitions, examples, and context.

Chapter 1: Introduction to Statistics

Population and Sample

Statistics is the science of collecting, analyzing, and interpreting data. Two fundamental concepts are population and sample:

  • Population: The entire group of individuals or items of interest.

  • Sample: A subset of the population, selected for analysis.

Parameter and Statistic

  • Parameter: A numerical summary describing a characteristic of a population.

  • Statistic: A numerical summary describing a characteristic of a sample.

Types of Statistics

  • Descriptive Statistics: Methods for summarizing and organizing data (e.g., mean, median, mode).

  • Inferential Statistics: Methods for making predictions or inferences about a population based on sample data.

Types of Data

  • Qualitative Data: Non-numerical, categorical data (e.g., colors, names).

  • Quantitative Data: Numerical data (e.g., heights, weights).

Chapter 2: Frequency Distributions and Histograms

Frequency Distribution

A frequency distribution is a table that displays the frequency of various outcomes in a sample.

  • Class: A group or interval into which data are divided.

  • Frequency (freq.): The number of data values in each class.

  • Lower and Upper Class Limits: The smallest and largest data values that can belong to a class.

  • Class Width (): The difference between the lower limits of consecutive classes.

  • Class Midpoints: The average of the lower and upper class limits for each class.

  • Class Boundaries: Values that separate classes without gaps.

  • Sample Size (): The total number of data values.

  • Relative Frequency: The proportion of data values in each class.

Constructing a Frequency Distribution

  • Determine the number of classes.

  • Calculate class width:

  • Set class limits and boundaries.

  • Tally data into classes.

Class Width Calculation (Given Frequency Distribution)

  • Or

Histograms

A histogram is a graphical representation of a frequency distribution, using bars to show the frequency of data in each class.

  • Approximate Symmetry: Bell-shaped, normal distribution.

  • Left-Skewed: Median is left of mean; tail extends to the left.

  • Right-Skewed: Median is right of mean; tail extends to the right.

Chapter 2.3: Measures of Central Tendency

Mean, Median, and Mode

  • Mean (): The arithmetic average of data values.

  • Median: The middle value when data are ordered.

  • Mode: The value(s) that occur most frequently.

  • Weighted Mean: Used when data values have different weights.

Calculating with Technology

  • For a list of data values: Use 1Var Stats on calculators.

  • For frequency tables: Use 1Var Stats with List:L1 and FreqList:L2.

Chapter 2.4: Variation and Spread

Standard Deviation and Variance

  • Standard Deviation (SD): Measures the average distance of data values from the mean. Sample SD (): Population SD ():

  • Variance: The square of the standard deviation. Sample Variance (): Population Variance ():

  • Range: The difference between the maximum and minimum data values.

Empirical Rule (for Normal Distributions)

  • Approximately 68% of data values are within 1 SD of the mean.

  • Approximately 95% of data values are within 2 SDs of the mean.

  • Approximately 99.7% of data values are within 3 SDs of the mean.

Chapter 2.5: Quartiles, Five-Number Summary, and Outliers

Quartiles

  • Q1: 25% of data are below this value.

  • Q2 (Median): 50% of data are below this value.

  • Q3: 75% of data are below this value.

Five-Number Summary

  • Minimum

  • Q1

  • Median (Q2)

  • Q3

  • Maximum

Interquartile Range (IQR)

  • IQR: The range of the middle 50% of data.

Outliers

  • Data values above are considered outliers.

  • Data values below are considered outliers.

Z-Score

  • Z-score: Indicates how many standard deviations a data value is from the mean.

  • Unusual values: or

Summary Table: Measures of Central Tendency and Variation

Measure

Definition

Formula

Mean

Arithmetic average

Median

Middle value

-

Mode

Most frequent value

-

Range

Max - Min

Standard Deviation

Average distance from mean

Variance

SD squared

IQR

Q3 - Q1

Z-score

SDs from mean

Additional info:

  • Calculator instructions (e.g., 1Var Stats, List:L1, FreqList:L2) are for TI-series calculators and similar statistical tools.

  • Histograms and frequency distributions are foundational for understanding data shape and spread.

Pearson Logo

Study Prep