Skip to main content
Back

Analyzing and Representing Data with Graphs: An Introduction to Descriptive Statistics

Study Guide - Smart Notes

Tailored notes based on your materials, expanded with key definitions, examples, and context.

Chapter 2: Analyzing and Representing Data with Graphs

Introduction

In statistics, the same data set can be represented in multiple graphical forms to reveal different aspects of its distribution. This chapter introduces common graphical methods for summarizing and interpreting data, using the example of ages from an introductory statistics class (30 students).

Graphical Representations of Data

Dotplot

A dotplot is a simple way to display individual data points along a number line. Each dot represents one observation. Dotplots are useful for small to moderate-sized data sets and allow for easy identification of clusters, gaps, and outliers.

  • Key Point: Each dot corresponds to a single data value.

  • Key Point: Useful for visualizing the distribution and identifying outliers.

  • Example: The ages of students are plotted as dots above their corresponding values on the number line.

Frequency Table

A frequency table summarizes data by showing the number of times each value (or range of values) occurs.

  • Key Point: Organizes raw data into a more interpretable format.

  • Key Point: Can be used to construct other graphs such as histograms and bar charts.

  • Example: The table below shows the frequency of each age in the data set.

Age

17

18

19

20

21

22

23

25

28

30

32

35

43

Frequency

1

3

6

3

3

3

2

2

2

1

1

1

1

Pie Chart

A pie chart displays categorical data as slices of a circle, with each slice representing a proportion of the whole.

  • Key Point: Useful for showing the relative frequencies of categories.

  • Key Point: Best for data divided into a small number of categories.

  • Example: The age data is grouped into intervals (e.g., 15-20, 20-25) and displayed as slices representing the percentage of students in each interval.

Age Interval

Percentage

(15, 20]

33.33%

(20, 25]

36.67%

(25, 30]

16.67%

(30, 35]

6.67%

(35, 40]

3.33%

(40, 45]

3.33%

Histogram

A histogram is a bar graph that shows the frequency of data within equal intervals (bins). Unlike bar charts, histograms are used for quantitative data and the bars touch each other to indicate continuous intervals.

  • Key Point: Reveals the shape, center, and spread of the data distribution.

  • Key Point: Can be used to identify skewness, modality, and outliers.

  • Example: The ages are grouped into intervals, and the frequency of each interval is shown as a bar.

Relative Frequency Histogram

A relative frequency histogram is similar to a histogram, but the y-axis shows the proportion (relative frequency) of data in each interval instead of the count.

  • Key Point: Useful for comparing distributions with different sample sizes.

  • Formula:

Boxplot

A boxplot (or box-and-whisker plot) summarizes data using five-number summary: minimum, first quartile (Q1), median, third quartile (Q3), and maximum. Outliers are often shown as individual points.

  • Key Point: Useful for visualizing the spread, center, and outliers of the data.

  • Key Point: Shows skewness and identifies potential outliers.

  • Example: The boxplot of ages shows most data between 18 and 25, with outliers at higher ages.

Bar Diagram (Bar Chart)

A bar diagram is used for categorical variables. Each bar represents a category, and the height shows the frequency or relative frequency.

  • Key Point: Bars are separated to emphasize discrete categories.

  • Key Point: Useful for comparing frequencies across categories.

  • Example: Bar charts show the distribution of students by class (e.g., "Youngest to Oldest") or by parent’s class.

Describing the Shape of Distributions

When analyzing a graph, it is important to describe its center, shape, and possible outliers. The shape of a distribution can be classified using the following terms:

  • Left-skewed (Negatively Skewed): The tail on the left side is longer; most data are concentrated on the right.

  • Right-skewed (Positively Skewed): The tail on the right side is longer; most data are concentrated on the left.

  • Symmetric: Both sides of the distribution are mirror images.

  • Uniform: All values have approximately the same frequency.

  • Multimodal: The distribution has more than one peak.

  • Bell-shaped: The distribution is symmetric and resembles a bell curve (normal distribution).

Exercise: Linking Shape Terms to Graphs

Match each of the following terms to the appropriate graphs:

  • Left-skewed: Histogram with a long left tail.

  • Right-skewed: Histogram with a long right tail.

  • Symmetric: Histogram or dotplot with mirror-image sides.

  • Uniform: Bar chart or histogram with bars of equal height.

  • Multimodal: Histogram with multiple peaks.

  • Bell-shaped: Histogram with a single, symmetric peak in the center.

Summary Table: Graph Types and Their Uses

Graph Type

Data Type

Main Purpose

Dotplot

Quantitative

Display individual data points

Histogram

Quantitative

Show distribution shape, center, spread

Boxplot

Quantitative

Summarize spread, center, outliers

Pie Chart

Categorical

Show proportions of categories

Bar Chart

Categorical

Compare frequencies of categories

Key Takeaways

  • Different graphs highlight different aspects of the data.

  • Always describe the center, shape, and outliers when interpreting graphs.

  • Choose the appropriate graph based on the type of data (quantitative or categorical).

Additional info: The notes are based on introductory statistics concepts and are suitable for exam preparation or review of descriptive statistics and graphical data representation.

Pearson Logo

Study Prep