Skip to main content
Back

Graphical Methods for Describing Quantitative Data

Study Guide - Smart Notes

Tailored notes based on your materials, expanded with key definitions, examples, and context.

Graphical Methods for Describing Quantitative Data

Introduction to Quantitative Data Visualization

Quantitative data consists of measurements recorded on a meaningful numerical scale. To effectively describe, summarize, and detect patterns in such data, statisticians use graphical methods. The three primary graphical techniques are dot plots, stem-and-leaf plots, and histograms. Each method provides unique insights into the distribution and characteristics of the data set.

Dot Plots

Dot plots are simple visualizations where each data value is represented by a dot on a horizontal axis. When values repeat, dots are stacked vertically above one another. Dot plots are especially useful for small to moderate-sized data sets and allow for easy identification of clusters, gaps, and outliers.

  • Definition: Each quantitative measurement is shown as a dot along a number line.

  • Interpretation: The vertical stacking of dots indicates frequency of repeated values.

  • Application: Useful for visualizing distributions and spotting patterns in data.

  • Example: Dot plot of miles per gallon (MPG) ratings for 100 cars.

Dotplot of MPG

Stem-and-Leaf Plots

Stem-and-leaf plots partition each data value into a "stem" (typically the leading digit(s)) and a "leaf" (usually the last digit). Stems are listed in a column, and leaves are placed in rows corresponding to their stem. This method preserves the original data values while providing a visual summary of the distribution.

  • Definition: Data values are split into stems and leaves; stems form the rows, leaves are listed horizontally.

  • Interpretation: Allows for quick identification of the shape, center, and spread of the data.

  • Modification: The choice of stem and leaf units can be adjusted for clarity (e.g., using tens digit as stem).

  • Example: Stem-and-leaf display of MPG ratings for 100 cars.

Stem-and-leaf display of MPG

  • Modification Example: Using tens digit as stem for values 36.3 and 32.7.

Modified stem-and-leaf display

Histograms

Histograms group data into class intervals of equal width and display the frequency or relative frequency of observations in each interval using vertical bars. The horizontal axis represents the intervals, while the height of each bar corresponds to the number of observations (or proportion) in that interval.

  • Definition: Data is divided into intervals; bars show frequency or relative frequency for each interval.

  • Interpretation: The area above each interval is proportional to the relative frequency of measurements in that interval.

  • Application: Useful for visualizing the overall shape, center, and spread of large data sets.

  • Example: Histogram of MPG ratings for 100 cars.

Histogram of MPG

  • Relative Frequency: For example, if the relative frequency for the interval 37-38 is 0.20, then 20% of the data falls within this interval.

  • Large Data Sets: As the number of measurements increases and interval width decreases, the histogram approaches a smooth curve, representing the population distribution.

Histograms for different data set sizes

Choosing the Number of Intervals in a Histogram

Selecting the appropriate number of intervals (classes) is crucial for accurately representing the data distribution. The following table provides recommendations based on the size of the data set:

Number of Observations in Data Set

Number of Classes

Fewer than 25

5–6

25–50

7–14

More than 50

15–20

Table for determining number of classes in a histogram

Examples of Graphical Methods

Example 1: EPA Mileage Ratings

The following table presents EPA mileage ratings (in miles per gallon) for 100 cars. This data set is used to illustrate dot plots, stem-and-leaf plots, and histograms.

EPA Mileage Ratings on 100 Cars

36.3, 41.0, 36.9, 37.1, 44.9, ... (see image for full data)

EPA Mileage Ratings Table

Example 2: Water-Level Task Deviations

In a psychological study, deviations (in angle degrees) from the true water level were measured for two groups: bartenders and waitresses. The data is summarized in the following table and visualized using histograms and stem-and-leaf plots.

Group

Deviations (angle degrees)

Bartenders

9, 6, 10, 6, 10, 3, 7, 8, 6, 14, 7, 8, 5, 2, -1, 0, 2, 3, 0, 2

Waitresses

7, 10, 25, 8, 10, 12, 9, 35, 10, 12, 11, 4, 0, 16, -1

Water-Level Task Deviations Table

  • Histogram: Shows the frequency distribution of deviations for both groups.

Histogram of Water-Level Task Deviations

  • Stem-and-Leaf Display: Provides a detailed breakdown of deviation values.

Stem-and-leaf display of deviations

Summary Table: Comparison of Graphical Methods

Method

Best For

Advantages

Limitations

Dot Plot

Small to moderate data sets

Shows individual values, easy to spot clusters/outliers

Not practical for large data sets

Stem-and-Leaf Plot

Moderate data sets

Retains original data, shows distribution shape

Less effective for very large or highly variable data

Histogram

Large data sets

Shows overall distribution, easy to interpret

Does not retain individual data values

Key Takeaways

  • Graphical methods are essential for summarizing and interpreting quantitative data.

  • Choice of method depends on data size and the level of detail required.

  • Histograms are preferred for large data sets, while dot plots and stem-and-leaf plots are useful for smaller sets.

  • Proper selection of class intervals in histograms is crucial for accurate representation.

Pearson Logo

Study Prep