Skip to main content
Back

Chapter 2: Displaying Descriptive Statistics – Business Statistics Study Notes

Study Guide - Smart Notes

Tailored notes based on your materials, expanded with key definitions, examples, and context.

Chapter 2: Displaying Descriptive Statistics

Overview

This chapter introduces the fundamental methods for displaying descriptive statistics in business contexts. It emphasizes the use of technology, particularly Microsoft Excel, for data visualization and analysis. The main topics include the role of technology, displaying quantitative and qualitative data, contingency tables, stem-and-leaf displays, and scatter plots.

The Role Technology Plays in Statistics

Using Microsoft Excel for Statistical Analysis

Modern business statistics relies heavily on technology for efficient data analysis and presentation. Microsoft Excel is a widely used tool that offers built-in options for statistical analysis and data visualization.

  • Excel's Analysis ToolPak: An add-in that provides advanced statistical analysis tools, such as histograms, regression, and descriptive statistics.

  • Activation: Users may need to activate the Analysis ToolPak add-in to access these features.

Steps to Activate Analysis ToolPak in Excel 2016:

  1. Open Excel and click on the File tab.

  2. Select Options from the drop-down menu to open the Excel Options dialog box.

  3. Click on Add-Ins in the left margin.

  4. At the bottom, click Go next to Manage Excel Add-ins.

  5. Check the boxes for Analysis ToolPak and Analysis ToolPak - VBA, then click OK.

Once activated, the Data Analysis option appears under the Data tab, providing access to various statistical tools.

Displaying Quantitative Data

Frequency Distributions

A frequency distribution is a summary table that shows the number of observations falling into each of several categories (classes). It helps to visualize patterns in data that are not immediately apparent from raw data tables.

  • Class: A category or interval in a frequency distribution.

  • Example: Number of iPads sold per day, grouped into classes such as 0, 1, 2, etc.

Discrete vs. Continuous Data

  • Discrete Data: Values that can be counted (e.g., number of children, cars, or vacation days). These are typically whole numbers.

  • Continuous Data: Values that can take on any value within a range, including decimals (e.g., time, weight, distance).

Relative and Cumulative Frequency Distributions

  • Relative Frequency: The proportion of observations in each class, calculated as .

  • Cumulative Relative Frequency: The sum of relative frequencies for all classes up to and including the current class.

Example Table: Cumulative Relative Frequency Distribution

Number Sold Per Day

Frequency

Relative Frequency

Cumulative Relative Frequency

0

5

0.10

0.10

1

8

0.16

0.26

2

14

0.28

0.54

4

6

0.12

0.92

5

4

0.08

1.00

Histograms

A histogram is a graphical representation of a frequency distribution for quantitative data. Each bar represents the frequency (or relative frequency) of observations within a class interval (bin).

  • Bins are the intervals or classes used in the distribution.

  • Histograms are useful for visualizing the shape, center, and spread of data.

Constructing a Histogram in Excel

  1. Select the Data tab and click Data Analysis.

  2. Choose Histogram from the menu and click OK.

  3. Specify the Input Range (data) and Bin Range (class intervals).

  4. Select output options and click OK to generate the histogram.

Shape of Histograms

  • Symmetric: Both sides are mirror images.

  • Skewed: One side is longer or fatter than the other.

Grouped Quantitative Data

  • For large or continuous data sets, group values into classes to avoid too many categories.

  • Recommended number of classes: between 4 and 20.

  • Use the formula to determine the number of classes, where is the number of classes and is the number of data points.

  • Class width:

Rules for Grouped Data Classes

  • Equal-size classes

  • Mutually exclusive (no overlap)

  • Include all data values

  • Avoid empty or open-ended classes

Ogive

An ogive is a line graph that displays the cumulative relative frequency distribution. It is useful for determining the number or proportion of observations below a particular value.

Displaying Qualitative Data

Qualitative Data and Frequency Distributions

Qualitative data are categorical and can be nominal (no order) or ordinal (ordered). Frequency distributions for qualitative data show the number of occurrences in each category.

  • Use Excel's COUNTIF() function to count occurrences of each category.

Bar Charts

  • Bar charts display frequencies for categorical data.

  • Can be vertical or horizontal.

  • Clustered bar charts group values side by side within categories.

  • Stacked bar charts show subcategories within a single bar.

Pareto Charts

A Pareto chart is a bar chart that displays categories in descending order of frequency, often used in quality control. It also includes a cumulative relative frequency line (ogive).

Example Table: Pareto Chart Data

Reason for Return

Frequency

Product defective

46

Disappointed with product

22

Product no longer wanted

14

Late delivery

5

Other

3

Pie Charts

  • Pie charts show the proportion of each category as a segment of a circle.

  • All categories must be included, and the total represents the whole.

  • Useful for comparing relative sizes of categories.

Contingency Tables

Definition and Construction

Contingency tables (also called cross-tabulations) display the frequency distribution of variables that have more than one attribute. Rows and columns represent different variables, and the table summarizes the data efficiently.

Example Table: Contingency Table

Female

Male

Grand Total

Cash

2

5

7

Credit

7

2

9

Debit

3

1

4

Grand Total

12

8

20

Constructing Contingency Tables in Excel

  1. Select any cell within your data.

  2. Go to the Insert tab and select Pivot Table.

  3. In the dialog box, choose where to place the table and click OK.

  4. Drag variable names into the Row Labels and Column Labels boxes as desired.

  5. Drag the variable to be summarized into the Values box.

Stem and Leaf Display

Definition and Construction

A stem-and-leaf display splits each data value into a "stem" (the leading digit(s)) and a "leaf" (the trailing digit). This method preserves the original data while providing a graphical view similar to a histogram.

  • All original data points are visible.

  • Easy to construct by hand.

  • Can split stems for more detail (e.g., 80–84, 85–89).

Scatter Plots

Definition and Use

A scatter plot is a graph that shows the relationship between two quantitative variables. The independent variable is plotted on the horizontal axis, and the dependent variable on the vertical axis.

  • Useful for identifying correlations or patterns between variables.

  • Each point represents a pair of values.

Line Charts and Sparklines

Line Charts

A line chart connects data points with lines, often used for time series data. Time is typically plotted on the horizontal axis.

Sparklines

Sparklines are small, cell-sized line charts in Excel that provide a compact visual summary of data trends within a worksheet cell.

Additional info: These notes are based on textbook slides for a college-level Business Statistics course, focusing on Chapter 2: Displaying Descriptive Statistics. The content is structured to provide a comprehensive yet concise study guide for exam preparation.

Pearson Logo

Study Prep