BackChapter 2: Displaying Descriptive Statistics – Business Statistics Study Notes
Study Guide - Smart Notes
Tailored notes based on your materials, expanded with key definitions, examples, and context.
Chapter 2: Displaying Descriptive Statistics
Overview
This chapter introduces the fundamental methods for displaying descriptive statistics in business contexts. It emphasizes the use of technology, particularly Microsoft Excel, for data visualization and analysis. The main topics include the role of technology, displaying quantitative and qualitative data, contingency tables, stem-and-leaf displays, and scatter plots.
The Role Technology Plays in Statistics
Using Microsoft Excel for Statistical Analysis
Modern business statistics relies heavily on technology for efficient data analysis and presentation. Microsoft Excel is a widely used tool that offers built-in options for statistical analysis and data visualization.
Excel's Analysis ToolPak: An add-in that provides advanced statistical analysis tools, such as histograms, regression, and descriptive statistics.
Activation: Users may need to activate the Analysis ToolPak add-in to access these features.
Steps to Activate Analysis ToolPak in Excel 2016:
Open Excel and click on the File tab.
Select Options from the drop-down menu to open the Excel Options dialog box.
Click on Add-Ins in the left margin.
At the bottom, click Go next to Manage Excel Add-ins.
Check the boxes for Analysis ToolPak and Analysis ToolPak - VBA, then click OK.
Once activated, the Data Analysis option appears under the Data tab, providing access to various statistical tools.
Displaying Quantitative Data
Frequency Distributions
A frequency distribution is a summary table that shows the number of observations falling into each of several categories (classes). It helps to visualize patterns in data that are not immediately apparent from raw data tables.
Class: A category or interval in a frequency distribution.
Example: Number of iPads sold per day, grouped into classes such as 0, 1, 2, etc.
Discrete vs. Continuous Data
Discrete Data: Values that can be counted (e.g., number of children, cars, or vacation days). These are typically whole numbers.
Continuous Data: Values that can take on any value within a range, including decimals (e.g., time, weight, distance).
Relative and Cumulative Frequency Distributions
Relative Frequency: The proportion of observations in each class, calculated as .
Cumulative Relative Frequency: The sum of relative frequencies for all classes up to and including the current class.
Example Table: Cumulative Relative Frequency Distribution
Number Sold Per Day | Frequency | Relative Frequency | Cumulative Relative Frequency |
|---|---|---|---|
0 | 5 | 0.10 | 0.10 |
1 | 8 | 0.16 | 0.26 |
2 | 14 | 0.28 | 0.54 |
4 | 6 | 0.12 | 0.92 |
5 | 4 | 0.08 | 1.00 |
Histograms
A histogram is a graphical representation of a frequency distribution for quantitative data. Each bar represents the frequency (or relative frequency) of observations within a class interval (bin).
Bins are the intervals or classes used in the distribution.
Histograms are useful for visualizing the shape, center, and spread of data.
Constructing a Histogram in Excel
Select the Data tab and click Data Analysis.
Choose Histogram from the menu and click OK.
Specify the Input Range (data) and Bin Range (class intervals).
Select output options and click OK to generate the histogram.
Shape of Histograms
Symmetric: Both sides are mirror images.
Skewed: One side is longer or fatter than the other.
Grouped Quantitative Data
For large or continuous data sets, group values into classes to avoid too many categories.
Recommended number of classes: between 4 and 20.
Use the formula to determine the number of classes, where is the number of classes and is the number of data points.
Class width:
Rules for Grouped Data Classes
Equal-size classes
Mutually exclusive (no overlap)
Include all data values
Avoid empty or open-ended classes
Ogive
An ogive is a line graph that displays the cumulative relative frequency distribution. It is useful for determining the number or proportion of observations below a particular value.
Displaying Qualitative Data
Qualitative Data and Frequency Distributions
Qualitative data are categorical and can be nominal (no order) or ordinal (ordered). Frequency distributions for qualitative data show the number of occurrences in each category.
Use Excel's COUNTIF() function to count occurrences of each category.
Bar Charts
Bar charts display frequencies for categorical data.
Can be vertical or horizontal.
Clustered bar charts group values side by side within categories.
Stacked bar charts show subcategories within a single bar.
Pareto Charts
A Pareto chart is a bar chart that displays categories in descending order of frequency, often used in quality control. It also includes a cumulative relative frequency line (ogive).
Example Table: Pareto Chart Data
Reason for Return | Frequency |
|---|---|
Product defective | 46 |
Disappointed with product | 22 |
Product no longer wanted | 14 |
Late delivery | 5 |
Other | 3 |
Pie Charts
Pie charts show the proportion of each category as a segment of a circle.
All categories must be included, and the total represents the whole.
Useful for comparing relative sizes of categories.
Contingency Tables
Definition and Construction
Contingency tables (also called cross-tabulations) display the frequency distribution of variables that have more than one attribute. Rows and columns represent different variables, and the table summarizes the data efficiently.
Example Table: Contingency Table
Female | Male | Grand Total | |
|---|---|---|---|
Cash | 2 | 5 | 7 |
Credit | 7 | 2 | 9 |
Debit | 3 | 1 | 4 |
Grand Total | 12 | 8 | 20 |
Constructing Contingency Tables in Excel
Select any cell within your data.
Go to the Insert tab and select Pivot Table.
In the dialog box, choose where to place the table and click OK.
Drag variable names into the Row Labels and Column Labels boxes as desired.
Drag the variable to be summarized into the Values box.
Stem and Leaf Display
Definition and Construction
A stem-and-leaf display splits each data value into a "stem" (the leading digit(s)) and a "leaf" (the trailing digit). This method preserves the original data while providing a graphical view similar to a histogram.
All original data points are visible.
Easy to construct by hand.
Can split stems for more detail (e.g., 80–84, 85–89).
Scatter Plots
Definition and Use
A scatter plot is a graph that shows the relationship between two quantitative variables. The independent variable is plotted on the horizontal axis, and the dependent variable on the vertical axis.
Useful for identifying correlations or patterns between variables.
Each point represents a pair of values.
Line Charts and Sparklines
Line Charts
A line chart connects data points with lines, often used for time series data. Time is typically plotted on the horizontal axis.
Sparklines
Sparklines are small, cell-sized line charts in Excel that provide a compact visual summary of data trends within a worksheet cell.
Additional info: These notes are based on textbook slides for a college-level Business Statistics course, focusing on Chapter 2: Displaying Descriptive Statistics. The content is structured to provide a comprehensive yet concise study guide for exam preparation.