Skip to main content
Back

Organizing and Visualizing Variables in Business Statistics

Study Guide - Smart Notes

Tailored notes based on your materials, expanded with key definitions, examples, and context.

Organizing and Visualizing Variables

Introduction

In business statistics, organizing and visualizing variables is essential for understanding data and making informed decisions. Variables can be categorized as either categorical or numerical, and each type requires specific methods for effective presentation and analysis.

Organizing Categorical Data

Summary Tables

A summary table is used to tally the frequencies or percentages of items in different categories, allowing for easy comparison between groups.

  • Definition: A summary table displays the count or percentage of observations for each category of a categorical variable.

  • Purpose: To highlight differences and trends among categories.

  • Example: Reasons young adults shop online.

Reason For Shopping Online?

Percent

Better Prices

37%

Avoiding holiday crowds or hassles

29%

Convenience

18%

Better selection

13%

Ships directly

3%

Contingency Tables

A contingency table is used to study patterns between two or more categorical variables by cross-tabulating their responses.

  • Definition: A table that displays the frequency distribution of variables to analyze the relationship between them.

  • Structure: Rows represent one variable, columns represent another.

  • Application: Used to identify associations or dependencies between categorical variables.

No Errors

Errors

Total

Small Amount

170

20

190

Medium Amount

100

40

140

Large Amount

65

5

70

Total

335

65

400

Contingency Table: Percentage of Overall Total

Percentages can be calculated for each cell to show the proportion of the overall total.

No Errors

Errors

Total

Small Amount

42.50%

5.00%

47.50%

Medium Amount

25.00%

10.00%

35.00%

Large Amount

16.25%

1.25%

17.50%

Total

83.75%

16.25%

100.00%

Organizing Numerical Data

Ordered Array

An ordered array is a sequence of data arranged in rank order, from the smallest to the largest value.

  • Purpose: Shows the range and helps identify outliers.

  • Application: Useful for preliminary data analysis.

Day Students

Night Students

16, 17, 17, 18, 18, 18, 19, 19, 20, 20, 21, 21, 22, 22, 25, 27, 32, 38, 42

18, 18, 19, 19, 20, 21, 23, 28, 32, 33, 41, 45

Frequency Distribution

A frequency distribution summarizes data by grouping values into classes and counting the number of observations in each class.

  • Class Interval:

  • Relative Frequency:

  • Cumulative Frequency: Running total of frequencies up to a given class.

Example Frequency Distribution

Class

Midpoint

Frequency

Less than 20

15

3

Less than 30

25

6

Less than 40

35

5

Less than 50

45

4

Less than 60

55

2

Total

20

Relative & Percent Frequency Distribution

Class

Frequency

Percent

Less than 20

3

15%

Less than 30

6

30%

Less than 40

5

25%

Less than 50

4

20%

Total

20

100%

Cumulative Frequency Distribution

Class

Cumulative Frequency

Cumulative Percent

Less than 20

3

15%

Less than 30

9

45%

Less than 40

14

70%

Less than 50

18

90%

Less than 60

20

100%

Why Use a Frequency Distribution?

  • Condenses raw data into a more useful form

  • Allows for quick visual interpretation

  • Enables determination of major characteristics, such as clustering

Visualizing Categorical Data

Bar Chart

A bar chart displays categorical data as bars, with the length representing frequency or percentage. Bars are separated by gaps.

Pie Chart

A pie chart divides a circle into slices representing categories, with slice size proportional to the category's percentage.

Pareto Chart

A Pareto chart is a vertical bar chart showing categories in descending order of frequency, often with a cumulative polygon. It helps identify the "vital few" causes among many.

Cause

Frequency

Percent

Card jammed

365

50.41%

Unreadable

234

32.32%

Malfunctions

32

4.42%

Out of cash

28

3.87%

Amount requested

23

3.18%

Keystroke

23

3.18%

Insufficient funds

19

2.62%

Total

724

100.00%

Side-by-Side Bar Charts

Side-by-side bar charts are used to compare data from a contingency table, showing frequencies or percentages for each group.

Visualizing Numerical Data

Stem-and-Leaf Display

A stem-and-leaf display organizes data into groups (stems) with individual values (leaves) branching out, preserving the original data values and showing distribution.

Histogram

A histogram is a vertical bar chart of a frequency distribution for numerical data. Bars touch each other, and the horizontal axis shows class boundaries or midpoints.

  • Vertical axis: Frequency, relative frequency, or percentage

  • Height of bars: Represents the frequency or proportion in each class

Visualizing Two Numerical Variables

Scatter Plot

A scatter plot displays paired observations from two numerical variables, with one variable on each axis. It is used to examine relationships or correlations.

Time Series Plot

A time series plot shows patterns in a numerical variable over time, with time on the horizontal axis and the variable on the vertical axis.

  • Useful for identifying trends, cycles, or seasonal patterns

Summary

Organizing and visualizing data are foundational skills in business statistics. Proper use of tables and charts enables clear communication, effective analysis, and better decision-making.

Additional info: Some table entries and examples were inferred or expanded for clarity and completeness.

Pearson Logo

Study Prep