Skip to main content
Back

Organizing and Visualizing Variables in Business Statistics

Study Guide - Smart Notes

Tailored notes based on your materials, expanded with key definitions, examples, and context.

Chapter 2: Organizing and Visualizing Variables

Categorical Data: Organization and Visualization

Categorical data refers to variables that represent categories or groups, such as product type, gender, or payment method. Organizing categorical data is essential for identifying patterns and making business decisions.

  • Definition: Categorical variables are qualitative and describe membership in a group or category.

  • Organization Methods:

    • Summary Table: Used for one categorical variable. Tallies frequencies or percentages for each category.

    • Contingency Table: Used for two or more categorical variables. Cross-tabulates responses to show relationships between categories.

Summary Table Example

A summary table displays the frequency or percentage of items in each category, allowing for easy comparison.

Reason For Shopping Online?

Percent

Better Prices

37%

Avoiding holiday crowds or hassles

29%

Convenience

18%

Better selection

13%

Ships directly

3%

Contingency Table Example

Contingency tables help study patterns between two categorical variables by showing joint frequencies.

Amount

No Errors

Errors

Total

Small Amount

170

20

190

Medium Amount

100

40

140

Large Amount

65

5

70

Total

335

65

400

Contingency Table: Percentage Calculations

Contingency tables can be expressed as percentages of overall total, row totals, or column totals to facilitate interpretation.

Percentage of Overall Total:

Amount

No Errors

Errors

Total

Small Amount

42.50%

5.00%

47.50%

Medium Amount

25.00%

10.00%

35.00%

Large Amount

16.25%

1.25%

17.50%

Total

83.75%

16.25%

100.00%

Percentage of Row Totals:

Amount

No Errors

Errors

Total

Small Amount

89.47%

10.53%

100.00%

Medium Amount

71.43%

28.57%

100.00%

Large Amount

92.86%

7.14%

100.00%

Total

83.75%

16.25%

100.00%

Percentage of Column Totals:

Amount

No Errors

Errors

Total

Small Amount

50.75%

30.77%

47.50%

Medium Amount

29.85%

61.54%

35.00%

Large Amount

19.40%

7.69%

17.50%

Total

100.00%

100.00%

100.00%

Numerical Data: Organization and Visualization

Numerical data consists of quantitative variables, such as sales figures, temperatures, or production volumes. Organizing numerical data helps reveal distributions and trends.

  • Ordered Array: A sequence of data arranged from smallest to largest value.

  • Frequency Distribution: A summary table where data are grouped into numerically ordered classes, showing the frequency of observations in each class.

  • Cumulative Distribution: Shows the cumulative frequency or percentage up to each class boundary.

Frequency Distribution Example

Steps to create a frequency distribution:

  1. Arrange raw data in ascending order.

  2. Calculate the range:

  3. Select number of classes (usually 5-15).

  4. Compute class interval (width):

  5. Determine class boundaries and midpoints.

  6. Count observations in each class.

Class

Midpoint

Frequency

10 but less than 20

15

3

20 but less than 30

25

6

30 but less than 40

35

5

40 but less than 50

45

4

50 but less than 60

55

2

Relative and Percent Frequency Distribution

  • Relative Frequency:

  • Percent Frequency:

Class

Percent Frequency

10 but less than 20

15%

20 but less than 30

30%

30 but less than 40

25%

40 but less than 50

20%

50 but less than 60

10%

Cumulative Frequency Distribution

  • Cumulative Frequency: The sum of frequencies for all classes up to and including the current class.

  • Cumulative Percent Frequency:

Class

Cumulative Frequency

Cumulative Percent

10 but less than 20

3

15%

20 but less than 30

9

45%

30 but less than 40

14

70%

40 but less than 50

18

90%

50 but less than 60

20

100%

Summary

  • Organizing data into tables (summary, contingency, frequency) is foundational for statistical analysis in business.

  • Percentages and cumulative frequencies provide deeper insights into data distributions and relationships.

  • Proper organization and visualization of data support better decision-making and clearer communication of results.

Additional info: These notes are based on textbook slides for "Statistics for Managers Using Microsoft Excel," focusing on Chapter 2 content relevant to organizing and visualizing categorical and numerical variables for business statistics.

Pearson Logo

Study Prep