BackOrganizing and Visualizing Variables in Business Statistics
Study Guide - Smart Notes
Tailored notes based on your materials, expanded with key definitions, examples, and context.
Organizing and Visualizing Variables
Introduction
In business statistics, organizing and visualizing variables is essential for understanding data and making informed decisions. Variables can be categorized as either categorical or numerical, and each type requires specific methods for effective presentation and analysis.
Organizing Categorical Data
Summary Tables
A summary table is used to tally the frequencies or percentages of items in different categories, allowing for easy comparison between groups.
Definition: A summary table displays the count or percentage of observations for each category of a categorical variable.
Purpose: To highlight differences and trends among categories.
Example: Reasons young adults shop online.
Reason For Shopping Online? | Percent |
|---|---|
Better Prices | 37% |
Avoiding holiday crowds or hassles | 29% |
Convenience | 18% |
Better selection | 13% |
Ships directly | 3% |
Contingency Tables
A contingency table is used to study patterns between two or more categorical variables by cross-tabulating their responses.
Definition: A table that displays the frequency distribution of variables to analyze the relationship between them.
Structure: Rows represent one variable, columns represent another.
Application: Used to identify associations or dependencies between categorical variables.
No Errors | Errors | Total | |
|---|---|---|---|
Small Amount | 170 | 20 | 190 |
Medium Amount | 100 | 40 | 140 |
Large Amount | 65 | 5 | 70 |
Total | 335 | 65 | 400 |
Contingency Table: Percentage of Overall Total
Percentages can be calculated for each cell to show the proportion of the overall total.
No Errors | Errors | Total | |
|---|---|---|---|
Small Amount | 42.50% | 5.00% | 47.50% |
Medium Amount | 25.00% | 10.00% | 35.00% |
Large Amount | 16.25% | 1.25% | 17.50% |
Total | 83.75% | 16.25% | 100.00% |
Organizing Numerical Data
Ordered Array
An ordered array is a sequence of data arranged in rank order, from the smallest to the largest value.
Purpose: Shows the range and helps identify outliers.
Application: Useful for preliminary data analysis.
Day Students | Night Students |
|---|---|
16, 17, 17, 18, 18, 18, 19, 19, 20, 20, 21, 21, 22, 22, 25, 27, 32, 38, 42 | 18, 18, 19, 19, 20, 21, 23, 28, 32, 33, 41, 45 |
Frequency Distribution
A frequency distribution summarizes data by grouping values into classes and counting the number of observations in each class.
Class Interval:
Relative Frequency:
Cumulative Frequency: Running total of frequencies up to a given class.
Example Frequency Distribution
Class | Midpoint | Frequency |
|---|---|---|
Less than 20 | 15 | 3 |
Less than 30 | 25 | 6 |
Less than 40 | 35 | 5 |
Less than 50 | 45 | 4 |
Less than 60 | 55 | 2 |
Total | 20 |
Relative & Percent Frequency Distribution
Class | Frequency | Percent |
|---|---|---|
Less than 20 | 3 | 15% |
Less than 30 | 6 | 30% |
Less than 40 | 5 | 25% |
Less than 50 | 4 | 20% |
Total | 20 | 100% |
Cumulative Frequency Distribution
Class | Cumulative Frequency | Cumulative Percent |
|---|---|---|
Less than 20 | 3 | 15% |
Less than 30 | 9 | 45% |
Less than 40 | 14 | 70% |
Less than 50 | 18 | 90% |
Less than 60 | 20 | 100% |
Why Use a Frequency Distribution?
Condenses raw data into a more useful form
Allows for quick visual interpretation
Enables determination of major characteristics, such as clustering
Visualizing Categorical Data
Bar Chart
A bar chart displays categorical data as bars, with the length representing frequency or percentage. Bars are separated by gaps.
Pie Chart
A pie chart divides a circle into slices representing categories, with slice size proportional to the category's percentage.
Pareto Chart
A Pareto chart is a vertical bar chart showing categories in descending order of frequency, often with a cumulative polygon. It helps identify the "vital few" causes among many.
Cause | Frequency | Percent |
|---|---|---|
Card jammed | 365 | 50.41% |
Unreadable | 234 | 32.32% |
Malfunctions | 32 | 4.42% |
Out of cash | 28 | 3.87% |
Amount requested | 23 | 3.18% |
Keystroke | 23 | 3.18% |
Insufficient funds | 19 | 2.62% |
Total | 724 | 100.00% |
Side-by-Side Bar Charts
Side-by-side bar charts are used to compare data from a contingency table, showing frequencies or percentages for each group.
Visualizing Numerical Data
Stem-and-Leaf Display
A stem-and-leaf display organizes data into groups (stems) with individual values (leaves) branching out, preserving the original data values and showing distribution.
Histogram
A histogram is a vertical bar chart of a frequency distribution for numerical data. Bars touch each other, and the horizontal axis shows class boundaries or midpoints.
Vertical axis: Frequency, relative frequency, or percentage
Height of bars: Represents the frequency or proportion in each class
Visualizing Two Numerical Variables
Scatter Plot
A scatter plot displays paired observations from two numerical variables, with one variable on each axis. It is used to examine relationships or correlations.
Time Series Plot
A time series plot shows patterns in a numerical variable over time, with time on the horizontal axis and the variable on the vertical axis.
Useful for identifying trends, cycles, or seasonal patterns
Summary
Organizing and visualizing data are foundational skills in business statistics. Proper use of tables and charts enables clear communication, effective analysis, and better decision-making.
Additional info: Some table entries and examples were inferred or expanded for clarity and completeness.