BackFrequency Distributions and Graphs in Statistics: Study Notes
Study Guide - Smart Notes
Tailored notes based on your materials, expanded with key definitions, examples, and context.
Frequency Distributions & Graphs
Definitions
Understanding how data is organized and visualized is fundamental in statistics. Frequency distributions and graphs help summarize and present data in a meaningful way.
Frequency: How often something occurs, denoted by f.
Frequency Distribution: Arrangement of data based on counting frequencies.
Example: Frequency distribution of soccer goals scored in recent games:
Goals | Frequency |
|---|---|
1 | 2 |
2 | 3 |
3 | 5 |
4 | 4 |
5 | 1 |
Class Interval: A range of values grouped together, defined by lower and upper limits.
Class Limits: Determined by the number of desired classes and the range of the data set.
Range:
Class Width: Difference between two lower class limits.
Papers Sold | Frequency |
|---|---|
20 | 2 |
21 | 1 |
22 | 4 |
23 | 2 |
24 | 1 |
25 | 1 |
Grouped Frequency Distribution Example:
Papers Sold | Frequency | Lower Limit | Upper Limit |
|---|---|---|---|
15–19 | 2 | 15 | 19 |
20–24 | 7 | 20 | 24 |
25–29 | 1 | 25 | 29 |
Additional info: Grouping data into classes helps reveal patterns and trends, especially for large data sets.
Constructing a Frequency Class Table
Steps to Construct a Frequency Class Table
Frequency class tables organize data into intervals (classes) to simplify analysis and visualization.
Identify the smallest and largest values in the data set.
Calculate the range:
Determine the class width: (rounded up)
List the lower class interval limits, then add the class width to get the next interval.
Continue until all data points are included.
Example: Constructing a frequency class table for farm acreage data:
Lower Limit | Upper Limit | Frequency |
|---|---|---|
59 | 114 | 5 |
115 | 170 | 6 |
171 | 226 | 6 |
227 | 282 | 5 |
283 | 338 | 2 |
339 | 394 | 3 |
395 | 450 | 3 |
Additional info: Class width is typically chosen so that there are 5–20 classes, depending on data size.
More Definitions
Key Terms in Frequency Distributions
Class Midpoint:
Relative Frequency: The percentage of data that falls in a particular class. , where is class frequency and is sample size.
Cumulative Frequency: The sum of the frequency of a class and all previous classes.
Class Boundaries: Numbers that separate classes without forming gaps between them. Important for continuous data.
Class | Class Boundaries | Frequency | Class Midpt. | Relative Frequency | Cumulative Frequency |
|---|---|---|---|---|---|
59–114.5 | 58.5–114.5 | 5 | 86.5 | .166 | 5 |
115–170 | 114.5–170.5 | 6 | 142.5 | .2 | 11 |
171–226 | 170.5–226.5 | 6 | 198.5 | .2 | 17 |
227–282 | 226.5–282.5 | 5 | 254.5 | .166 | 22 |
283–338 | 282.5–338.5 | 2 | 310.5 | .066 | 24 |
339–394 | 338.5–394.5 | 3 | 366.5 | .1 | 27 |
395–450 | 394.5–450.5 | 3 | 422.5 | .1 | 30 |
Sum of frequencies:
Sum of relative frequencies:
Graphing Frequency Data
Types of Graphs
Frequency data can be visualized using several types of graphs, each serving a specific purpose.
Bar Graph: Displays frequency for each class using bars.
Relative Frequency Bar Graph: Uses relative frequency for the y-axis.
Pareto Chart: Bar chart with frequencies in descending order, useful for identifying the most common class.
Frequency Polygon: Line graph connecting midpoints of each class interval.
Cumulative Frequency Graph (Ogive): Line graph showing cumulative frequency across class boundaries.
Key Points:
Frequency polygons should begin and end on the x-axis, using class width for endpoints.
Ogives help identify the greatest increase in cumulative frequency.
Stem & Leaf Plots
Displaying Data with Stem-and-Leaf Plots
Stem-and-leaf plots provide a way to organize data while retaining actual data values.
Stem: Leftmost digit(s) of a data point.
Leaf: Rightmost digit(s) of a data point.
Example: Grades on a statistics test:
Grade | Frequency |
|---|---|
53 | 1 |
57 | 1 |
61 | 1 |
68 | 1 |
72 | 1 |
75 | 1 |
80 | 1 |
83 | 2 |
87 | 1 |
89 | 2 |
91 | 1 |
92 | 1 |
94 | 1 |
96 | 1 |
99 | 1 |
100 | 1 |
Stem-and-leaf plots can easily organize data of all sizes and reflect actual data values.
Scatter Plots
Visualizing Relationships Between Two Variables
Scatter plots are used to display paired data sets, showing possible relationships or correlations between variables.
Example: Internet usage vs. age.
Correlation: Indicates whether variables are related. Can be positive (both increase together) or negative (one increases as the other decreases).
Scatter plots can help identify causation, trends, and outliers.
Additional info: Fisher's Iris Data Set is a classic example used to demonstrate scatter plots in statistics.
Pie Charts
Displaying Relative Frequencies
Pie charts are circular graphs used to display relative frequencies of categorical data.
Example: Movie preferences of 20 individuals.
Movie Genre | Frequency | Relative Frequency |
|---|---|---|
Comedy | 4 | 0.2 |
Romance | 6 | 0.3 |
Action | 5 | 0.25 |
Drama | 1 | 0.05 |
SciFi | 4 | 0.2 |
Relative frequency is calculated as
Pie charts visually represent the proportion of each category.
Electronic Tools
Using Calculators for Frequency Analysis
Statistical calculators can be used to input data, sort, and calculate frequency distributions and summary statistics.
Steps include entering data, sorting, and using built-in functions to calculate minimum, maximum, and other statistics.
Electronic tools streamline the process for large data sets.