Skip to main content
Back

Frequency Distributions and Graphical Representation in Statistics

Study Guide - Smart Notes

Tailored notes based on your materials, expanded with key definitions, examples, and context.

Frequency Distributions

Introduction to Frequency Distributions

A frequency distribution is a summary of how often different values occur within a dataset. It organizes raw data into classes or intervals, making it easier to interpret and analyze large sets of data.

  • Class: A group or interval into which data values are grouped.

  • Frequency: The number of data values that fall within each class.

  • Class Limits: The smallest and largest data values that can belong to a class.

  • Class Boundaries: The values that separate classes without gaps, typically found by averaging the upper limit of one class and the lower limit of the next.

  • Class Width: The difference between the lower (or upper) limits of consecutive classes.

Constructing a Frequency Distribution

To construct a frequency distribution, follow these steps:

  1. Determine the range of the data:

  2. Choose the number of classes (often between 5 and 20 for most datasets).

  3. Calculate the class width: (round up to a convenient number).

  4. Select the lower limit of the first class, then add the class width to get subsequent lower limits.

  5. Determine the upper limits for each class.

  6. Tally the data into the appropriate classes to find the frequency for each.

Example: Constructing Class Limits and Boundaries

Given: Minimum = 10, Maximum = 93, Number of classes = 6

  • Class width:

  • Lower class limits: 10, 24, 38, 52, 66, 80

  • Upper class limits: 23, 37, 51, 65, 79, 93

Class Boundaries

Class boundaries are calculated to avoid gaps between classes:

  • Lower boundary of first class:

  • Upper boundary of last class:

  • Example: For class 10-23, boundaries are 9.5 and 23.5

Frequency Tables

Expanded Frequency Table

A frequency table may include additional columns such as midpoints, relative frequency, and cumulative frequency.

Class

Frequency (f)

Midpoint

Relative Frequency

Cumulative Frequency

19-29

19

24

0.05

19

30-40

42

35

0.12

61

41-51

68

46

0.19

129

52-62

69

57

0.19

198

63-73

75

68

0.21

273

74-84

68

79

0.19

341

85-95

24

90

0.07

365

  • Midpoint:

  • Relative Frequency: , where is the class frequency and is the sample size.

  • Cumulative Frequency: The sum of the frequencies for that class and all previous classes.

Graphical Representation of Data

Histograms

A histogram is a bar graph that represents the frequency distribution of a dataset. The horizontal axis shows the classes, and the vertical axis shows the frequency.

  • Bars must touch each other, indicating continuous data.

  • Height of each bar corresponds to the frequency of the class.

  • Used for quantitative data.

Frequency Polygon

A frequency polygon is a line graph that connects the midpoints of each class at the height corresponding to the frequency.

  • Useful for comparing distributions.

  • Shows the shape of the data distribution.

Ogive (Cumulative Frequency Graph)

An ogive is a line graph that displays cumulative frequencies. The horizontal axis shows the upper class boundaries, and the vertical axis shows cumulative frequency.

  • Helps determine medians, percentiles, and the number of values below a certain point.

Relative Frequency Histogram

A relative frequency histogram is similar to a histogram, but the vertical axis shows relative frequency (proportion or percentage) instead of absolute frequency.

Interpreting Graphs and Tables

Patterns and Analysis

  • Identify the class with the greatest and least frequency.

  • Describe the distribution (e.g., symmetric, skewed, bimodal).

  • Use cumulative frequency to find where the greatest increase occurs.

  • Relative frequency helps compare classes regardless of sample size.

Example: Employee Salaries Histogram

  • Number of classes: 7

  • Class width: 15

  • Greatest frequency: 270

  • About half of the employees make between $40,000 and $49,000.

Summary Table: Key Terms and Formulas

Term

Definition

Formula

Class Width

Difference between consecutive class limits

Midpoint

Center value of a class

Relative Frequency

Proportion of data in a class

Cumulative Frequency

Sum of frequencies up to a class

Sum of frequencies for all classes up to and including the current class

Applications

  • Frequency distributions are used in summarizing exam scores, salaries, biological measurements, and more.

  • Histograms and polygons help visualize data distributions and identify trends or outliers.

  • Relative and cumulative frequencies are essential for probability calculations and statistical inference.

Additional info:

  • Some examples and tables were inferred and expanded for clarity and completeness.

  • Graphical representations (histograms, polygons, ogives) are essential tools in exploratory data analysis.

Pearson Logo

Study Prep