BackFundamentals of Descriptive Statistics: Populations, Frequency Distributions, and Data Variation
Study Guide - Smart Notes
Tailored notes based on your materials, expanded with key definitions, examples, and context.
Chapter 1: Introduction to Statistics
Population and Sample
Statistics is the science of collecting, analyzing, and interpreting data. Two fundamental concepts are population and sample:
Population: The entire group of individuals or items of interest.
Sample: A subset of the population, selected for analysis.
Parameter and Statistic
Parameter: A numerical summary describing a characteristic of a population.
Statistic: A numerical summary describing a characteristic of a sample.
Types of Statistics
Descriptive Statistics: Methods for summarizing and organizing data (e.g., mean, median, mode).
Inferential Statistics: Methods for making predictions or inferences about a population based on sample data.
Types of Data
Qualitative Data: Non-numerical, categorical data (e.g., colors, names).
Quantitative Data: Numerical data (e.g., heights, weights).
Chapter 2: Frequency Distributions and Histograms
Frequency Distribution
A frequency distribution is a table that displays the frequency of various outcomes in a sample.
Class: A group or interval into which data are divided.
Frequency (freq.): The number of data values in each class.
Lower and Upper Class Limits: The smallest and largest data values that can belong to a class.
Class Width (): The difference between the lower limits of consecutive classes.
Class Midpoints: The average of the lower and upper class limits for each class.
Class Boundaries: Values that separate classes without gaps.
Sample Size (): The total number of data values.
Relative Frequency: The proportion of data values in each class.
Constructing a Frequency Distribution
Determine the number of classes.
Calculate class width:
Set class limits and boundaries.
Tally data into classes.
Class Width Calculation (Given Frequency Distribution)
Or
Histograms
A histogram is a graphical representation of a frequency distribution, using bars to show the frequency of data in each class.
Approximate Symmetry: Bell-shaped, normal distribution.
Left-Skewed: Median is left of mean; tail extends to the left.
Right-Skewed: Median is right of mean; tail extends to the right.
Chapter 2.3: Measures of Central Tendency
Mean, Median, and Mode
Mean (): The arithmetic average of data values.
Median: The middle value when data are ordered.
Mode: The value(s) that occur most frequently.
Weighted Mean: Used when data values have different weights.
Calculating with Technology
For a list of data values: Use 1Var Stats on calculators.
For frequency tables: Use 1Var Stats with List:L1 and FreqList:L2.
Chapter 2.4: Variation and Spread
Standard Deviation and Variance
Standard Deviation (SD): Measures the average distance of data values from the mean. Sample SD (): Population SD ():
Variance: The square of the standard deviation. Sample Variance (): Population Variance ():
Range: The difference between the maximum and minimum data values.
Empirical Rule (for Normal Distributions)
Approximately 68% of data values are within 1 SD of the mean.
Approximately 95% of data values are within 2 SDs of the mean.
Approximately 99.7% of data values are within 3 SDs of the mean.
Chapter 2.5: Quartiles, Five-Number Summary, and Outliers
Quartiles
Q1: 25% of data are below this value.
Q2 (Median): 50% of data are below this value.
Q3: 75% of data are below this value.
Five-Number Summary
Minimum
Q1
Median (Q2)
Q3
Maximum
Interquartile Range (IQR)
IQR: The range of the middle 50% of data.
Outliers
Data values above are considered outliers.
Data values below are considered outliers.
Z-Score
Z-score: Indicates how many standard deviations a data value is from the mean.
Unusual values: or
Summary Table: Measures of Central Tendency and Variation
Measure | Definition | Formula |
|---|---|---|
Mean | Arithmetic average | |
Median | Middle value | - |
Mode | Most frequent value | - |
Range | Max - Min | |
Standard Deviation | Average distance from mean | |
Variance | SD squared | |
IQR | Q3 - Q1 | |
Z-score | SDs from mean |
Additional info:
Calculator instructions (e.g., 1Var Stats, List:L1, FreqList:L2) are for TI-series calculators and similar statistical tools.
Histograms and frequency distributions are foundational for understanding data shape and spread.