Skip to main content
Back

Exam 1 Review: Descriptive Statistics, Distributions, and Correlation

Study Guide - Smart Notes

Tailored notes based on your materials, expanded with key definitions, examples, and context.

Descriptive Statistics and Data Types

Quantitative vs. Qualitative Data

Descriptive statistics involve summarizing and organizing data to understand its main features. Data can be classified as quantitative (numerical) or qualitative (categorical).

  • Quantitative Data: Consists of numbers representing counts or measurements (e.g., test scores).

  • Qualitative Data: Consists of categories or labels (e.g., class A, B, C).

  • Example: The data set {10, 19, 25, 28, 36, 39, 41, 43, 43, 46, 60, 64, 71, 80, 100} is quantitative.

Measures of Central Tendency and Spread

Central tendency and spread are key concepts in describing data numerically.

  • Mean: The arithmetic average of the data set.

  • Median: The middle value when the data is ordered.

  • Mode: The value that appears most frequently.

  • Standard Deviation: Measures the spread of data around the mean.

  • Quartiles: Divide the data into four equal parts (Q1, Q2/median, Q3).

  • Five Number Summary: Minimum, Q1, Median, Q3, Maximum.

  • Example: For the data set above, calculate the mean, median, and mode using formulas: is the middle value in the ordered list. is the most frequent value.

Frequency Distributions and Tables

Frequency Distribution

A frequency distribution shows how often each value or category occurs in a data set.

  • Steps:

    1. List all possible values or categories.

    2. Count the number of occurrences for each.

  • Example: For classes A, B, and C, count the number of students in each class and create a table.

Class

Frequency

A

Count of A

B

Count of B

C

Count of C

Additional info:

Fill in actual counts from data provided.

Histograms and Distribution Shapes

Histograms

A histogram is a graphical representation of the distribution of numerical data, showing the frequency of data within certain ranges (bins).

  • Class Width: The range of values in each bin.

  • Shape of Distribution: Can be symmetric, skewed left/right, or uniform.

  • Example: The histogram of RDER values shows the distribution of scores among subjects.

Frequency Table from Histogram

Convert histogram data into a frequency table by counting the number of observations in each bin.

Bin Range

Frequency

75-85

Count

85-95

Count

95-105

Count

Additional info:

Fill in actual counts from histogram.

Standard Scores (z-scores)

Calculating z-scores

A z-score indicates how many standard deviations a value is from the mean.

  • Formula: where is the value, is the mean, and is the standard deviation.

  • Example: If Maria scored 98, mean is 80, and standard deviation is 5:

Correlation and Regression

Scatterplots and Correlation Coefficient

A scatterplot visually displays the relationship between two quantitative variables. The correlation coefficient () measures the strength and direction of a linear relationship.

  • Formula for :

  • Interpretation: ranges from -1 (perfect negative) to +1 (perfect positive).

Least Squares Regression Line

The least squares regression line models the relationship between two variables.

  • Formula: where is the intercept and is the slope.

  • Interpretation of : The change in for a one-unit increase in .

  • Interpretation of : The predicted value of when .

Critical Value for Correlation

To determine if a correlation is statistically significant, compare the observed to a critical value from a table (based on sample size and significance level).

  • Application: If exceeds the critical value, the correlation is significant.

Summary Table: Key Concepts

Concept

Definition

Formula

Mean

Average value

Median

Middle value

--

Mode

Most frequent value

--

Standard Deviation

Spread of data

z-score

Standardized value

Correlation Coefficient

Strength of linear relationship

Regression Line

Best fit line

Additional info: Some frequency counts and table entries should be filled in with actual data from the questions. The notes cover topics from chapters 2, 3, 11, and 12 of a typical college statistics course.

Pearson Logo

Study Prep