Skip to main content
Back

Hypothesis Testing, Confidence Intervals, and Correlation: Study Notes for Statistics Exam Prep

Study Guide - Smart Notes

Tailored notes based on your materials, expanded with key definitions, examples, and context.

Hypothesis Testing for Two Samples

Formulating and Testing Hypotheses

Hypothesis testing is a fundamental statistical method used to determine whether there is enough evidence in a sample of data to infer that a certain condition holds for the entire population. When comparing two samples, we often test whether their means or proportions are equal.

  • Null Hypothesis (H0): The default assumption, typically stating that there is no difference between the two population parameters (e.g., means or proportions).

  • Alternative Hypothesis (Ha): The claim we are testing, often that there is a difference between the two parameters.

  • Significance Level (α): The probability of rejecting the null hypothesis when it is actually true. Commonly set at 0.05.

  • Test Statistic: A value calculated from sample data, used to decide whether to reject H0.

  • Critical Value: The threshold value that the test statistic must exceed to reject H0.

  • Decision Rule: If the test statistic falls in the rejection region, reject H0; otherwise, do not reject H0.

Example: Testing whether the proportion of successes in two samples are equal.

  • Sample 1: ,

  • Sample 2: ,

  • Test statistic for proportions:

  • Where , , is the pooled proportion.

Additional info: The test involves calculating the z-score and comparing it to the critical value for the chosen significance level.

Comparing Means of Two Populations

Independent Samples t-Test

When comparing the means of two independent samples, the t-test is used to determine if the difference between sample means is statistically significant.

  • Assumptions: Samples are independent, populations are normally distributed, and variances may or may not be equal.

  • Test Statistic:

  • Degrees of Freedom: Calculated using the formula for two samples (may use the Welch-Satterthwaite equation if variances are unequal).

  • Decision: Compare the calculated t-value to the critical t-value from the t-distribution table.

Example: Testing the effectiveness of a training technique by comparing pre- and post-training scores.

  • Calculate sample means and standard deviations for each group.

  • Apply the t-test formula and interpret the result.

Confidence Intervals for the Difference Between Means

Constructing Confidence Intervals

A confidence interval estimates the range in which the true difference between two population means lies, with a specified level of confidence (e.g., 95%).

  • Formula:

  • Interpretation: If the interval includes 0, there may be no significant difference between the means.

  • Example: Heights of women from two countries are compared, and the confidence interval for the mean difference is calculated.

Correlation and Regression Analysis

Linear Correlation Coefficient

The linear correlation coefficient (r) measures the strength and direction of a linear relationship between two variables.

  • Formula:

  • Range: -1 (perfect negative) to +1 (perfect positive).

  • Significance Test: Use a t-test to determine if the correlation is statistically significant.

  • Example: Calculating r for test scores and determining significance at α = 0.05.

Linear Regression Equation

Regression analysis is used to model the relationship between a dependent variable and one or more independent variables.

  • Regression Equation:

  • Where: ,

  • Application: Predicting one variable based on the value of another.

  • Example: Predicting dexterity from productivity scores using the regression equation.

Summary Table: Hypothesis Testing Steps

Step

Description

1. State Hypotheses

Formulate H0 and Ha

2. Select Significance Level

Choose α (e.g., 0.05)

3. Compute Test Statistic

Calculate z or t value

4. Determine Critical Value

Find from statistical tables

5. Make Decision

Reject or do not reject H0

Summary Table: Correlation and Regression

Concept

Formula

Interpretation

Correlation Coefficient (r)

Strength and direction of linear relationship

Regression Equation

Predicts y from x

Additional info: These notes cover hypothesis testing for two samples, confidence intervals, correlation, and regression, which correspond to Chapters 8, 9, and 10 of a college statistics course.

Pearson Logo

Study Prep