BackExam 3 Review: Hypothesis Testing, Confidence Intervals, and Sampling Distributions
Study Guide - Smart Notes
Tailored notes based on your materials, expanded with key definitions, examples, and context.
Hypothesis Testing
Types of Errors and Symbols
In hypothesis testing, we use specific symbols and terminology to describe errors and test statistics. Understanding these is crucial for interpreting results and making decisions.
Type I Error (α): Rejecting the null hypothesis when it is actually true. The probability of a Type I error is denoted by .
Type II Error (β): Failing to reject the null hypothesis when it is actually false. The probability of a Type II error is denoted by .
Test Statistic (z, t): Used to determine whether to reject the null hypothesis. Common test statistics include the z-score and t-score.
Sample Mean (): The average value in a sample, used in test statistics.
Sample Standard Deviation (s): Measures the spread of sample data.
Population Standard Deviation (): Measures the spread of population data.
Sample Proportion (): The proportion of successes in a sample.
Example: If , there is a 5% chance of committing a Type I error.
Decision Rules in Hypothesis Testing
Decisions in hypothesis testing are based on the comparison of the test statistic to critical values or p-values.
Reject : If the test statistic falls in the rejection region or if the p-value is less than .
Fail to Reject : If the test statistic does not fall in the rejection region or if the p-value is greater than .
Example: For a two-tailed test with , reject if .
Sampling Distributions and Estimation
Sampling Distributions
A sampling distribution is the probability distribution of a given statistic based on a random sample. It is fundamental for making inferences about populations.
Sample Mean Distribution: If the population is normal or the sample size is large (), the sampling distribution of the sample mean is approximately normal.
Standard Error: The standard deviation of the sampling distribution. For the mean, .
Example: For and , .
Confidence Intervals
Confidence intervals provide a range of values within which the population parameter is likely to fall, with a certain level of confidence.
Formula for Mean (Known ):
Formula for Mean (Unknown ):
Formula for Proportion:
Example: For , , , and (from t-table for 15 df, 95% CI):
Testing Hypotheses for Two Variables
Comparing Two Means
When comparing two means, we use different formulas depending on whether samples are independent or dependent.
Independent Samples:
Dependent Samples (Paired): , where is the mean of the differences.
Example: For paired data, calculate the difference for each pair, then compute and .
Comparing Two Proportions
To test the difference between two proportions:
Test Statistic: where is the pooled proportion.
Example: If , , , , calculate pooled and substitute.
Critical Values and Statistical Tables
z-Table, t-Table, and Chi-Square Table
Statistical tables are used to find critical values for hypothesis tests and confidence intervals.
Distribution | Purpose | Key Parameters |
|---|---|---|
z-Table | Critical values for normal distribution | Mean, standard deviation |
t-Table | Critical values for t-distribution (small samples) | Degrees of freedom |
Chi-Square Table | Critical values for chi-square tests | Degrees of freedom |
Example: For a two-tailed test at , the critical z-value is .
Assumptions in Hypothesis Testing
Key Assumptions
Correct application of statistical tests requires certain assumptions:
Random sampling
Independence of observations
Normality of population (for t-tests)
Equal variances (for some two-sample tests)
Example: For a two-sample t-test, check that both samples are independent and variances are equal.
Formulating Hypotheses
Null and Alternative Hypotheses
Every hypothesis test begins with formulating null () and alternative () hypotheses.
Null Hypothesis (): States that there is no effect or difference.
Alternative Hypothesis (): States that there is an effect or difference.
One-tailed vs. Two-tailed: One-tailed tests look for a difference in a specific direction; two-tailed tests look for any difference.
Example: vs. (two-tailed)
Application: Interpreting Results and Making Decisions
Using p-values and Critical Regions
Decisions are made by comparing the p-value to the significance level or by checking if the test statistic falls in the critical region.
If , reject .
If , fail to reject .
Example: If and , reject .
Summary Table: Key Symbols and Their Meanings
Symbol | Meaning |
|---|---|
Type I error rate (significance level) | |
Type II error rate | |
Sample mean | |
Sample standard deviation | |
Population standard deviation | |
Sample proportion | |
z-score (standard normal test statistic) | |
t-score (Student's t test statistic) |
Additional info:
Some content inferred from context and standard statistics curriculum, such as formulas and definitions.
Tables reconstructed to summarize critical values and symbols.