AP Statistics 2nd Semester TEKS Test Review: Comprehensive Study Notes

Study Guide - Smart Notes

Tailored notes based on your materials, expanded with key definitions, examples, and context.

Probability Distributions and Random Variables

Linear Transformations of Random Variables

When working with random variables, it is important to understand how their means and standard deviations change under linear transformations and combinations.

Linear Transformation: If is a random variable with mean and standard deviation , and and are constants, then for :
- Mean:
- Standard deviation:
Sum and Difference of Independent Random Variables: If and are independent:
- Mean of :
- Standard deviation:
Example: If has mean 75 and SD 12, has mean 14 and SD 5, then:
- : Mean = 45, SD = 12
- : Mean = 28, SD = 10
- : Mean = 89, SD = 13
- : Mean = 61, SD = 13

Discrete Probability Distributions

Expected Value, Variance, and Standard Deviation

The expected value (mean) of a discrete random variable with possible values and probabilities is:

The variance is:

The standard deviation is:

Example: For a probability distribution of glass flame temperatures or number of toys played with, use the above formulas to compute mean, variance, and standard deviation.

Binomial and Geometric Distributions

Binomial Distribution

A binomial random variable counts the number of successes in independent trials, each with probability of success.

Probability of exactly successes:
Mean:
Standard deviation:

Geometric Distribution

The geometric distribution models the number of trials until the first success.

Probability that the first success occurs on the th trial:

Normal Distribution and Sampling Distributions

Properties of the Normal Distribution

A normal distribution is symmetric, bell-shaped, and described by its mean and standard deviation .
Standardizing:
Probabilities can be found using standard normal tables or technology.

Sampling Distribution of the Sample Mean

If is normal with mean and SD , the sample mean of an SRS of size has mean $\mu$ and SD .
Central Limit Theorem: For large , the sampling distribution of is approximately normal, regardless of the population distribution.

Confidence Intervals and Hypothesis Testing

Confidence Intervals for Proportions and Means

For a proportion from a sample of size :
For a mean from a sample of size with sample SD :
Confidence interval for a mean:
Confidence interval for a proportion:

Hypothesis Testing

State hypotheses: (null), (alternative)
Calculate test statistic (z or t)
Find p-value and compare to significance level
Draw a conclusion: reject or fail to reject

Chi-Square Tests

Goodness-of-Fit, Independence, and Homogeneity

Goodness-of-fit: Tests if observed frequencies match expected frequencies for a single categorical variable.
Test of independence: Tests if two categorical variables are independent in a population.
Test of homogeneity: Tests if the distribution of a categorical variable is the same in several populations.
Test statistic:

Regression and Correlation

Least Squares Regression

Regression line:
Slope , where is the correlation coefficient
Standard error of the slope:

Correlation

Measures the strength and direction of a linear relationship between two quantitative variables.
Correlation coefficient ranges from -1 to 1.

Residuals in Regression Analysis

Definition and Interpretation of Residuals

In regression analysis, a residual is the difference between an observed value and the value predicted by the regression model. Residuals are used to assess the fit of the model and to check for patterns that might indicate non-linearity or other issues.

For each data point, the residual is calculated as:
Interpretation:
- Positive residual: The observed value is above the predicted value.
- Negative residual: The observed value is below the predicted value.
Sum of residuals in least squares regression is always zero (up to rounding error).

Table of residuals e1 for a regression model

Example: The table above shows a set of residuals from a regression analysis. These values can be used to check for randomness (no pattern) and to verify that the sum is approximately zero, confirming a good fit for a linear model.

Inference Procedures, Power, and Errors

Type I and Type II Errors

Type I error ($\alpha$): Rejecting when it is true.
Type II error ($\beta$): Failing to reject when is true.
Power: Probability of correctly rejecting when is true.

Making Confidence Intervals Wider

Decrease sample size
Increase confidence level
Increase standard deviation

Formulas for Standard Errors

Standard error about the least squares line:
Standard error for a 2-sample proportion:
Standard error of the sample mean:
Standard error of the proportion:

Choosing the Correct Inference Procedure

Different statistical questions require different inference procedures, such as t-tests, z-tests, chi-square tests, and regression analysis. Always check the assumptions and conditions before applying a procedure.

Additional info: These notes are structured to cover the main topics in a college-level statistics course, including probability, distributions, inference, regression, and categorical data analysis. The included image is a table of residuals, directly relevant to regression diagnostics.