BackStatistics Study Guide: Probability, Experimental Design, Chi-Square Tests, and Statistical Inference
Study Guide - Smart Notes
Tailored notes based on your materials, expanded with key definitions, examples, and context.
Chapter 11: Probability and Randomness
Learning Objectives
This chapter introduces the foundational concepts of probability and randomness, essential for understanding statistical inference and data analysis.
Randomness and Probability: Correctly interpret randomness and probability in the long run. Randomness refers to outcomes that cannot be predicted with certainty, while probability quantifies the likelihood of these outcomes.
Sample Space: Identify the possible outcomes of an experiment, known as the sample space. The sample space is the set of all possible results of a random process.
Complement of an Event: Identify the complement of an event, which consists of all outcomes in the sample space that are not part of the event.
Probability Rules: Correctly assign probabilities to possible outcomes by applying the probability rules, such as the addition and multiplication rules.
Key Formulas
Probability of an event:
Complement Rule:
Addition Rule (for mutually exclusive events):
Example
If a die is rolled, the sample space is {1, 2, 3, 4, 5, 6}. The probability of rolling an even number is .
Chapter 22: Chi-Square Tests and Categorical Data Analysis
Learning Objectives
This chapter covers the use of chi-square tests for analyzing categorical data, including goodness-of-fit, homogeneity, and independence tests.
Hypothesis for Chi-Square Test: Identify whether a hypothesis calls for a chi-square test of goodness-of-fit, homogeneity, or independence.
Assumptions: Check whether the assumptions for performing a chi-square test are met, such as expected cell counts and independence of observations.
Degrees of Freedom: Determine the chi-square statistic and the degrees of freedom for a chi-square test.
Interpreting Output: Interpret the output of a chi-square test, including the p-value and test statistic.
Standardized Residuals: Use standardized residuals to interpret the association between two categorical variables.
Key Formulas
Chi-Square Statistic: , where is the observed frequency and is the expected frequency.
Degrees of Freedom (for contingency table): , where is the number of rows and is the number of columns.
Example
Testing whether a die is fair: compare observed counts of each face to expected counts using the chi-square statistic.
Chapter 17: Statistical Inference and Error Types
Learning Objectives
This chapter focuses on statistical inference, including p-values, confidence intervals, and the distinction between statistical and biological significance.
P-value: Explain the meaning of the p-value, which quantifies the probability of observing data as extreme as the sample, assuming the null hypothesis is true.
Confidence Interval: Use a confidence interval to approximate a one- or two-sided hypothesis test about a proportion.
Statistical vs. Biological Relevance: Explain the relationship between statistical significance and biological relevance.
Type I and Type II Errors: Explain the difference between Type I errors (false positives) and Type II errors (false negatives), and how these errors affect hypothesis testing.
Key Formulas
Confidence Interval for a Proportion:
Type I Error Rate (): Probability of rejecting the null hypothesis when it is true.
Type II Error Rate (): Probability of failing to reject the null hypothesis when it is false.
Example
In a clinical trial, a p-value of 0.03 suggests statistical significance at the 0.05 level, but the effect size should be considered for biological relevance.
Chapter 10: Experimental Design and Epidemiological Studies
Learning Objectives
This chapter introduces experimental design, observational studies, and epidemiological concepts relevant to statistical analysis.
Experiment vs. Observational Study: Explain the difference between an experiment and an observational study.
Experimental Units and Factors: Identify the experimental units or subjects, the factors, the treatments, the response variable, and the number of replications.
Principles of Experimental Design: Explain the purposes of the four principles of experimental design: control, randomization, replication, and blocking.
Epidemiological/Clinical Study Designs: Identify different epidemiological or clinical study designs, such as cohort, case-control, and randomized controlled trials.
Risk Ratios and Odds Ratios: Decide whether risk ratio (hazard ratio) can be calculated and interpret results with risk ratios or odds ratios.
Strength of Evidence: Evaluate the strength of evidence at the result level (direction of effect, effect size, statistical significance) and at the study design level (evidence hierarchy).
Key Formulas
Risk Ratio:
Odds Ratio:
Example
A randomized controlled trial compares the incidence of disease in treatment and control groups to calculate the risk ratio.
Summary Table: Key Statistical Concepts
Concept | Definition | Example |
|---|---|---|
Probability | Likelihood of an event occurring | Chance of rolling a 6 on a die: |
Chi-Square Test | Test for association between categorical variables | Testing independence in a contingency table |
P-value | Probability of observed data under null hypothesis | P-value of 0.03 indicates statistical significance |
Type I Error | False positive | Rejecting a true null hypothesis |
Type II Error | False negative | Failing to reject a false null hypothesis |
Risk Ratio | Relative risk between groups | Incidence in exposed vs. unexposed |
Odds Ratio | Relative odds between groups | Odds of disease in exposed vs. unexposed |
Additional info:
Supporting materials include chapter videos, external links, and recommended articles for deeper understanding.
Topics covered are highly relevant for college-level statistics, including probability, distributions, conditional probabilities, chi-square tests, effect size, power, experimental design, and strength of evidence.