Skip to main content
Back

Statistics Exam Study Guide: Sampling, Experiments, Probability, and Regression

Study Guide - Smart Notes

Tailored notes based on your materials, expanded with key definitions, examples, and context.

Sampling and Experimental Design

Subjects and Treatments in Experiments

In statistics, experiments are designed to test the effect of treatments on subjects. Understanding the distinction between subjects and treatments is fundamental for interpreting experimental results.

  • Subjects: The individuals or objects being studied. For example, in a diet study, the subjects are the adults participating in the experiment.

  • Treatments: The specific conditions or interventions applied to subjects. In the diet study, the treatments are the new diet plan and the usual diet.

  • Random Assignment: Assigning subjects to treatments randomly helps reduce bias and confounding variables.

  • Example: A nutritionist randomly assigns 25 adults to a new diet and 25 to continue their usual diet, then measures cholesterol levels after 3 months.

Population and Sample

Distinguishing between the population and the sample is essential in statistical studies.

  • Population: The entire group of individuals or items of interest. For example, all crabs in the Gulf of Mexico.

  • Sample: A subset of the population selected for study. For example, the 60 crabs collected and measured.

  • Parameter: A numerical summary of a population (e.g., population mean).

  • Statistic: A numerical summary of a sample (e.g., sample mean).

  • Example: A marine biologist measures the lengths of 60 crabs to estimate the average length of all crabs in the Gulf of Mexico.

Benefits of Random Sampling and Random Assignment

Random sampling and random assignment are key techniques in statistics to ensure validity and reliability.

  • Random Sampling: Ensures that the sample is representative of the population, reducing selection bias.

  • Random Assignment: Helps control for confounding variables, allowing for causal inference in experiments.

  • Example: Randomly selecting adults for a sugar intake study increases the generalizability of the results.

Types of Studies: Observational vs. Experimental

Statistical studies can be classified as observational or experimental.

  • Observational Study: Researchers observe subjects without intervening. Example: Measuring blood pressure and exercise hours without assigning treatments.

  • Experimental Study: Researchers apply treatments and observe effects. Example: Giving a vitamin supplement to one group of mice and not to another, then measuring antibody levels.

Probability and Random Variables

Probability Distributions

A probability distribution lists all possible outcomes of a random experiment and their associated probabilities.

  • Discrete Random Variable: Takes on a countable number of distinct values (e.g., outcomes of rolling a die).

  • Continuous Random Variable: Takes on an infinite number of possible values within a range (e.g., heights of adults).

Die Outcome (X)

Probability P(X)

1

0.1

2

0.2

3

0.3

4

0.2

5

?

6

0.1

Additional info: The missing probability for outcome 5 can be found by ensuring the total probability sums to 1.

  • Sample Space: The set of all possible outcomes. For a die: {1, 2, 3, 4, 5, 6}

  • Probability of an Event: Sum the probabilities of the outcomes in the event.

  • Example: Probability of rolling a 1 or 3:

Discrete vs. Continuous Random Variables

Random variables are classified based on the type of values they can take.

  • Discrete Random Variable: Has specific, countable outcomes (e.g., number of heads in coin tosses).

  • Continuous Random Variable: Can take any value within a range (e.g., height, weight).

  • Example: The outcome of rolling a die is a discrete random variable.

Central Limit Theorem and Law of Large Numbers

Central Limit Theorem (CLT)

The Central Limit Theorem is a fundamental concept in inferential statistics.

  • Statement: For a large enough sample size, the sampling distribution of the sample mean will be approximately normal, regardless of the population's distribution.

  • Formula: If are independent and identically distributed random variables with mean and standard deviation , then the sample mean approaches normality as increases.

  • Application: Allows use of normal probability methods for inference about means.

Law of Large Numbers

The Law of Large Numbers describes the result of performing the same experiment many times.

  • Statement: As the number of trials increases, the sample mean approaches the population mean.

  • Formula:

  • Application: Justifies using sample statistics to estimate population parameters.

Regression Analysis

Linear Regression Equation

Linear regression is used to model the relationship between two quantitative variables.

  • Equation: , where is the dependent variable, is the independent variable, is the intercept, and is the slope.

  • Interpretation: The slope indicates the change in for a one-unit increase in .

  • Example: In the equation , is anxiety score, is hours of daily exercise, , .

  • Intercept (): The value of when .

  • Slope (): The rate of change of with respect to .

  • Calculation: If Sarah exercises 8 hours per week, .

Normal Distribution and Probability Calculations

Normal Distribution

The normal distribution is a continuous probability distribution characterized by its mean and standard deviation.

  • Mean (): The center of the distribution.

  • Standard Deviation (): Measures the spread of the distribution.

  • Probability Calculation: To find the probability that a value is less than a certain threshold, use the standard normal () transformation:

  • Example: For cm, cm, cm:

  • Use standard normal tables to find .

Summary Table: Key Concepts

Concept

Definition

Example

Population

Entire group of interest

All adults in a city

Sample

Subset of the population

40 randomly selected adults

Parameter

Numerical summary of population

Population mean sugar intake

Statistic

Numerical summary of sample

Sample mean sugar intake

Discrete Random Variable

Countable outcomes

Die roll outcome

Continuous Random Variable

Infinite possible values

Height of adults

Additional info: Some questions in the original file were incomplete or missing multiple-choice options. Academic context and examples have been added to ensure completeness and clarity.

Pearson Logo

Study Prep