Skip to main content
Back

Lesson 2

Study Guide - Smart Notes

Tailored notes based on your materials, expanded with key definitions, examples, and context.

Chapter 4: Gathering Data

Section 4.1: Study Design – Experiment or Observation?

Understanding how data is gathered is fundamental in statistics. The choice between conducting an experiment or merely observing can significantly affect the conclusions drawn from a study.

  • Population: The entire group of subjects of interest.

  • Sample: A subset of the population, selected for measurement.

  • Observational Study: The researcher observes values of variables without imposing treatments.

  • Experimental Study: The researcher assigns subjects to conditions (treatments) and observes outcomes.

Key Terms

  • Explanatory Variable: The variable manipulated or categorized to explain changes in the response variable.

  • Response Variable: The outcome measured in the study.

Example: Alcohol Consumption and Type II Diabetes

An observational study followed individuals for 12 years, examining the relationship between alcohol consumption and the risk of developing type II diabetes. The study found a ~30% reduced risk in moderate alcohol consumers, but no risk reduction in heavy or non-consumers.

  • Observational Study: No assignment of alcohol consumption schedules; researchers merely observed existing habits.

  • Limitation: Cannot distinguish effects of drinking from other factors (diet, lifestyle, etc.).

Diagram showing explanatory and lurking variables affecting response variable

Comparing Experimental and Observational Studies

Experiments provide more control over outside influences and can establish cause and effect, while observational studies can only identify associations.

  • Experiment: Reduces potential for lurking variables.

  • Observational Study: Cannot establish causation.

  • Ethical and Practical Constraints: Sometimes experiments are not feasible.

Section 4.2: Sampling Methods

Sampling Frame & Sampling Design

The sampling design determines how subjects are selected. Ideally, every subject should have an equal chance of being chosen.

  • Sampling Frame: The list of subjects from which the sample is drawn.

  • Sampling Design: The method used to select the sample.

Diagram of population and sample selection

Simple Random Sampling (SRS)

Simple random sampling is the gold standard for obtaining a representative sample.

  • SRS: Each possible sample of size n has the same chance of being selected.

  • Random Number Table: Used to select subjects randomly.

Table of random numbers

Steps to Select a SRS

  1. Number the subjects in the sampling frame.

  2. Select numbers from a random number table or generator.

  3. Include subjects whose numbers match the selected random numbers.

Margin of Error

Sample surveys estimate population percentages and include a margin of error, which quantifies the uncertainty in the estimate.

  • Margin of Error: Indicates how well the sample estimate predicts the population percentage.

  • Formula (approximate):

Poor Sampling Methods: Convenience and Volunteer Samples

Convenience samples are easy to obtain but often biased and unrepresentative. Volunteer samples are a common form of convenience sampling.

  • Convenience Sample: Results apply only to observed subjects.

  • Volunteer Sample: Volunteers may not represent the population.

Illustration of a shopping mall, representing convenience sampling

Types of Bias in Sample Surveys

Bias is a systematic tendency to favor certain parts of the population.

  • Sampling Bias: Due to nonrandom samples or undercoverage.

  • Nonresponse Bias: When subjects cannot be reached or refuse to participate.

  • Response Bias: When subjects give incorrect responses or questions are misleading.

Key Parts of a Sample Survey

  • Identify the population of interest.

  • Construct a sampling frame.

  • Use random sampling to select subjects.

  • Be cautious of sampling bias.

Section 4.3: Experimental Design

Elements of an Experiment

Experiments involve deliberate assignment of treatments to subjects to observe their responses.

  • Experimental Units: Subjects measured in the experiment.

  • Treatment: Experimental condition imposed on subjects.

  • Explanatory Variable: Defines groups to be compared.

  • Response Variable: Outcome measured to assess treatment effect.

Components of a Good Experiment

  • Control/Comparison Group: Allows analysis of treatment effectiveness.

  • Randomization: Eliminates researcher bias and balances groups.

  • Replication: Assigns several units to each treatment, increasing confidence in results.

Example: TV Advertising Experiment

College students viewed ads of varying length and frequency. Their recall, attitude, and purchase intention were measured.

  • Subjects: College students

  • Factors: Length and repetition of ads

  • Treatments: Combinations of ad length and repetition

  • Response Variables: Recall, attitude, purchase intention

Principle 1: Control or Comparison Group

  • Placebo: Dummy treatment used in control groups.

  • Control Group: Receives placebo or standard treatment for comparison.

Principle 2: Randomization

  • Random assignment eliminates bias and balances groups on known and unknown variables.

Principle 3: Replication

  • Assigning multiple units to each treatment reduces ordinary variation and increases confidence in results.

Blinding the Experiment

  • Blinding: Subjects are unaware of their treatment.

  • Double-Blinding: Neither subjects nor investigators know the treatment assignments, controlling response bias.

Statistical Significance

  • If the observed difference is larger than expected by chance, it is statistically significant.

  • Practical significance should also be considered.

Generalizing Results

  • Results should be generalized only to the population represented by the study.

Lesson Summary

  • Two primary methods for gathering data: observational study and experiment.

  • Observational studies identify associations but not causation.

  • Randomization minimizes effects of lurking variables.

  • Biases can arise from undercoverage, nonresponse, and response bias.

  • Experiments assign subjects to treatments and often use placebos in control groups.

Pearson Logo

Study Prep