Skip to main content
Back

Chapter 1: Introduction to Statistics - Study Notes

Study Guide - Smart Notes

Tailored notes based on your materials, expanded with key definitions, examples, and context.

Introduction to Statistics

Statistical and Critical Thinking

Statistics is the science of collecting, analyzing, interpreting, and presenting data. Critical thinking is essential in statistics to ensure that data collection and analysis are valid and meaningful.

  • Key Concept: The method of collecting sample data is crucial. If data are not collected appropriately, the results may be unreliable.

  • Gold Standard: Random assignment with placebo/treatment groups is considered the gold standard in experimental design. A placebo is a harmless, inactive substance used for comparison.

Elementary Statistics textbook cover

Basics of Collecting Data

Data can be collected through observational studies or experiments. The choice of method affects the validity of conclusions.

  • Experiment: Apply a treatment and observe its effects on subjects (experimental units).

  • Observational Study: Observe and measure characteristics without modifying the subjects.

Examples: Ice Cream and Drownings

Observational studies may lead to incorrect conclusions due to lurking variables. Experiments can help clarify causation.

  • Observational Study Example: Data shows ice cream sales and drownings both increase with temperature, but temperature is the lurking variable.

  • Experiment Example: Groups treated with and without ice cream show no difference in drowning rates, demonstrating no causal effect.

Design of Experiments

Replication

Replication involves repeating an experiment on multiple subjects to ensure results are reliable and not due to chance.

  • Sample Size: Large enough sample sizes are needed to detect treatment effects.

Blinding and Double-Blind Designs

Blinding prevents subjects from knowing whether they receive treatment or placebo, reducing bias. Double-blind designs extend blinding to both subjects and experimenters.

  • Blinding: Subjects do not know their group assignment.

  • Double-Blind: Both subjects and experimenters are unaware of group assignments.

Randomization

Randomization assigns subjects to groups by chance, ensuring groups are similar and reducing bias.

  • Logic: Chance creates comparable groups for valid comparisons.

Sampling Methods

Simple Random Sample

A simple random sample ensures every possible sample of size n has an equal chance of being chosen.

  • Definition: All samples of the same size are equally likely.

  • Random Sample: All members have the same chance of selection (weaker requirement).

Other Sampling Methods

  • Systematic Sampling: Select a starting point, then every kth element.

  • Convenience Sampling: Use data that are easy to obtain.

  • Stratified Sampling: Divide population into subgroups (strata) and sample from each.

  • Cluster Sampling: Divide population into clusters, randomly select clusters, and sample all members in selected clusters.

  • Multistage Sampling: Combine multiple sampling methods in stages.

Types of Observational Studies

  • Cross-sectional Study: Data collected at one point in time.

  • Retrospective (Case Control) Study: Data collected from past records.

  • Prospective (Cohort) Study: Data collected in the future from groups sharing common factors.

Confounding and Controlling Variables

Confounding

Confounding occurs when it is unclear which factor caused an observed effect. Proper experimental design aims to avoid confounding.

Experimental Designs

  • Completely Randomized Design: Subjects assigned to groups randomly.

  • Randomized Block Design: Subjects grouped into blocks with similar characteristics; treatments assigned within blocks.

  • Matched Pairs Design: Subjects matched in pairs based on similarities; each pair receives different treatments.

  • Rigorously Controlled Design: Subjects assigned to groups to ensure similarity in important characteristics (difficult to implement).

Sampling Errors

Types of Errors

  • Sampling Error: Random discrepancies between sample and population results due to chance.

  • Nonsampling Error: Human errors such as incorrect data entry, biased questions, or inappropriate statistical methods.

  • Nonrandom Sampling Error: Errors from using nonrandom sampling methods (e.g., convenience samples).

Summary Table: Sampling Methods

Sampling Method

Description

Example

Simple Random Sample

Every sample of size n has equal chance

Randomly select 50 students from a class

Systematic Sampling

Select every kth element after a random start

Choose every 10th person on a list

Convenience Sampling

Use easily available data

Survey people at a shopping mall

Stratified Sampling

Divide into strata, sample from each

Sample from each grade level in a school

Cluster Sampling

Divide into clusters, sample all in selected clusters

Randomly select classrooms, survey all students in them

Multistage Sampling

Combine methods in stages

Randomly select schools, then classes, then students

Summary Table: Types of Observational Studies

Type

Description

Example

Cross-sectional

Data at one point in time

Survey on current eating habits

Retrospective

Data from past records

Study of past medical records

Prospective

Data collected in the future

Follow a cohort over several years

Key Formulas

  • Probability of Simple Random Sample:

  • Sampling Error:

Additional info: Expanded explanations and examples were added for clarity and completeness. Tables were inferred and constructed to summarize sampling methods and types of observational studies.

Pearson Logo

Study Prep