BackChapter 1: Introduction to Statistics - Study Notes
Study Guide - Smart Notes
Tailored notes based on your materials, expanded with key definitions, examples, and context.
Introduction to Statistics
Statistical and Critical Thinking
Statistics is the science of collecting, analyzing, interpreting, and presenting data. Critical thinking is essential in statistics to ensure that data collection and analysis are valid and meaningful.
Key Concept: The method of collecting sample data is crucial. If data are not collected appropriately, the results may be unreliable.
Gold Standard: Random assignment with placebo/treatment groups is considered the gold standard in experimental design. A placebo is a harmless, inactive substance used for comparison.

Basics of Collecting Data
Data can be collected through observational studies or experiments. The choice of method affects the validity of conclusions.
Experiment: Apply a treatment and observe its effects on subjects (experimental units).
Observational Study: Observe and measure characteristics without modifying the subjects.
Examples: Ice Cream and Drownings
Observational studies may lead to incorrect conclusions due to lurking variables. Experiments can help clarify causation.
Observational Study Example: Data shows ice cream sales and drownings both increase with temperature, but temperature is the lurking variable.
Experiment Example: Groups treated with and without ice cream show no difference in drowning rates, demonstrating no causal effect.
Design of Experiments
Replication
Replication involves repeating an experiment on multiple subjects to ensure results are reliable and not due to chance.
Sample Size: Large enough sample sizes are needed to detect treatment effects.
Blinding and Double-Blind Designs
Blinding prevents subjects from knowing whether they receive treatment or placebo, reducing bias. Double-blind designs extend blinding to both subjects and experimenters.
Blinding: Subjects do not know their group assignment.
Double-Blind: Both subjects and experimenters are unaware of group assignments.
Randomization
Randomization assigns subjects to groups by chance, ensuring groups are similar and reducing bias.
Logic: Chance creates comparable groups for valid comparisons.
Sampling Methods
Simple Random Sample
A simple random sample ensures every possible sample of size n has an equal chance of being chosen.
Definition: All samples of the same size are equally likely.
Random Sample: All members have the same chance of selection (weaker requirement).
Other Sampling Methods
Systematic Sampling: Select a starting point, then every kth element.
Convenience Sampling: Use data that are easy to obtain.
Stratified Sampling: Divide population into subgroups (strata) and sample from each.
Cluster Sampling: Divide population into clusters, randomly select clusters, and sample all members in selected clusters.
Multistage Sampling: Combine multiple sampling methods in stages.
Types of Observational Studies
Cross-sectional Study: Data collected at one point in time.
Retrospective (Case Control) Study: Data collected from past records.
Prospective (Cohort) Study: Data collected in the future from groups sharing common factors.
Confounding and Controlling Variables
Confounding
Confounding occurs when it is unclear which factor caused an observed effect. Proper experimental design aims to avoid confounding.
Experimental Designs
Completely Randomized Design: Subjects assigned to groups randomly.
Randomized Block Design: Subjects grouped into blocks with similar characteristics; treatments assigned within blocks.
Matched Pairs Design: Subjects matched in pairs based on similarities; each pair receives different treatments.
Rigorously Controlled Design: Subjects assigned to groups to ensure similarity in important characteristics (difficult to implement).
Sampling Errors
Types of Errors
Sampling Error: Random discrepancies between sample and population results due to chance.
Nonsampling Error: Human errors such as incorrect data entry, biased questions, or inappropriate statistical methods.
Nonrandom Sampling Error: Errors from using nonrandom sampling methods (e.g., convenience samples).
Summary Table: Sampling Methods
Sampling Method | Description | Example |
|---|---|---|
Simple Random Sample | Every sample of size n has equal chance | Randomly select 50 students from a class |
Systematic Sampling | Select every kth element after a random start | Choose every 10th person on a list |
Convenience Sampling | Use easily available data | Survey people at a shopping mall |
Stratified Sampling | Divide into strata, sample from each | Sample from each grade level in a school |
Cluster Sampling | Divide into clusters, sample all in selected clusters | Randomly select classrooms, survey all students in them |
Multistage Sampling | Combine methods in stages | Randomly select schools, then classes, then students |
Summary Table: Types of Observational Studies
Type | Description | Example |
|---|---|---|
Cross-sectional | Data at one point in time | Survey on current eating habits |
Retrospective | Data from past records | Study of past medical records |
Prospective | Data collected in the future | Follow a cohort over several years |
Key Formulas
Probability of Simple Random Sample:
Sampling Error:
Additional info: Expanded explanations and examples were added for clarity and completeness. Tables were inferred and constructed to summarize sampling methods and types of observational studies.