Skip to main content
Back

Chapter 1: Introduction to Statistics – Structured Study Notes

Study Guide - Smart Notes

Tailored notes based on your materials, expanded with key definitions, examples, and context.

Introduction to Statistics

1.1 Review and Preview

Statistics is the science of collecting, analyzing, presenting, and interpreting data. It is foundational for making informed decisions in various fields.

  • Data: Observations such as measurements, genders, or survey responses that have been collected.

  • Statistics (the subject): A collection of methods for planning experiments, obtaining data, and then organizing, summarizing, presenting, analyzing, and interpreting those data, as well as drawing conclusions based on them.

  • Population: The complete collection of all elements (individuals, items, or data) to be studied.

  • Sample: A subset of elements selected from the population.

Example: Determining whether a group is a population or a sample:

  • All American citizens – Population

  • Marketing interns at a new division – Sample

  • A surgical ward’s patients – Sample

  • All college football fields – Population

  • 5 out of 1000 testing bottles of hypertensive potion – Sample

Important Note: Always identify the population of interest and the sample used in a study. The sample must be representative of the population for valid conclusions.

1.2 Statistical and Critical Thinking

Statistical analysis requires careful consideration of context, data source, sampling method, and the distinction between statistical and practical significance.

  • Context: What do the data mean? What is the goal of the study?

  • Source of Data: Who collected the data? Is there bias?

  • Sampling Method: Was the data collected in a way that is unbiased or biased?

  • Graph the Data: Visualize the data to identify patterns.

  • Statistical Methods: Use appropriate methods (mean, median, correlation, hypothesis tests, etc.).

  • Statistical Significance: Results are unlikely to occur by chance.

  • Practical Significance: Results have real-world importance.

Example: A study finds a very strong correlation between shoe size and reading score. Does this mean that having a bigger shoe size causes one to be a better reader? Correlation does not imply causation.

1.3 Types of Data

Data can be classified based on the nature of the variables and the level of measurement.

  • Individuals: Objects described by a set of data (people, animals, things).

  • Variable: A characteristic of an individual that can take different values.

  • Parameter: A numerical measurement describing a characteristic of a population.

  • Statistic: A numerical measurement describing a characteristic of a sample.

Qualitative vs. Quantitative Data:

Type

Description

Examples

Qualitative (Categorical)

Describes attributes or categories

Gender, political party, class ranking

Quantitative (Numerical)

Describes numerical measurements

Height, weight, number of cars

Quantitative variables can be further classified as:

  • Discrete: Countable values (e.g., number of students)

  • Continuous: Infinite possible values within a range (e.g., height, gallons of gasoline)

1.4 Levels of Measurement

Variables can be measured at different levels, which determine the type of statistical analysis possible.

Level

Description

Examples

Nominal

Categories only, no order

Political party, gender

Ordinal

Categories with order, no meaningful differences

Class ranking

Interval

Ordered, meaningful differences, no true zero

Temperature (°F)

Ratio

Ordered, meaningful differences, true zero

Height, age, income

1.5 Collecting Sample Data

Sampling methods are crucial for obtaining representative data. Common methods include:

  • Random Sampling: Every member of the population has an equal chance of being selected.

  • Stratified Sampling: Population divided into subgroups (strata), then random samples taken from each stratum.

  • Cluster Sampling: Population divided into clusters, some clusters are randomly selected, and all members of selected clusters are sampled.

  • Systematic Sampling: Select every k-th member from a list after a random start.

  • Multistage Sampling: Combines several sampling methods, often used for large populations.

Example: To study student opinions, you might randomly select classes (clusters) and survey all students in those classes.

1.6 Experimental Design

Experiments are designed to study the effect of treatments on subjects. Key components include:

  • Individuals: Subjects being studied.

  • Factors: Explanatory variables manipulated by the researcher.

  • Levels: Different values of the factors.

  • Treatments: Combinations of factor levels applied to subjects.

  • Response Variable: Outcome measured in the experiment.

Types of Experimental Design:

  • Completely Randomized Design: Subjects randomly assigned to treatments.

  • Block Design: Subjects grouped by similarity, then randomly assigned within blocks.

  • Matched Pairs Design: Subjects paired based on similarity, then each receives different treatments.

Important Terminology:

  • Blinding: Subjects do not know which treatment they receive.

  • Double-Blind: Both subjects and researchers do not know treatment assignments.

  • Placebo Effect: Subjects respond to a fake treatment as if it were real.

  • Confounding: When effects of multiple factors cannot be distinguished.

Key Formulas and Equations

  • Sample Mean:

  • Population Mean:

  • Sample Proportion:

Summary Table: Sampling Methods

Method

Description

Example

Random

Equal chance for all

Lottery draw

Stratified

Divide by strata, sample each

Sample by age group

Cluster

Divide by clusters, sample all in selected clusters

Sample all students in selected classes

Systematic

Every k-th member

Every 10th person on a list

Multistage

Combination of methods

Randomly select schools, then classes, then students

Additional info:

  • These notes cover foundational concepts from Chapter 1 of a college-level Statistics course, including definitions, types of data, sampling methods, and experimental design.

  • Examples and tables have been expanded for clarity and completeness.

Pearson Logo

Study Prep