Skip to main content
Back

Chapter 1: Data Collection – Fundamentals of Statistics

Study Guide - Smart Notes

Tailored notes based on your materials, expanded with key definitions, examples, and context.

Introduction to Statistics

What is Statistics?

Statistics is the science of collecting, organizing, analyzing, and interpreting data to make informed decisions. It provides tools for understanding data and drawing conclusions from it.

  • Data: Information gathered from counting, measuring, or collecting responses.

  • Population: The entire set containing all data of interest ("every," "each").

  • Sample: A subset of the population, used to draw conclusions about the whole.

  • Parameter: A numerical value that describes a characteristic of a population.

  • Statistic: A numerical value that describes a characteristic of a sample.

Example: If you measure the salary of every employee at a marketing firm, you have population data. If you measure the salary of 12 out of 100 employees, you have sample data. The average salary of all employees is a parameter; the average salary of the 12 employees is a statistic.

Practice Questions

  • Collecting test scores of every other student in a class: Sample

  • 46.5% of all registered voters are registered democrats: Parameter

  • Amount spent by each customer in a grocery store: Population

  • Average workout duration from a survey of 40 gym members: Statistic

Types of Data

Qualitative vs. Quantitative Data

Data can be categorized as either qualitative or quantitative, each with distinct characteristics and uses.

  • Qualitative Data: Describes qualities or categories (e.g., favorite color, eye color).

  • Quantitative Data: Describes quantities or amounts (e.g., number of students, dice roll).

  • Discrete Data: Quantitative data that cannot be broken down further (e.g., number of students).

  • Continuous Data: Quantitative data that can be measured to any degree of precision (e.g., time, temperature).

Examples:

  • Surveying nationalities: Qualitative

  • Measuring distances walked: Quantitative; Continuous

Clock representing time (continuous data) Thermometer representing temperature (continuous data)

Practice Questions

  • Which is NOT quantitative data? The brands of smartphones owned by students

  • Which is a discrete quantitative set? The number of goals scored by a soccer team in a match

Levels of Measurement

Understanding Levels of Measurement

Levels of measurement describe the nature of information within the values assigned to variables. They determine what kinds of statistical analysis are appropriate.

Level

Description

Qualitative or Quantitative

Example

Nominal

Categories, names, or labels; no order or calculations

Either

Hair color

Ordinal

Data can be ordered; differences are not meaningful

Either

Letter grades

Interval

Ordered; differences are meaningful; no true zero

Quantitative

Temperature

Ratio

Ordered; differences and ratios are meaningful; true zero exists

Quantitative

Heights, distances

Example: Birth years (interval), satisfaction ratings (ordinal), working hours (ratio), favorite music genre (nominal).

Nominal data example Ordinal data example Interval data example Ratio data example

Practice Questions

  • Which level could describe both quantitative or qualitative data? Nominal, Ordinal

  • Participants rate symptoms as mild, moderate, or severe: Ordinal

  • Birth weights of newborns: Ratio

Ordinal data practice Interval data practice Nominal data practice Ratio data practice

Example: Recording water temperature at multiple points is interval data. Saying 80°F is twice as hot as 40°F is incorrect because the interval scale does not have a true zero.

Temperature measurement example Interval scale explanation

Collecting Data

Observational Studies vs. Experiments

There are two main ways to collect data:

  • Experiment: Apply a treatment and measure its effects; can assume causation if well-designed.

  • Observational Study: Observe and measure characteristics without influencing them; cannot assume causation.

Examples:

  • Testing medication with a placebo group: Experiment; Causation possible

  • Surveying students about sleep habits: Observational Study; No causation

  • Comparing fair and loaded dice: Experiment; Causation possible

Experiment example Observational study example Survey example Dice experiment example

Practice Questions

  • Randomly selecting stores to stay open later and comparing profits: Experiment; Causation possible

  • Surveying customers about advertising: Observational Study; No causation

  • Surveying employees about personal growth: Observational Study

  • Testing a fitness app for weight loss: Experiment

Sampling Methods

Simple Random Sampling (SRS)

Sampling is the process of selecting a smaller group (sample) from a larger group (population). A representative sample accurately reflects the characteristics of the population.

  • Simple Random Sampling (SRS): Each subject and each possible group is equally likely to be chosen.

Example: Randomly selecting marbles from a bag or students from a class.

Random selection of marbles Sample of people Sample of people

Practice Questions

  • Surveying only gym members in fitness classes: Not a representative sample

  • Polling all people entering a shop on a random day: Not a representative sample

  • Randomly selecting teachers across grades and disciplines: Representative sample

  • Surveying random employees in each branch: Representative sample

Other Sampling Methods

When SRS is not practical, other methods can be used:

Method

Description

Example

Systematic

Select every nth subject

Testing every 12th cookie

Cluster

Divide population into groups (clusters), randomly select clusters, survey all in selected clusters

Randomly select 1 class per grade and survey all students

Stratified

Divide population into groups (strata) with similar characteristics, randomly select subjects from each stratum

Surveying 50 undergrads and 50 grad students

Systematic sampling example Cluster sampling example Stratified sampling example Simple random sampling example

Practice Questions

  • Selecting random cases at the end of the day: Cluster sampling

  • Selecting every tenth unit: Systematic sampling

  • Randomly selecting 100 out of 1500 units: Simple random sampling

  • Taking 10 random units from each of 10 machines: Stratified sampling

Summary Table: Sampling Methods

Sampling Method

How It Works

When to Use

Simple Random

Randomly select from whole population

When every subject/group should have equal chance

Systematic

Select every nth subject

When population is ordered or production line

Cluster

Randomly select groups, survey all in group

When population is naturally divided into groups

Stratified

Divide by characteristic, randomly select from each

When subgroups are important to represent

Additional info: These foundational concepts are essential for understanding how to properly collect and analyze data in statistics, ensuring valid and reliable results for further statistical inference.

Pearson Logo

Study Prep