BackChapter 1: Data Collection – Fundamentals of Statistics
Study Guide - Smart Notes
Tailored notes based on your materials, expanded with key definitions, examples, and context.
Introduction to Statistics
What is Statistics?
Statistics is the science of collecting, organizing, analyzing, and interpreting data to make informed decisions. It provides tools for understanding data and drawing conclusions from it.
Data: Information gathered from counting, measuring, or collecting responses.
Population: The entire set containing all data of interest ("every," "each").
Sample: A subset of the population, used to draw conclusions about the whole.
Parameter: A numerical value that describes a characteristic of a population.
Statistic: A numerical value that describes a characteristic of a sample.
Example: If you measure the salary of every employee at a marketing firm, you have population data. If you measure the salary of 12 out of 100 employees, you have sample data. The average salary of all employees is a parameter; the average salary of the 12 employees is a statistic.
Practice Questions
Collecting test scores of every other student in a class: Sample
46.5% of all registered voters are registered democrats: Parameter
Amount spent by each customer in a grocery store: Population
Average workout duration from a survey of 40 gym members: Statistic
Types of Data
Qualitative vs. Quantitative Data
Data can be categorized as either qualitative or quantitative, each with distinct characteristics and uses.
Qualitative Data: Describes qualities or categories (e.g., favorite color, eye color).
Quantitative Data: Describes quantities or amounts (e.g., number of students, dice roll).
Discrete Data: Quantitative data that cannot be broken down further (e.g., number of students).
Continuous Data: Quantitative data that can be measured to any degree of precision (e.g., time, temperature).
Examples:
Surveying nationalities: Qualitative
Measuring distances walked: Quantitative; Continuous

Practice Questions
Which is NOT quantitative data? The brands of smartphones owned by students
Which is a discrete quantitative set? The number of goals scored by a soccer team in a match
Levels of Measurement
Understanding Levels of Measurement
Levels of measurement describe the nature of information within the values assigned to variables. They determine what kinds of statistical analysis are appropriate.
Level | Description | Qualitative or Quantitative | Example |
|---|---|---|---|
Nominal | Categories, names, or labels; no order or calculations | Either | Hair color |
Ordinal | Data can be ordered; differences are not meaningful | Either | Letter grades |
Interval | Ordered; differences are meaningful; no true zero | Quantitative | Temperature |
Ratio | Ordered; differences and ratios are meaningful; true zero exists | Quantitative | Heights, distances |
Example: Birth years (interval), satisfaction ratings (ordinal), working hours (ratio), favorite music genre (nominal).

Practice Questions
Which level could describe both quantitative or qualitative data? Nominal, Ordinal
Participants rate symptoms as mild, moderate, or severe: Ordinal
Birth weights of newborns: Ratio

Example: Recording water temperature at multiple points is interval data. Saying 80°F is twice as hot as 40°F is incorrect because the interval scale does not have a true zero.

Collecting Data
Observational Studies vs. Experiments
There are two main ways to collect data:
Experiment: Apply a treatment and measure its effects; can assume causation if well-designed.
Observational Study: Observe and measure characteristics without influencing them; cannot assume causation.
Examples:
Testing medication with a placebo group: Experiment; Causation possible
Surveying students about sleep habits: Observational Study; No causation
Comparing fair and loaded dice: Experiment; Causation possible

Practice Questions
Randomly selecting stores to stay open later and comparing profits: Experiment; Causation possible
Surveying customers about advertising: Observational Study; No causation
Surveying employees about personal growth: Observational Study
Testing a fitness app for weight loss: Experiment
Sampling Methods
Simple Random Sampling (SRS)
Sampling is the process of selecting a smaller group (sample) from a larger group (population). A representative sample accurately reflects the characteristics of the population.
Simple Random Sampling (SRS): Each subject and each possible group is equally likely to be chosen.
Example: Randomly selecting marbles from a bag or students from a class.

Practice Questions
Surveying only gym members in fitness classes: Not a representative sample
Polling all people entering a shop on a random day: Not a representative sample
Randomly selecting teachers across grades and disciplines: Representative sample
Surveying random employees in each branch: Representative sample
Other Sampling Methods
When SRS is not practical, other methods can be used:
Method | Description | Example |
|---|---|---|
Systematic | Select every nth subject | Testing every 12th cookie |
Cluster | Divide population into groups (clusters), randomly select clusters, survey all in selected clusters | Randomly select 1 class per grade and survey all students |
Stratified | Divide population into groups (strata) with similar characteristics, randomly select subjects from each stratum | Surveying 50 undergrads and 50 grad students |

Practice Questions
Selecting random cases at the end of the day: Cluster sampling
Selecting every tenth unit: Systematic sampling
Randomly selecting 100 out of 1500 units: Simple random sampling
Taking 10 random units from each of 10 machines: Stratified sampling
Summary Table: Sampling Methods
Sampling Method | How It Works | When to Use |
|---|---|---|
Simple Random | Randomly select from whole population | When every subject/group should have equal chance |
Systematic | Select every nth subject | When population is ordered or production line |
Cluster | Randomly select groups, survey all in group | When population is naturally divided into groups |
Stratified | Divide by characteristic, randomly select from each | When subgroups are important to represent |
Additional info: These foundational concepts are essential for understanding how to properly collect and analyze data in statistics, ensuring valid and reliable results for further statistical inference.