BackClassifying Data: Types, Levels of Measurement, and Study Designs
Study Guide - Smart Notes
Tailored notes based on your materials, expanded with key definitions, examples, and context.
Classifying Data
Introduction to Statistics and Data
Statistics is the science of collecting, organizing, analyzing, and interpreting data to make decisions or draw conclusions under uncertainty. Data are} systematically collected observations, measurements, or facts that are studied to gain insights.
Data: Observations, measurements, or facts collected for analysis.
Statistics: The science of working with data to make informed decisions.
Example: Social media platforms collect data from user profiles, posts, likes, and connections to tailor content and advertisements.
Types of Data
Data can be classified as either categorical (qualitative) or quantitative (numerical).
Categorical Data: Represents categories or groups (e.g., account type, platform, preferred language).
Quantitative Data: Represents measurable quantities (e.g., number of followers, time spent on app, age).
Note: Numeric codes assigned to categories (e.g., 1 for personal, 2 for business) are still categorical, not quantitative.
Further Classification of Data
Quantitative Data:
Discrete: Has gaps between possible values (e.g., 0, 1, 2, 3). All finite data sets are discrete.
Continuous: No gaps between possible values (e.g., height, weight). Values can be infinitely precise.
Categorical Variables:
Nominal: Categories with no natural order (e.g., Samsung, Apple, Nokia).
Ordinal: Categories with a natural order, but no fixed numerical spacing (e.g., small, medium, large).
Levels of Measurement
Data can be further classified by its level of measurement, which determines the types of statistical analysis that are appropriate.
Nominal: Categories without order (e.g., brands, jersey numbers).
Ordinal: Ordered categories, but differences between ranks are not equal (e.g., chili pepper scale, survey ratings).
Interval: Ordered, equal spacing between values, but no true zero (e.g., temperature in Celsius, IQ scores).
Ratio: Ordered, equal spacing, and a true zero; multiples are meaningful (e.g., number of likes, physical measurements).
Example: Chili Pepper Scale
The chili pepper scale visually represents levels of spiciness, which are ordered categories. This is an example of ordinal data because the peppers are ranked from least to most spicy, but the difference between each level is not necessarily equal.

Example: Jersey Numbers
Jersey numbers assigned to players are nominal data because the numbers are used as labels and do not imply any order or ranking.
Example: IQ Scores
IQ scores are interval data because the differences between scores are meaningful, but there is no true zero point.
Example: Number of Likes
The number of likes on a social media post is ratio data because there is a true zero (no likes), and multiples are meaningful (e.g., 10 likes is twice as many as 5 likes).
Data Matrix Example
A data matrix can contain various types of variables. Consider the following example:
Variable | Type | Level of Measurement |
|---|---|---|
Location ID | Categorical | Nominal |
Location Type | Categorical | Nominal |
Country | Categorical | Nominal |
Elevation (meters) | Quantitative | Ratio |
Mean Monthly Temp (°C) | Quantitative | Interval |
# days/year above mean | Quantitative | Ratio |
General Description | Categorical | Ordinal |
Populations, Samples, Statistics, and Parameters
Definitions and Relationships
Understanding the distinction between populations and samples is fundamental in statistics.
Population: The complete set of individuals or items of interest.
Sample: A subset of the population, selected for study.
Parameter: A numerical value describing a characteristic of the population.
Statistic: A numerical value describing a characteristic of the sample.
Example: To estimate the average number of sick days taken by Canadian workers (population), a researcher studies a random sample of 500 workers (sample) and calculates the average for this group (statistic).
Statistics is concerned with using sample statistics to estimate population parameters and to test claims about those parameters.
Methods of Data Collection
Surveys, Observational Studies, Experiments, and Census
Data can be collected through various methods, each with its own strengths and limitations.
Survey: Collects data from a sample using questionnaires or interviews.
Observational Study: Researchers observe and record data without manipulating variables. Can demonstrate associations but not causation.
Experiment: Researchers manipulate variables (treatments) to measure their effect on a response. Can establish cause-and-effect relationships.
Census: Collects data from the entire population.
Observational Study vs. Experiment
Observational Study: Example: Recording diets and health outcomes. Can show association but not causation.
Experiment: Example: Randomly assigning participants to medication or placebo. Can establish causality.
Key Point: Only well-designed experiments can demonstrate cause-and-effect relationships; observational studies are limited to identifying associations.