Skip to main content
Back

Classifying Data: Types, Levels of Measurement, and Study Designs

Study Guide - Smart Notes

Tailored notes based on your materials, expanded with key definitions, examples, and context.

Classifying Data

Introduction to Statistics and Data

Statistics is the science of collecting, organizing, analyzing, and interpreting data to make decisions or draw conclusions under uncertainty. Data are‎} systematically collected observations, measurements, or facts that are studied to gain insights.

  • Data: Observations, measurements, or facts collected for analysis.

  • Statistics: The science of working with data to make informed decisions.

  • Example: Social media platforms collect data from user profiles, posts, likes, and connections to tailor content and advertisements.

Types of Data

Data can be classified as either categorical (qualitative) or quantitative (numerical).

  • Categorical Data: Represents categories or groups (e.g., account type, platform, preferred language).

  • Quantitative Data: Represents measurable quantities (e.g., number of followers, time spent on app, age).

  • Note: Numeric codes assigned to categories (e.g., 1 for personal, 2 for business) are still categorical, not quantitative.

Further Classification of Data

  • Quantitative Data:

    • Discrete: Has gaps between possible values (e.g., 0, 1, 2, 3). All finite data sets are discrete.

    • Continuous: No gaps between possible values (e.g., height, weight). Values can be infinitely precise.

  • Categorical Variables:

    • Nominal: Categories with no natural order (e.g., Samsung, Apple, Nokia).

    • Ordinal: Categories with a natural order, but no fixed numerical spacing (e.g., small, medium, large).

Levels of Measurement

Data can be further classified by its level of measurement, which determines the types of statistical analysis that are appropriate.

  • Nominal: Categories without order (e.g., brands, jersey numbers).

  • Ordinal: Ordered categories, but differences between ranks are not equal (e.g., chili pepper scale, survey ratings).

  • Interval: Ordered, equal spacing between values, but no true zero (e.g., temperature in Celsius, IQ scores).

  • Ratio: Ordered, equal spacing, and a true zero; multiples are meaningful (e.g., number of likes, physical measurements).

Example: Chili Pepper Scale

The chili pepper scale visually represents levels of spiciness, which are ordered categories. This is an example of ordinal data because the peppers are ranked from least to most spicy, but the difference between each level is not necessarily equal.

Chili pepper scale representing levels of spiciness

Example: Jersey Numbers

Jersey numbers assigned to players are nominal data because the numbers are used as labels and do not imply any order or ranking.

Example: IQ Scores

IQ scores are interval data because the differences between scores are meaningful, but there is no true zero point.

Example: Number of Likes

The number of likes on a social media post is ratio data because there is a true zero (no likes), and multiples are meaningful (e.g., 10 likes is twice as many as 5 likes).

Data Matrix Example

A data matrix can contain various types of variables. Consider the following example:

Variable

Type

Level of Measurement

Location ID

Categorical

Nominal

Location Type

Categorical

Nominal

Country

Categorical

Nominal

Elevation (meters)

Quantitative

Ratio

Mean Monthly Temp (°C)

Quantitative

Interval

# days/year above mean

Quantitative

Ratio

General Description

Categorical

Ordinal

Populations, Samples, Statistics, and Parameters

Definitions and Relationships

Understanding the distinction between populations and samples is fundamental in statistics.

  • Population: The complete set of individuals or items of interest.

  • Sample: A subset of the population, selected for study.

  • Parameter: A numerical value describing a characteristic of the population.

  • Statistic: A numerical value describing a characteristic of the sample.

  • Example: To estimate the average number of sick days taken by Canadian workers (population), a researcher studies a random sample of 500 workers (sample) and calculates the average for this group (statistic).

Statistics is concerned with using sample statistics to estimate population parameters and to test claims about those parameters.

Methods of Data Collection

Surveys, Observational Studies, Experiments, and Census

Data can be collected through various methods, each with its own strengths and limitations.

  • Survey: Collects data from a sample using questionnaires or interviews.

  • Observational Study: Researchers observe and record data without manipulating variables. Can demonstrate associations but not causation.

  • Experiment: Researchers manipulate variables (treatments) to measure their effect on a response. Can establish cause-and-effect relationships.

  • Census: Collects data from the entire population.

Observational Study vs. Experiment

  • Observational Study: Example: Recording diets and health outcomes. Can show association but not causation.

  • Experiment: Example: Randomly assigning participants to medication or placebo. Can establish causality.

Key Point: Only well-designed experiments can demonstrate cause-and-effect relationships; observational studies are limited to identifying associations.

Pearson Logo

Study Prep