Skip to main content
Back

Descriptive Statistics and Data Collection Methods in Health Sciences

Study Guide - Smart Notes

Tailored notes based on your materials, expanded with key definitions, examples, and context.

Introduction to Statistics

Why Statistics Matter in Health Sciences

Statistics is essential for transforming health data into meaningful evidence. It enables researchers to test treatments, identify risk factors, and evaluate research outcomes with confidence. By mastering statistical methods, students gain critical thinking skills and the ability to make evidence-based decisions that improve health outcomes.

Populations and Samples

Definitions and Relationships

A population is the entire group of individuals of interest in a study, while a sample is a subset of the population selected for analysis. The goal is to use sample data to make inferences about the population.

  • Population: All individuals of interest (e.g., all students at a university).

  • Sample: Individuals selected to participate in the research study.

  • Inferential Statistics: Using sample data to draw conclusions about the population.

Cycle of population and sample selection and generalization

Example: Population vs. Sample

In a survey of 250 students at the University of Ottawa, 35 reported smoking regularly. The population is all students at the university, while the sample is the 250 surveyed students.

Population parameters and sample statistics example

Parameters and Statistics

Key Definitions

  • Parameter: A numerical description of a population characteristic (e.g., average age of all students).

  • Statistic: A numerical description of a sample characteristic (e.g., average age of surveyed students).

Example

  • If a survey of 450 students reports an average weekly income of $325, this is a sample statistic.

  • If the average weekly income for all students is $405, this is a population parameter.

Sampling Error

Definition and Importance

Sampling error is the discrepancy between a sample statistic and its corresponding population parameter. It arises naturally due to chance and is a central concept in inferential statistics.

  • Sampling error quantifies the uncertainty in generalizing sample results to the population.

  • Measuring and minimizing sampling error is crucial for reliable statistical inference.

Random numbers for sampling Sample selection from population

Methods of Sample Collection

Simple Random Sampling (SRS)

In SRS, every possible sample of size n from a population of size N has an equally likely chance of being selected. This method ensures unbiased representation.

  • Random number generators or tables are often used to select samples.

Random number generation for SRS Visual representation of SRS

Systematic Sampling

Systematic sampling involves selecting a starting point and then choosing every kth element in the population.

  • Example: Selecting every 3rd person from a list.

Systematic sampling example

Stratified Sampling

Stratified sampling divides the population into subgroups (strata) that share specific characteristics, then samples are drawn from each subgroup.

  • Ensures representation of all subgroups (e.g., gender, age).

Stratified sampling example

Cluster Sampling

Cluster sampling divides the population into clusters, randomly selects some clusters, and includes all members from selected clusters.

  • Useful for geographically dispersed populations.

Cluster sampling example

Variables in Research

Definition and Types

A variable is any characteristic or condition that can change or take on different values. Research often investigates relationships between two or more variables.

  • Discrete Variables: Take on a finite or countable number of values (e.g., number of patients).

  • Continuous Variables: Can take on infinitely many values within a range (e.g., weight, blood pressure).

Levels of Measurement

Overview

The level of measurement determines which statistical calculations are meaningful. There are four levels: nominal, ordinal, interval, and ratio.

Level

Description

Examples

Nominal

Qualitative; categories only, no order, no math

Colors, names, types

Ordinal

Ordered categories, but differences not meaningful

Class standings, rankings

Interval

Ordered, equal intervals, no true zero

Temperature, years

Ratio

Ordered, equal intervals, true zero

Height, weight, length

Nominal Level

Nominal data are qualitative and consist of names, labels, or categories. No mathematical computations are possible.

  • Examples: Colors in a flag, student names, textbook titles.

Ordinal Level

Ordinal data are qualitative and can be arranged in order, but differences between entries are not meaningful.

  • Examples: Class standings, player numbers, song rankings.

Interval Level

Interval data are quantitative, ordered, and have equal intervals, but zero does not represent the absolute lowest value.

  • Examples: Temperatures (Fahrenheit), years on a timeline.

Ratio Level

Ratio data are quantitative, ordered, have equal intervals, and a true zero. Zero means none of the variable exists.

  • Examples: Height, weight, length.

Key Distinctions

  • Nominal: Categories only, no order, no math

  • Ordinal: Ordered categories, no meaningful differences

  • Interval: Ordered, equal intervals, no true zero

  • Ratio: Ordered, equal intervals, true zero

Class activity on levels of measurement

Summary Table

Concept

Definition

Population

Entire group of interest

Sample

Subset of population

Parameter

Numerical value describing population

Statistic

Numerical value describing sample

Sampling Error

Difference between statistic and parameter

Simple Random Sample

Every sample equally likely

Systematic Sampling

Select every kth element

Stratified Sampling

Sample from each subgroup

Cluster Sampling

Sample all members from selected clusters

Discrete Variable

Finite/countable values

Continuous Variable

Infinite values within range

Nominal

Categories only

Ordinal

Ordered categories

Interval

Ordered, equal intervals, no true zero

Ratio

Ordered, equal intervals, true zero

Key Formulas

Sampling Error Formula

The sampling error is calculated as:

Probability of Simple Random Sample

For a population of size N and sample size n:

Conclusion

Understanding populations, samples, sampling methods, variables, and levels of measurement is foundational for descriptive statistics and for conducting meaningful health science research. These concepts enable researchers to collect, analyze, and interpret data accurately, ensuring valid and reliable conclusions.

Pearson Logo

Study Prep