Skip to main content
Back

Foundations of Statistics: Populations, Samples, Data Types, and Sampling Methods

Study Guide - Smart Notes

Tailored notes based on your materials, expanded with key definitions, examples, and context.

Parameters vs. Statistics

Introduction to Populations and Samples

Statistics is the science of collecting, organizing, analyzing, and interpreting data to make informed decisions. Understanding the distinction between populations and samples is fundamental in statistical analysis.

  • Population: The entire set of individuals or items of interest in a study. Denoted as the complete group from which data may be collected.

  • Sample: A subset of the population, selected for analysis to draw conclusions about the population.

  • Parameter: A numerical value that describes a characteristic of a population (e.g., population mean).

  • Statistic: A numerical value that describes a characteristic of a sample (e.g., sample mean).

Example:

Scenario

Population or Sample?

Parameter or Statistic?

The salary of every employee at a marketing firm (A)

Population

Parameter

The salaries of 12 out of 100 total employees at a marketing firm (B)

Sample

Statistic

The average salary of all employees at a marketing firm is $41,000 (C)

Population

Parameter

The average salary of 12 out of 100 employees at a marketing firm is $58,000 (D)

Sample

Statistic

Additional info: Parameters are typically unknown and estimated using statistics from samples.

Types of Data

Qualitative vs. Quantitative Data

Data can be categorized based on its nature and the way it is measured. This classification is essential for choosing appropriate statistical methods.

  • Qualitative Data: Describes qualities or categories (e.g., favorite color, eye color).

  • Quantitative Data: Describes numerical values and can be further divided into:

    • Discrete Data: Consists of countable values (e.g., number of students in a classroom, dice roll outcomes).

    • Continuous Data: Can take any value within a range (e.g., time, temperature).

Example Table:

Type

Definition

Examples

Qualitative

Describes qualities or categories

Favorite color, eye color

Quantitative - Discrete

Countable numbers

Dice roll, number of students

Quantitative - Continuous

Any value in a range

Time, temperature

Example:

  • Surveying the nationalities of 10 people on a plane: Qualitative

  • Measuring the distances people walk each day with GPS watches: Quantitative, Continuous

Intro to Collecting Data

Experimental vs. Observational Studies

There are two main ways to collect data in statistics: experiments and observational studies. The choice affects the conclusions that can be drawn, especially regarding causation.

  • Experiment: Apply a treatment and measure its effects; allows for causation to be inferred.

  • Observational Study: Observe and measure characteristics without intervention; does not allow for causation to be inferred.

Example Table:

Scenario

Type

Causation?

Testing a medication by giving 15 subjects a placebo, 15 the actual medication

Experiment

Yes

Surveying 30 college students about their sleep habits and grades

Observational Study

No

Rolling a fair & loaded die 10 times each and comparing results

Experiment

Yes

Simple Random Sampling

Sampling Methods and Representativeness

Sampling is the process of selecting a subset (sample) from a larger group (population) for analysis. The method of sampling affects the validity and generalizability of statistical conclusions.

  • Representative Sample: A sample made up of equal or proportional characteristics to the original population.

  • Simple Random Sample (SRS): Each subject has an equal chance of being selected.

Example Table:

Scenario

Representative Sample?

Simple Random Sample?

Randomly select 3 marbles from a bag with 2 red & 4 blue marbles; all selected are blue

No

Yes

University with 60% undergraduates & 40% graduates surveys 60% undergrads & 40% grads

Yes

Yes

Example:

  • To generate a simple random sample of 5 out of 20 students, assign each student a number from 1 to 20, then use a random number generator to select 5 unique numbers.

Additional info: Representative samples are crucial for generalizing findings to the population. Simple random sampling helps minimize selection bias.

Pearson Logo

Study Prep