BackFoundations of Statistics: Populations, Samples, Data Types, and Sampling Methods
Study Guide - Smart Notes
Tailored notes based on your materials, expanded with key definitions, examples, and context.
Parameters vs. Statistics
Introduction to Populations and Samples
Statistics is the science of collecting, organizing, analyzing, and interpreting data to make informed decisions. Understanding the distinction between populations and samples is fundamental in statistical analysis.
Population: The entire set of individuals or items of interest in a study. Denoted as the complete group from which data may be collected.
Sample: A subset of the population, selected for analysis to draw conclusions about the population.
Parameter: A numerical value that describes a characteristic of a population (e.g., population mean).
Statistic: A numerical value that describes a characteristic of a sample (e.g., sample mean).
Example:
Scenario | Population or Sample? | Parameter or Statistic? |
|---|---|---|
The salary of every employee at a marketing firm (A) | Population | Parameter |
The salaries of 12 out of 100 total employees at a marketing firm (B) | Sample | Statistic |
The average salary of all employees at a marketing firm is $41,000 (C) | Population | Parameter |
The average salary of 12 out of 100 employees at a marketing firm is $58,000 (D) | Sample | Statistic |
Additional info: Parameters are typically unknown and estimated using statistics from samples.
Types of Data
Qualitative vs. Quantitative Data
Data can be categorized based on its nature and the way it is measured. This classification is essential for choosing appropriate statistical methods.
Qualitative Data: Describes qualities or categories (e.g., favorite color, eye color).
Quantitative Data: Describes numerical values and can be further divided into:
Discrete Data: Consists of countable values (e.g., number of students in a classroom, dice roll outcomes).
Continuous Data: Can take any value within a range (e.g., time, temperature).
Example Table:
Type | Definition | Examples |
|---|---|---|
Qualitative | Describes qualities or categories | Favorite color, eye color |
Quantitative - Discrete | Countable numbers | Dice roll, number of students |
Quantitative - Continuous | Any value in a range | Time, temperature |
Example:
Surveying the nationalities of 10 people on a plane: Qualitative
Measuring the distances people walk each day with GPS watches: Quantitative, Continuous
Intro to Collecting Data
Experimental vs. Observational Studies
There are two main ways to collect data in statistics: experiments and observational studies. The choice affects the conclusions that can be drawn, especially regarding causation.
Experiment: Apply a treatment and measure its effects; allows for causation to be inferred.
Observational Study: Observe and measure characteristics without intervention; does not allow for causation to be inferred.
Example Table:
Scenario | Type | Causation? |
|---|---|---|
Testing a medication by giving 15 subjects a placebo, 15 the actual medication | Experiment | Yes |
Surveying 30 college students about their sleep habits and grades | Observational Study | No |
Rolling a fair & loaded die 10 times each and comparing results | Experiment | Yes |
Simple Random Sampling
Sampling Methods and Representativeness
Sampling is the process of selecting a subset (sample) from a larger group (population) for analysis. The method of sampling affects the validity and generalizability of statistical conclusions.
Representative Sample: A sample made up of equal or proportional characteristics to the original population.
Simple Random Sample (SRS): Each subject has an equal chance of being selected.
Example Table:
Scenario | Representative Sample? | Simple Random Sample? |
|---|---|---|
Randomly select 3 marbles from a bag with 2 red & 4 blue marbles; all selected are blue | No | Yes |
University with 60% undergraduates & 40% graduates surveys 60% undergrads & 40% grads | Yes | Yes |
Example:
To generate a simple random sample of 5 out of 20 students, assign each student a number from 1 to 20, then use a random number generator to select 5 unique numbers.
Additional info: Representative samples are crucial for generalizing findings to the population. Simple random sampling helps minimize selection bias.