BackFoundations of Statistics: Populations, Samples, Data Types, and Sampling Methods
Study Guide - Smart Notes
Tailored notes based on your materials, expanded with key definitions, examples, and context.
Parameters vs. Statistics
Definitions and Distinctions
Statistics is the science of collecting, organizing, analyzing, and interpreting data to make informed decisions. Understanding the difference between populations and samples, as well as parameters and statistics, is fundamental in statistical analysis.
Data: Information gathered from counting, measuring, or collecting responses.
Population: The entire set containing all data points ("every" member of interest).
Sample: A subset of the population, selected for analysis.
Parameter: A numerical value that describes a characteristic of a population.
Statistic: A numerical value that describes a characteristic of a sample.
Example:
Scenario | Population or Sample? | Parameter or Statistic? |
|---|---|---|
The salary of every employee at a marketing firm (A) | Population | Parameter |
The salaries of 12 out of 100 total employees at a marketing firm (B) | Sample | Statistic |
The average salary of all employees at a marketing firm is $41,000 (C) | Population | Parameter |
The average salary of 12 out of 100 employees at a marketing firm is $58,000 (D) | Sample | Statistic |
Types of Data
Qualitative vs. Quantitative Data
Data can be categorized based on its nature and the way it is measured. The two main types are qualitative and quantitative data.
Qualitative Data: Describes qualities or categories (e.g., favorite color, eye color).
Quantitative Data: Describes quantities and can be measured numerically.
Subtypes of Quantitative Data
Discrete Data: Quantitative data that can take only specific, separate values (e.g., number of students in a classroom, dice roll outcomes).
Continuous Data: Quantitative data that can take any value within a range (e.g., time, temperature).
Type | Description | Examples |
|---|---|---|
Qualitative | Qualities, categories | Favorite color, eye color |
Quantitative - Discrete | Countable, separate values | Dice roll, number of students |
Quantitative - Continuous | Any value in a range | Time, temperature |
Example: Surveying the nationalities of 10 people on a plane yields qualitative data. Measuring the distance people walk each day with GPS watches yields quantitative, continuous data.
Intro to Collecting Data
Methods of Data Collection
There are two main ways to collect data in statistics: experiments and observational studies.
Experiment: Apply a treatment and measure its effects; you can assume causation.
Observational Study: Observe without changing anything; you cannot assume causation.
Example:
Testing a medication by giving 15 subjects a placebo and 15 the actual medication is an experiment (causation possible).
Surveying 30 college students about their sleep habits and grades is an observational study (no causation assumed).
Rolling a fair and a loaded die 10 times each and comparing results is an experiment (causation possible).
Simple Random Sampling
Sampling Methods and Representativeness
Sampling is the process of selecting a smaller group (sample) from a larger group (population) for analysis. The goal is often to obtain a representative sample that reflects the characteristics of the population.
Representative Sample: Made up of equal proportions of characteristics as the original population.
Simple Random Sample (SRS): Each subject has an equal chance of being selected.
Example:
Scenario | Representative Sample? | Simple Random Sample? |
|---|---|---|
Randomly select 3 marbles from a bag with 2 red & 4 blue marbles; all selected are blue | No | Yes |
University with 60% undergraduates & 40% graduates surveys 60% undergrads & 40% grads | Yes | Yes (if selection is random) |
Example Process for SRS: To select 5 random students out of 20, assign each student a number from 1 to 20, then use a random number generator to select 5 unique numbers. The students corresponding to those numbers form the simple random sample.
Practice Questions and Applications
Identifying Populations, Samples, Parameters, and Statistics
Collecting test scores of every other student in a class: Sample
Report showing amount spent by each customer in a grocery store: Population (if all customers included)
46.5% of all registered voters in a country are registered democrats: Parameter
Survey of 40 gym members finds average workout duration is 52 minutes: Statistic
Identifying Data Types
Amount of hours students study per week: Quantitative, Continuous
Heights of basketball players: Quantitative, Continuous
Brands of smartphones owned: Qualitative
Outcomes of ten rolls of a standard six-sided die: Quantitative, Discrete
Discrete Quantitative Data Examples
Weight of apples: Continuous
Temperature in classroom: Continuous
Time to complete a lap: Continuous
Number of goals scored by a soccer team: Discrete
Experiment vs. Observational Study
Surveying target demographic for product interest: Observational Study
Determining employee feelings about growth: Observational Study
Testing a fitness app for weight loss: Experiment
Sampling Scenarios
Surveying gym members about rowing machine: Not representative if only afternoon classes are surveyed
Surveying all people entering a shop during a day: May not be representative if time of day affects customer type
Surveying teachers from each grade: Representative if grades are equally sampled
Surveying random employees in chain restaurant: Representative if locations and processes are equally sampled
Additional info: Some explanations and examples have been expanded for clarity and completeness.