BackFoundations of Statistics: Populations, Samples, Data Types, and Sampling Methods
Study Guide - Smart Notes
Tailored notes based on your materials, expanded with key definitions, examples, and context.
Parameters vs. Statistics
Introduction to Statistics
Statistics is the science of collecting, organizing, analyzing, and interpreting data to make informed decisions. Understanding the distinction between populations and samples, as well as parameters and statistics, is fundamental in statistical analysis.
Data: Information gathered from counting, measuring, or collecting responses.
Population: The entire set containing all data points ("every," "each").
Sample: A subset of the population, selected for analysis.
Parameter: A numerical value that describes a characteristic of a population.
Statistic: A numerical value that describes a characteristic of a sample.
Example:
Label | Description | Population or Sample? | Parameter or Statistic? |
|---|---|---|---|
A | The salary of every employee at a marketing firm | Population | Parameter |
B | The salaries of 12 out of 100 total employees at a marketing firm | Sample | Statistic |
C | The average salary of all employees at a marketing firm is $41,000 | Population | Parameter |
D | The average salary of 12 out of 100 employees at a marketing firm is $58,000 | Sample | Statistic |
Additional info: Parameters are fixed values for populations, while statistics can vary from sample to sample.
Types of Data
Qualitative vs. Quantitative Data
Data can be categorized as qualitative or quantitative, each with distinct properties and uses in statistical analysis.
Qualitative Data: Describes qualities or categories (e.g., favorite color, eye color).
Quantitative Data: Describes quantities or amounts and can be measured numerically.
Subtypes of Quantitative Data
Discrete Data: Consists of countable values (e.g., number of students in a classroom, dice roll outcomes).
Continuous Data: Can take any value within a range and is measurable (e.g., time, temperature).
Type | Description | Examples |
|---|---|---|
Qualitative | Qualities, categories | Favorite color, eye color |
Quantitative (Discrete) | Countable quantities | Dice roll, number of students |
Quantitative (Continuous) | Measurable quantities | Time, temperature |
Example: Surveying the nationalities of 10 people on a plane yields qualitative data. Measuring the distances people walk each day with GPS-enabled watches yields quantitative, continuous data.
Intro to Collecting Data
Methods of Data Collection
There are two main ways to collect data in statistics: experiments and observational studies.
Experiment: Apply a treatment and measure its effects; can establish causation.
Observational Study: Observe and measure characteristics without intervention; cannot establish causation.
Example:
Testing a medication by giving 15 subjects a placebo and 15 the actual medication is an experiment and can establish causation.
Surveying 30 college students about their sleep habits and grades is an observational study and cannot establish causation.
Rolling a fair and a loaded die 10 times each and comparing results is an experiment and can establish causation.
Additional info: Experiments require random assignment to control for confounding variables.
Simple Random Sampling
Sampling Methods
Sampling is the process of selecting a smaller group (sample) from a larger group (population) for analysis. The goal is to obtain a sample that accurately represents the population.
Representative Sample: Made up of equal proportions of characteristics as the original population.
Simple Random Sample (SRS): Each subject has an equal chance of being selected.
Example:
Scenario | Representative Sample? | Simple Random Sample? |
|---|---|---|
Randomly select 3 marbles from a bag with 2 red & 4 blue marbles; all selected are blue | No | Yes |
University with 60% undergraduates & 40% graduates surveys 60% undergrads & 40% grads | Yes | Yes |
Additional info: SRS is often implemented using random number generators or drawing lots.
Practice Questions and Applications
Identifying Populations, Samples, Parameters, and Statistics
Collecting test scores of every other student in a class: Sample
Amount spent by each customer in a grocery store: Population
46.5% of all registered voters in a country are registered democrats: Parameter
Survey of 40 gym members finds average work out duration is 52 minutes: Statistic
Identifying Data Types
Brands of smartphones owned by students: Qualitative
Outcomes of ten rolls of a standard six-sided die: Quantitative, Discrete
Temperature in a classroom: Quantitative, Continuous
Number of goals scored by a soccer team in a match: Quantitative, Discrete
Distinguishing Experiments from Observational Studies
Surveying target demographic about product interest: Observational Study
Testing effects of extra work hours on store profits by randomly assigning stores: Experiment
Determining employee feelings about personal growth: Observational Study
Testing a fitness app for weight loss and strength: Experiment
Evaluating Sampling Methods
Surveying gym members about rowing machine purchase using random number generator: Simple Random Sample, Representative Sample if all groups are equally likely to be selected.
Surveying random employees in chain restaurant branches: Simple Random Sample, Representative Sample if all branches and roles are equally represented.
Surveying random students from a statistics course: Use random number assignment to select 5 out of 20 students for a Simple Random Sample.
Key Formulas
Sample Mean:
Population Mean:
Sample Proportion:
Population Proportion: