BackIntroduction to Statistics: Populations, Samples, Data Types, and Sampling Methods
Study Guide - Smart Notes
Tailored notes based on your materials, expanded with key definitions, examples, and context.
Parameters vs. Statistics
Definitions and Key Concepts
Statistics is the science of collecting, organizing, analyzing, and interpreting data to make informed decisions. Understanding the distinction between populations and samples, as well as parameters and statistics, is fundamental in statistical analysis.
Data: Information gathered from counting, measuring, or collecting responses.
Population: The entire set containing all individuals or items of interest. Denoted as the set of all data ("every," "each").
Sample: A subset of the population, selected for analysis.
Parameter: A numerical value that describes a characteristic of a population.
Statistic: A numerical value that describes a characteristic of a sample.
Example:
Scenario | Population or Sample? | Parameter or Statistic? |
|---|---|---|
The salary of every employee at a marketing firm | Population | Parameter |
The salaries of 12 out of 100 total employees at a marketing firm | Sample | Statistic |
The average salary of all employees at a marketing firm is $41,000 | Population | Parameter |
The average salary of 12 out of 100 employees at a marketing firm is $58,000 | Sample | Statistic |
Types of Data
Qualitative vs. Quantitative Data
Data can be categorized based on its nature and measurement. Recognizing the type of data is essential for choosing appropriate statistical methods.
Qualitative Data: Data that describes qualities or categories (e.g., favorite color, eye color).
Quantitative Data: Data that represents quantities and can be measured numerically.
Subtypes of Quantitative Data
Discrete Data: Quantitative data that can take only specific, separate values (e.g., number of students in a classroom, dice roll outcomes).
Continuous Data: Quantitative data that can take any value within a range (e.g., time, temperature).
Type | Description | Examples |
|---|---|---|
Qualitative | Qualities, categories | Favorite color, eye color |
Quantitative (Discrete) | Countable quantities | Dice roll, number of students |
Quantitative (Continuous) | Measurable quantities | Time, temperature |
Example: Surveying the nationalities of 10 people on a plane yields qualitative data. Measuring the distances people walk each day with GPS-enabled watches yields quantitative, continuous data.
Intro to Collecting Data
Methods of Data Collection
There are two main ways to collect data in statistics: experiments and observational studies.
Experiment: Apply a treatment and measure its effects; you can assume causation.
Observational Study: Observe and measure characteristics without applying treatments; you cannot assume causation.
Example:
Testing a medication by giving 15 subjects a placebo and 15 the actual medication is an experiment (causation can be inferred).
Surveying 30 college students about their sleep habits and grades is an observational study (no causation inferred).
Rolling a fair and a loaded die 10 times each and comparing results is an experiment (causation can be inferred).
Simple Random Sampling
Sampling Methods and Representativeness
Sampling is the process of selecting a smaller group (sample) from a larger group (population). The goal is to obtain a sample that accurately represents the population.
Representative Sample: A sample made up of equal proportions of characteristics as the original population.
Simple Random Sample (SRS): Each subject in the population has an equal chance of being selected.
Scenario | Representative Sample? | Simple Random Sample? |
|---|---|---|
Randomly select 3 marbles from a bag with 2 red and 4 blue marbles; all selected are blue | No | Yes |
University with 60% undergraduates & 40% graduates surveys 60% undergrads & 40% grads | Yes | Yes |
Example: To generate a simple random sample of 5 out of 20 students, assign each student a number from 1 to 20, then use a random number generator to select 5 unique numbers. The students corresponding to those numbers form the sample.
Summary Table: Key Terms and Concepts
Term | Definition | Example |
|---|---|---|
Population | Entire group of interest | All employees at a company |
Sample | Subset of the population | 12 employees selected from the company |
Parameter | Numerical summary of a population | Average salary of all employees |
Statistic | Numerical summary of a sample | Average salary of 12 selected employees |
Qualitative Data | Describes qualities or categories | Eye color |
Quantitative Data | Describes quantities | Height in centimeters |
Discrete Data | Countable values | Number of students |
Continuous Data | Any value within a range | Time taken to run a lap |
Experiment | Apply treatment, measure effect | Testing a new drug |
Observational Study | Observe without intervention | Surveying sleep habits |
Simple Random Sample | Equal chance for all subjects | Randomly selecting students |
Key Formulas
Sample Mean:
Population Mean:
Additional info: These notes provide foundational concepts for introductory statistics, including definitions, examples, and basic formulas. Practice questions and examples are included to reinforce understanding of key terms and methods.