Skip to main content
Back

Introduction to Statistics: Populations, Samples, and Data Collection

Study Guide - Smart Notes

Tailored notes based on your materials, expanded with key definitions, examples, and context.

Chapter 1: Introduction to Statistics

Populations and Samples

Statistics is the science of collecting, analyzing, and interpreting data. Two foundational concepts are populations and samples. Understanding these helps in designing studies and interpreting results.

  • Population: The entire group of individuals or items that we want to study. For example, all middle and high school students in Kansas City.

  • Sample: A subset of the population, selected for actual study. For example, 2500 students chosen from the population.

  • Parameter: A numerical summary describing a characteristic of a population (e.g., the true proportion of students who drink soda).

  • Statistic: A numerical summary describing a characteristic of a sample (e.g., the proportion found in the sample).

Example: If 1/3 of 2500 surveyed students drink soda, this is a statistic. If it is known for all students, it is a parameter.

Statistical Significance vs. Practical Significance

When analyzing data, it is important to distinguish between statistical and practical significance.

  • Statistical Significance: Results are unlikely to have occurred by chance, as determined by statistical tests (e.g., p-value).

  • Practical Significance: Results are large enough to be meaningful in real-world terms.

Example: A diet program shows a statistically significant average weight loss of 10 pounds, but if the cost or effort is high, the practical significance may be low.

Consider the Source

Always evaluate the credibility and potential bias of the source providing statistical information.

  • Sources may have vested interests that affect the presentation or interpretation of data.

  • Independent verification and critical thinking are essential.

Example: A candy manufacturer claims their product does not cause tooth decay, but the claim should be scrutinized for bias.

Classifying Data

Types of Data

Data can be classified as qualitative (categorical) or quantitative (numerical).

  • Qualitative Data: Describes categories or qualities (e.g., car brands, colors).

  • Quantitative Data: Describes numerical values (e.g., number of tests taken).

  • Discrete Data: Countable values (e.g., number of students).

  • Continuous Data: Measurable values within a range (e.g., height, weight).

Example: The number of tests students take is discrete; their scores could be continuous.

Levels of Measurement

Measurement Scales

Data can be measured at different levels, which determine the types of statistical analysis possible.

  • Nominal: Categories without order (e.g., car types: sedan, SUV).

  • Ordinal: Categories with a meaningful order but no consistent difference between ranks (e.g., class rankings).

  • Interval: Numerical data with meaningful differences but no true zero (e.g., temperature in Celsius).

  • Ratio: Numerical data with meaningful differences and a true zero (e.g., weight, height).

Example: The number of wheels on a vehicle is ratio; car categories are nominal.

Sampling Methods

Types of Sampling

Sampling methods affect the representativeness and reliability of results.

  • Simple Random Sampling: Every member of the population has an equal chance of being selected.

  • Stratified Sampling: Population divided into subgroups (strata), and random samples taken from each.

  • Cluster Sampling: Population divided into clusters, some clusters are randomly selected, and all members of selected clusters are studied.

  • Systematic Sampling: Every nth member of the population is selected.

  • Convenience Sampling: Samples are taken from members who are easiest to reach.

Example: Surveying every 5th student entering a cafeteria is systematic sampling.

Vocabulary and Study Types

Key Terms and Study Designs

  • Voluntary Response Sample: Participants choose to respond, often leading to bias.

  • Observational Study: Researchers observe subjects without intervention.

  • Experimental Study: Researchers apply treatments and observe effects.

  • Retrospective Study: Looks back at past data.

  • Prospective Study: Follows subjects into the future.

  • Cross-sectional Study: Data collected at one point in time.

Study Type

Description

Retrospective

Go back in time to collect data over some past period.

Prospective

Go forward in time and observe groups sharing common factors.

Cross-sectional

Data are measured at one moment in time.

Short Answer Concepts

Sampling and Data Collection

  • Reasons for Sampling: Cost, time, and practicality often make sampling preferable to studying entire populations.

  • Self-selected Sample: Participants choose themselves, which may introduce bias.

  • Correlation vs. Causation: Correlation does not imply causation; other factors may be involved.

  • Sample Size: Larger samples generally yield more reliable results, but only if sampling is unbiased.

Example: Calculating a baseball player's batting average:

Example: In a survey, if 52.4% of 170 students agree with a statement, the number is students.

Summary Table: Sampling Methods

Sampling Method

Description

Example

Simple Random

Equal chance for all members

Randomly select students from a list

Stratified

Divide into subgroups, sample from each

Sample students from each grade level

Cluster

Divide into clusters, sample entire clusters

Sample all students in selected classrooms

Systematic

Select every nth member

Survey every 10th shopper

Convenience

Sample easiest to reach

Survey students in the cafeteria

Additional info: Some examples and explanations have been expanded for clarity and completeness.

Pearson Logo

Study Prep