Skip to main content
Back

Introduction to Statistics: Collecting and Classifying Data

Study Guide - Smart Notes

Tailored notes based on your materials, expanded with key definitions, examples, and context.

Introduction to Statistics

Definition and Scope

Statistics is the science of collecting, organizing, analyzing, and interpreting data. It involves the collection of observations or measurements for analysis.

  • Never make conclusions off of 1 piece of data. Reliable statistical analysis requires sufficient data.

  • Sample: A subset of the population selected for the study.

  • Individual: One person or object selected from the population.

  • Population: The entire set of all individuals or items under study.

Random sampling removes bias and ensures that every member of the population has an equal chance of being selected.

Types of Statistics

Statistics can be classified as descriptive or inferential.

  • Descriptive Statistics: Summarizes and describes data collected from a sample or population.

  • Inferential Statistics: Draws conclusions or makes predictions about a population based on sample data.

Comparison Table: Descriptive vs Inferential Statistics

Aspect

Descriptive Statistics

Inferential Statistics

Purpose

Summarize and describe data

Draw conclusions about populations

Data Used

Entire sample or population

Typically a sample

Examples

Mean, Median, Mode, Graphs

Estimation, Hypothesis testing

Key Question

"What does the data show?"

"What can we infer about the whole?"

Certainty

No uncertainty

Uncertainty & probability

Collecting Data

Sampling Methods

Sampling is the process of selecting a subset of individuals from a population to estimate characteristics of the whole population.

  • Simple Random Sample: Every member of the population has an equal chance of being selected.

  • Stratified Sample: Population is divided into subgroups (strata) and random samples are taken from each stratum.

  • Cluster Sample: Population is divided into clusters, some clusters are randomly selected, and all individuals in chosen clusters are studied.

  • Systematic Sample: Every nth individual is selected from a list of the population.

Bias in Sampling

Bias occurs when the sample does not accurately represent the population. Random sampling helps minimize bias and improve the reliability of statistical conclusions.

Descriptive vs Inferential Statistics

Descriptive Statistics

Descriptive statistics are used to summarize and organize data. Common measures include:

  • Mean: The average value of a dataset.

  • Median: The middle value when data are ordered.

  • Mode: The value that appears most frequently.

  • Range: The difference between the highest and lowest values.

Inferential Statistics

Inferential statistics use sample data to make generalizations about a population. Key concepts include:

  • Estimation: Using sample statistics to estimate population parameters.

  • Hypothesis Testing: Assessing evidence to support or refute a claim about a population.

  • Confidence Intervals: Range of values within which a population parameter is likely to fall.

Example

If a researcher wants to know the average height of students in a university, they may select a random sample of students and calculate the mean height. This is descriptive statistics. If they use this sample mean to estimate the average height of all students at the university, this is inferential statistics.

Additional info: The notes also emphasize the importance of random sampling and the distinction between descriptive and inferential statistics, which are foundational concepts in introductory statistics courses.

Pearson Logo

Study Prep