Skip to main content
Back

Chapter 1: Stats Starts Here – Introduction to Statistics, Data, and Variables

Study Guide - Smart Notes

Tailored notes based on your materials, expanded with key definitions, examples, and context.

Stats Starts Here

Introduction to Statistics

Statistics is the science of collecting, organizing, analyzing, and interpreting data to make informed decisions. It helps us make sense of information in a world full of variability and uncertainty.

  • Data: Any collection of numbers, characters, images, or other items that provide information about something.

  • Data can come from surveys, experiments, or observational studies and can vary widely in form and content.

  • Statistics provides tools to understand and interpret this variability.

Examples of Data in Everyday Life

  • Facebook: Collects data such as age, gender, education, and interests to personalize ads and content.

  • Texting While Driving: Studies use data to assess the dangers of texting while driving, such as measuring reaction times in simulated emergencies.

Example: A University of Utah study found that texting drivers had slower reaction times than sober or drunk drivers, highlighting the importance of data in evaluating safety.

Learning Outcomes

  • Interpret data and communicate results effectively.

  • Identify flaws in conclusions presented in articles or studies.

  • Become a more informed and critical consumer of information.

Data and Its Organization

Organizing Data

Raw data can be difficult to interpret without proper organization. Presenting data clearly is essential for meaningful analysis.

  • Tables and charts help make sense of complex data sets.

Order Number

Name

State/Country

Price

Area Code

Destination

GMT

ABN

Artist

105-298843-3759464

Katherine H.

Ohio

0.99

440

Amsterdam

Y

B000002KVA

Cold Play

105-371243-4200364

Samuel R.

Detroit

1.99

313

New York

N

B000002KVA

Red Hot Chili Peppers

105-137250-0198464

Chris G.

Massachusetts

0.99

612

Los Angeles

N

B000002KVA

Frank Sinatra

103-262343-6536846

Monique D.

Canada

0.99

902

Beverly Hills

N

B000002KVA

Weezer

Additional info: Table purpose is to show how organizing data makes interpretation easier.

The Five "W"s and One "H"

To understand data, always consider:

  • Who: The individuals or cases surveyed or studied.

  • What: The variables measured or recorded.

  • When: The time period of data collection.

  • Where: The location of the study or survey.

  • Why: The purpose of the data collection.

  • How: The method used to collect the data.

Who and What: Key Terms

  • Respondents: Individuals who answer surveys (e.g., customers at Amazon).

  • Subjects/Participants: People on whom experiments are conducted (e.g., patients in a clinical trial).

  • Experimental Units: Objects of study when not people (e.g., rats in a maze).

  • Records: Rows in a database, each representing an individual case or transaction.

Sample and Population

Statistics often aims to make inferences about a large group (population) based on a smaller group (sample).

  • Population: The entire group of interest.

  • Sample: A subset of the population, used to draw conclusions about the whole.

  • Representative Sample: The sample should accurately reflect the population.

Think, Show, and Tell

  • Think: Decide what information you need.

  • Show: Present data clearly and accurately.

  • Tell: Explain what the data reveal.

Variables and Their Types

Categorical Variables

Categorical variables (also called nominal or qualitative) indicate group membership.

  • Examples: Favorite color, country of birth, area code.

  • Drawback: Harder to analyze with mathematical operations.

Quantitative Variables

Quantitative variables have measured numerical values with units, representing amounts or degrees.

  • Examples: Ounces, dollars, degrees Fahrenheit.

Categorical or Quantitative?

  • Some variables, like age, can be treated as either categorical (e.g., child, teen, adult) or quantitative (e.g., 21 years old), depending on context.

Identifier Variables

Identifier variables uniquely identify individuals but do not describe them.

  • Examples: Login ID, customer number, transaction number, social security number.

Ordinal Variables

Ordinal variables report order but not the exact difference between values.

  • Examples: Likert scale (Strongly Disagree to Strongly Agree), Olympic rank (Gold, Silver, Bronze).

  • Can be treated as quantitative by assigning numbers to ranks (e.g., 1 = Strongly Disagree, 4 = Strongly Agree).

Models in Statistics

What is a Model?

A model is a simplified representation of reality, used to explain or predict phenomena.

  • Example: A model airplane in a wind tunnel simulates real flight conditions.

  • Statistical Example: Kepler's Laws model planetary motion using observed data.

Critical Thinking in Statistics

What Can Go Wrong?

  • Do not label variables as categorical or quantitative without considering their meaning and context.

  • Do not assume a variable is quantitative just because it uses numbers.

  • Always approach data with skepticism and critical thinking.

Chapter Review

  • Data are values (numerical or labels) with context.

  • The Five W's and One H help clarify the context and meaning of data.

  • Always identify the cases (who), variables (what), and purpose (why) before analyzing data.

  • Consider the source and reason for data collection.

  • Variables can be categorical or quantitative, and sometimes the same variable can be treated as either, depending on the research question.

Pearson Logo

Study Prep