Skip to main content
Back

Introduction to Statistics: Understanding Data and Variables

Study Guide - Smart Notes

Tailored notes based on your materials, expanded with key definitions, examples, and context.

Chapter 1: Introduction to Statistics

Section 1.1: What Is Statistics?

Statistics is the science of collecting, analyzing, interpreting, and presenting data. It helps us make sense of information in a world where data is abundant and often variable.

  • Definition of Data: Any collection of numbers, characters, images, or other items that provide information about something.

  • Variation in Data: Data from surveys and experiments can vary, producing a range of outcomes.

  • Purpose of Statistics: To help us understand and interpret data, especially when it varies.

Example: Data Uses in Social Media

  • Platforms like Facebook collect data such as age, gender, education, and interests.

  • Statistics is used to determine which ads you see, based on your data and interactions.

  • Your information is valuable to companies for targeted advertising.

Example: Texting While Driving

  • Question: Is texting while driving dangerous?

  • Despite a rise in texting, driving fatalities have decreased in recent years.

  • University of Utah Study: Measured reaction times of sober, drunk, and texting drivers in simulated emergencies. Result: Texting drivers had the slowest reaction times.

  • Statistics helps us evaluate safety by analyzing such data.

Learning Outcomes:

  • Design and analyze experiments.

  • Interpret data and communicate results.

  • Identify deficiencies in conclusions from articles.

  • Become a more informed citizen.

Section 1.2: Data

Data must be organized and described to be useful. Proper organization allows for easier interpretation and analysis.

  • Raw Data: Unorganized data can be difficult to interpret.

  • Data Presentation: Effective presentation (tables, charts) makes data more understandable.

The Five "W"s and One "H" of Data

  • Who: Describe the individuals or objects studied.

  • What: Determine what is being measured (variables).

  • When: When was the research conducted?

  • Where: Where was the research conducted?

  • Why: What was the purpose of the survey or experiment?

  • How: How was the survey or experiment conducted?

Types of Individuals in Data

  • Respondents: Individuals who answer surveys (e.g., customers at Amazon).

  • Subjects/Participants: People on whom experiments are conducted (e.g., patients in a clinical trial).

  • Experimental Units: Objects of the experiment when not people (e.g., rats in a maze).

  • Records: Rows in a database, representing individual data entries (e.g., purchase records).

Sample and Population

  • Population: The entire group of interest.

  • Sample: A subset of the population, used to make inferences about the whole.

  • Representativeness: The sample should accurately reflect the population.

Think, Show, and Tell

  • Think: Decide what information you want to know.

  • Show: Display results professionally and accurately.

  • Tell: Describe conclusions drawn from the data.

Example: Identifying the Who

  • Scenario: Consumer Reports evaluates tablets from various manufacturers.

  • Population: All tablets currently offered for sale.

  • Sample: 16 tablets tested.

  • Who: The 16 tablets selected for testing.

Section 1.3: Variables

Variables are characteristics recorded about each individual or object in a study. They can be classified into different types based on their nature and measurement.

Categorical Variables

  • Definition: Variables that indicate group or category membership.

  • Synonyms: Nominal, qualitative.

  • Examples: Favorite color, country of birth, type of vehicle.

  • Drawback: Difficult to analyze with computation.

Quantitative Variables

  • Definition: Variables with measured numerical values and units, representing amounts or degrees.

  • Examples: Ounces, dollars, degrees Fahrenheit.

Categorical or Quantitative?

  • Some variables, like age, can be treated as categorical (e.g., child, teen, adult) or quantitative (numerical age).

Identifier Variables

  • Definition: Variables used to uniquely identify individuals, not to describe them.

  • Examples: Login ID, customer number, transaction number, social security number.

Ordinal Variables

  • Definition: Variables that report order without natural units.

  • Examples: Likert scale (Strongly Disagree to Strongly Agree), Olympic rank (Gold, Silver, Bronze).

  • Can be treated as quantitative by assigning ranks (e.g., 1 = Strongly Disagree, 4 = Strongly Agree).

Example: Identifying Variables in a Tablet Study

  • Variables: Manufacturer (categorical), price (quantitative, $), battery life (quantitative, hours), operating system (categorical), quality score (quantitative, no units), memory card reader (categorical).

  • Purpose: To help consumers choose a good tablet.

Section 1.4: Models

Models are simplified representations of reality, used to understand and predict phenomena.

  • Example: Model airplane in a wind tunnel to study flight dynamics.

  • Example: Kepler's Laws as models for planetary motion.

What Can Go Wrong?

  • Do not label a variable as categorical or quantitative without considering the data and its meaning.

  • Do not assume a variable is quantitative just because its values are numbers.

  • Always be skeptical and critically evaluate data and variables.

Pearson Logo

Study Prep