Skip to main content
Back

Introduction to Statistics: Essentials and Critical Thinking

Study Guide - Smart Notes

Tailored notes based on your materials, expanded with key definitions, examples, and context.

Introduction to Statistics

1-1 Statistical and Critical Thinking

Statistical thinking is essential for making sense of data and drawing valid conclusions. The process of conducting a statistical study consists of three main steps: prepare, analyze, and conclude.

  • Prepare: Understand the context, source of the data, and sampling method. Ensure the data is relevant and collected appropriately.

  • Analyze: Begin with graphical exploration and apply sound statistical methods. Use common sense and critical thinking, not just computational skills.

  • Conclude: Distinguish between statistical significance (results unlikely due to chance) and practical significance (results meaningful in real-world context).

Key Terms

  • Statistical Significance: Achieved if the likelihood of an event occurring by chance is 5% or less. Example: Getting 98 girls in 100 births is statistically significant; getting 52 girls is not.

  • Practical Significance: A result may be statistically significant but not large enough to be useful in practice. Example: A diet program resulting in a mean weight loss of 2.1 kg may be statistically significant but not practically significant for most dieters.

1-2 Types of Data

Data are collections of observations, such as measurements, genders, or survey responses. Understanding the type of data is crucial for selecting appropriate statistical methods.

  • Data: Collections of observations (measurements, survey responses, etc.).

  • Statistics: The science of planning studies and experiments; obtaining, organizing, summarizing, presenting, analyzing, and interpreting data to draw conclusions.

1-3 Collecting Sample Data

Proper data collection is fundamental to valid statistical analysis. The distinction between population and sample is central to statistical inference.

  • Population: The complete collection of all measurements or data being considered. Inferences are typically made about the population.

  • Census: Data collected from every member of a population.

  • Sample: A subcollection of members selected from a population.

Example: Watch What You Post Online

In a survey of 410 human resource professionals, 148 reported disqualifying job candidates due to social media postings. Here, the population is all human resource professionals, and the sample is the 410 surveyed. The goal is to use the sample to infer conclusions about the population.

Sampling Methods

  • Random Sampling: Individuals are selected randomly, ensuring each member of the population has an equal chance of being chosen.

  • Voluntary Response Sample (Self-Selected Sample): Respondents decide whether to participate. Common in internet, mail-in, and call-in polls. These samples are often biased and should not be used to make population-wide conclusions.

Example: Voluntary Response Sample

In a TV poll, 67% of 186,000 volunteer respondents wanted the UN headquarters moved, while only 38% of 500 randomly selected respondents agreed. The random sample provides more reliable results due to superior sampling methodology.

Statistical and Critical Thinking Process

  • Prepare: Assess context, source, and sampling method.

  • Analyze: Graph and explore data, apply statistical methods.

  • Conclude: Evaluate statistical and practical significance.

Table: Shoe Print Lengths and Heights of Eight Males

This table is used to explore the relationship between shoe print length and height, a common forensic application.

Shoe Print (cm)

Height (cm)

27.6

172.7

29.7

175.3

29.7

177.8

31.0

175.3

31.3

180.3

31.4

182.3

31.8

177.8

34.5

193.7

Purpose: To determine if there is a relationship between shoe print length and height. Forensic scientists use such data to estimate the height of individuals from crime scene evidence.

Analyzing Data: Potential Pitfalls

  • Misleading Conclusions: Ensure statements are clear and understandable to those unfamiliar with statistics.

  • Sample Data Reported Instead of Measured: Prefer direct measurements over self-reported data.

  • Loaded Questions: Poorly worded survey questions can bias results.

  • Order of Questions: The sequence of survey questions can unintentionally influence responses.

  • Nonresponse: Occurs when selected individuals do not respond, potentially biasing results.

  • Low Response Rates: Decreases reliability and increases bias risk.

  • Percentages: Be cautious of misleading percentages, especially those exceeding 100%.

Formulas and Equations

  • Statistical Significance (p-value):

Additional info: These notes cover foundational concepts in statistics, including definitions, sampling methods, and critical thinking, which are essential for further study in statistical analysis and inference.

Pearson Logo

Study Prep