BackIntroduction to Statistics: Key Concepts and Critical Thinking
Study Guide - Smart Notes
Tailored notes based on your materials, expanded with key definitions, examples, and context.
Introduction to Statistics
What is Statistics?
Statistics is the science of collecting, organizing, analyzing, interpreting, and presenting data to draw meaningful conclusions. It is essential for making informed decisions in various fields, including science, business, and public policy.
Definition: Statistics involves planning studies and experiments, obtaining data, and then organizing, summarizing, presenting, analyzing, and interpreting those data.
Purpose: The ultimate goal is to draw conclusions and make inferences about populations based on sample data.
Applications: Used in quality control, medical research, social sciences, and more.
Key Terms in Statistics
Data: Collections of observations, such as measurements, genders, or survey responses.
Population: The complete collection of all measurements or data that are being considered. Typically, a population is the group about which we want to make inferences.
Census: The collection of data from every member of a population.
Sample: A subcollection of members selected from a population.
Example: Population vs. Sample
In a survey of 410 human resource professionals, 148 said job candidates were disqualified due to social media postings.
Population: All human resource professionals.
Sample: The 410 professionals who were surveyed.
Objective: Use the sample to draw conclusions about the entire population.
Statistical and Critical Thinking
The Statistical Process
The process of conducting a statistical study consists of three main steps: prepare, analyze, and conclude.
Prepare: Understand the context, source of the data, and sampling method.
Analyze: Graph and explore the data, apply statistical methods.
Conclude: Assess statistical and practical significance of results.
Step 1: Prepare
Context: What do the data represent? What is the goal of the study?
Source of the Data: Are the data from a reputable source? Is there any bias?
Sampling Method: Were individuals randomly selected? Is the sample unbiased?
Step 2: Analyze
Graph the Data: Use appropriate graphs to visualize the data.
Explore the Data: Look for outliers, summarize with statistics (mean, standard deviation), check for missing data, and examine distribution.
Apply Statistical Methods: Use technology and sound statistical techniques to obtain results.
Step 3: Conclude
Statistical Significance: Results are statistically significant if the likelihood of an event occurring by chance is 5% or less.
Practical Significance: Even if results are statistically significant, they may not be meaningful or useful in practice.
Example: Shoe Print Lengths and Heights
Forensic scientists use shoe print lengths at crime scenes to estimate the height of a criminal. The following table shows shoe print lengths and heights of eight males:
Shoe Print (cm) | 27.6 | 29.7 | 29.7 | 31.0 | 31.3 | 31.4 | 31.8 | 34.5 |
|---|---|---|---|---|---|---|---|---|
Height (cm) | 172.7 | 175.3 | 177.8 | 175.3 | 180.3 | 182.3 | 177.8 | 193.7 |
Context: The goal is to determine if there is a relationship between shoe print length and height.
Source: Data from a reputable source (Appendix B, Data Set 9 "Foot and Height").
Sampling Method: Individuals were randomly selected.
Types of Samples
Voluntary Response Sample
A voluntary response sample (or self-selected sample) is one in which respondents themselves decide whether to participate. These samples are often biased and unreliable for making generalizations about a population.
Examples:
Internet polls
Mail-in polls
Telephone call-in polls
Problem: Results may be skewed due to self-selection bias.
Example: Voluntary Response Sample
Nightline poll: 186,000 volunteer respondents, 67% wanted the UN moved out of the US.
Random survey: 500 randomly selected respondents, 38% wanted the UN moved.
Conclusion: The smaller, randomly selected sample is more reliable due to superior sampling method.
Statistical Significance vs. Practical Significance
Definitions
Statistical Significance: Achieved if the likelihood of an event occurring by chance is 5% or less.
Practical Significance: Refers to whether the result is large enough to be meaningful in real-world terms.
Example
Weight loss study: 21 subjects on Atkins diet lost an average of 2.1 kg in one year.
Statistical significance: The result is unlikely due to chance.
Practical significance: Many dieters may find a 2.1 kg loss not worth the effort, so the result lacks practical significance.
Analyzing Data: Potential Pitfalls
Common Pitfalls
Misleading Conclusions: Conclusions should be clear and understandable to all audiences.
Sample Data Reported Instead of Measured: Direct measurement is preferred over self-reported data.
Loaded Questions: Poorly worded survey questions can bias results.
Order of Questions: The sequence of questions can unintentionally influence responses.
Nonresponse: Occurs when selected subjects do not respond, potentially biasing results.
Low Response Rates: Decreases reliability and increases bias.
Percentages: Be cautious of misleading percentages, especially those exceeding 100%.
Examples of Loaded Questions
How great do you think FAU is?
How bad do you think FAU is?
Will you continue to support our amazing company?
Do you really intend to vote for that candidate?
Have you stopped procrastinating?
Summary Table: Key Statistical Concepts
Term | Definition | Example |
|---|---|---|
Population | Entire group being studied | All human resource professionals |
Sample | Subset of the population | 410 surveyed professionals |
Census | Data from every member of the population | National census |
Voluntary Response Sample | Respondents choose to participate | Internet poll |
Statistical Significance | Unlikely to occur by chance (≤5%) | 98 girls in 100 births |
Practical Significance | Result is meaningful in practice | 2.1 kg weight loss may not be practical |
Additional info: These notes expand on the original slides by providing full definitions, context, and examples for each concept, ensuring a self-contained study guide for students beginning their statistics course.