BackChapter 1: Introduction to Statistics – Structured Study Notes
Study Guide - Smart Notes
Tailored notes based on your materials, expanded with key definitions, examples, and context.
Introduction to Statistics
Overview
This chapter introduces the foundational concepts of statistics, including the definition of statistics, types of data, and methods for collecting sample data. Understanding these basics is essential for further study and application of statistical methods.
Statistical and Critical Thinking
Key Concepts
Statistical Study Process: The process involves three main steps: prepare, analyze, and conclude.
Statistical Thinking: Involves critical thinking and the ability to interpret results, not just perform calculations.
Steps in Statistical Study
Prepare: Understand the context, source, and sampling method of the data.
Analyze: Graph and explore the data, apply statistical methods, and interpret results.
Conclude: Assess the significance and practical implications of the findings.
Basic Definitions
Data
Data: Collections of observations, such as measurements, genders, or survey responses.
Statistics
Statistics: The science of planning studies and experiments; obtaining, organizing, summarizing, presenting, analyzing, and interpreting data to draw conclusions.
Population and Sample
Population: The complete collection of all measurements or data being considered, typically the group about which inferences are made.
Census: Data collected from every member of a population.
Sample: A subcollection of members selected from a population.
Example
Population: All human resource professionals.
Sample: The 410 human resource professionals surveyed.
Types of Data
Statistic vs. Parameter
Parameter: A numerical measurement describing a characteristic of a population.
Statistic: A numerical measurement describing a characteristic of a sample.
Quantitative vs. Categorical Data
Quantitative (Numerical) Data: Numbers representing counts or measurements. Examples: Weights of supermodels, ages of respondents.
Categorical (Qualitative) Data: Names or labels, not numbers. Examples: Gender of athletes, shirt numbers on uniforms.
Discrete vs. Continuous Data
Discrete Data: Quantitative data with a finite or countable number of values. Example: Number of coin tosses before getting tails.
Continuous Data: Quantitative data with infinitely many possible values, not countable. Example: Lengths of distances from 0 cm to 12 cm.
Levels of Measurement
Classification
Nominal: Categories only; no order. Example: Survey responses: yes, no, undecided.
Ordinal: Categories with some order; differences not meaningful. Example: Course grades: A, B, C, D, F.
Interval: Ordered, meaningful differences; no natural zero. Example: Years: 1000, 2000, 1776, 1492.
Ratio: Ordered, meaningful differences and ratios; natural zero. Example: Class times: 50 minutes, 100 minutes.
Summary Table: Levels of Measurement
Level | Description | Example |
|---|---|---|
Nominal | Categories only | Yes/No/Undecided |
Ordinal | Categories with order | Grades (A, B, C, D, F) |
Interval | Differences, no natural zero | Years (1000, 2000) |
Ratio | Differences and natural zero | Class times (50, 100 min) |
Collecting Sample Data
Observational Studies vs. Experiments
Observational Study: Observe and measure characteristics without modifying subjects.
Experiment: Apply treatment and observe effects on subjects (experimental units).
Example: Ice Cream and Drownings
Observational Study: Correlation between ice cream sales and drownings due to lurking variable (temperature).
Experiment: Groups treated with/without ice cream show no effect on drownings, demonstrating the importance of experimental design.
Sampling Methods
Simple Random Sample: Every possible sample of size has the same chance of being chosen.
Systematic Sampling: Select a starting point, then every th element.
Convenience Sampling: Use data that are easy to obtain.
Stratified Sampling: Subdivide population into subgroups (strata), sample from each.
Cluster Sampling: Divide population into clusters, randomly select clusters, sample all members in selected clusters.
Multistage Sampling: Combine multiple sampling methods in stages.
Types of Observational Studies
Cross-sectional Study: Data collected at one point in time.
Retrospective (Case Control) Study: Data collected from past records.
Prospective (Cohort) Study: Data collected in the future from groups sharing common factors.
Cross-Section vs. Time-Series Data
Definitions
Cross-Section Data: Data collected on different elements at the same point in time or for the same period.
Time-Series Data: Data collected on the same element for the same variable at different points in time.
Example Tables
Company | 2010 Total Revenue (millions of dollars) |
|---|---|
Wal-Mart Stores | 421,849 |
Royal Dutch Shell | 378,152 |
Exxon Mobil | 354,674 |
BP | 308,928 |
Sinopec Group | 273,422 |
China National Petroleum | 240,192 |
Year | Money Recovered (billions of dollars) |
|---|---|
2006 | 2.2 |
2007 | 1.8 |
2008 | 1.0 |
2009 | 1.6 |
2010 | 2.5 |
Key Formulas
Sample Mean:
Population Mean:
Additional info: These notes expand on the brief points in the slides and text, providing definitions, examples, and structured tables for clarity and completeness.