BackIntroduction to Statistics: Key Concepts and Foundations
Study Guide - Smart Notes
Tailored notes based on your materials, expanded with key definitions, examples, and context.
Chapter 1: Introduction to Statistics
Chapter Outline
1. An Overview of Statistics
2. Data Classification
3. Data Collection and Experimental Design
An Overview of Statistics
Section 1.1 Objectives
Define statistics and data
Distinguish between a population and a sample
Distinguish between a parameter and a statistic
Distinguish between descriptive statistics and inferential statistics
What is Data?
Data consists of information collected from observations, counts, measurements, or responses. In statistics, data is the foundation for analysis and decision-making.
Observation: Recording what is seen or experienced.
Measurement: Assigning numbers or values to characteristics.
Response: Answers or reactions from subjects in a study.
Example: "7 in 10 Americans believe the arts unify their communities, and 2 in 5 Americans have changed opinion or perception based on an arts experience." (Source: Americans for the Arts)
Example: "21% of 8–11 year-olds have a social media profile." (Source: Smart Insights, Ltd)
What is Statistics?
Statistics is the science of collecting, organizing, analyzing, and interpreting data to make decisions. It provides methods for understanding and describing variability in data and for making informed conclusions.
Collecting: Gathering data from various sources.
Organizing: Arranging data in a meaningful way.
Analyzing: Applying mathematical or graphical techniques to summarize data.
Interpreting: Drawing conclusions and making decisions based on data analysis.
Data Sets: Population and Sample
Understanding the difference between a population and a sample is fundamental in statistics.
Population: The collection of all outcomes, responses, measurements, or counts that are of interest.
Sample: A subset, or part, of the population, selected for analysis.
Example: In a survey of 834 employees in the United States, the population is all employees in the U.S., while the sample is the 834 employees who responded.
Parameter and Statistic
Statistics distinguishes between values that describe populations and those that describe samples.
Parameter: A numerical description of a population characteristic.
Statistic: A numerical description of a sample characteristic.
Example: The average age of all people in the United States is a parameter. The average age of people from a sample of three states is a statistic.
Example: If a survey of 9400 individuals finds an average of 5.19 hours per day spent on leisure, this is a sample statistic because it is based on a subset of the population.
Example: If the freshman class at a university has an average SAT math score of 514, this is a population parameter because it describes the entire class.
Example: If the FDA finds that 34% of randomly checked retail stores are not storing fish at the proper temperature, this is a sample statistic.
Branches of Statistics
Statistics is divided into two main branches: descriptive statistics and inferential statistics.
Descriptive Statistics: Involves the organization, summarization, and display of data. Common tools include tables, charts, and graphs.
Inferential Statistics: Involves using sample data to draw conclusions or make inferences about a population.
Comparison Table: Descriptive vs. Inferential Statistics
Branch | Main Purpose | Examples |
|---|---|---|
Descriptive Statistics | Summarize and present data | Mean, median, mode, charts, tables |
Inferential Statistics | Draw conclusions about populations from samples | Hypothesis testing, confidence intervals, regression |
Examples: Descriptive and Inferential Statistics
In studies, it is important to identify the population, sample, and which branch of statistics is being used.
Descriptive: "18% of adults from households earning less than $30,000 annually do not use the Internet." (Based on a sample of 1502 U.S. adults)
Inferential: A possible inference is that the Internet may be less accessible to lower-income households.
Descriptive: "32% of 1000 U.S. 401(k) retirement plan participants do not know how many years their retirement savings might last."
Inferential: It may be inferred that determining the amount of money needed for retirement is difficult for many people.
Key Formulas and Notation
Population Mean:
Sample Mean:
Note: denotes the population mean, denotes the sample mean, is the population size, and is the sample size.