BackIntroduction to Statistics: Concepts, Data, and Types
Study Guide - Smart Notes
Tailored notes based on your materials, expanded with key definitions, examples, and context.
Chapter 1: Introduction to Statistics
Overview of Statistics
Statistics is a foundational discipline in data analysis, focusing on the collection, organization, analysis, and interpretation of data to inform decision-making. This chapter introduces the basic concepts and terminology essential for understanding statistics.
What is Data?
Data refers to a collection of information obtained from observations, counts, measurements, or responses. It forms the basis for statistical analysis and can be quantitative or qualitative.
Definition: Data is a list of information coming from observations, counts, measurements, or responses.
Examples:
"1 in 10 Americans believe the arts unify their community."
"2 in 5 Americans have changed an opinion or perception based on an arts experience."
"21% of 8–11 year-olds have a social media profile."
What is Statistics?
Statistics is the science of collecting, organizing, analyzing, and interpreting data to make decisions. It provides tools for understanding and drawing conclusions from data.
Definition: Statistics is the science of collecting, organizing, analyzing, and interpreting data to make decisions.
Applications: Used in fields such as business, healthcare, social sciences, and government to inform policy and practice.
Data Sets: Populations and Samples
Population
A population is the complete collection of all outcomes, responses, measurements, or counts that are of interest in a statistical study.
Definition: The entire group of individuals or items under consideration.
Example: All employees in the United States.
Sample
A sample is a subset, or part, of the population. Samples are used to make inferences about the population when it is impractical to collect data from every member.
Definition: A portion of the population selected for analysis.
Example: 834 employees surveyed out of all U.S. employees.
Example: Identifying Data Sets
In a recent survey, 834 employees in the United States were asked if they thought their jobs were highly stressful. Of the 834 respondents, 517 said yes.
Population: All employees in the United States.
Sample: The 834 employees who responded to the survey.
Sample Data Set: 517 "yes" responses and 317 "no" responses.
Parameters and Statistics
Parameter
A parameter is a numerical description of a population characteristic.
Definition: A value that describes a characteristic of the entire population.
Example: The average age of all people in the United States.
Statistic
A statistic is a numerical description of a sample characteristic.
Definition: A value that describes a characteristic of a sample.
Example: The average age of people from a sample of three states.
Examples: Distinguishing Parameters and Statistics
Sample Statistic: In the United States, a survey of about 9,400 individuals aged 15 and over found that such individuals spent an average of 5.19 hours per day engaged in leisure and sports activities. (Based on a subset of the population.)
Population Parameter: The freshman class at a university has an average SAT math score of 514. (Based on the entire freshman class.)
Sample Statistic: In a random check of several hundred retail stores, the Food and Drug Administration found that 34% of the stores were not storing fish at the proper temperature. (Based on a subset of all stores.)
Branches of Statistics
Branch | Description | Examples |
|---|---|---|
Descriptive Statistics | Involves the organization, summarization, and display of data. | Tables, charts, averages |
Inferential Statistics | Uses sample data to draw conclusions about a population. | Making predictions, testing hypotheses |
Example: Descriptive and Inferential Statistics
Descriptive: A study of 1,502 U.S. adults found that 18% of adults from households earning less than $30,000 annually do not use the Internet.
Inferential: A possible inference is that the Internet has been made less accessible to lower-income households.
Descriptive: A study of 1,000 U.S. 401(k) retirement plan participants found that 32% do not know how many years their retirement savings might last.
Inferential: A possible inference is that the amount of money a person needs for retirement is difficult to determine.
Summary Table: Key Terms
Term | Definition | Example |
|---|---|---|
Population | Entire group under study | All U.S. employees |
Sample | Subset of the population | 834 surveyed employees |
Parameter | Numerical description of a population | Average SAT score of all freshmen |
Statistic | Numerical description of a sample | Average SAT score from a sample |
Descriptive Statistics | Summarizes and displays data | Percentage of Internet non-users |
Inferential Statistics | Draws conclusions about populations | Inferring accessibility issues |
Key Formulas
Sample Mean:
Population Mean:
Additional info: These notes expand on the brief points in the slides to provide full definitions, examples, and context for each concept, suitable for introductory statistics students.