BackFundamentals of Statistics: Data Collection and Statistical Thinking
Study Guide - Smart Notes
Tailored notes based on your materials, expanded with key definitions, examples, and context.
Introduction to the Practice of Statistics
Objectives
Define statistics and statistical thinking
Explain the process of statistics
Distinguish between qualitative and quantitative variables
Distinguish between discrete and continuous variables
Determine the level of measurement of a variable
Define Statistics and Statistical Thinking
Statistics is the science of collecting, organizing, summarizing, and analyzing information to draw conclusions or answer questions. It also involves providing a measure of confidence in any conclusions drawn from data.
Data: Facts or propositions used to draw a conclusion or make a decision. Data describe the characteristics of an individual.
Variability: Data vary among individuals and even within the same individual over time. Understanding and describing sources of variability is a key goal of statistics.
Example: Not everyone in a class has the same height or hair color, and an individual's sleep hours or calorie intake can vary from day to day.
Explain the Process of Statistics
Populations, Samples, and Individuals
The process of statistics involves working with groups and subsets:
Population: The entire group of individuals to be studied.
Individual: A single person or object that is a member of the population.
Sample: A subset of the population that is being studied.
Descriptive and Inferential Statistics
Statistic: A numerical summary based on a sample.
Parameter: A numerical summary of a population.
Descriptive statistics: Organizing and summarizing data through numerical summaries, tables, and graphs.
Inferential statistics: Using methods that take results from a sample, extend them to the population, and measure the reliability of the result.
Example: If 48.2% of all students on a campus own a car, this is a parameter. If a sample of 100 students shows 46% own a car, this is a statistic.
The Statistical Process
Identify the research objective: Clearly state the question and identify the population.
Collect the data: Gather data needed to answer the question, often from a sample due to practical constraints.
Describe the data: Use descriptive statistics to summarize and understand the data.
Perform inference: Apply inferential techniques to generalize from the sample to the population and report the reliability (e.g., margin of error).
Example: A poll surveys 1,628 adults, finding 52% trust their neighbors. The margin of error is reported to account for uncertainty in generalizing to all adults.
Types of Variables
Qualitative vs. Quantitative Variables
Variables are characteristics of individuals that can vary.
Qualitative (Categorical) Variables: Allow for classification based on attributes or characteristics (e.g., gender, hair color).
Quantitative Variables: Provide numerical measures of individuals. Arithmetic operations are meaningful (e.g., height, number of days studied).
Example: Gender is qualitative; number of days studied is quantitative.
Discrete vs. Continuous Variables
Discrete Variable: Quantitative variable with a finite or countable number of possible values (e.g., number of cars, number of heads in coin flips).
Continuous Variable: Quantitative variable with an infinite number of possible values, measurable to any desired level of accuracy (e.g., distance traveled, temperature).
Example: Number of cars is discrete; distance a car can travel is continuous.
Levels of Measurement
Nominal: Values name, label, or categorize; no inherent order (e.g., gender).
Ordinal: Values can be ranked or ordered (e.g., letter grades).
Interval: Differences between values have meaning; zero does not indicate absence (e.g., temperature in Celsius).
Ratio: Ratios of values have meaning; zero indicates absence (e.g., number of days studied).
Example: Temperature is interval; number of days studied is ratio.