BackIntroduction to the Practice of Statistics: Key Concepts and Methods
Study Guide - Smart Notes
Tailored notes based on your materials, expanded with key definitions, examples, and context.
1. Introduction to the Practice of Statistics
1.1 Define Statistics and Statistical Thinking
Statistics is the science of collecting, organizing, summarizing, and analyzing information to draw conclusions or answer questions. It provides a measure of confidence in any conclusions.
Data: Facts or propositions used to draw a conclusion or make a decision. Data describe characteristics of an individual.
Variability: Data often vary; understanding and describing sources of variability is a key goal of statistics.
Example: If you keep track of the number of hours you sleep each night, you may notice variability from day to day. Statistics helps describe and understand such variability.
1.2 Populations, Samples, and Parameters
In statistics, it is often impractical to study an entire group of interest (the population), so a sample is selected for study.
Population: The entire group of individuals to be studied.
Sample: A subset of the population that is being studied.
Individual: A person or object that is a member of the population.
Parameter: A numerical summary of a population.
Statistic: A numerical summary based on a sample.
Example: If the proportion of all students on campus who have a job is 0.849, this is a parameter. If a sample of 260 students is taken and the proportion with a job is 0.864, this is a statistic.
1.3 The Process of Statistics
The process of statistics involves several key steps:
Identify the research objective: Clearly state the question(s) to be answered.
Collect the information needed: Obtain relevant data to answer the question.
Describe the data: Organize and summarize the information using tables, graphs, and numerical summaries.
Draw conclusions from the data: Use statistical methods to make inferences and answer the research question.
Example: In a study of high school start times and sleep duration, researchers identified the objective, collected data from 383 adolescents, described the data, and concluded that later start times were associated with longer sleep duration.
2. Types of Variables
2.1 Qualitative vs. Quantitative Variables
Variables are characteristics of individuals within the population. They can be classified as:
Qualitative (Categorical) Variables: Allow for classification based on attribute or characteristic (e.g., gender, name of university).
Quantitative Variables: Provide numerical measures of individuals (e.g., height, number of vending machines).
Example: Education level is qualitative; daily intake of whole grains is quantitative.
2.2 Discrete vs. Continuous Variables
Quantitative variables can be further classified as:
Discrete Variables: Have a finite or countable number of possible values (e.g., number of children in a classroom).
Continuous Variables: Have an infinite number of possible values, often measured (e.g., income, temperature).
Example: Grade earned in Algebra (as a percentage) is continuous; number of vending machines is discrete.
3. Levels of Measurement
3.1 Nominal, Ordinal, Interval, and Ratio Levels
Variables can be measured at different levels:
Nominal Level: Values are names, labels, or categories; no order (e.g., gender).
Ordinal Level: Values can be ranked or ordered (e.g., income status: low, middle, high).
Interval Level: Ordered values with meaningful differences, but no true zero (e.g., temperature in Celsius).
Ratio Level: Ordered values with meaningful differences and a true zero; ratios are meaningful (e.g., income).
Example: Number of children in a classroom is ratio; income status is ordinal.
4. Observational Studies vs. Designed Experiments
4.1 Distinguishing Study Types
Statistical studies can be classified as observational studies or designed experiments:
Observational Study: Measures the value of the response variable without attempting to influence the value of either the response or explanatory variables.
Designed Experiment: Researcher intentionally manipulates explanatory variables and controls other variables to observe effects on the response variable.
Example: Studying the effect of music on intelligence by observing groups is observational; assigning groups and controlling exposure is experimental.
4.2 Confounding and Lurking Variables
Confounding occurs when the effects of two or more variables are not separated, making it unclear which variable is responsible for an observed effect. A lurking variable is an unconsidered variable that may affect the response variable.
Confounding Variable: May be related to both explanatory and response variables, obscuring causal relationships.
Lurking Variable: Not included in the study but may influence results.
Example: In a study of flu shots and seniors, age and health status may be lurking variables affecting the outcome.
5. Summary Table: Types of Variables
Type of Variable | Description | Example |
|---|---|---|
Qualitative | Classification based on attribute or characteristic | Gender, name of university |
Quantitative | Numerical measures; can be discrete or continuous | Height, number of vending machines |
Discrete | Countable number of possible values | Number of children in a classroom |
Continuous | Infinite number of possible values | Income, temperature |
6. Key Formulas and Notation
Population Proportion:
Sample Proportion:
Mean (Average):
Standard Deviation:
*Additional info: Academic context and examples have been expanded for clarity and completeness.*