Skip to main content
Back

Introduction to the Practice of Statistics: Key Concepts and Methods

Study Guide - Smart Notes

Tailored notes based on your materials, expanded with key definitions, examples, and context.

1. Introduction to the Practice of Statistics

1.1 Define Statistics and Statistical Thinking

Statistics is the science of collecting, organizing, summarizing, and analyzing information to draw conclusions or answer questions. It provides a measure of confidence in any conclusions.

  • Data: Facts or propositions used to draw a conclusion or make a decision. Data describe characteristics of an individual.

  • Variability: Data often vary; understanding and describing sources of variability is a key goal of statistics.

Example: If you keep track of the number of hours you sleep each night, you may notice variability from day to day. Statistics helps describe and understand such variability.

1.2 Populations, Samples, and Parameters

In statistics, it is often impractical to study an entire group of interest (the population), so a sample is selected for study.

  • Population: The entire group of individuals to be studied.

  • Sample: A subset of the population that is being studied.

  • Individual: A person or object that is a member of the population.

  • Parameter: A numerical summary of a population.

  • Statistic: A numerical summary based on a sample.

Example: If the proportion of all students on campus who have a job is 0.849, this is a parameter. If a sample of 260 students is taken and the proportion with a job is 0.864, this is a statistic.

1.3 The Process of Statistics

The process of statistics involves several key steps:

  1. Identify the research objective: Clearly state the question(s) to be answered.

  2. Collect the information needed: Obtain relevant data to answer the question.

  3. Describe the data: Organize and summarize the information using tables, graphs, and numerical summaries.

  4. Draw conclusions from the data: Use statistical methods to make inferences and answer the research question.

Example: In a study of high school start times and sleep duration, researchers identified the objective, collected data from 383 adolescents, described the data, and concluded that later start times were associated with longer sleep duration.

2. Types of Variables

2.1 Qualitative vs. Quantitative Variables

Variables are characteristics of individuals within the population. They can be classified as:

  • Qualitative (Categorical) Variables: Allow for classification based on attribute or characteristic (e.g., gender, name of university).

  • Quantitative Variables: Provide numerical measures of individuals (e.g., height, number of vending machines).

Example: Education level is qualitative; daily intake of whole grains is quantitative.

2.2 Discrete vs. Continuous Variables

Quantitative variables can be further classified as:

  • Discrete Variables: Have a finite or countable number of possible values (e.g., number of children in a classroom).

  • Continuous Variables: Have an infinite number of possible values, often measured (e.g., income, temperature).

Example: Grade earned in Algebra (as a percentage) is continuous; number of vending machines is discrete.

3. Levels of Measurement

3.1 Nominal, Ordinal, Interval, and Ratio Levels

Variables can be measured at different levels:

  • Nominal Level: Values are names, labels, or categories; no order (e.g., gender).

  • Ordinal Level: Values can be ranked or ordered (e.g., income status: low, middle, high).

  • Interval Level: Ordered values with meaningful differences, but no true zero (e.g., temperature in Celsius).

  • Ratio Level: Ordered values with meaningful differences and a true zero; ratios are meaningful (e.g., income).

Example: Number of children in a classroom is ratio; income status is ordinal.

4. Observational Studies vs. Designed Experiments

4.1 Distinguishing Study Types

Statistical studies can be classified as observational studies or designed experiments:

  • Observational Study: Measures the value of the response variable without attempting to influence the value of either the response or explanatory variables.

  • Designed Experiment: Researcher intentionally manipulates explanatory variables and controls other variables to observe effects on the response variable.

Example: Studying the effect of music on intelligence by observing groups is observational; assigning groups and controlling exposure is experimental.

4.2 Confounding and Lurking Variables

Confounding occurs when the effects of two or more variables are not separated, making it unclear which variable is responsible for an observed effect. A lurking variable is an unconsidered variable that may affect the response variable.

  • Confounding Variable: May be related to both explanatory and response variables, obscuring causal relationships.

  • Lurking Variable: Not included in the study but may influence results.

Example: In a study of flu shots and seniors, age and health status may be lurking variables affecting the outcome.

5. Summary Table: Types of Variables

Type of Variable

Description

Example

Qualitative

Classification based on attribute or characteristic

Gender, name of university

Quantitative

Numerical measures; can be discrete or continuous

Height, number of vending machines

Discrete

Countable number of possible values

Number of children in a classroom

Continuous

Infinite number of possible values

Income, temperature

6. Key Formulas and Notation

  • Population Proportion:

  • Sample Proportion:

  • Mean (Average):

  • Standard Deviation:

*Additional info: Academic context and examples have been expanded for clarity and completeness.*

Pearson Logo

Study Prep