Skip to main content
Back

Introduction to the Practice of Statistics: Key Concepts, Data Types, Study Designs, and Sampling Methods

Study Guide - Smart Notes

Tailored notes based on your materials, expanded with key definitions, examples, and context.

Introduction to the Practice of Statistics

Objectives

  • Define Statistics

  • Explain the process of statistics

  • Distinguish between Qualitative and Quantitative variables

  • Distinguish between Discrete and Continuous variables

Definition of Statistics

Statistics is the science of collecting, organizing, summarizing, and analyzing information to draw conclusions or answer questions.

Four-Step Statistical Process

  1. Plan (Identify a question): Formulate a statistical question that can be answered with data. This is a crucial step in the process.

  2. Collect (Produce Data): Design and implement a plan to collect appropriate data. Data can be gathered through observations, interviews, questionnaires, databases, sampling, or experimentation.

  3. Process (Analyze the Data): Organize and summarize the data using graphical or numerical methods. This may include histograms, dot plots, and box plots.

  4. Conclusion (Interpret the Results): Interpret findings in the context of the original question and explain how the data answers it.

Key Terms in the Process of Statistics

  • Population: The entire group of individuals to be studied.

  • Individual: A person or object that is a member of the population.

  • Sample: A subset of the population that is being studied.

Types of Statistics

  • Descriptive statistics: Methods for organizing and summarizing data, often using tables, graphs, and numerical summaries.

  • Inferential statistics: Methods that use sample data to make generalizations about a population and measure the reliability of the results.

Example

  • The average score for a class of 28 students taking a calculus midterm exam was 72%. Parameter

  • Interviews of 100 adults 18 years of age or older found that 44% could state the minimum age required for the office of U.S. president. Statistic

Types of Data: Qualitative and Quantitative Variables

Qualitative vs. Quantitative Variables

  • Qualitative (Categorical) variables: Classify individuals based on some attribute or characteristic (e.g., gender, nationality).

  • Quantitative variables: Provide numerical measures of individuals. Arithmetic operations can be performed on these values (e.g., age, income).

Discrete vs. Continuous Variables

  • Discrete variable: A quantitative variable with a finite or countable number of possible values (e.g., number of children).

  • Continuous variable: A quantitative variable with an infinite number of possible values that are not countable (e.g., daily intake of whole grains in grams).

Example: Classification

  • Nationality: Qualitative

  • Number of children: Discrete

  • Household income in the previous year: Quantitative

  • Daily intake of whole grains: Continuous

Observational Studies versus Designed Experiments

Objectives

  • Distinguish between an observational study and an experiment

  • Explain the various types of observational studies

Definitions

  • Observational study: Measures the value of the response variable without attempting to influence the value of either the response or explanatory variables.

  • Designed experiment: Researcher intentionally changes the value of the explanatory variable and records the value of the response variable.

  • Lurking variable: An explanatory variable not considered in a study but may affect the value of the response variable.

Key Point

Observational studies do not allow a researcher to claim causation, only association. Only well-designed experiments can prove the cause and effect.

Types of Observational Studies

  • Cross-sectional studies: Collect information about individuals at a specific point in time or over a very short period.

  • Case-control studies: Retrospective studies that require individuals to look back in time or require researchers to look at existing records. Individuals are matched based on certain characteristics.

  • Cohort studies: Identify a group of individuals to participate in a study (the cohort) and are observed over a long period. Characteristics are recorded, and data is collected over time.

Example Table: Types of Observational Studies

Study Type

Key Feature

Example

Cross-sectional

Snapshot at one point in time

Daily coffee consumption and skin cancer

Case-control

Retrospective, matched groups

Tanning bed use and skin cancer

Cohort

Follow group over time

Doll and Hill smoking study

Sampling Techniques

Simple Random Sampling

Random sampling is the process of using chance to select individuals from a population to be included in the sample. A sample of size n from a population of size N is obtained through simple random sampling if every possible sample of size n has an equally likely chance of occurring.

Steps for Simple Random Sampling

  1. Put members in alphabetical order and number them.

  2. Select random numbers using a random number generator or table.

  3. Match generated random numbers to the corresponding individuals.

Example: Simple Random Sampling

  • From 80 students, select 5 using random digits: 05, 16, 62, 77, 48.

Other Sampling Techniques

  • Stratified sample: Population is separated into homogeneous, nonoverlapping groups (strata), and a simple random sample is taken from each stratum.

  • Systematic sample: Every kth individual is selected from the population. The first individual is selected at random between 1 and k.

  • Cluster sample: All individuals within a randomly selected collection or group are included in the sample.

Comparison Table: Sampling Techniques

Sampling Method

Key Feature

Example

Simple Random

Every individual has equal chance

Randomly select 5 students from a class

Stratified

Divide into strata, sample from each

Sample students by grade level

Systematic

Select every k-th individual

Every 10th person entering a store

Cluster

Sample all individuals in selected groups

Sample all students in selected classrooms

Strata vs. Clusters

Strata are groups of similar individuals, and samples are taken from each stratum. Clusters are groups selected at random, and all individuals in the selected clusters are sampled.

Summary of Key Formulas

  • Parameter vs. Statistic: A parameter describes a population, a statistic describes a sample.

  • Simple Random Sampling Probability:

Additional info: Some explanations and examples have been expanded for clarity and completeness.

Pearson Logo

Study Prep