Skip to main content
Back

Elementary Statistics: Chapter 1 – Data Collection and Statistical Thinking

Study Guide - Smart Notes

Tailored notes based on your materials, expanded with key definitions, examples, and context.

Chapter 1: Data Collection

Introduction to the Practice of Statistics

Statistics is a foundational discipline for making informed decisions using data. This chapter introduces the basic concepts, terminology, and processes essential for understanding and applying statistics in real-world contexts.

Define Statistics and Statistical Thinking

  • Statistics is the science of collecting, organizing, summarizing, and analyzing information to draw conclusions or answer questions.

  • It also involves providing a measure of confidence in any conclusions drawn.

  • The information used in statistics is called data. Data are facts or propositions used to draw a conclusion or make a decision, and they represent characteristics of individuals.

  • One key aspect of data is variability—data values can differ among individuals.

  • The primary goal of statistics is to describe and understand sources of variability.

Explain the Process of Statistics

The process of statistics involves several key steps to ensure valid and reliable conclusions:

  1. Identify the Research Objective: Clearly define the question(s) to be answered and the population of interest.

  2. Collect the Data: Gather data relevant to the research objective. Proper data collection is crucial for meaningful results.

  3. Describe the Data: Use descriptive statistics (numerical summaries, tables, graphs) to summarize and explore the data.

  4. Perform Inference: Apply inferential statistical methods to extend results from the sample to the population and report the reliability of these results.

Key Statistical Terms

  • Population: The entire group of individuals to be studied.

  • Sample: A subset of the population that is actually studied.

  • Individual: A single person or object that is a member of the population.

  • Statistic: A numerical summary based on a sample.

  • Parameter: A numerical summary of a population.

  • Descriptive Statistics: Methods for organizing and summarizing data.

  • Inferential Statistics: Methods for making generalizations from a sample to a population and measuring the reliability of those generalizations.

Example: Statistic vs. Parameter

  • If a study of all 3,628 students at a college finds that 50% own a television, this is a parameter (since it describes the entire population).

  • If the average annual salary of 50 out of 80 employees is $50,000, this is a statistic (since it is based on a sample).

Types of Variables

Variables are characteristics of individuals within a population. Understanding the types of variables is essential for selecting appropriate statistical methods.

Qualitative vs. Quantitative Variables

  • Qualitative (Categorical) Variables: Classify individuals based on some attribute or characteristic (e.g., gender, color, type).

  • Quantitative Variables: Provide numerical measures of individuals. The values can be added or subtracted and provide meaningful results (e.g., height, weight, age).

Discrete vs. Continuous Variables

  • Discrete Variable: A quantitative variable with either a finite or countable number of possible values (e.g., number of students, number of cars).

  • Continuous Variable: A quantitative variable with an infinite number of possible values, measurable to any desired level of accuracy (e.g., time, distance, temperature).

Examples: Identifying Variable Types

  • Number of snack and soft drink vending machines in a school: Quantitative, Discrete

  • Whether the school has a closed campus policy during lunch: Qualitative

  • Class rank (Freshman, Sophomore, Junior, Senior): Qualitative

  • Number of days per week a student eats school lunch: Quantitative, Discrete

Sampling Methods

Sampling is the process of selecting a subset of individuals from a population to estimate characteristics of the whole population.

Simple Random Sampling

  • Simple Random Sampling is a method where every possible sample of a given size has an equally likely chance of being selected.

  • Steps to obtain a simple random sample:

    1. Obtain a frame (list) of all individuals in the population and number them from 1 to N.

    2. Use a random number table, calculator, or statistical software to randomly select n numbers (where n is the desired sample size).

    3. Set a seed for reproducibility when using random number generators.

Example: Simple Random Sampling Methods

  • Number the names from 1 to 25 and use a random number generator to select 4 different numbers corresponding to the names selected.

  • Put each name on a separate piece of paper, place them all in a hat, and pick four.

  • Listing members alphabetically and taking the first four is not a simple random sample.

Bias in Sampling

Bias occurs when the results of a sample are not representative of the population.

  • For example, if a village manager selects 10 homes in the southwest corner of the village to determine household income, the sample may not represent the entire village, introducing bias.

Summary Table: Types of Variables

Type

Description

Examples

Qualitative (Categorical)

Classifies individuals based on attributes or characteristics

Gender, color, class rank

Quantitative (Discrete)

Numerical, countable values

Number of students, number of cars

Quantitative (Continuous)

Numerical, measurable values with infinite possibilities

Height, weight, time

Key Formulas

  • Sample Mean:

  • Population Mean:

  • Sample Proportion:

  • Population Proportion:

Additional info: This summary is based on the provided slides and standard introductory statistics content. Some explanations and formulas have been expanded for clarity and completeness.

Pearson Logo

Study Prep