Skip to main content
Back

Introduction to Elementary Statistics: Chapter 1 Study Notes

Study Guide - Smart Notes

Tailored notes based on your materials, expanded with key definitions, examples, and context.

Chapter 1: Introduction to Statistics

Overview of Statistics

This chapter introduces the foundational concepts of statistics, including definitions, types of data, and the distinction between populations and samples. Understanding these basics is essential for further study in statistics.

  • Statistics is the science of collecting, organizing, analyzing, and interpreting data to make decisions.

  • Data refers to information obtained from observations, counts, measurements, or responses.

  • Examples of data include survey results, measurements, and recorded observations.

Populations and Samples

In statistics, it is important to distinguish between the entire group of interest and a subset used for analysis.

  • Population: The complete collection of all outcomes, responses, measurements, or counts that are of interest.

  • Sample: A subset, or part, of the population.

  • Example: In a survey of 834 U.S. employees, the population is all U.S. employees, while the sample is the 834 surveyed employees. The data set consists of 517 'yes' and 317 'no' responses.

Parameters and Statistics

Statistical analysis often involves summarizing data using numerical values. These summaries can describe either populations or samples.

  • Parameter: A numerical description of a population characteristic (e.g., average age of all people in the United States).

  • Statistic: A numerical description of a sample characteristic (e.g., average age of people from a sample of three states).

  • Example: If a survey of 9,400 individuals aged 15 and over finds an average of 5.19 hours per day spent on leisure, this is a sample statistic because it is based on a subset of the population.

Descriptive vs. Inferential Statistics

Statistics is divided into two main branches: descriptive and inferential.

  • Descriptive Statistics: Involves the organization, summarization, and display of data (e.g., tables, charts, averages).

  • Inferential Statistics: Involves using sample data to draw conclusions about a population.

  • Example: In a study of 1,502 U.S. adults, 18% of adults from households earning less than $30,000 do not use the Internet. This percentage is descriptive; inferring that Internet access is less available to lower-income households is inferential.

Data Classification

Types of Data

Data can be classified as qualitative or quantitative, depending on its nature.

  • Qualitative Data: Consists of attributes, labels, or non-numerical entries (e.g., place of birth, eye color).

  • Quantitative Data: Consists of numerical measurements or counts (e.g., weight, temperature).

  • Example: In a table of sports-related head injuries, the types of sports are qualitative data, while the number of head injuries is quantitative data.

Levels of Measurement

Data can be further classified according to four levels of measurement: nominal, ordinal, interval, and ratio.

  • Nominal Level: Qualitative data only; categorized using names, labels, or qualities. No mathematical computations can be made.

  • Ordinal Level: Qualitative or quantitative data; can be arranged in order or ranked, but differences between data entries are not meaningful.

  • Interval Level: Quantitative data; can be ordered, and differences between data entries are meaningful. Zero represents a position on a scale, but not an inherent zero.

  • Ratio Level: Similar to interval level, but zero is an inherent zero (implies "none"). Ratios of two data values can be formed, and one data value can be expressed as a multiple of another.

Level of Measurement

Characteristics

Example

Nominal

Put data in categories

Types of TV shows (Comedy, Drama, Reality)

Ordinal

Put data in order, but differences not meaningful

Movie genres ranked by popularity

Interval

Order, meaningful differences, no true zero

Monthly temperatures in °F

Ratio

Order, meaningful differences, true zero, ratios possible

Monthly precipitation in inches

Data Collection and Experimental Design

Designing a Statistical Study

Proper design is crucial for valid statistical studies. The process involves several steps:

  1. Identify the variable(s) and the population.

  2. Develop a detailed plan for collecting data. Ensure the sample is representative.

  3. Collect the data using appropriate techniques.

  4. Interpret the data and make decisions about the population using inferential statistics.

  5. Identify any possible sources of error.

Methods of Data Collection

  • Observational Study: The researcher observes and measures characteristics of interest without influencing the subjects.

  • Experiment: A treatment is applied to part of a population, and responses are observed. Control groups and placebos may be used.

  • Simulation: Uses mathematical or physical models to reproduce conditions of a situation or process, often with computers.

  • Survey: Investigation of one or more characteristics of a population, commonly done by interview, Internet, phone, or mail.

Experimental Design Elements

  • Control: Managing variables to minimize confounding effects.

  • Randomization: Randomly assigning subjects to treatment groups.

  • Replication: Repeating the experiment with a large group to validate results.

  • Blinding: Subjects do not know whether they are receiving treatment or placebo. In double-blind experiments, neither subjects nor experimenters know.

Sampling Techniques

Sampling is used to select a subset of the population for study. Several techniques exist:

  • Random Sample: Every member of the population has an equal chance of being selected.

  • Simple Random Sample: Every possible sample of the same size has the same chance of being selected.

  • Stratified Sample: Divide the population into groups (strata) and select a random sample from each group.

  • Cluster Sample: Divide the population into clusters, then select all members from one or more clusters.

  • Systematic Sample: Choose a starting value at random, then select every kth member.

  • Convenience Sample: Select only members who are easy to reach; often leads to bias and is not recommended.

Sampling Technique

Description

Example

Simple Random

Equal chance for all samples

Randomly select 8 students from 731 using random numbers

Stratified

Divide into strata, sample from each

Divide students by major, sample from each major

Cluster

Divide into clusters, select all from some clusters

Select all households from randomly chosen zip codes

Systematic

Select every kth member

Choose every 100th household

Convenience

Easy to reach, but biased

Survey students in your own class

Key Formulas and Concepts

  • Population Parameter: (mean of a population)

  • Sample Statistic: (mean of a sample)

  • Sampling Error: The difference between the results of a sample and those of the population.

*Additional info: Some examples and tables have been expanded for clarity and completeness. The summary tables are reconstructed based on the context of the original slides.*

Pearson Logo

Study Prep