Skip to main content
Back

Introduction to Statistics: Key Concepts and Types of Data

Study Guide - Smart Notes

Tailored notes based on your materials, expanded with key definitions, examples, and context.

Introduction to Statistics

Statistical and Critical Thinking

Statistics is the science of collecting, analyzing, interpreting, and presenting data. A major use of statistics is to collect and use sample data to make conclusions about populations. Critical thinking in statistics involves evaluating the validity of statistical methods and the reliability of conclusions drawn from data.

  • Population: The entire group of individuals or items that is the subject of a statistical study.

  • Sample: A subset of the population, selected for analysis to draw conclusions about the population.

  • Key Point: Statistical inference allows us to make generalizations about a population based on sample data.

Parameters and Statistics

Understanding the distinction between parameters and statistics is fundamental in statistics.

  • Parameter: A numerical measurement describing some characteristic of a population.

  • Statistic: A numerical measurement describing some characteristic of a sample.

  • Example: The average height of all students in a university is a parameter; the average height of a sample of 100 students is a statistic.

Types of Data

Data can be classified based on their nature and measurement. The two main types are quantitative and categorical data.

  • Quantitative Data: Consists of numbers representing counts or measurements. Examples include exam scores, height, temperature, weight, age, time, and ranking.

  • Categorical Data: Consists of names or labels (not numbers) that represent categories. Examples include gender, school type, sex, blood type, and car make/model.

Types of Variables

Variables are characteristics or properties that can take on different values. They are classified as follows:

Variable Type

Description

Examples

Qualitative (Categorical)

Describes qualities or categories

Gender, blood type, school type

Quantitative

Describes numerical values

Height, age, exam scores

Discrete (Quantitative)

Countable values, finite or countable

Number of coin tosses, number of people

Continuous (Quantitative)

Infinitely many possible values, not countable

Height, weight, time

Discrete and Continuous Data

Quantitative data can be further classified as discrete or continuous:

  • Discrete Data: Result when the data values are quantitative and the number of values is finite or countable. Example: Number of exam questions answered correctly, number of people in a room.

  • Continuous Data: Result from infinitely many possible quantitative values, where the collection of values is not countable. Example: The length of a desk, time taken to complete a task.

Big Data and Data Science

Modern statistics often deals with very large and complex data sets, known as big data. The analysis of big data may require advanced software and parallel computing.

  • Big Data: Data sets so large and complex that traditional software tools are insufficient for analysis.

  • Data Science: An interdisciplinary field involving statistics, computer science, software engineering, and other relevant domains to analyze and interpret big data.

Missing Data

Missing data occurs when some values in a data set are not recorded. Understanding the nature of missing data is important for proper analysis.

  • Missing Completely at Random (MCAR): The likelihood of a value being missing is independent of its value or any other values in the data set.

  • Missing Not at Random (MNAR): The missing value is related to the reason it is missing.

Correcting for Missing Data

There are two common methods for handling missing data:

  • Delete Cases: Remove all subjects with any missing values from the analysis.

  • Impute Missing Values: Substitute missing data values with estimated or predicted values.

Summary Table: Types of Data

Type

Description

Examples

Quantitative

Numerical values

Height, age, exam scores

Discrete

Countable, finite values

Number of people, coin tosses

Continuous

Infinitely many values, not countable

Weight, time, temperature

Categorical

Names or labels

Gender, blood type, car model

Key Formulas

  • Population Mean (Parameter):

  • Sample Mean (Statistic):

Additional info: The formulas for mean are provided for context, as they are fundamental to understanding parameters and statistics in data analysis.

Pearson Logo

Study Prep