Skip to main content
Back

Fundamental Concepts in Statistics: Descriptive, Inferential, Sampling Methods, and Random Sampling

Study Guide - Smart Notes

Tailored notes based on your materials, expanded with key definitions, examples, and context.

Descriptive and Inferential Statistics

Definitions and Distinctions

Statistics is broadly divided into two main branches: descriptive statistics and inferential statistics. Understanding the difference between these branches is essential for interpreting statistical studies and data analyses.

  • Descriptive Statistics: Involves organizing, summarizing, and displaying data. It describes the main features of a dataset, often through tables, charts, and summary measures such as mean, median, and mode.

  • Inferential Statistics: Uses data from a sample to make generalizations or draw conclusions about a larger population. It involves estimation, hypothesis testing, and prediction.

Example:

  • Given birth and birth rate data for the years 1990-1994, summarizing these rates is descriptive statistics.

  • Estimating the percentage of people lacking health insurance in the U.S. based on a sample is inferential statistics.

Table: Birth and Birth Rate Data (Descriptive Example)

Year

Births

Birth Rate

1990

4,158,212

16.7

1991

4,110,907

16.1

1992

4,065,014

15.5

1993

4,000,240

15.3

1994

3,979,000

15.2

Table: Percentage of People Lacking Health Insurance (Inferential Example)

Age

Percentage Not Covered

25-34

24.2

35-44

19.9

45-54

15.5

55-64

14.5

Populations and Samples

Definitions

In statistics, it is crucial to distinguish between a population and a sample:

  • Population: The entire group of individuals or items that is the subject of a statistical study.

  • Sample: A subset of the population, selected for analysis to draw conclusions about the population.

Example:

  • Population: All 400 Ethernet cables in a box.

  • Sample: The 10 cables tested for reliability.

Sampling Methods

Types of Sampling

Sampling is the process of selecting a subset of individuals from a population to estimate characteristics of the whole population. Several sampling methods exist, each with its own advantages and limitations.

  • Simple Random Sampling: Every member of the population has an equal chance of being selected. Selection is typically done using random numbers.

  • Cluster Sampling: The population is divided into clusters (often geographically), and entire clusters are randomly selected. All members of chosen clusters are included in the sample.

  • Stratified Sampling: The population is divided into strata (groups) based on a characteristic (e.g., age, grade), and random samples are taken from each stratum.

Examples:

  • Cluster Sampling: An education researcher randomly selects 38 schools from a district and interviews all teachers at those schools.

  • Stratified Sampling: A school administrator selects a random sample of students from each grade level (freshmen, sophomores, juniors, seniors).

Random Sampling and Simple Random Sampling

Definitions and Properties

Random Sampling ensures that every member of the population has a known, non-zero chance of being selected. Simple Random Sampling is a specific type of random sampling where every possible sample of a given size has an equal chance of being chosen.

  • Random Sample: All individuals have the same probability of selection.

  • Simple Random Sample: Every possible group of n individuals has an equal chance of being selected.

Example:

  • If a manager selects 20 employees at random from a group of 100, each employee has a 1 in 5 (20 in 100) chance of being selected. This is a random sample.

  • If the selection process does not allow for all possible combinations (e.g., cannot select 40 software engineers from a group of 100 employees split between software and hardware engineers), it is not a simple random sample.

Calculating Probability in Random Sampling

  • Probability of Selection: If there are N individuals and n are selected, the probability for each is .

  • Example: Selecting 1 out of 20 finalists: or 5% chance.

Using Random Number Tables

Application in Sampling

Random number tables are used to ensure unbiased selection in simple random sampling. The process involves:

  • Assigning numbers to each member of the population.

  • Using a random number table to select members for the sample.

Example Table: Random Numbers

Line

Random Numbers

1

270, 455, 415, 151, 310, 85, 105, 84, 129

2

Additional info: These numbers are used to select sample members from a population of 470 contestants.

Constructing Samples and Sample Spaces

Sample Space in Combinatorial Context

When selecting samples, especially in competitions or experiments, it is important to enumerate all possible samples (sample space).

  • Sample Space: The set of all possible samples that can be drawn from a population.

  • Example: For 5 finalists (Lisa, Melina, Ben, Danny, Ruth), all possible groups of 3 finalists can be listed.

Table: Possible Samples of 3 Finalists

Sample

LMB

LMD

LMR

LBD

LBR

LDR

MBD

MBR

MDR

BDR

Additional info: The above table lists all possible combinations of 3 finalists from a group of 5.

Summary Table: Sampling Methods Comparison

Sampling Method

Description

Example

Simple Random

Every possible sample of n individuals has equal chance

Randomly selecting 20 employees from 100

Cluster

Randomly select entire groups (clusters)

Selecting all teachers from 38 randomly chosen schools

Stratified

Randomly select from each subgroup (stratum)

Selecting students from each grade level

Key Formulas

  • Probability of Selection:

  • Combinations (number of possible samples):

Conclusion

Understanding the distinction between descriptive and inferential statistics, the concepts of populations and samples, and the various sampling methods is foundational for any statistics student. Mastery of these topics enables accurate data analysis and valid generalizations from sample data to broader populations.

Pearson Logo

Study Prep