Skip to main content
Back

Introduction to Statistics: Key Concepts, Sampling Methods, and Types of Variables

Study Guide - Smart Notes

Tailored notes based on your materials, expanded with key definitions, examples, and context.

Introduction to Statistics

Overview of Statistics

Statistics is a branch of mathematics focused on the collection, analysis, interpretation, and presentation of numerical data. It provides essential tools for making informed decisions based on data.

  • Collection: Gathering data through experiments, surveys, or observational studies.

  • Analysis: Examining data to identify patterns, trends, and relationships.

  • Interpretation: Drawing conclusions from analyzed data.

  • Presentation: Communicating findings using reports, graphs, and tables.

Population vs. Sample

Definitions and Importance

  • Population: The entire group of individuals or objects of interest in a study.

  • Sample: A subset of the population selected for actual analysis.

Sampling is crucial because studying an entire population is often impractical due to size, cost, or destructive testing.

Variables vs. Data

Understanding Variables and Data

  • Variables: Characteristics or properties that can take on different values (e.g., blood pressure, gender, body weight).

  • Data: Actual observed values of variables (e.g., 120 mmHg, Female, 48 kg).

Descriptive vs. Inferential Statistics

Key Differences and Applications

  • Descriptive Statistics: Techniques used to summarize and describe the main features of a dataset. Examples include measures of central tendency (mean, median) and measures of dispersion (range, standard deviation).

  • Inferential Statistics: Methods that use sample data to make generalizations or predictions about a population. Common techniques include estimation (confidence intervals) and hypothesis testing.

Example:

  • Descriptive: Calculating the average score of students in a class.

  • Inferential: Using a sample of students to estimate the average score of all students in the university.

Sampling: Concepts and Methods

What is Sampling?

Sampling is the process of selecting a portion of the population to collect information, aiming to generalize findings to the entire population.

  • Used when the population is too large, testing is destructive, or resources are limited.

Sample Size and Sampling Method

  • Sample Size: Larger samples generally provide more accurate results. Minimum sample size can be calculated statistically.

  • Sampling Method: The method must ensure the sample is representative of the population. Random sampling is preferred for representativeness.

Types of Sampling Methods

  • Random Sampling: Every subject has an equal chance of being selected. Types include:

    • Simple Random Sampling: Subjects are numbered and selected randomly (e.g., using a random number generator).

    • Systematic Sampling: Every nth subject is selected after numbering the population sequentially.

    • Cluster Sampling: The population is divided into groups (clusters), some clusters are randomly selected, and all subjects within chosen clusters are included.

    • Multi-Stage Sampling: Clusters are selected, then a random sample is taken within each selected cluster.

    • Stratified Sampling: The population is divided into strata (homogeneous groups), and random samples are taken from each stratum.

  • Non-Random Sampling: Not every subject has an equal chance of selection. Types include:

    • Convenience Sampling: Selecting easily available subjects (e.g., friends, first customers).

    • Quota Sampling: Similar to stratified sampling, but subjects within strata are chosen by convenience, not randomly.

Types of Variables

Classification of Variables

  • Qualitative (Categorical) Variables:

    • Nominal: Categories without order (e.g., gender, blood type).

    • Ordinal: Categories with a meaningful order but not equal intervals (e.g., cancer stage, pain level).

  • Quantitative (Numerical) Variables:

    • Discrete: Countable values, usually integers (e.g., number of children).

    • Continuous: Any value within a range, including fractions (e.g., weight, age).

Summary Table: Types of Variables

Type

Description

Examples

Nominal

Categories are mutually exclusive and unordered

Gender, blood group, eye color

Ordinal

Categories are mutually exclusive and ordered

Cancer stage, education level

Discrete

Integer values (counts)

Number of children, days sick per year

Continuous

Any value in a range (measured)

Weight, height, age

Practice Questions and Applications

Sample Questions

  • Which of the following is an example of non-random sampling? Answer: Convenience sampling

  • The variable "level of satisfaction" (with categories like very satisfied, satisfied, etc.) is an example of: Answer: Ordinal variable

  • Which of the following are examples of random sampling? Answer: Stratified sampling, Systematic sampling, Multi-stage sampling

  • What is the application of inferential statistics? Answer: To make assumptions about a population based on the description of data

Key Formulas and Concepts

Measures of Central Tendency and Dispersion

  • Mean (Arithmetic Average):

  • Median: The middle value when data are ordered.

  • Range: Difference between the maximum and minimum values.

  • Standard Deviation:

Additional info: These formulas are foundational for descriptive statistics and will be used throughout the course.

Pearson Logo

Study Prep