Skip to main content
Back

Business Statistics: Foundational Concepts and Descriptive Measures

Study Guide - Smart Notes

Tailored notes based on your materials, expanded with key definitions, examples, and context.

Defining and Collecting Data

Introduction to Business Analytics and Statistics

Business analytics involves using statistical methods and technologies to analyze historical data, gain insights, and improve decision-making. Statistics is a branch of mathematics focused on collecting, analyzing, interpreting, and presenting data. The DCOVA framework guides statistical analysis: Define, Collect, Organize, Visualize, and Analyze.

  • Descriptive Statistics: Summarize and present data (e.g., mean, median, variance).

  • Inferential Statistics: Draw conclusions about populations from samples (e.g., estimation, hypothesis testing).

  • Predictive Statistics: Use models to forecast future outcomes.

Variables and Types of Data

Variables are characteristics or attributes measured in a study. Data are the observed values of these variables. Variables can be classified as categorical (qualitative) or numerical (quantitative), with further subdivisions:

  • Categorical (Qualitative): Nominal (no order), Ordinal (ordered categories)

  • Numerical (Quantitative): Discrete (countable values), Continuous (any value within a range)

Classification of variables

Levels of Measurement

The level of measurement determines the mathematical properties of data:

  • Nominal: Categories without order (e.g., gender, eye color)

  • Ordinal: Ordered categories (e.g., satisfaction ratings)

  • Interval: Numeric, equal intervals, no true zero (e.g., temperature in Celsius)

  • Ratio: Numeric, equal intervals, true zero (e.g., height, weight, revenue)

Sampling Methods

Sampling is the process of selecting a subset of a population for analysis. There are two main types:

  • Non-probability Samples: Judgement, convenience, quota, self-selection (not random, may introduce bias)

  • Probability Samples: Simple random, stratified, systematic, cluster (random selection, allows inference to population)

Types of samples

Tabular and Visual Summarization of Variables

Measures of Central Tendency

Central tendency describes the center of a data set. The three main measures are:

  • Mean: Arithmetic average, sensitive to outliers.

  • Median: Middle value when data are ordered, robust to outliers.

  • Mode: Most frequent value, useful for categorical data.

Calculation of meanCalculation of median

Measures of Variation

Variation measures the spread or dispersion of data values. Key measures include:

  • Range: Difference between largest and smallest values. Sensitive to outliers.

  • Variance: Average squared deviation from the mean.

  • Standard Deviation: Square root of variance, in same units as data.

  • Coefficient of Variation (CV): Standard deviation divided by mean, expressed as a percentage. Useful for comparing variability across datasets with different units.

Measures of variationRange and sensitivity to outliersSample variance formulaSample standard deviation formulaCoefficient of variation formula

Descriptive Measures for Populations vs. Samples

Population parameters use Greek letters; sample statistics use Latin letters. For example:

Measure

Sample Statistic

Population Parameter

Mean

\( \bar{x} \)

\( \mu \)

Variance

\( s^2 \)

\( \sigma^2 \)

Standard Deviation

\( s \)

\( \sigma \)

Comparison of sample statistics and population parameters

Numerical Descriptive Measures

Standard Deviation and Variance

The formulas for standard deviation and variance differ for populations and samples:

  • Population Standard Deviation:

  • Sample Standard Deviation:

Standard deviation for a sample and population

Geometric Mean and Rate of Return

The geometric mean is used for data combined multiplicatively, such as growth rates. The geometric rate of return is:

Geometric rate of return formula

Z-Scores

A Z-score standardizes a data value by expressing it in terms of standard deviations from the mean:

Z-score formula

Skewness and Kurtosis

These statistics describe the shape of a distribution:

  • Skewness: Measures asymmetry. Left-skewed (mean < median), right-skewed (mean > median), symmetric (mean = median).

  • Kurtosis: Measures the concentration of values in the tails. High kurtosis indicates heavy tails; low kurtosis indicates light tails.

Skewness formulaSkewness interpretationKurtosis formulaKurtosis interpretation

Quartiles, Interquartile Range, and Boxplots

Quartiles divide data into four equal parts. The interquartile range (IQR) measures the spread of the middle 50% of data:

  • First Quartile (Q1):

  • Second Quartile (Median, Q2):

  • Third Quartile (Q3):

  • IQR:

Quartiles and IQRBoxplot and five-number summaryBoxplot shapes and distribution

Probability and Probability Distributions

Basic Probability Concepts

Probability quantifies the likelihood of an event, ranging from 0 (impossible) to 1 (certain). Events can be simple or joint, and probabilities can be assessed a priori, empirically, or subjectively.

  • Marginal Probability: Probability of a single event.

  • Joint Probability: Probability of two or more events occurring together.

  • Conditional Probability: Probability of one event given another has occurred.

Probability Rules and Tables

Key probability rules include the addition rule, multiplication rule, and Bayes' theorem. Probabilities are often organized in contingency tables for clarity.

Event

B1

B2

Total

A1

P(A1 and B1)

P(A1 and B2)

P(A1)

A2

P(A2 and B1)

P(A2 and B2)

P(A2)

Total

P(B1)

P(B2)

1

Contingency table for joint and marginal probabilities

Additional info:

  • All formulas are provided in LaTeX format for clarity and academic rigor.

  • Images are included only when directly relevant to the explanation and reinforce the educational content.

Pearson Logo

Study Prep