BackSampling Distributions and Estimators: Essentials of Statistics Chapter 6 Study Notes
Study Guide - Smart Notes
Tailored notes based on your materials, expanded with key definitions, examples, and context.
Sampling Distributions and Estimators
Introduction
Sampling distributions are fundamental in inferential statistics, allowing us to understand how sample statistics behave when repeatedly drawn from a population. This topic covers the behavior of sample proportions, means, and variances, and introduces the concepts of estimators, including unbiased and biased estimators.
General Behavior of Sampling Distributions
When samples of the same size are taken from the same population, certain properties emerge:
Sample proportions and sample means tend to be normally distributed.
The mean of all sample proportions (or sample means) equals the population proportion (or population mean).
Sampling Distributions for Proportions, Means, and Variances
Statistic | Sampling Procedure | Distribution Type |
|---|---|---|
Proportion () | Randomly select n values and find the proportion for each sample | Normal distribution |
Mean () | Randomly select n values and find the mean for each sample | Normal distribution |
Variance () | Randomly select n values and find the variance for each sample | Skewed distribution (right-skewed) |
Sampling Distribution of a Statistic
The sampling distribution of a statistic is the distribution of all values of the statistic when all possible samples of the same size n are taken from the same population.
Sampling Distribution of the Sample Proportion
The sampling distribution of the sample proportion () is the distribution of sample proportions for all samples of size n from the population.
Notation:
x: number of successes
n: sample size
N: population size
: sample proportion
: population proportion
Behavior:
The distribution of sample proportions tends to be normal.
The mean of all sample proportions equals the population proportion .
Example: Rolling a die 5 times and finding the proportion of odd numbers (1, 3, 5). Repeating this process many times, the sample proportions are approximately normally distributed with a mean of 0.5 (since half the numbers on a die are odd).
Sampling Distribution of the Sample Mean
The sampling distribution of the sample mean () is the distribution of all possible sample means for samples of size n from the population.
The distribution of sample means tends to be normal.
The mean of all sample means equals the population mean .
Example: Rolling a die 5 times and finding the mean. Repeating this process many times, the sample means are approximately normally distributed with a mean of 3.5 (the mean of the numbers 1 through 6).
Sampling Distribution of the Sample Variance
The sampling distribution of the sample variance () is the distribution of all sample variances for samples of size n from the population.
The distribution of sample variances is skewed to the right, not normal.
The mean of all sample variances equals the population variance .
Example: Rolling a die 5 times and finding the variance. Repeating this process many times, the sample variances have a mean of 2.9 (the population variance for a fair die), and the distribution is right-skewed.
Estimators
An estimator is a statistic used to infer or estimate the value of a population parameter.
Unbiased Estimator
An unbiased estimator is a statistic whose sampling distribution has a mean equal to the corresponding population parameter.
Unbiased estimators:
Sample proportion ()
Sample mean ()
Sample variance ()
Biased Estimator
A biased estimator is a statistic whose sampling distribution does not have a mean equal to the population parameter.
Biased estimators:
Median
Range
Sample standard deviation ()
Important Note: The sample standard deviation does not target the population standard deviation , but the bias is small for large samples. Thus, is often used to estimate even though it is technically a biased estimator.
Summary Table: Unbiased vs. Biased Estimators
Estimator | Unbiased? | Targets Parameter? |
|---|---|---|
Sample Proportion () | Yes | Population Proportion () |
Sample Mean () | Yes | Population Mean () |
Sample Variance () | Yes | Population Variance () |
Sample Median | No | Does not target |
Sample Range | No | Does not target |
Sample Standard Deviation () | No (but often used) | Population Standard Deviation () |
Key Formulas
Sample Proportion:
Population Proportion:
Sample Mean:
Sample Variance:
Applications and Importance
Understanding sampling distributions is essential for making inferences about populations from samples.
Unbiased estimators provide reliable estimates of population parameters, which is crucial for hypothesis testing and confidence intervals.
Recognizing the difference between unbiased and biased estimators helps in selecting appropriate statistics for estimation.
Additional info: The examples provided use dice rolls to illustrate the behavior of sampling distributions, which is a common pedagogical approach in introductory statistics. The notes are based on slides from "Essentials of Statistics" by Mario F. Triola, Chapter 6, Section 6-3.