BackDistribution of the Sample Proportion: Binomial and Bernoulli Distributions
Study Guide - Smart Notes
Tailored notes based on your materials, expanded with key definitions, examples, and context.
Distribution of the Sample Proportion
Introduction
In statistics, researchers are often interested in studying a process or population to make inferences about a characteristic of interest. When the population is too large or infinite, a sample is taken to estimate the population parameters. The sample proportion is a key statistic used to estimate the proportion of individuals in a population with a given characteristic. This chapter explores the distribution of the sample proportion, its relationship to the binomial and Bernoulli distributions, and how sample size affects the precision of estimates.
Random Sample
Definition and Importance
A random sample is a subset of individuals chosen from a population such that every member has an equal chance of being selected. This ensures that the sample is representative of the population, allowing valid statistical inference.
Random sampling reduces bias and increases the reliability of estimates.
Measurements on individuals in a random sample are assumed to be independent and identically distributed.
Random samples are essential for applying probability models to sample statistics.
Example: A researcher selects 100 individuals at random from a city to estimate the proportion who support a new policy.
Bernoulli Distribution
Definition and Properties
The Bernoulli distribution models a random experiment with only two possible outcomes: "success" (usually coded as 1) and "failure" (coded as 0). If the probability of success is , then the probability mass function is:
The mean and variance of a Bernoulli random variable are:
Mean:
Variance:
Example: The probability that a randomly selected patient responds to a drug is 0.8. is 1 if the patient responds, 0 otherwise.
The Family of Binomial Distributions
Definition and Properties
The binomial distribution generalizes the Bernoulli distribution to independent trials, each with probability of success. The binomial random variable counts the number of successes in trials.
Probability mass function: for
Mean:
Variance:
Standard deviation:
Example: In a sample of 10 children, each has a 0.4 probability of inheriting a trait. The probability that exactly 3 children inherit the trait is:
Binomial Distribution Table
Parameter | Symbol | Formula |
|---|---|---|
Number of trials | n | - |
Probability of success | p | - |
Mean | ||
Variance | ||
Standard deviation |
Distribution of the Sample Proportion
Definition and Calculation
The sample proportion is the proportion of individuals in the sample with the characteristic of interest. If is the number of successes in a sample of size , then:
The sample proportion is a random variable with its own distribution, mean, and variance:
Mean:
Variance:
Standard deviation:
Example: In a sample of 10 individuals, if , the probability that the sample proportion is 0.2 is:
Effect of Sample Size
As the sample size increases, the variance of the sample proportion decreases, resulting in more precise estimates of .
For ,
For ,
Graph: The variance of decreases as increases, as shown in the provided figure.
Conclusion
In summary, the distribution of the sample proportion is a key tool for making inferences about the proportion of individuals in a population with a given characteristic. Under the assumption of independence and a common Bernoulli distribution, is an unbiased estimator of . Increasing the sample size reduces the variance of , leading to more precise estimates.
Key Formulas
Bernoulli mean:
Bernoulli variance:
Binomial mean:
Binomial variance:
Sample proportion:
Sample proportion mean:
Sample proportion variance:
Summary Table: Distributions and Properties
Distribution | Random Variable | Mean | Variance |
|---|---|---|---|
Bernoulli | |||
Binomial | |||
Sample Proportion |
Additional info: The notes include exercises and calculator instructions for computing binomial probabilities, which are useful for practical applications in statistics.