The hypergeometric distribution is a statistical model used when the trials are dependent, meaning the outcome of one trial affects the outcomes of subsequent trials. This contrasts with the binomial distribution, where trials are independent and have a constant probability of success. In the hypergeometric scenario, we draw items from a finite population without replacement, which alters the composition of the population with each draw.
To illustrate, consider an example where we draw marbles from a bag containing two red and four blue marbles. We want to determine the probability of drawing exactly one red marble in three draws, both with and without replacement. In the case of drawing with replacement, each draw is independent, and the probability of success remains constant. Here, we can apply the binomial distribution formula:
For a binomial distribution, the probability of getting exactly \( x \) successes in \( n \) trials is given by:
\[ P(X = x) = \binom{n}{x} p^x (1-p)^{n-x} \]
Where \( p \) is the probability of success on each trial. In our example, the probability of drawing one red marble with replacement simplifies to \( \frac{4}{9} \).
However, when drawing without replacement, we must use the hypergeometric distribution. The relevant parameters are:
- \( N \): Total number of items (6 marbles)
- \( n \): Number of draws (3)
- \( R \): Number of successes in the population (2 red marbles)
- \( r \): Number of successes we want (1 red marble)
The hypergeometric probability formula is expressed as:
\[ P(X = x) = \frac{\binom{R}{x} \binom{N-R}{n-x}}{\binom{N}{n}} \]
In our case, we calculate:
\[ P(X = 1) = \frac{\binom{2}{1} \binom{4}{2}}{\binom{6}{3}} \]
Calculating each term, we find:
- \( \binom{2}{1} = 2 \)
- \( \binom{4}{2} = 6 \)
- \( \binom{6}{3} = 20 \)
Thus, the probability becomes:
\[ P(X = 1) = \frac{2 \times 6}{20} = \frac{12}{20} = \frac{3}{5} \]
This result indicates that the probability of drawing exactly one red marble in three draws without replacement is \( \frac{3}{5} \). Understanding the differences between the binomial and hypergeometric distributions is crucial for accurately modeling scenarios where the independence of trials cannot be assumed.
