Question 1

What is the hypergeometric distribution?

Accepted Answer

The hypergeometric distribution models the probability of drawing a specific number of successes from a finite population without replacement. Unlike the binomial distribution which assumes replacement (or infinite population), the hypergeometric distribution accounts for the changing probability as items are drawn. A classic example is drawing cards from a deck: what is the probability of getting exactly 2 hearts in a 5-card hand from a standard 52-card deck? The distribution is defined by three parameters: the population size N, the number of success states K in the population, and the number of draws n. Each draw changes the composition of the remaining population.

Question 2

How is the hypergeometric distribution different from the binomial distribution?

Accepted Answer

The key difference is sampling with versus without replacement. The binomial distribution assumes each trial is independent with a constant probability of success, which applies when sampling with replacement or from an effectively infinite population. The hypergeometric distribution accounts for the fact that each draw changes the remaining population composition. For example, after drawing a heart from a deck, the probability of the next card being a heart changes from 13/52 to 12/51. When the population is very large relative to the sample size, the hypergeometric distribution approximates the binomial distribution because removing one item barely changes the probabilities.

Question 3

What is the formula for the hypergeometric probability?

Accepted Answer

The probability mass function is P(X = k) = C(K,k) * C(N-K, n-k) / C(N,n), where C(a,b) is the binomial coefficient (a choose b). Here N is the total population, K is the number of success items, n is the sample size, and k is the desired number of successes. The numerator counts the favorable outcomes: C(K,k) ways to choose k successes from K success items, times C(N-K, n-k) ways to choose the remaining n-k items from the N-K non-success items. The denominator C(N,n) counts all possible ways to draw n items from N. This ratio gives the exact probability.

Question 4

What are common applications of the hypergeometric distribution?

Accepted Answer

The hypergeometric distribution appears in quality control when inspecting a batch of products without replacement, such as testing 10 items from a batch of 100 to check for defects. It is used in ecology for capture-recapture methods to estimate animal populations. In card games, it calculates the probability of specific hands. In genetics, it models the likelihood of observing a certain number of genes of interest in a random sample. Statistical tests like Fisher's exact test use the hypergeometric distribution for analyzing contingency tables, especially with small sample sizes where chi-squared approximations are unreliable.

Hypergeometric Distribution Calculator

Formula

Worked Examples

Example 1: Drawing Hearts from a Deck

Example 2: Quality Control Inspection

Frequently Asked Questions

What is the hypergeometric distribution?

How is the hypergeometric distribution different from the binomial distribution?

What is the formula for the hypergeometric probability?

What are common applications of the hypergeometric distribution?

References