A probability distribution describes how the values of a random variable are distributed. Choosing the right distribution for your data is crucial — using the wrong one leads to incorrect probabilities and invalid statistical tests.

Discrete vs Continuous Distributions

Discrete distributions: The variable takes countable values (0, 1, 2, ...). Example: number of heads in 10 flips.

Continuous distributions: The variable can take any value in an interval. Example: height, weight, temperature.

Key Discrete Distributions

Binomial Distribution B(n, p)

Counts the number of successes in n independent trials, each with probability p. P(X=k) = C(n,k) × pᵏ × (1−p)ⁿ⁻ᵏ. Mean = np, Variance = np(1−p).

Use when: Fixed n trials, binary outcome (success/failure), constant p, independent trials.
Examples: Number of heads in 20 coin flips, number of defective items in a batch of 50, number of patients who respond to treatment in a clinical trial of 100.

Poisson Distribution Poisson(λ)

Counts events occurring in a fixed interval when events happen at constant average rate λ. P(X=k) = (λᵏ × e⁻λ)/k!. Mean = Variance = λ.

Use when: Counting rare events in time/space, events are independent, constant average rate.
Examples: Number of calls to a call centre per hour, number of accidents per month at a junction, number of typos per page.

Hypergeometric Distribution

Like binomial but for sampling WITHOUT replacement from a finite population. Use when the sample is a significant fraction of the population (>5%).

Key Continuous Distributions

Normal Distribution N(μ, σ²)

The most important distribution in statistics. Bell-shaped, symmetric, defined by mean and variance. Foundation of the Central Limit Theorem and most parametric tests.

Examples: Height, IQ scores, measurement errors, exam scores.

T-Distribution t(df)

Like the normal but with heavier tails. Used when σ is unknown and sample size is small. As df → ∞, t → normal.

Use for: T-tests, confidence intervals when σ is unknown.

Chi-Square Distribution χ²(df)

Sum of squared standard normal variables. Always non-negative, right-skewed. Used in goodness-of-fit tests, tests of independence, and variance tests.

F-Distribution F(df₁, df₂)

Ratio of two chi-square distributions. Used in ANOVA, regression F-tests, and comparing variances.

Exponential Distribution Exp(λ)

Time between events in a Poisson process. Memoryless: P(T > t+s | T > s) = P(T > t). Mean = 1/λ.

Examples: Time between customer arrivals, time until machine failure, radioactive decay times.

Uniform Distribution U(a, b)

All values in [a, b] equally likely. Mean = (a+b)/2. Used in simulation and as a reference distribution.

Choosing the Right Distribution

Data TypeQuestionDistribution
Count of successes in n trialsHow many heads in 20 flips?Binomial
Count of rare eventsCalls per hour?Poisson
Continuous, symmetricHeights, test scores?Normal
Time until eventWhen does machine fail?Exponential
Testing means (σ unknown)T-test p-value?T-distribution
Testing variances/ANOVAF-test p-value?F-distribution
Testing categorical dataChi-square p-value?Chi-square

Calculate probabilities for all these distributions using our free statistics calculators.