How to Determine Sample Size

One of the most common questions in research design: "How many participants do I need?" Too few and you miss real effects (low power). Too many and you waste resources. This guide shows you exactly how to calculate the right sample size for different study types.

Why Sample Size Matters

Too small: Low statistical power (high β error). Real effects remain undetected. Study is inconclusive.
Too large: Wasted time, money, and participant burden. Even trivially small, meaningless effects become statistically significant.
Just right: 80–90% power to detect your expected effect size at α = 0.05.

Key Inputs for Sample Size Calculation

Confidence level (1−α): Usually 95% (α = 0.05) or 99% (α = 0.01)
Desired power (1−β): Usually 80% or 90% (β = 0.20 or 0.10)
Effect size: How big a difference you expect or consider meaningful
Population variability (σ): From prior studies or a pilot study
Margin of error (E): For estimation studies — how precise do you need to be?

Formula 1: Estimating a Mean

n = (z × σ / E)²

Example: Estimate mean daily study time for students (σ = 1.5 hours from pilot study). Want 95% CI (z = 1.96) with margin of error E = 0.3 hours.

n = (1.96 × 1.5 / 0.3)² = (9.8)² = 96 students

Formula 2: Estimating a Proportion

n = z² × p(1−p) / E²

Most conservative: Use p = 0.5 when unknown (maximises required n).

Example: Survey to estimate proportion of people who support a policy. 95% CI, E = 0.05, p unknown.

n = (1.96)² × 0.5 × 0.5 / (0.05)² = 3.84 × 0.25 / 0.0025 = 384 respondents

Formula 3: Two-Sample T-Test (Power Analysis)

When comparing two group means with desired power 1−β:

n = 2(z_α/2 + z_β)² × σ² / δ²

Where δ = expected difference between means. This gives the required n per group.

Example: You expect a drug to reduce blood pressure by δ = 8 mmHg (σ = 12 mmHg). α = 0.05, power = 0.80 (z_α/2 = 1.96, z_β = 0.84).

n = 2(1.96 + 0.84)² × 144 / 64 = 2 × 7.84 × 2.25 = 35 per group (70 total)

Practical Adjustments

Expected dropout: Divide by retention rate. If 20% dropout expected: n_adjusted = n / 0.80
Finite population correction: If sample is >5% of population N: n_adjusted = n × N/(N + n − 1)
Design effect (DEFF): For cluster sampling, multiply n by DEFF (typically 1.5–2.5)

Common Sample Size Rules of Thumb

Study Type	Minimum Recommended n
Survey / opinion poll	≥ 384 (95% CI, ±5%)
T-test comparison	≥ 30 per group
Linear regression	≥ 10–20 observations per predictor
Chi-square test	≥ 5 expected per cell
Pilot study	≥ 12–30 per group

Use our free Sample Size Calculator to get exact required n for any combination of confidence level, margin of error, and power.

Why Sample Size Matters

Sample size fundamentally determines the precision and statistical power of your study. Too small a sample produces imprecise estimates, fails to detect real effects, and wastes participants' time and effort on inconclusive research. Too large a sample wastes resources and may detect trivially small effects that have no practical importance. Power analysis determines the minimum sample size needed to achieve desired precision and power before collecting data.

Key Components of Sample Size Calculation

Four quantities are interrelated in sample size calculations. Specify three and calculate the fourth:

Effect size: The minimum difference you want to detect. Based on practical importance, not statistical convention.
Significance level α: Acceptable Type I error rate (typically 0.05)
Power (1−β): Probability of detecting the effect if it exists (typically 0.80 or 0.90)
Sample size n: The unknown to solve for

Sample Size for Estimating a Mean

To achieve margin of error E with confidence level 1−α and known σ: n = (z_α/2 × σ/E)². For 95% confidence (z = 1.96), σ = 10, E = 2: n = (1.96 × 10/2)² = (9.8)² = 96.04, so n = 97. When σ is unknown, estimate from pilot data or literature. Note that n grows with the square of z but inversely with the square of E — halving the margin of error requires quadrupling the sample size.

Sample Size for Comparing Two Means

For a two-sample t-test detecting effect size d = |μ₁−μ₂|/σ with power 1−β at significance α: n per group ≈ 2(z_α/2 + z_β)²/d². For d=0.5 (medium effect), α=0.05, power=0.80: z₀.₀₅/₂ = 1.96, z₀.₂₀ = 0.84. n ≈ 2(1.96+0.84)²/0.25 ≈ 63 per group.

Sample Size for Proportions

To detect a difference between proportions p₁ and p₂: n per group ≈ (z_α/2√(2p̄(1−p̄)) + z_β√(p₁(1−p₁)+p₂(1−p₂)))²/(p₁−p₂)², where p̄=(p₁+p₂)/2. For estimating a single proportion with margin of error E: n = z²p(1−p)/E². Use p=0.5 if unknown (gives maximum/conservative estimate): n = 1.96²×0.25/0.05² = 384 for E=5% margin.

Finite Population Correction

The standard formulas assume an infinite population. When sampling more than 5-10% of a finite population, apply the finite population correction: n_adjusted = n/(1 + n/N), where N is the population size. This correction reduces the required sample size when the population is small — if you survey 100 out of 200 people, you are capturing half the population and need fewer additional observations to achieve the same precision.

Practical Considerations Beyond the Formula

The calculated n is the minimum needed for statistical goals. Practical design requires adjusting for expected non-response (increase n by 1/(1−non-response rate)), dropouts in longitudinal studies, and stratification (may need minimum per stratum). Cluster sampling often requires much larger samples than simple random sampling (the design effect). Budget constraints and ethical limits on participant burden also factor into final decisions.

Why Sample Size Matters

Key Components of Sample Size Calculation

Four quantities are interrelated in sample size calculations. Specify three and calculate the fourth:

Effect size: The minimum difference you want to detect. Based on practical importance, not statistical convention.
Significance level α: Acceptable Type I error rate (typically 0.05)
Power (1−β): Probability of detecting the effect if it exists (typically 0.80 or 0.90)
Sample size n: The unknown to solve for

Sample Size for Estimating a Mean

Sample Size for Comparing Two Means

Sample Size for Proportions

Finite Population Correction

Practical Considerations Beyond the Formula

Worked Example: Clinical Trial Sample Size

A clinical trial compares a new antidepressant to placebo using the Hamilton Depression Rating Scale (HDRS). The minimum clinically meaningful difference is 3 points (δ = 3). Based on prior studies, σ = 8 points. The team wants 90% power and α = 0.05 (two-tailed).

n per group = 2 × (z_α/2 + z_β)² × σ² / δ² = 2 × (1.96 + 1.282)² × 64 / 9 = 2 × 10.52 × 64 / 9 = 2 × 74.8 ≈ 150 per group.

Expecting 15% dropout, enrol 150/0.85 = 177 per group (355 total). This is a substantial investment — but the calculation shows exactly why: detecting a 3-point difference against background noise of σ=8 requires large samples. If the team reduces target power to 80%, n drops to 112 per group (132 with dropout adjustment), saving 90 participants but accepting higher risk of missing a real effect.

Sample Size for Surveys: A Practical Guide

A marketing team wants to estimate the proportion of customers who would pay for a premium subscription. They want a margin of error of ±4% with 95% confidence. Using p = 0.5 (maximum variance, most conservative): n = 1.96² × 0.5 × 0.5 / 0.04² = 3.8416 × 0.25 / 0.0016 = 600.25, so n = 601.

If they have prior data suggesting p ≈ 0.30: n = 1.96² × 0.30 × 0.70 / 0.04² = 3.8416 × 0.21 / 0.0016 = 504. They can save 97 surveys by using prior knowledge. For a finite population of N = 2,000 customers: adjusted n = 601/(1 + 601/2000) = 601/1.3005 = 462. The finite population correction saves an additional 139 surveys — significant when recruiting customers is expensive.

Consequences of Ignoring Sample Size Planning

A 2022 systematic review found that over 40% of published randomised trials in top medical journals were underpowered (less than 80% power). These studies frequently report null results when a real effect exists, leading to incorrect conclusions that treatments are ineffective. Conversely, some industry-funded trials use enormous samples that detect statistically significant but clinically meaningless effects, then advocate for adoption of expensive treatments. Transparent pre-registration of power analyses on platforms like ClinicalTrials.gov and OSF prevents these abuses and improves research credibility.

Calculate Instantly — 100% Free

45 statistics calculators with step-by-step solutions, interactive charts, and PDF export. No sign-up needed.

▶ Open Free Statistics Calculator

🔗 Related Resources

Research Methods Sample Size Calculator → Research Methods Confidence Interval Calculator → Research Methods T-Test Calculator → All Articles Browse All Statistics Articles →

Why Sample Size Matters

Key Inputs for Sample Size Calculation

Formula 1: Estimating a Mean

Formula 2: Estimating a Proportion

Formula 3: Two-Sample T-Test (Power Analysis)

Practical Adjustments

Common Sample Size Rules of Thumb

Why Sample Size Matters

Key Components of Sample Size Calculation

Sample Size for Estimating a Mean

Sample Size for Comparing Two Means

Sample Size for Proportions

Finite Population Correction

Practical Considerations Beyond the Formula

Why Sample Size Matters

Key Components of Sample Size Calculation

Sample Size for Estimating a Mean

Sample Size for Comparing Two Means

Sample Size for Proportions

Finite Population Correction

Practical Considerations Beyond the Formula

Worked Example: Clinical Trial Sample Size

Sample Size for Surveys: A Practical Guide

Consequences of Ignoring Sample Size Planning

Calculate Instantly — 100% Free

Deep Dive: Sample Size Determination — Theory, Assumptions, and Best Practices

Mathematical Foundation

Assumptions and Diagnostics

Interpreting Your Results Completely

Effect Size and Practical Significance

Common Errors and How to Avoid Them

When This Test Is Not Appropriate

Reporting in Academic and Professional Contexts