Stratified vs Cluster Sampling

Stratified and cluster sampling are both probability sampling methods that divide the population into groups before sampling — but they work in opposite ways and are used in very different situations. Confusing the two is a common mistake.

The Key Distinction

Stratified sampling: Groups (strata) are DIFFERENT from each other. You sample FROM EVERY stratum. Goal: ensure representation of every important subgroup.

Cluster sampling: Groups (clusters) are SIMILAR to each other (microcosms of the population). You sample SOME clusters and survey everyone (or a sample) within them. Goal: reduce cost of geographically dispersed populations.

Stratified Sampling

Divide the population into strata based on a characteristic relevant to your research variable. Take a random sample from each stratum.

When to use:

Subgroups differ significantly on the variable you are measuring
You need guaranteed representation of all subgroups
You want to make separate estimates for each subgroup
You want more precise estimates than simple random sampling

Example: Studying student satisfaction at a university with 1000 undergrads, 400 postgrads, 100 PhD students. Stratify by level, sample 50 from each stratum proportionally (or equally if comparing strata).

Precision: Stratified sampling is MORE precise than simple random sampling when strata differ — less variance within each stratum.

Cluster Sampling

Divide the population into clusters (usually geographic). Randomly select some clusters. Survey everyone (or a random sample) within selected clusters.

When to use:

Population is spread over a large geographic area
You do not have a complete list of individuals — only a list of clusters
Travelling to many locations is expensive or impractical

Example: National reading survey of primary school students. List all 5,000 schools (clusters). Randomly select 100 schools. Survey all students in those 100 schools. You only need to travel to 100 locations instead of having students dispersed across the country.

Precision: Cluster sampling is LESS precise than simple random sampling (design effect DEFF > 1). Individuals within a cluster tend to be similar — this reduces the effective sample size.

Side-by-Side Comparison

Feature	Stratified Sampling	Cluster Sampling
Groups differ?	YES — strata are heterogeneous	NO — clusters are homogeneous
Sample from all groups?	YES — every stratum sampled	NO — only selected clusters
Goal	Precision, representation	Cost reduction, feasibility
Precision vs SRS	More precise	Less precise (DEFF > 1)
Cost	Higher (sample from everywhere)	Lower (only visit selected clusters)
Requires full list?	Yes — list of individuals	No — list of clusters only
Analysis complexity	Moderate	Higher (account for DEFF)

Multistage Sampling

Large national surveys often combine both methods: first cluster to select geographic areas (cost-efficient), then stratify within selected areas (improve precision). This is called multistage stratified cluster sampling — the method used by most national statistical agencies.

Use our Sample Size Calculator to compute required sample sizes for different study designs, including adjustments for design effects in cluster sampling.

Why Random Sampling Comes in Different Flavours

Simple random sampling (SRS) — where every member of the population has an equal probability of selection — is the theoretical ideal but often impractical. When populations have meaningful subgroups, or when a complete population list is unavailable, alternative probability sampling methods provide practical solutions while maintaining statistical validity. Stratified and cluster sampling are the two most important alternatives.

Stratified Sampling: Ensuring Subgroup Representation

Stratified sampling divides the population into mutually exclusive subgroups (strata) based on a relevant characteristic (age group, region, income bracket, gender), then draws random samples from each stratum. This guarantees representation from every subgroup, unlike SRS which might by chance undersample some groups. The strata should be defined by variables related to the outcome of interest — this is what reduces variance.

Proportional vs Optimal Allocation

In proportional stratified sampling, the sample from each stratum is proportional to the stratum's size in the population. This produces estimates that are unbiased and easy to weight. In optimal (Neyman) allocation, sample sizes are also proportional to within-stratum variability — strata with more heterogeneity get larger samples. Optimal allocation minimises variance for a given total sample size, but requires knowledge of within-stratum variance in advance.

When Stratified Sampling Outperforms SRS

Stratified sampling is more precise than SRS when: strata are internally homogeneous (low within-stratum variance) but differ from each other (high between-stratum variance), you need reliable estimates for each subgroup separately, or you want to guarantee representation of small but important subpopulations. In a national health survey, stratifying by region and age ensures adequate coverage of rural elderly populations that might be missed by SRS.

Cluster Sampling: When a Population List Doesn't Exist

Cluster sampling randomly selects groups (clusters) from the population, then surveys all members within selected clusters. It is the practical choice when no complete population list exists but a list of groups does. For a national student survey, you might randomly select 100 schools (clusters) and survey all students in those schools — no complete student list is needed nationally.

Two-Stage Cluster Sampling

In two-stage (multi-stage) cluster sampling, you first select clusters randomly, then select a random sample of individuals within each selected cluster. This is more flexible than single-stage cluster sampling and is the basis for most large national surveys (census, health surveys, labour force surveys). The design effect (DEFF) quantifies how much less efficient cluster sampling is compared to SRS — cluster samples typically have DEFF > 1, meaning you need larger samples to achieve equivalent precision.

Key Differences at a Glance

Feature	Stratified	Cluster
Goal	Increase precision	Reduce cost/logistical complexity
Subgroups sampled	All strata (every group)	Randomly selected clusters (subset)
Within groups	Sample from each stratum	All (or sample) within selected clusters
Efficiency vs SRS	More efficient	Less efficient (higher DEFF)
Requires group list	Yes, complete	Only list of clusters, not individuals

Why Random Sampling Comes in Different Flavours

Stratified Sampling: Ensuring Subgroup Representation

Proportional vs Optimal Allocation

When Stratified Sampling Outperforms SRS

Cluster Sampling: When a Population List Doesn't Exist

Two-Stage Cluster Sampling

Key Differences at a Glance

Feature	Stratified	Cluster
Goal	Increase precision	Reduce cost/logistical complexity
Subgroups sampled	All strata (every group)	Randomly selected clusters (subset)
Within groups	Sample from each stratum	All (or sample) within selected clusters
Efficiency vs SRS	More efficient	Less efficient (higher DEFF)
Requires group list	Yes, complete	Only list of clusters, not individuals

Worked Example: National Education Survey

The government wants to estimate average mathematics scores for 5th-grade students nationwide. There are 15,000 schools with an average of 80 students each (1.2 million total). A complete list of all students does not exist, but a list of all schools does.

SRS would require: A list of all 1.2M students, random selection of, say, 1,200 students from across the country — logistically impossible to reach students scattered across thousands of schools.

Cluster sampling solution: Randomly select 60 schools (clusters), then test all 5th-graders in those schools (~80 students each = ~4,800 students total). The design effect (DEFF) for school clustering is typically 3–5 for academic outcomes, meaning you need 3–5× more students than SRS to achieve equivalent precision. With DEFF=4, the effective sample size is 4,800/4 = 1,200 — equivalent to a simple random sample of 1,200, but far more logistically feasible.

Stratified improvement: Instead of simple random selection of schools, stratify by state (50 strata) and select 1–2 schools per stratum. This guarantees every state is represented and often reduces the design effect, giving better precision for the same cost.

Systematic Sampling Pitfall: A Real Example

A military researcher analysing aircraft returning from combat missions noticed that bullet holes were concentrated in certain areas (wings, fuselage) and proposed reinforcing those areas. Statistician Abraham Wald famously pointed out the survivorship bias: the sample only included planes that returned. Planes hit in the engine or cockpit didn't return — they were shot down. The researcher should reinforce where the surviving planes were NOT hit. This is arguably the most famous example of how sampling method profoundly shapes conclusions: only sampling survivors systematically excludes the most informative cases. The correct population was all aircraft hit, not just returning aircraft.

Calculate Instantly — 100% Free

45 statistics calculators with step-by-step solutions, interactive charts, and PDF export. No sign-up needed.

▶ Open Free Statistics Calculator

🔗 Related Resources

Sampling Methods Sample Size Calculator → Sampling Methods Types of Sampling Methods → Sampling Methods Statistics Glossary → All Articles Browse All Statistics Articles →

The Key Distinction

Stratified Sampling

Cluster Sampling

Side-by-Side Comparison

Multistage Sampling

Why Random Sampling Comes in Different Flavours

Stratified Sampling: Ensuring Subgroup Representation

Proportional vs Optimal Allocation

When Stratified Sampling Outperforms SRS

Cluster Sampling: When a Population List Doesn't Exist

Two-Stage Cluster Sampling

Key Differences at a Glance

Why Random Sampling Comes in Different Flavours

Stratified Sampling: Ensuring Subgroup Representation

Proportional vs Optimal Allocation

When Stratified Sampling Outperforms SRS

Cluster Sampling: When a Population List Doesn't Exist

Two-Stage Cluster Sampling

Key Differences at a Glance

Worked Example: National Education Survey

Systematic Sampling Pitfall: A Real Example

Calculate Instantly — 100% Free

Deep Dive: Stratified Vs Cluster Sampling — Theory, Assumptions, and Best Practices

Mathematical Foundation

Assumptions and Diagnostics

Interpreting Your Results Completely

Effect Size and Practical Significance

Common Errors and How to Avoid Them

When This Test Is Not Appropriate

Reporting in Academic and Professional Contexts

Statistical Reasoning: Building Intuition Through Examples

Case Study 1: Healthcare Research Application

Case Study 2: Business Analytics Application

Case Study 3: Educational Assessment

Understanding Output from Statistical Software

Integrating Multiple Analyses

Statistical Software Commands Reference

Frequently Asked Questions: Advanced Topics

Can I use this test with non-normal data?

How do I handle missing data?

What is the difference between a one-sided and two-sided test?

How should I report results in a research paper?