ANOVA (Analysis of Variance) is one of the most widely used statistical tests. It compares the means of three or more groups simultaneously — answering the question: "Are all these group means equal, or does at least one group differ?"

Why Not Just Run Multiple T-Tests?

If you want to compare 4 groups, you could run 6 pairwise t-tests (4×3/2 = 6 pairs). But each test has a 5% chance of a false positive at α = 0.05. Running 6 tests inflates the overall error rate to 1 − (0.95)⁶ = 26%. You would expect one false significant result every four experiments just by chance. ANOVA avoids this by testing all groups in a single unified test.

The Core Idea of ANOVA

ANOVA works by comparing two sources of variance:

Between-group variance (MSB): How much do the group means differ from each other? If groups are truly different, MSB will be large.

Within-group variance (MSW): How much do observations vary within each group? This is "background noise" — variability not explained by group membership.

The F-statistic = MSB/MSW. A large F means the between-group differences are large relative to random noise — suggesting the groups are genuinely different.

Assumptions of One-Way ANOVA

ANOVA is robust to mild violations of normality, especially with equal group sizes (balanced design).

Reading the ANOVA Table

SourceSSdfMSFp-value
Between groupsSSBk−1MSB = SSB/(k−1)MSB/MSWFrom F-distribution
Within groups (Error)SSWN−kMSW = SSW/(N−k)
TotalSSTN−1

Where k = number of groups, N = total observations, SST = SSB + SSW.

Interpreting Results

If p < α: reject H₀. At least one group mean is significantly different from the others. But ANOVA does not tell you which groups differ — that requires post-hoc testing.

If p ≥ α: fail to reject H₀. Insufficient evidence that any group means differ.

Post-Hoc Tests After Significant ANOVA

After a significant ANOVA result, run pairwise comparisons with correction for multiple testing:

Effect Size for ANOVA

Always report effect size alongside the F-statistic:

When to Use Non-Parametric Alternatives

If ANOVA assumptions are severely violated: use the Kruskal-Wallis test — the non-parametric equivalent that uses ranks instead of raw values. Robust to non-normality and outliers.

Use our free ANOVA Calculator to get the full ANOVA table with F-statistic, p-value, and effect size instantly.