Effect Sizes
SciPy - Statistical Testing
2 min read
Published Nov 17 2025
Guide Sections
Guide Comments
Statistical significance (p-values) tells you whether an effect exists.
Effect sizes tell you how large the effect is.
Both are essential.
Effect sizes are widely used in:
- A/B testing
- Clinical research
- Behavioural science
- Social science
- Machine learning evaluation
- Business analytics
For mean differences:
- Cohen's d
- Glass's Δ
- Hedge’s g
For non-parametric tests:
- Rank-biserial correlation
- Cliff’s delta
For categorical tests:
- Cramér’s V
- Phi coefficient
For ANOVA:
- Eta-squared (η²)
- Partial eta-squared
Cohen’s d (Independent Samples)
For two independent groups, measures the standardised mean difference.
Formula (simplified):
Example Implementation
Interpretation
- 0.20 = small
- 0.50 = medium
- 0.80 = large
Cohen’s d (Paired Samples)
Used after a paired t-test.
Example
Hedge’s g (Corrected Cohen’s d)
More accurate for small samples (n < 20).
Function
Glass’s Δ
Used when group variances differ greatly.
Uses control group's standard deviation only.
Function
Rank-Biserial Correlation (Mann–Whitney U)
Effect size for Mann–Whitney U test.
Function
Example
Cliff’s Delta (Non-Parametric Effect Size)
Measures how often values in one group exceed values in another.
Function
Interpretation
- 0.147 = small
- 0.330 = medium
- 0.474 = large
Cramér’s V (Chi-Square Effect Size)
Used for chi-square tests of independence.
Function
Interpretation
- 0.10 = small
- 0.30 = medium
- 0.50 = large
Phi Coefficient (2×2 Tables Only)
Special case of Cramér’s V for 2×2 tables.
Function
Equivalent to Pearson correlation for binary variables.
Eta-Squared (η²) for ANOVA
Effect size for one-way ANOVA.
Function
Interpretation
- 0.01 = small
- 0.06 = medium
- 0.14 = large
Partial Eta-Squared
Used in multi-factor ANOVA (via Statsmodels).
Statsmodels output includes it automatically if using:
Compute manually from table:
Summary: When to Use Which Effect Size
Scenario | Best Effect Size |
Two independent means | Cohen’s d / Hedge’s g |
Two paired means | Cohen’s d (paired) |
Non-parametric 2-group | Rank-biserial / Cliff’s delta |
Chi-square independence | Cramér’s V |
2×2 categorical | Phi coefficient |
One-way ANOVA | Eta-squared |
Multi-factor ANOVA | Partial eta-squared |
Conversion rates (binary) | Cohen’s h |














