Kruskal–Wallis H Test

Maths: Statistics for machine learning

2 min read

Published Oct 22 2025, updated Oct 23 2025

40

0

0

0

Machine LearningMathsNumPyPandasPythonStatistics

The Kruskal–Wallis H test is a non-parametric statistical test used to determine whether there are statistically significant differences between the medians of three or more independent groups.

It’s the non-parametric alternative to a one-way ANOVA.

In simple terms:

“The Kruskal–Wallis test checks whether the distributions of three or more groups are the same — without assuming a normal distribution.”

When to Use It

Three or more groups - Independent samples
Ordinal or continuous data - That do not follow a normal distribution
Same shape of distribution - The test assumes group distributions have a similar shape

When not to Use It

Paired data - Use the Friedman test instead
Normal data - Use ANOVA instead

Example Question

“Do customers in different regions (North, South, East, West) spend the same amount on average?”

If spending data are skewed (e.g., non-normal, outliers), the Kruskal–Wallis test is the best choice.

Hypotheses

H₀ (Null Hypothesis) - All group medians are equal (no difference between groups)
H₁ (Alternative Hypothesis) - At least one group median is different

How It Works

Combine all group data together.
Rank all data from smallest to largest (1 = smallest).
Compute the sum of ranks (Rᵢ) for each group.
Calculate the test statistic H:

Kruskal formula

Where:

N = total number of observations
R_i = sum of ranks for group i
n_i = size of group i

H follows an approximate chi-squared (χ²) distribution with k − 1 degrees of freedom.

If H is large → group medians differ → reject H₀.

Example in Python

Let’s test if three different marketing campaigns lead to different customer spending.

import numpy as np

from scipy.stats import kruskal

# Example data (non-normal)

campaign_A = np.array([45, 50, 52, 60, 61, 62, 65, 70, 72])

campaign_B = np.array([40, 42, 48, 51, 55, 57, 59, 61, 63])

campaign_C = np.array([52, 55, 58, 60, 63, 65, 66, 68, 70])

# Perform Kruskal–Wallis Test

stat, p = kruskal(campaign_A, campaign_B, campaign_C)

print(f"Kruskal–Wallis H Statistic: {stat:.3f}")

print(f"P-value: {p:.4f}")

Interpretation:

p < 0.05 → Reject H₀ → At least one group’s median differs.
p ≥ 0.05 → Fail to reject H₀ → No significant difference between groups.

Output Example:

Kruskal–Wallis H Statistic: 8.274

P-value: 0.0159

Since p < 0.05 → there is a significant difference between at least one group’s distribution.

Visual Results:

Kruskal visualisation

This shows each group’s spread - if one boxplot is clearly higher/lower, that’s what drives the significant difference.

Python code

import pingouin as pg

import pandas as pd

import numpy as np

# Example: Comparing performance across 3 metrics (non-normal data)

np.random.seed(42)

df = pd.DataFrame({

'Metric': np.repeat(['A', 'B', 'C'], 20),

'Performance': np.concatenate([

np.random.exponential(scale=1.0, size=20),

np.random.exponential(scale=1.5, size=20),

np.random.exponential(scale=2.0, size=20)

])

})

# Run Kruskal-Wallis test

results = pg.kruskal(data=df, dv='Performance', between='Metric')

print(results)

Output:

Source ddof1 H p-unc

Kruskal Metric 2 3.649508 0.161257

Products from our shop

Docker Cheat Sheet - Print at Home Designs

Docker Cheat Sheet - Print at Home Designs

Docker Cheat Sheet Mouse Mat

Docker Cheat Sheet Mouse Mat

Docker Cheat Sheet Travel Mug

Docker Cheat Sheet Travel Mug

Docker Cheat Sheet Mug

Docker Cheat Sheet Mug

Vim Cheat Sheet - Print at Home Designs

Vim Cheat Sheet - Print at Home Designs

Vim Cheat Sheet Mouse Mat

Vim Cheat Sheet Mouse Mat

Vim Cheat Sheet Travel Mug

Vim Cheat Sheet Travel Mug

Vim Cheat Sheet Mug

Vim Cheat Sheet Mug

SimpleSteps.guide branded Travel Mug

SimpleSteps.guide branded Travel Mug

Developer Excuse Javascript - Travel Mug

Developer Excuse Javascript - Travel Mug

Developer Excuse Javascript Embroidered T-Shirt - Dark

Developer Excuse Javascript Embroidered T-Shirt - Dark

Developer Excuse Javascript Embroidered T-Shirt - Light

Developer Excuse Javascript Embroidered T-Shirt - Light

Developer Excuse Javascript Mug - White

Developer Excuse Javascript Mug - White

Developer Excuse Javascript Mug - Black

Developer Excuse Javascript Mug - Black

SimpleSteps.guide branded stainless steel water bottle

SimpleSteps.guide branded stainless steel water bottle

Developer Excuse Javascript Hoodie - Light

Developer Excuse Javascript Hoodie - Light

Developer Excuse Javascript Hoodie - Dark

Developer Excuse Javascript Hoodie - Dark

© 2025 SimpleSteps.guide

About Us FAQ Policies Contact Us