Power Law (Pareto) Distribution

Maths: Statistics for machine learning

2 min read

Published Oct 22 2025, updated Oct 23 2025


40
0
0
0

Machine LearningMathsNumPyPandasPythonStatistics

A Power Law Distribution describes a situation where small occurrences are extremely common,
but large occurrences are very rare, following the general rule:

Power formula

In simple terms:

“A few items account for most of the effect.”
Examples: a few rich people own most wealth, a few websites get most traffic, a few words dominate language use.

It is also known as the Pareto Distribution (after economist Vilfredo Pareto).




Probability Density Function (PDF)

Power PDF Formula

Where:

  • xm​ = minimum possible value (scale parameter)
  • α = shape parameter (also called the power law exponent)

The PDF decreases rapidly as x increases — forming a long right tail.




Cumulative Distribution Function (CDF)

Power CDF Formula

As x → ∞, F(x)→1



Intuition

  • Small values are very common (high probability near xm​)
  • Large values are rare, but not impossible — producing a long tail
  • The distribution is scale-invariant, meaning the shape looks the same at any scale:
Power Intuition Formula


Examples

  • Wealth distribution - A few individuals hold most wealth
  • Internet traffic - Few sites get most visits
  • City populations - Few cities are very large
  • Word frequencies - Few words used very often
  • Social networks - Few users have many followers


Power Law Distribution

  • PDF (left): High probability near the minimum (xₘ), with a long, slow-decaying right tail.
  • CDF (right): Increases quickly at first, then slowly approaches 1.

The Power Law shows that extreme events are rare but not negligible.
The tail never fully disappears — there’s always some chance of very large values.






Effect of α (Shape Parameter)

Power Law Distribution Effect

Smaller α → heavier tail (more big events)
Larger α → lighter tail (big events become rarer)






In Machine Learning and Data Science

  • Modelling heavy-tailed data - Wealth, web traffic, popularity, network degrees
  • Anomaly detection - Detecting rare extreme outliers
  • Natural language processing - Word frequencies (Zipf’s law)
  • Network science - Power-law degree distributions in social graphs
  • Economics / risk modelling - Financial returns, market volatility (tail risk)
  • Generative modelling - Sampling realistic “long-tail” distributions

Products from our shop

Docker Cheat Sheet - Print at Home Designs

Docker Cheat Sheet - Print at Home Designs

Docker Cheat Sheet Mouse Mat

Docker Cheat Sheet Mouse Mat

Docker Cheat Sheet Travel Mug

Docker Cheat Sheet Travel Mug

Docker Cheat Sheet Mug

Docker Cheat Sheet Mug

Vim Cheat Sheet - Print at Home Designs

Vim Cheat Sheet - Print at Home Designs

Vim Cheat Sheet Mouse Mat

Vim Cheat Sheet Mouse Mat

Vim Cheat Sheet Travel Mug

Vim Cheat Sheet Travel Mug

Vim Cheat Sheet Mug

Vim Cheat Sheet Mug

SimpleSteps.guide branded Travel Mug

SimpleSteps.guide branded Travel Mug

Developer Excuse Javascript - Travel Mug

Developer Excuse Javascript - Travel Mug

Developer Excuse Javascript Embroidered T-Shirt - Dark

Developer Excuse Javascript Embroidered T-Shirt - Dark

Developer Excuse Javascript Embroidered T-Shirt - Light

Developer Excuse Javascript Embroidered T-Shirt - Light

Developer Excuse Javascript Mug - White

Developer Excuse Javascript Mug - White

Developer Excuse Javascript Mug - Black

Developer Excuse Javascript Mug - Black

SimpleSteps.guide branded stainless steel water bottle

SimpleSteps.guide branded stainless steel water bottle

Developer Excuse Javascript Hoodie - Light

Developer Excuse Javascript Hoodie - Light

Developer Excuse Javascript Hoodie - Dark

Developer Excuse Javascript Hoodie - Dark

© 2025 SimpleSteps.guide
AboutFAQPoliciesContact