Standard Normal Distribution (Z-Score)

Maths: Statistics for machine learning

2 min read

Published Oct 22 2025, updated Oct 23 2025


40
0
0
0

Machine LearningMathsNumPyPandasPythonStatistics

The Standard Normal Distribution is a special case of the Normal Distribution where μ=0 and σ=1.


That means:

  • The mean (centre) = 0
  • The standard deviation (spread) = 1

It’s used to measure how far a value is from the mean in standard deviation units — this distance is called the Z-score.




Z-Score Formula

Z Score Formula

Where:

  • X = original data value
  • μ = population mean
  • σ = population standard deviation

The Z-score tells you how many standard deviations away a value is from the mean.


Interpretation

Z-Score

Meaning

0

Exactly at the mean

+1

1 standard deviation above mean

-1

1 standard deviation below mean

+2

2 standard deviations above mean

-2

2 standard deviations below mean

> +3 or < -3

Unusually extreme (outlier)




Why Use Z-Scores?

Z-scores standardise data — making different variables comparable even if they have different units or scales.

For example:
A test score of 75 in Math and 82 in English — which is better?
Z-scores let you compare them on the same relative scale.




Probability Density Function (PDF)

For the standard normal distribution:

standard normal PDF Formula
  • The curve is symmetric around 0
  • The total area = 1 (represents 100% probability)



Cumulative Distribution Function (CDF)

standard normal CDF Formula

Represents the probability that a standard normal variable is less than or equal to a given Z-score.



standard normal Distribution

  • Left plot (PDF):
    • A bell-shaped curve centered at 0
    • Colored areas show probabilities for ±1σ, ±2σ, ±3σ
  • Right plot (CDF):
    • Smooth S-shaped curve
    • Represents cumulative probability up to each Z-score

The area under the curve corresponds to probability
Z-scores make probability lookup easy using Z-tables





In Machine Learning

  • Feature standardisation - Convert all features to mean=0, std=1 before training
  • Distance-based algorithms - Important for SVMs, KNNs, PCA, clustering
  • Outlier detection - Extreme Z-scores
  • Probability modelling - Z-scores convert any normal variable to standard form
  • Evaluation metrics - Used in Z-tests, confidence intervals, etc.

Products from our shop

Docker Cheat Sheet - Print at Home Designs

Docker Cheat Sheet - Print at Home Designs

Docker Cheat Sheet Mouse Mat

Docker Cheat Sheet Mouse Mat

Docker Cheat Sheet Travel Mug

Docker Cheat Sheet Travel Mug

Docker Cheat Sheet Mug

Docker Cheat Sheet Mug

Vim Cheat Sheet - Print at Home Designs

Vim Cheat Sheet - Print at Home Designs

Vim Cheat Sheet Mouse Mat

Vim Cheat Sheet Mouse Mat

Vim Cheat Sheet Travel Mug

Vim Cheat Sheet Travel Mug

Vim Cheat Sheet Mug

Vim Cheat Sheet Mug

SimpleSteps.guide branded Travel Mug

SimpleSteps.guide branded Travel Mug

Developer Excuse Javascript - Travel Mug

Developer Excuse Javascript - Travel Mug

Developer Excuse Javascript Embroidered T-Shirt - Dark

Developer Excuse Javascript Embroidered T-Shirt - Dark

Developer Excuse Javascript Embroidered T-Shirt - Light

Developer Excuse Javascript Embroidered T-Shirt - Light

Developer Excuse Javascript Mug - White

Developer Excuse Javascript Mug - White

Developer Excuse Javascript Mug - Black

Developer Excuse Javascript Mug - Black

SimpleSteps.guide branded stainless steel water bottle

SimpleSteps.guide branded stainless steel water bottle

Developer Excuse Javascript Hoodie - Light

Developer Excuse Javascript Hoodie - Light

Developer Excuse Javascript Hoodie - Dark

Developer Excuse Javascript Hoodie - Dark

© 2025 SimpleSteps.guide
AboutFAQPoliciesContact