Standard Normal Distribution (Z-Score)

Maths: Statistics for machine learning

Machine LearningMathsNumPyPandasPythonStatistics

The Standard Normal Distribution is a special case of the Normal Distribution where μ=0 and σ=1.

That means:

It’s used to measure how far a value is from the mean in standard deviation units — this distance is called the Z-score.

Where:

The Z-score tells you how many standard deviations away a value is from the mean.

Interpretation

Z-Score	Meaning
0	Exactly at the mean
+1	1 standard deviation above mean
-1	1 standard deviation below mean
+2	2 standard deviations above mean
-2	2 standard deviations below mean
> +3 or < -3	Unusually extreme (outlier)

Z-scores standardise data — making different variables comparable even if they have different units or scales.

For example:
A test score of 75 in Math and 82 in English — which is better?
Z-scores let you compare them on the same relative scale.

For the standard normal distribution:

Represents the probability that a standard normal variable is less than or equal to a given Z-score.

Left plot (PDF):
- A bell-shaped curve centered at 0
- Colored areas show probabilities for ±1σ, ±2σ, ±3σ
Right plot (CDF):
- Smooth S-shaped curve
- Represents cumulative probability up to each Z-score

The area under the curve corresponds to probability
Z-scores make probability lookup easy using Z-tables