Member-only story

Kolmogorov-Smirnov (K-S) statistic

Sandeep Sharma
5 min readSep 15, 2023

In my previous blog we read about “Statistics importance in Regression Modeling”. In this blog we will understand one of the concept which is call KS Statistics or K-S statistics.

It is named after Andrey Kolmogorov and Nikolai Smirnov

  • The Kolmogorov-Smirnov (K-S) statistic is a non-parametric test used to compare a sample with a reference probability distribution or to compare two samples. It can be used to check the goodness of fit of a distribution to data.
  • In this context, we can use the K-S test to check whether the residuals (difference between actual and predicted values) of our regression model follow a normal distribution
Did not understand it yet?

Lets understand through a story

Imagine you have a big jar of jellybeans. You think all the jellybeans in this jar are blue. To check, you close your eyes and take a small handful of jellybeans. Now, you want to compare the colors in your hand to what you expect (all blue).

The Kolmogorov-Smirnov (let’s call it K-S) test is like a tool that helps you decide if the handful of jellybeans you picked is a good representation of the entire jar.

Using the K-S test, you compare your handful to the expected all-blue jellybeans. If the test says, “Hey, this…

--

--

Sandeep Sharma
Sandeep Sharma

Written by Sandeep Sharma

Manager Data Science — Coffee Lover — Machine Learning — Statistics — Management Consultant — Product Management — Business Analyst

No responses yet