k-Nearest Neighbors

This episode explores the k-nearest neighbors algorithm which is an unsupervised, non-parametric method that can be used for both classification and regression. The basic concept is that it leverages some distance function on your dataset to find the k closests other observations of the dataset and averaging them to impute an unknown value or unlabelled datapoint.

Enjoy this post? Sign up for our mailing list and don't miss any updates.

Have a word to say? Propose a specific change to the blog post.