K-Nearest Neighbors from Scratch in Python

Posted on March 16 2017 in Machine Learning • Tagged with k-nearest neighbors, classification, pythonLeave a comment

MNIST

The \(k\)-nearest neighbors algorithm is a simple, yet powerful machine learning technique used for classification and regression. The basic premise is to use closest known data points to make a prediction; for instance, if \(k = 3\), then we'd use 3 nearest neighbors of a point in the test set …

Continue reading

Multilevel URL Community Detection with Infomap

Posted on March 13 2017 in Machine Learning • Tagged with community detection, network, graph, machine learning, pythonLeave a comment

Recently I've discovered a dataset online that is a collection of 20M web queries from about 650K users over three months from March 1 to May 31 2006.

The columns are listed as follows:

  • User ID - an anonymous user ID number
  • Query - the query submitted by the user
  • Query Time …
Continue reading

Clustering Burger King Menu with the Dirichlet Process

Posted on August 18 2015 in Machine Learning • Tagged with clustering, machine learning, burger kingLeave a comment

an interesting graph

Burger King Cluster Sample

I was going to write part two of the previous post on A/B testing now using Bayesian methods, but I plan to do that in another time since today I'm going to write about clustering, a widely used machine learning technique, specifically clustering Burger King menu …

Continue reading