NOTE: This is a migration of an old post from my previous blog. Recently, I’ve been playing around with some competitions on Kaggle. Given that an inescapable fact of Machine Learning is Feature Selection, I’ve been finding myself in the situation of having to call a dozen or more functions that add synthetic features, infer missing values, etc.