Tutorial: Playing with the Boston Housing Data

From dftwiki3
Revision as of 05:28, 11 August 2016 by Thiebaut (talk | contribs) (TensorFlow NN with Hidden Layers: Regression on Boston Data)
Jump to: navigation, search

--D. Thiebaut (talk) 16:06, 8 August 2016 (EDT)


Deep Neural-Network Regressor (DNNRegressor from Tensorflow)


This tutorial uses SKFlow and TensorFlow, and follows very closely two other good tutorials and merges elements from both:


We use the Boston housing prices data for this tutorial.
The tutorial is best viewed as a Jupyter notebook (available in zipped form below), or as a static pdf (you'll have to retype all the commands...)



SKLearn Linear Regression Model on the Boston Data


This tutorial also uses SKFlow and follows very closely two other good tutorials and merges elements from both:


We also use the Boston housing prices data for this tutorial.
The tutorial is best viewed as a Jupyter notebook (available in zipped form below), or as a static pdf (you'll have to retype all the commands...)


TensorFlow NN with Hidden Layers: Regression on Boston Data


Here we take the same approach, but use the TensorFlow library to solve the problem of predicting the housing prices using the 13 features present in the Boston data. The code is longer, but offers insight into the "behind the scene" aspect of sklearn.

The result is quite good, as illustrated in the figures below, showing prediction versus test data, and residuals. The R2 coefficient of correlation obtained for a network taking 13 features and feeding them into a 52x39x26x13 architecture of layers is R2 = 0.8150.



TensorFlowNNBostonHousingPredVsReal.png TensorFlowNNBostonHousingResiduals.png TensorFlowNNBostonHousingCostVsEpochs.png