| TreeNet/MART |
OverviewTreeNet stochastic gradient boosting is Stanford University Professor Jerome Friedman's latest advance in data mining methodology. In TreeNet, classification and regression models are built up gradually through a potentially large collection of small trees, each of which improves on its predecessors through an error-correcting strategy. Although each tree may have only one split, the full model can be extraordinarily accurate. The final model takes the form of a series expansion in which every term is a (small) tree. TreeNet improves over conventional boosting in that:
Key innovations in stochastic gradient boosting include:
Learning outcomesAttendees will be introduced to the main concepts in boosting methods in data mining. They will also be presented with the core innovations behind TreeNet stochastic gradient boosting, including the concepts of slow learning, use of weak learners in every stage of model building, resampling from the training at every stage, and ignoring data considered too far from the decision boundary in classification problems. Although the TreeNet model can be complex, this tutorial will show how new graphical tools assist in interpreting the results. The graphical tools demonstrated include 2-D and 3-D graphs that exhibit the dependence of the target on any individual or pair of predictors, and variable importance rankings that are available separately for each target class of the multi-class problem. Content and instructional methodsAttendees will see examples of recent analysis of real world data. PowerPoint slides and live modeling runs will facilitate the learning process. Course Outline:
|