Data Mining with Decision Trees (CART)
Overview
Discover the power of tree-structured data mining during this popular introductory seminar, geared toward statisticians and IT audiences who are interested in understanding the conceptual basis of decision tree technology -- what it is, why it works, how it has been used, and how it can help you make better business decisions. Explore the practical use and application of decision trees for solving real world data mining problems and learn about:
- Decision tree fundamentals
- Decision tree applications
- How to build and interpret CART models
- How to use advanced options
- alternative splitting rules
- prior probabilities
- differential costs of misclassification
Attendees will see examples of analysis of real world data. PowerPoint slides and live modeling runs will facilitate the learning process.
Course Outline:
- Historical Background of CART
- Sample CART Model
- What is CART?
- How to read the tree
- Tree interpretation and use
- Introducing the CART Interface
- Setting up the model
- Understanding the results
- The Big Picture
- Binary recursive partitioning
- Workflow of a model
- Fundamentals of Tree Pruning
- Competitors & Surrogates
- Variable Importance
- Splitting Rules and Friends
- Introduction to splitting rules
- Penalizing variables
- Missing values
- Forced splits
- Constraints and StructuredTrees™
- Introduction to priors and costs
- Cross Validation
- Regression Trees
- Scoring & Deployment
- CART Automation (Batteries of runs)
- HotSpotDetector™
- Stable Trees & Consistency

