Download Now! Free 30 Day Trial of Salford System's Predictive Modeling Suite

Upcoming Tradeshows

  • JSM
    July 28, 2012 - August 02, 2012
    San Diego, CA, Booth TBA
  • KDD
    August 12, 2012 - August 16, 2012
    Beijing, China, Booth TBA
  • Statistical Learning and Data Mining III
    October 01, 2012
    Boston, MA
  • DMA
    October 13, 2012 - October 19, 2012
    Las Vegas, NV
  • INFORMS
    October 14, 2012 - October 16, 2012
    Phoenix, AZ
View full calendar
Home Blog Dan Steinberg Dan Steinberg Using Batteries in The Salford Predictive Modeling Suite

Using Batteries in The Salford Predictive Modeling Suite

Written by  Dan Steinberg Friday, September 16 2011
Rate this item
(0 votes)

The Salford Predictive Modeler™ suite (SPM) includes a number of automated tools to assist in the process of feature selection under the BATTERY mechanism. For example,

BATTERY KEEP

Selects a subset of features at random and builds a model from this random subset only. The GUI will guide you in how to use this option, but from the command line you would issue something like:

BATTERY KEEP=100, 15

Which requests 100 models, each of which includes 15 randomly-selected predictors. If we are sure that we want certain variables included in every such model, the command would look like:

BATTERY KEEP=100, 15 CORE= X1, X2, X3, X4, X5

 

The Salford Predictive Modeler™ suite (SPM) includes a number of automated tools to assist in the process of feature selection under the BATTERY mechanism. For example,

BATTERY KEEP

Selects a subset of features at random and builds a model from this random subset only. The GUI will guide you in how to use this option, but from the command line you would issue something like:

BATTERY KEEP=100, 15

Which requests 100 models, each of which includes 15 randomly-selected predictors. If we are sure that we want certain variables included in every such model, the command would look like:

BATTERY KEEP=100, 15 CORE= X1, X2, X3, X4, X5

Which would pre-select five of the 15 available predictors, leaving another 10 to be selected at random.

BATTERY SHAVE is another automated model-building mechanism that is intended to start with a list of predictors that we would like to pare down to a smaller subset. We "shave" variables from the list of predictors by running a model, reviewing the variable and then rebuild the model. In another mode, we can remove the most important variables sequentially instead. Both modes can be very helpful in discovering a good set of predictors to work with.

BATTERY KEEP=‹NK, NR› [CORE=‹predictor›, ‹predictor›,…]

Repeat the model NR times, selecting a subset of NK predictors from the KEEP list each time.

The CORE option defines a group of predictors (from the main KEEP list) that are included in each of the models of the battery.

The CORE option, if used, must be the final option on the BATTERY command.<.p>

Dan Steinberg

Dan Steinberg

Dan Steinberg, President and Founder of Salford Systems, is a well-respected member of the statistics and econometrics communities. In 1992, he developed the first PC-based implementation of the original CART procedure, working in concert with Leo Breiman, Richard Olshen, Charles Stone and Jerome Friedman. In addition, he has provided consulting services on a number of biomedical and market research projects, which have sparked further innovations in the CART program and methodology.

Dr. Steinberg received his Ph.D. in Economics from Harvard University, and has given full day presentations on data mining for the American Marketing Association, the Direct Marketing Association and the American Statistical Association. A book he co-authored on Classification and Regression Trees was awarded the 1999 Nikkei Quality Control Literature Prize in Japan for excellence in statistical literature promoting the improvement of industrial quality control and management.