TreeNet
On Demand Introductory Videos
Download Now Instant Evaluation
Get Price Quote

Testimonials

Rawa Shamroukh, Union Bank, Vice President / Senior Strategic Modeling Manager, Predictive Modeling Department

Using Salfords’ data mining Software TreeNet and CART for the last 8 years, Union Bank has developed multiple in-house statistical models for the various business units within Retail Banking. These tools have greatly expanded our ability to synthesize massive amounts of data quickly, reduce our modeling turn around-times, and mainly produce the most robust predictive models in terms of accuracy and performance. The main benefits from these tools from the bank’s perspective are time savings, Productivity increase, superior analytics, and affordable tools. Treenet and CART provided significant advantages and established a new paradigm for certain modeling challenges within our department. Benefits were obtained along each step of the overall data mining process from data processing to model development and implementation. It demonstrates remarkable performance for both regression and classification models. When I first started using Treenet for logistic Regression Modeling, I always used to develop a parallel model using basic SAS. Treenet kept outperforming SAS logistic regression models over and over. In my opinion, the most amazing features in these tools are their automatic handling of variables selection, collinearity, data transformation, outliers, and many more. In addition, you can have on-the-spot model assessment by generating immediate Gain / Lift charts, ROC measures, Train/ test comparison, Misclassification / prediction success, etc. Using these tools, you will have the options to deploy your final model in a SAS environment or a SQL environment since they automatically translates your model into SAS codes, C codes, or PMML codes. I have used Treenet in various applications. To name few: Attrition models, Behavior scoring models, Operation losses models, Overdraft models, Uncollected funds models, Fraud models, etc.

Rawa Shamroukh
Union Bank
Vice President / Senior Strategic Modeling Manager
Predictive Modeling Department


Sassoon Kosian, Sr. Assistant Vice President and Head of Methodology at EXL Service

In the Decision Analytics practice of EXL Service we build many predictive models for our clients representing a wide range of industries, levels of complexity and application environments. CART and MARS have been standard tools that our resources are trained on and apply in model development. I personally have been using Salford Systems products for nearly 8 years and I believe will continue using them for the foreseeable future. I have mostly used CART and sometimes MARS, and in recent years also tried Random Forests and TreeNet. All of the products offer an intuitive GUI interface and versatility but for me personally CART would be the winner.
When it comes to decision trees there are other options in the market but none has come close to CART in terms of rich functionality, intuitive ease of use and affordability. These are reasons why I continue recommending it to our modeling resources. There several ways we take advantage of CART features. First of all, it is quite easy and quick to build a CART tree with decent predictive power and which you can also explain to business users - something that's very important to our clients. Besides building tree models we often use CART to quickly derive insights about the data by building a preliminary tree and exploring the variable importance list, important nodes and making sense of individual variables. I find the data exploration functionality incredibly valuable especially when you have hundreds or thousands of variables. CART and MARS have also helped us to create composite variables that we have used in regression models - another very useful feature we frequently use. CART also comes in handy to quickly identify segmentation strategies in complex scenarios. All of these features make CART a very important tool in our model development toolkit and help us bring value to our clients.

Sassoon Kosian, Sr. Assistant Vice President and Head of Methodology at EXL Service


Xu Jie, Nanjing University of Information Science & Technology

I am conducting a project about GIS, in which many data analysis are needed. Lacking useful tools, our project made slow progress. In an accidental chance I got a TreeNet trial version and it shocked me with its powerful capabilities of data analyzing, friendly user interface and most important of all, accuracy. After using it we got many benefits from it during our research and our project had gone much faster.
In many features of TreeNet, we like most is plots which offers graphs displaying after building model. This feature is especially useful to us which provides the most visual and easy way to find shortcomings and make improvement of the model. We like the multiple model setting up ways as well, it’s flexible and covers most aspects of our research.
What attracts me most is the amazing speed of TreeNet. We have used some other software before using TreeNet. None of them could build a model in such a short time. What’s more, using TreeNet, the painstaking procedures of data preprocessing are saved, it greatly accelerated our research. Since our data contains over 100 million of variables, using common software, it takes weeks to get a result of analyze. However, by using TreeNet, it takes just a couple of days.
In addition, the most important is the accuracy. Due to the defects of sampling stage, there are some noisy variables in our data, which brings instability to our model. By using TreeNet, we got more stable model of our research than by other tools. What’s more, the model we built by TreeNet could be repeated and verified.
As to ROI, I cannot say how much money we have saved by using this tool, but we did consider to buy a high performance computer(about 5000 US dollars)to assist our research. After using the software, we decided to postpone that purchasing.
We are getting to use TreeNet just a few months , but we really impressed by its powerfulness. We know we are using just a few common features, a lot of powerful features are still waiting for us to learn.

Xu Jie, Nanjing University of Information Science & Technology


Megan Sun, Data Mining Analyst, Marketing Department at Genworth Financial

I have 6 years using SAS and other statistical software to conduct academic and business projects. I started using SPM to build predictive models in May 2014. Our team mainly uses SPM TreeNet to build models for direct mail campaigns. I think the SPM software (Salford Predictive Modeler) is S.P.M. - SMART, PRODUCTIVE and MANAGEABLE.

Read full testimonial

Megan Sun, Data Mining Analyst,
Marketing Department at Genworth Financial


Brian Griner, Chief Methodologist at Quintiles

The Salford Predictive Modeler Software Suite
Great product! Very easy to test different models, compare results and export code to score a database.

 Brian Griner, Chief Methodologist at Quintiles
New York, USA


Jim Kenyon, Director of Operations for Optimization Group.

We use SPM because it lets us quickly and easily build predictive models that produce useful and usable results for our clients.

 Jim Kenyon, Director of Operations at Optimization Group
Ann Arbor, MI, USA


[J#64:1602]

Bill Heavlin, Advanced Micro Devices, Inc.

MARS brings a new generation of statistical modeling technology to industrial statistics. MARS models are much more flexible than conventional response surface methods. The output is much more visual and has proven the source of insights in presentations to engineers. Finally, the windows- type GUI opens the door to training engineers to use the analysis software effectively.

 Bill Heavlin, Advanced Micro Devices, Inc.


David Broadhurst, Assistant Professor of Biostatistics at University of Alberta

Salford Systems provides a fast and effective solution to many complex multivariate classification/regression tasks. It is particularly effective in isolating influential features in 'Omic based data sets (proteomics/metabolomics etc). Although there are open source versions of much of the underlying mechanics, Salford Systems have provided a very thorough interface which rapidly decreases the learning curve for a set of very powerful algorithms. I'm a particular fan of MARS :-)

 David Broadhurst, Assistant Professor of Biostatistics at University of Alberta
Edmonton, Canada Area


Herb Edelstein, President, Two Crows Data Mining Consultancy

For years, I have been predicting that MARS would be one of the hottest algorithms and it will be. MARS addresses some shortcomings of decision trees, and it does so in a fairly elegant fashion.

 Herb Edelstein, President, Two Crows Data Mining Consultancy


Richard DeVeaux, Williams College

MARS is in many cases both more accurate and much faster than neural nets.

Richard DeVeaux, Williams College


Sadi Eserce, Senior Analyst at Chadwick Martin Bailey

I recommend this product.

 Sadi Eserce, Senior Analyst at Chadwick Martin Bailey 
Greater Boston Area


Thomas Brauch, Marketing Manager, Data Driven Marketing Department, Fireman's Fund Insurance

MARS is an essential tool for any data miner. It finds significant effects in complex data structures where other methods simply fail. I use it as both a stand alone solution and as a transformation tool for simpler modeling techniques.

Thomas Brauch, Marketing Manager, Data Driven Marketing Department, Fireman's Fund Insurance


Wayne Danter, University of Western Ontario

The MARS interface is smooth, intuitive and worked well. I think you have hit another home run with this data mining and modeling tool. I look forward to using it in a number of medical research projects. Also, I very much appreciate the outstanding customer support I have received.

Wayne Danter, University of Western Ontario


[J#76:1602]

Broadband propensity project: comparing TreeNet with Enterprise Miner using logistic regression.

We’re seeing these benefits
1. TreeNet (Stochastic gradient boosting) method injects randomization to the selection of candidate predictors and training data, making this method much more robust than the traditional statistical models especially in dealing with messy data. For example, in our dataset, there is a part of information missing like customer’s portfolio and usage data. Although we do the data replacement for the statistical models, it would still affect the final results as it uses the whole dataset for training. By using TreeNet, only a subset of data and predictors are used each time and this process will be repeated for hundreds of time. This method greatly reduced the influence of messy data and improves the robustness of the final model. In terms of the nature of modeling, growing a large number of small trees instead of using a single complex tree has been proved to be more accurate and robust.
Our modeling dataset is always of big size. In this example, data size is above 500,000 and the initial predictor set is about 160 predictors. TreeNet is computational efficient and scalable for the large dataset which is much faster than the enterprise miner. In the result analysis, the detailed relationships between predictors and the target are much easier to be visualized. Battery automates the process of running multiple experiments which reduces a lot of efforts in the predictor selection. In this example, 16 predictors are finally selected after 5 cycles and 2 battery processes.
2. Insights: TreeNet can dig out very granular information. For example, it helps to find the impact of a specific sector of predictors. In this example, predictors regarding to product holding and usage are emphasized in the TreeNet model while they are not prominent in the traditional logistic regression model. Information in these sectors contributed a lot in improving the prediction of customer’s propensity in buying our fix broadband products.
3. We’re seeing various levels of performance gains over traditional statistical models. For the best performance we’ve seen, the improvement of Lift is consistently about 40%, helping to capture more than 30% customers who are willing to buy our product.

Predictive Analytics Manager at Leading Telco in Singapore.
**Her team works on scientific marketing initiatives using statistics, data science and optimization methodologies.


David Vogel, CEO Voloridge Investment Management and Captain of the winning Heritage Health prize team

I have multiple versions of gradient boosting I could find including popular open source versions and TreeNet outperforms them all in predictive accuracy (consistently across many different kinds of data sets) while maintaining the ability to train models quickly.

David Vogel, CEO Voloridge Investment Management and Captain of the winning Heritage Health prize team
Florida, USA


 

Brad Turner, Vice President of Marketing and Business Development, Inkiru

Everyday, the Inkiru product predicts sales for 2000 items in an e-commerce context. In addition, the product generates a customized confidence interval for each prediction. The input is dynamic and it consists of 1 year of historical data. Each record contains approximately 150 features with information about sales, products, customers, and promotions.

The problem was very challenging from a modeling point of view. Important parts of the data were continuous, categorical, highly non-linear, sparse, missing, and noisy. We found Salford Systems adequate to deal with these characteristics of the data.

Precision was an important goal in this project. A validation with real data reports 90% of the predictions lying within 7 units of the actual sales and 50% within 2 units. Salford Systems was definitely an important tool to reach this degree of accuracy in the product.

 Brad Turner, Vice President of Marketing and Business Development, Inkiru
California, USA


Andrew Russo, Vice President, Modeling and Analytics at AccuData Integrated

As a traditional modeler, I had been primarily using regression and logistic regressions. I began to test TreeNet last fall. Since then I have built several models that are now market-tested and are performing as predicted by TreeNet. The real value of TreeNet has been the speed in which it builds data, the accuracy of its predictions and the incremental lift it is experiencing in side-by-side tests of regressions. It has also proven to be a tremendous data prep time saver in its ability to deal with outliers, missing data as well as doing a decent job distinguishing between scale and categorical data. Importantly, the ability for less-hands-on model builds has enabled us to offer new modeling products to our clients that otherwise would not have had the budget to do a modeling project. In short this new, advanced capability is giving my company a competitive advantage.

 Andrew Russo, Vice President, Modeling and Analytics at AccuData Integrated Marketing
Florida, USA


Tom Osborn, Adjunct Professor at University of Technology

I've used TreeNet on commercial projects since '04. For customer and prospect targeting, it outperforms logistic family regression, neural nets and other methods in my kitbag. Key strengths: handling of missing values, robustness, general non-linearity, variable interactions. Clients like feedback on variable importance (more general than Shapley or PMVD). They also like seeing how the variable contribute to predictions. Fast and easy to use. Best - is developed on Jerry Friedman's great maths.

 Tom Osborn, Adjunct Professor (analytics/data mining) at University of Technology, Sydney
Sydney, Australia


Fred Hazelton, Master Statistician

Predicting Crowds at Walt Disney World Theme Parks
Since 1986, the Unofficial Guide to Walt Disney World has been helping visitors to Orlando’s theme parks get the most out of their time and money. Market research shows that the two most important factors that affect a visitors satisfaction with a Disney trip is; 1) how long did I have to wait in line and 2) how much did I get to see. The Unofficial Guide and its website, TouringPlans.com has become the best source for solving these two problems.
The most effective way to reduce the amount of time you wait in line and to increase the number of attractions you get to experience is to visit at a time of year when the crowds are lower and to use an optimal, computer designed touring plan. Touring Plans are great! They tell you the optimal order in which to experience the attractions with minimal wait, a classic implementation of the travelling-salesman problem. However, an optimal touring plan requires that we can predict with reasonable accuracy, the wait time at an attraction at any given time or day, for any given day of the year.
We at TouringPlans.com have been using traditional linear regression methods to predict wait times for several years. But, the limitations of regression are more apparent as we gather more and more data. Subscribers to our mobile application “Lines” can see our estimates for wait times and submit updates when they are in the park. The sporadic nature of the wait times that we gather make it difficult to utilize in a traditional regression environment. The ups and downs of wait times throughout the day are difficult to model using regression but perfect for a data mining tool.

Using Treenet
Some auxiliary variables such as Park Hours, Parade Schedules, Historic Wait Times and School Schedules are available for each wait time record in advance. These can be used in a traditional regression model to analyse the past and predict the future. But the true value of the data we gather is in its dynamic nature. Variables like current weather, attraction status (open or broken down), recent wait times and recent wait times for other attractions have a great impact on how long you will wait in line. These variables are not available for predictions in advance and the value of these variables is not available for all records in the database. For example, not every wait time record in the database will have a recent wait time submission. Treenet can easily handle missing data, whereas regression cannot.
In a traditional regression model, the burden of determining variable interactions is placed on the statistician, usually to be discovered using trial and error. It is easy to rationalize that wait time data must have plenty of interactions that have a great impact. Relationships between wait times at other attractions, relationships between park hours and parade schedules, etc. With dozens of variables, the process of identifying interactions (and transformations) is prohibitive in a traditional regression environment. In Treenet, the search for interactions and transformations is inherent, exhaustive and automatic – a refreshing saver of time and energy, allowing more resources for other tasks.

Fred Hazelton
Master Statistician


Constance Jiang, Data Analyst, Tencent, Inc.

As a Data Analyst in risk management fields, it is significant to distinguish quality consumers, so as to recognize and limit low ROI transactions. We use TreeNet to build classification models, work on regressive problems. This software not only provides us great choices of powerful algorithms for model training, but also shows its outstanding accuracy (10% better under same circumstances), ability to process huge datasets, like, over 100,000 records with 50 complicated variables. TreeNet is also highly productive and user-friendly, several minutes are quite enough for model training. Now we can now spend more time on the results analysis and decision making.
TreeNet's performance is impressive, satisfying, and could really adapted into real scenarios and reducing the related risks.

 Constance JiangData Analyst Tencent, Inc


Xu Jie from Nanjing University of Information Science & Technology

I am conducting a project about GIS, in which many data analysis are needed. Lacking useful tools, our project made slow progress. In an accidental chance I got a TreeNet trial version and it shocked me with its powerful capabilities of data analyzing, friendly user interface and most important of all, accuracy. After using it we got many benefits from it during our research and our project had gone much faster.
In many features of TreeNet, we like most is plots which offers graphs displaying after building model. This feature is especially useful to us which provides the most visual and easy way to find shortcomings and make improvement of the model. We like the multiple model setting up ways as well, it’s flexible and covers most aspects of our research.
What attracts me most is the amazing speed of TreeNet. We have used some other software before using TreeNet. None of them could build a model in such a short time. What’s more, using TreeNet, the painstaking procedures of data preprocessing are saved, it greatly accelerated our research. Since our data contains over 100 million of variables, using common software, it takes weeks to get a result of analyze. However, by using TreeNet, it takes just a couple of days.
In addition, the most important is the accuracy. Due to the defects of sampling stage, there are some noisy variables in our data, which brings instability to our model. By using TreeNet, we got more stable model of our research than by other tools. What’s more, the model we built by TreeNet could be repeated and verified.
As to ROI, I cannot say how much money we have saved by using this tool, but we did consider to buy a high performance computer(about 5000 US dollars)to assist our research. After using the software, we decided to postpone that purchasing.
We are getting to use TreeNet just a few months , but we really impressed by its powerfulness. We know we are using just a few common features, a lot of powerful features are still waiting for us to learn.

 Xu Jie
Nanjing University of Information Science & Technology


[J#100:1602]

[art#44:1602]

Price Quote Request

Price Quote

[art#42:1602]

Salford LOGIT™

LOGIT is a comprehensive package for logistic regression analysis, providing tools for model building, model evaluation, prediction, simulation, hypothesis testing and regression diagnostics. A fast, full-featured software package, LOGIT is capable of handling an unlimited number of cases and includes special tools for discrete choice models.

View the The Hybrid CART®-Logit Model in Classification and Data Mining presentation by Dr. Dan Steinberg explaining the benefits of using Hybrid CART® with Logit.

[art#43:1609]

MARS - Multivariate Adaptive Regression Splines®

MARS

Automated Non-Linear Regression
MARS software is ideal for users who prefer results in a form similar to traditional regression while capturing essential nonlinearities and interactions. The MARS approach to regression modeling effectively uncovers important data patterns and relationships that are difficult, if not impossible, for other regression methods to reveal. MARS builds its model by piecing together a series of straight lines with each allowed its own slope. This permits MARS to trace out any pattern detected in the data.
High-Quality Probability
The MARS model is designed to predict continuous numeric outcomes such as the average monthly bill of a mobile phone customer or the amount that a shopper is expected to spend in a web site visit. MARS is also capable of producing high quality probability models for a yes/no outcome. MARS performs variable selection, variable transformation, interaction detection, and self-testing, all automatically and at high speed.
High-Performance Results
Areas where MARS has exhibited very high-performance results include forecasting electricity demand for power generating companies, relating customer satisfaction scores to the engineering specifications of products, and presence/absence modeling in geographical information systems (GIS).

 

[Salford-Short-Code-2014a]

[J#74:1604]

Product Versions

SPM® 8 Product Versions

Ultra
The best of the best. For the modeler who must have access to leading edge technology available and fastest run times including major advances in ensemble modeling, interaction detection and automation. ULTRA also provides advance access to new features as they become available in frequent upgrades.
ProEx
For the modeler who needs cutting-edge data mining technology, including extensive automation of workflows typical for experienced data analysts and dozens of extensions to the Salford data mining engines.
Pro
A true predictive modeling workbench designed for the professional data miner. Variety of supporting conventional statistical modeling tools, programming language, reporting services, and a modest selection of workflow automation options.
Basic
Literally the basics. Salford Systems award winning data mining engines without extensions or automation or surrounding statistical services, programming language, and sophisticated reporting. Designed for small budgets while still delivering our world famous engines

[J#48:1603]

[art#41:1611]

Get In Touch With Us

Contact Us

9685 Via Excelencia, Suite 208, San Diego, CA 92126
Ph: 619-543-8880
Fax: 619-543-8888
info (at) salford-systems (dot) com