
| Scalability |
MARS requires that all training data reside in RAM, so the larger the data set to be analyzed, the larger the RAM needed to analyze it. The exact amount of RAM required will vary from problem to problem. The table below is intended as a guide for the maximum number of candidate predictor variables that can be specified in a MARS analysis for the given sample size and amount of RAM workspace:
Rule of Thumb for Calculating Required RAMA rule of thumb that you can also use for calculating the needed RAM for your data set is to multiply the data set size by a factor of 3 to 4. For example, if your data set is 10 megabytes, MARS potentially requires 40 megabytes of RAM for the analysis. Increasing the Number of Variables MARS Can HandleIf you have a very large list of potential predictors, CART can be used first to extract the most important variables. MARS can then focus on the top variables from the CART model, enabling you to fit larger problem sizes into smaller workspaces and resulting in faster analyses and more accurate and robust models. |