In an embodiment, the tree-based model 418 is a random forests model. Random forests is a class of ensemble methods used for classification problems. Random forests models work by fitting an ensemble of decision tree classifiers on sub samples of the data. Each tree only sees a portion of the data, drawing samples of equal size with replacement. Each tree can use only a limited number of features. By averaging the output of classification across the ensemble, the random forests model can limit over-fitting that might otherwise occur in a decision tree model.
In an embodiment, the tree-based model 418 uses the random forests model in Python's scikit-learn. In an illustrative embodiment, the tree-based model 418 uses the following parameters in the scikit-learn random forests model: