b8639e91b78c51dd.tex
1: \begin{abstract}
2: Gradient boosted decision trees are a popular machine learning technique,
3: in part because of their ability to give good accuracy with small models.
4: We describe two extensions to the standard tree boosting algorithm designed
5: to increase this advantage. The first improvement extends the boosting
6: formalism from scalar-valued trees to vector-valued trees. This allows
7: individual trees to be used as multiclass classifiers, rather than requiring
8: one tree per class, and drastically reduces the model size required for multiclass
9: problems. We also show that some other popular vector-valued gradient boosted
10: trees modifications fit into this formulation and can be easily obtained in our implementation. The second extension, layer-by-layer boosting,
11: takes smaller steps in function space, which is empirically shown to lead to a faster convergence and to a
12: more compact ensemble. We have added both improvements to the open-source TensorFlow Boosted
13: trees (TFBT) package, and we demonstrate their efficacy on a variety of multiclass datasets. We expect these extensions will be of particular interest
14: to boosted tree applications that require small models, such as embedded devices,
15: applications requiring fast inference, or applications desiring more interpretable
16: models.
17: \end{abstract}
18: