b521a5b9fa97598f.tex
1: \begin{abstract}
2: In many applications of supervised learning, multiple classification
3: or regression outputs have to be predicted jointly. We consider
4: several extensions of gradient boosting to address such problems. We
5: first propose a straightforward adaptation of gradient boosting
6: exploiting multiple output regression trees as base learners. We
7: then argue that this method is only expected to be optimal when the
8: outputs are fully correlated, as it forces the partitioning induced
9: by the tree base learners to be shared by all outputs. We then
10: propose a novel extension of gradient tree boosting to specifically
11: address this issue. At each iteration of this new method, a
12: regression tree structure is grown to fit a single random projection
13: of the current residuals and the predictions of this tree are fitted
14: linearly to the current residuals of all the outputs,
15: independently. Because of this linear fit, the method can adapt
16: automatically to any output correlation structure. Extensive
17: experiments are conducted with this method, as well as other
18: algorithmic variants, on several artificial and real
19: problems. Randomly projecting the output space is shown to provide a
20: better adaptation to different output correlation patterns and is
21: therefore competitive with the best of the other methods in most
22: settings. Thanks to model sharing, the convergence speed is also
23: improved, reducing the computing times (or the complexity of the
24: model) to reach a specific accuracy.
25: 
26: % \keywords{Gradient boosting \and Multi-label learning \and Multi-output regression
27: % \and Random projection} % disqble
28: 
29: \end{abstract}