abstract:b521a5b9fa97598f.tex

1: \begin{abstract}

2: In many applications of supervised learning, multiple classification

3: or regression outputs have to be predicted jointly. We consider

4: several extensions of gradient boosting to address such problems. We

5: first propose a straightforward adaptation of gradient boosting

6: exploiting multiple output regression trees as base learners. We

7: then argue that this method is only expected to be optimal when the

8: outputs are fully correlated, as it forces the partitioning induced

9: by the tree base learners to be shared by all outputs. We then

10: propose a novel extension of gradient tree boosting to specifically

11: address this issue. At each iteration of this new method, a

12: regression tree structure is grown to fit a single random projection

13: of the current residuals and the predictions of this tree are fitted

14: linearly to the current residuals of all the outputs,

15: independently. Because of this linear fit, the method can adapt

16: automatically to any output correlation structure. Extensive

17: experiments are conducted with this method, as well as other

18: algorithmic variants, on several artificial and real

19: problems. Randomly projecting the output space is shown to provide a

20: better adaptation to different output correlation patterns and is

21: therefore competitive with the best of the other methods in most

22: settings. Thanks to model sharing, the convergence speed is also

23: improved, reducing the computing times (or the complexity of the

24: model) to reach a specific accuracy.

25:

26: % \keywords{Gradient boosting \and Multi-label learning \and Multi-output regression

27: % \and Random projection} % disqble

28:

29: \end{abstract}