abstract:73e6b1d001f751c5.tex

1: \begin{abstract}

2: 		We introduce a novel interpretable and tree-based algorithm for prediction in a regression setting in which each tree in a classical random forest is replaced by a family of planted trees that grow simultaneously.

3: 		The motivation for our algorithm is to estimate the unknown regression function from a functional ANOVA decomposition perspective, where

4: 		each tree corresponds to a function within that decomposition.

5: 		Therefore, planted trees are limited in the number of interaction terms.

6: 		The maximal order of approximation in the ANOVA decomposition can be specified or left unlimited. If a first order approximation is chosen, the result is an additive model. In the other extreme case, if the order of approximation is not limited, the resulting model puts no restrictions on the form of the regression function.

7: 		In a simulation study we find encouraging prediction and visualisation properties of our  random planted forest method.

8: 		%; this is  especially the case in sparse settings and in cases where the regression functions have jumps.

9: 		We also develop theory for an idealised version of random planted forests in the case of an underlying additive model.

10: 		We show that in the additive case, the idealised version achieves up to a logarithmic factor asymptotically optimal one-dimensional convergence rates of order $n^{-2/5}$. \footnote{Code is available on on GitHub: \url{https://github.com/PlantedML/Planted_Forest}.}

11: 	\end{abstract}

12: