1: \begin{abstract}
2: %In this paper we propose a new framework to study the generalization property of classifier chains trained over observations associated with multiple interdependent class labels. The results are based on large deviation inequalities for Lipschitz functions of weakly dependent sequences proposed by \cite{RIO2000905}. The resulting generalization error bound involves a Rademacher type of complexity term defined over training samples where examples are not independently distributed, as well as two dependency coefficients; where the first one measures the variational distance between the probability measure of the current class in the chain conditionally to the whole training examples and their previous class labels, and the distribution of the current class. The second coefficient estimates the variational distance between the same probability measures over any subset of the training samples of different sizes. Further, the bound exhibits convergence rates that extend those proposed in the literature for the binary case.
3: In this paper, we propose a new framework to study the generalization property of classifier chains trained over observations associated with multiple and interdependent class labels. The results are based on large deviation inequalities for Lipschitz functions of weakly dependent sequences proposed by \cite{RIO2000905}. We believe that the resulting generalization error bound brings many advantages and could be adapted to other frameworks that consider interdependent outputs. First, it explicitly exhibits the dependencies between class labels. Secondly, it provides insights of the effect of the order of the chain on the algorithm generalization performances. Finally, the two dependency coefficients that appear in the bound could also be used to design new strategies to decide the order of the chain.
4: \end{abstract}
5: