ebd54d085b67333a.tex
1: \begin{abstract}
2: 	We consider a problem of multiclass classification, where the training sample 
3: 	\break $S_n =  \{(X_i, Y_i)\}_{i=1}^n$ is generated from the model $\p(Y = m | 
4: 	X = x) = \eta_m(x)$, $1 \leq m \leq M$, and $\eta_1(x), \dots, \eta_M(x)$ are 
5: 	unknown $\alpha$-Holder continuous functions.
6: 	Given a test point $X$, our goal is to predict its label.
7: 	A widely used $\mathsf k$-nearest-neighbors classifier constructs estimates of 
8: 	$\eta_1(X), \dots, \eta_M(X)$ and uses a plug-in rule for the prediction.
9: 	However, it requires a proper choice of the smoothing parameter $\mathsf k$, which 
10: 	may become tricky in some situations.
11: 	We fix several integers $n_1, \dots, n_K$, compute corresponding 
12: 	$n_k$-nearest-neighbor estimates for each $m$ and each $n_k$ and apply an 
13: 	aggregation procedure.
14: 	We study an algorithm, which constructs a convex combination of these estimates 
15: 	such that the aggregated estimate behaves approximately as well as an oracle 
16: 	choice.
17: 	We also provide a non-asymptotic analysis of the procedure, prove its adaptation to 
18: 	the unknown smoothness parameter $\alpha$ and to the margin and establish rates 
19: 	of convergence under mild assumptions.	
20: \end{abstract}
21: