abstract:7a9b0e7cf7fba5f2.tex

1: \begin{abstract}

2: 	We analyze the clustering problem through a flexible probabilistic model that aims to identify an optimal partition on the sample $X_1,...,X_n$. We perform exact clustering with high probability using a convex semidefinite estimator that interprets as a corrected, relaxed version of $K$-means. The estimator is analyzed through a non-asymptotic framework and showed to be optimal or near-optimal in recovering the partition. Furthermore, its performances are shown to be adaptive to the problem's effective dimension, as well as to $K$ the unknown number of groups in this partition. We illustrate the method's performances in comparison to other classical clustering algorithms with numerical experiments on simulated data.

3: \end{abstract}

4: