15911892dd52265d.tex
1: \begin{abstract}
2: We consider point clouds obtained as random samples of a  measure on a Euclidean domain. A graph representing the point cloud is obtained by assigning weights to edges based on the distance between the points they connect. 
3: Our goal is to develop mathematical tools needed to study the consistency, as the number of available data points increases, of graph-based machine learning algorithms for tasks such as clustering.
4: %We develop mathematical tools for answering when do the minimizers of graph-based functionals describing tasks such as clustering  converge, as the number of data points increases, to a minimizer of a limiting functional. 
5: In particular, we study when is the cut capacity, and more generally total variation, on these graphs a good approximation of the perimeter (total variation) in the continuum setting.
6: %, as the number of data points tends to infinity.  
7: We address this question in the setting of $\Gamma$-convergence.
8: We obtain almost optimal conditions on the scaling, as number of points increases, of the size of the neighborhood over which the points are connected by an edge for the $\Gamma$-convergence to hold.
9:  Taking the limit is enabled by a transportation based metric which allows to suitably compare functionals defined on different point clouds.
10: \end{abstract}
11: