db11b87baf253a23.tex
1: \begin{abstract}
2: Graph-based representations of point-cloud data are widely used in data science and machine learning, including $\epsilon$-graphs that contain edges between pairs of data points that are nearer than $\epsilon$ and kNN-graphs that connect each point to its $k$ nearest neighbors.
3: %
4: Recently, topological data analysis has emerged as a family of mathematical and computational techniques to investigate topological features of data using simplicial complexes. These are a higher-order generalization of graphs and many techniques such as Vietoris-Rips (VR) filtrations are  also parameterized by a distance $\epsilon$.
5: %
6: Here, we develop kNN complexes as a generalization of kNN graphs, leading to kNN-based  persistent homology techniques for which we develop stability and convergence results.
7: We apply this technique to characterize the convergence properties  PageRank, highlighting how the perspective of discrete topology complements traditional geometrical-based analyses of convergence. Specifically, we show that   convergence of relative positions (i.e., ranks) is captured by kNN persistent homology, whereas  persistent homology with VR filtrations   coincides with vector-norm convergence.
8: %In doing so, our work embarks on a new interface between TDA and the analysis of data-science algorithms.
9: Beyond PageRank, kNN-based persistent homology is expected to be useful to other data-science applications in which the relative positioning of data points is more important than their precise locations.
10: 
11: 
12: \end{abstract}
13: