abstract:6c4b8ab21bf4e7d6.tex

1: \begin{abstract}

2: %We describe a new framework for graph-based semi-supervised and active learning.

3: %Motivated by the need to address the degeneracy of canonical Laplace learning algorithms in low label rates, we reformulate graph-based semi-supervised learning as a generalization of a \emph{Trust-Region Subproblem} (TRS) in which one is asked to optimize a nonconvex quadratic over a Euclidean sphere.

4: Motivated by the need to address the degeneracy of canonical Laplace learning algorithms in low label rates, we propose to reformulate graph-based semi-supervised learning as a nonconvex generalization of a \emph{Trust-Region Subproblem} (TRS).

5: This reformulation is motivated by the well-posedness of Laplacian eigenvectors in the limit of infinite unlabeled data.

6: %, and we propose approximate and iterative algorithms that enjoy global convergence guarantees to solve it.

7: To solve this problem, we first show that a first-order condition implies the solution of a manifold alignment problem and that solutions to the classical \emph{Orthogonal Procrustes} problem can be used to efficiently find good classifiers that are amenable to further refinement. Next, we address the criticality of selecting supervised samples at low-label rates. We characterize informative samples with a novel measure of centrality derived from the principal eigenvectors of a certain submatrix of the graph Laplacian. We demonstrate that our framework achieves lower classification error compared to recent state-of-the-art and classical semi-supervised learning methods at extremely low, medium, and high label rates. Our code is available on github\footnote{anonymized for submission}.

8: \end{abstract}

9: