5df53d66524f6f2d.tex
1: \begin{abstract}
2: Classifying large-scale image data into object categories is an important problem
3: that has received increasing research attention.
4: Given the huge amount of data, non-parametric approaches such as nearest neighbor classifiers
5: have shown promising results, especially when they are underpinned by a learned distance or similarity measurement.
6: Although metric learning has been well studied in the past decades, most existing algorithms are impractical to handle
7: large-scale data sets.
8: In this paper, we present an image similarity learning method that can scale well
9: in both the number of images and the dimensionality of image descriptors.
10: To this end, similarity comparison is restricted to each sample's local neighbors and
11: a discriminative similarity measure is induced from large margin neighborhood embedding.
12: We also exploit the ensemble of projections so that high-dimensional features
13: can be processed in a set of lower-dimensional subspaces in parallel without much performance compromise.
14: The similarity function is learned online using a stochastic gradient descent algorithm in which
15: the triplet sampling strategy is customized for quick convergence of classification performance.
16: The effectiveness of our proposed model is validated on several data sets with scales varying
17: from tens of thousands to one million images. Recognition accuracies competitive with
18: the state-of-the-art performance are achieved with much higher efficiency and scalability.
19: \end{abstract}
20: