abstract:780a29c5938048aa.tex

1: \begin{abstract}

2: Perceptron is a classic online algorithm for learning a classification function. In this paper, we provide a novel extension of the perceptron algorithm to the learning to rank problem in information retrieval. We consider popular listwise performance measures such as Normalized Discounted Cumulative Gain (NDCG) and Average Precision (AP). We propose a novel family of listwise, large margin ranking surrogates, which are adaptable to NDCG and AP measures and derive a perceptron-like algorithm using these surrogates. Exploiting a self-bounding property of the proposed surrogates, we provide a guarantee on the cumulative NDCG (or AP) induced loss incurred by our perceptron-like algorithm. We show that, if there exists a perfect oracle ranker which can correctly rank, with some margin, each instance in an online sequence, the cumulative NDCG (or AP) induced loss of perceptron algorithm on that sequence is bounded by a constant, irrespective of the length of the sequence. This result is a learning to rank analogue of Novikoff's convergence theorem for the classification perceptron. However, our perceptron like algorithm for learning to rank has two drawbacks. First, unlike classification perceptron, the prediction at each round depends on a learning rate parameter. Second, the perceptron loss bound does not match our established lower bound on the cumulative loss achievable by any deterministic online algorithm. We propose a second perceptron like algorithm which achieves the lower bound and is independent of the learning rate parameter. However, our second algorithm does not adapt to different ranking measures, does not possess the listwise property and does not perform well on real world datasets. Experiments on simulated datasets corroborate our theoretical results and demonstrate competitive performance on large industrial benchmark datasets.

3: \end{abstract}

4: