37349fa8d54614f5.tex
1: \begin{abstract}
2: We present the \textit{Structured Weighted Violations Perceptron (SWVP)} algorithm, 
3: a new structured prediction algorithm 
4: that generalizes the Collins Structured Perceptron (CSP, \cite{CollinsPerceptron}).
5: Unlike CSP, the update rule of SWVP explicitly 
6: exploits the internal structure of the predicted labels. 
7: We prove the convergence of SWVP for linearly separable training sets, 
8: %SWVP converges to a weight 
9: %vector that separates the data, under certain conditions on the parameters of the algorithm. 
10: provide mistake and generalization bounds, 
11: %on: (a) the number of updates in the separable case; 
12: %(b) mistakes in the non-separable case; and (c) the probability to misclassify 
13: %an unseen example (generalization), 
14: and show that in the general case these bounds are tighter than those of the CSP special case.
15: In synthetic data experiments with data drawn from an HMM, various variants of SWVP 
16: substantially outperform its CSP special case. 
17: SWVP also provides encouraging initial dependency parsing results.
18: \end{abstract}
19: