1: \begin{abstract} It has been widely recognized that the $0/1$-loss function is one of the most natural choices for modelling classification errors,
2: and it has a wide range of applications including support vector machines and $1$-bit compressed sensing.
3: Due to the combinatorial nature of the $0/1$ loss function, methods based on convex relaxations or smoothing approximations have dominated the existing research and are often able to provide approximate solutions of good quality.
4: However, those methods are not optimizing the $0/1$ loss function directly and hence no optimality has been established for the original problem.
5: This paper aims to study the optimality conditions of the $0/1$ function minimization, and for the first time to develop Newton's method that directly optimizes the $0/1$ function with a local quadratic convergence under reasonable conditions. Extensive numerical experiments demonstrate its superior performance as one would expect from Newton-type methods.
6:
7: \vspace{3mm}
8: \noindent{\bf \textbf{Keywords}:}
9: $0/1$ loss function, P-stationarity, Newton's method, locally quadratic convergence, superior numerical performance
10:
11: \vspace{3mm}
12: \noindent{\bf \textbf{Mathematical Subject Classification}:} 49M05 $\cdot$ 90C26 $\cdot$ 90C30 $\cdot$ 65K05
13: \end{abstract}
14: