6e2e357206d82f87.tex
1: \begin{abstract}Inspired by the remarkable success of large neural networks, there has been significant interest in understanding the generalization performance of over-parameterized models. Substantial efforts have been invested in characterizing how optimization algorithms impact generalization through their ``preferred'' solutions, a phenomenon commonly referred to as \emph{implicit regularization}.
2:     In particular, it has been argued that gradient descent (GD) induces an implicit $\ell_2$-norm regularization in regression and classification problems.
3:     However, the implicit regularization of different algorithms are confined to either a specific geometry or a particular class of learning problems, indicating a gap in a general approach for controlling the implicit regularization.
4:     To address this, we present a unified approach using mirror descent (MD), a notable generalization of GD, to control implicit regularization in both regression and classification settings.
5:     More specifically, we show that MD with the general class of homogeneous potential functions converges in direction to a \textit{generalized maximum-margin} solution for linear classification problems, thereby answering a long-standing question in the classification setting.
6:     Further, we show that MD can be implemented efficiently and enjoys fast convergence under suitable conditions. Through comprehensive experiments, we demonstrate that MD is a versatile method to produce learned models with different regularizers, which in turn have different generalization performances.
7: \end{abstract}
8: