1: \begin{abstract}Inspired by the remarkable success of large neural networks, there has been significant interest in understanding the generalization performance of over-parameterized models. Substantial efforts have been invested in characterizing how optimization algorithms impact generalization through their ``preferred'' solutions, a phenomenon commonly referred to as \emph{implicit regularization}.
2: In particular, it has been argued that gradient descent (GD) induces an implicit $\ell_2$-norm regularization in regression and classification problems.
3: However, the implicit regularization of different algorithms are confined to either a specific geometry or a particular class of learning problems, indicating a gap in a general approach for controlling the implicit regularization.
4: To address this, we present a unified approach using mirror descent (MD), a notable generalization of GD, to control implicit regularization in both regression and classification settings.
5: More specifically, we show that MD with the general class of homogeneous potential functions converges in direction to a \textit{generalized maximum-margin} solution for linear classification problems, thereby answering a long-standing question in the classification setting.
6: Further, we show that MD can be implemented efficiently and enjoys fast convergence under suitable conditions. Through comprehensive experiments, we demonstrate that MD is a versatile method to produce learned models with different regularizers, which in turn have different generalization performances.
7: \end{abstract}
8: