abstract:488fef09a36a9f1c.tex

1: \begin{abstract}

2: With an eye

3: toward understanding complexity control in deep learning, we study how infinitesimal regularization or gradient descent optimization lead to margin maximizing solutions in both homogeneous and {\em non homogeneous} models, extending previous work that focused on infinitesimal regularization only in homogeneous models.  To this end we study the limit of loss minimization with a diverging norm constraint (the ``constrained path''), relate it to the limit of a ``margin path'' and characterize the resulting solution.  For non-homogeneous ensemble models, which output is a sum of homogeneous sub-models, we show that this solution discards the shallowest sub-models if they are unnecessary. For homogeneous models, we show convergence to a ``lexicographic max-margin solution'', and provide conditions under which max-margin solutions are also attained as the limit of unconstrained gradient descent.

4: \end{abstract}

5: