1: \begin{abstract}
2:
3: Developing a differentially private deep learning algorithm is challenging, due to the difficulty in analyzing the {\it{sensitivity}} of objective functions that are typically used to train deep neural networks.
4: %
5: Many existing methods resort to the stochastic gradient descent algorithm and apply
6: a {\it{pre-defined}} sensitivity to the gradients for privatizing weights.
7: %
8: However, their slow convergence
9: typically yields a high cumulative privacy loss.
10: Here, we take a different route by employing the {\it{method of auxiliary coordinates}}, which allows us to independently update the weights per layer by optimizing a {\it{per-layer}} objective function.
11: %
12: This objective function can be well approximated by a low-order Taylor's expansion, in which sensitivity analysis becomes tractable.
13: %
14: We perturb the coefficients of the expansion for privacy, which we optimize using more advanced optimization routines than SGD for faster convergence.
15: We empirically show that our algorithm provides a decent trained model quality under a modest privacy budget.\footnote{We updated this current manuscript by fixing an implementation error, which was part of the implementation to produce the results we presented at the PPML2018 workshop. For detailed comments on what changes we made, see Appendix.}
16: \end{abstract}
17: