1: \begin{abstract}
2: While significant progress has been made separately on analytics systems
3: for scalable stochastic gradient descent (SGD) and private SGD,
4: none of the major scalable analytics frameworks have incorporated
5: differentially private SGD.
6: There are two inter-related issues for this disconnect between research
7: and practice: (1) low model accuracy due to added noise to guarantee privacy,
8: and (2) high development and runtime overhead of the private algorithms.
9: This paper takes a first step to remedy this disconnect and
10: proposes a private SGD algorithm to address {\em both} issues
11: in an integrated manner.
12: In contrast to the white-box approach adopted by previous work,
13: we revisit and use the classical technique of {\em output perturbation} to
14: devise a novel ``bolt-on'' approach to private SGD.
15: While our approach trivially addresses (2), it makes (1) even more challenging.
16: We address this challenge by providing a novel analysis of the $L_2$-sensitivity of SGD,
17: which allows, under the same privacy guarantees, better convergence of SGD
18: when only a constant number of passes can be made over the data.
19: We integrate our algorithm, as well as other state-of-the-art differentially private SGD,
20: into \Bismarck{}, a popular scalable SGD-based analytics system on top of an RDBMS.
21: Extensive experiments show that our algorithm can be easily integrated, incurs virtually
22: no overhead, scales well, and most importantly, yields substantially better (up to 4X)
23: test accuracy than the state-of-the-art algorithms on many real datasets.
24: \end{abstract}
25: