404d91cfe2cfcf2a.tex
1: \begin{abstract}
2: %Alternating direction method of multipliers~(\mbox{ADMM}) has been widely used in many applications due to its promising performance to solve complex regularization problems and large-scale distributed optimization problems. Stochastic ADMM, which visits only one sample or a mini-batch of samples each time, has recently been proved to achieve better performance than batch ADMM.
3: Most stochastic ADMM~(alternating direction method of multipliers) methods can only achieve a convergence rate which is slower than $O(1/T)$ on general convex problems, where $T$ is the number of iterations. Hence, these methods are not scalable in terms of convergence rate~(computation cost). There exists only one stochastic method, called \mbox{SA-ADMM}, which can achieve a convergence rate of $O(1/T)$ on general convex problems. However, an extra memory is needed for SA-ADMM to store the historic gradients on all samples, and thus it is not scalable in terms of storage cost. In this paper, we propose a novel method, called \underline{sca}lable \underline{s}tochastic \mbox{ADMM}~(\mbox{SCAS-ADMM}), for large-scale optimization and learning problems. Without the need to store the historic gradients on all samples, \mbox{SCAS-ADMM} can achieve the same convergence rate of $O(1/T)$ as the best stochastic method \mbox{SA-ADMM} and batch ADMM on general convex problems. Experiments on graph-guided fused lasso show that SCAS-ADMM can achieve state-of-the-art performance in real applications.
4: \end{abstract}
5: