35353dacb2b72926.tex
1: \begin{abstract}
2: %Novel scalable and robust algorithms are required to successfully address modern statistical challenges that arise when it is necessary to process large volumes of data contaminated by outliers. 
3: This paper presents new algorithms for distributed statistical estimation that can take advantage of the divide-and-conquer approach.
4: We show that one of the key benefits attained by an appropriate divide-and-conquer strategy is robustness, an important characteristic of large distributed systems. 
5: We introduce a class of algorithms that are based on the properties of the geometric median, establish connections between performance of these distributed algorithms and rates of convergence in normal approximation, and provide tight deviations guarantees for resulting estimators in the form of exponential concentration inequalities. 
6: 
7: Our techniques are illustrated through several examples: in particular, we obtain new results for the median-of-means estimator, as well as provide performance guarantees for robust distributed maximum likelihood estimation. 
8: 
9: %We establish  that are based on the properties of the geometric median , In particular, our results imply that it is often possible to preserve optimal estimation rates desp
10: %Our techniques apply to a variety of popular methods, including regression and Maximum Likelihood Estimation.
11: 
12: \end{abstract}
13: