8805704d5486fbd8.tex
1: \begin{abstract}
2: Distributed optimization algorithms are widely used in many industrial machine learning
3: applications. 
4: However choosing the appropriate algorithm and cluster size is often difficult for users
5: as the performance and convergence rate of optimization algorithms vary with the size of the cluster.
6: In this
7: paper we make the case for an ML-optimizer that can select the appropriate algorithm and cluster size to use
8: for a given problem. To do this we propose building two models: one that captures the system level
9: characteristics of how computation, communication change as we increase cluster sizes and another
10: that captures how convergence rates change with cluster sizes. We present preliminary results
11: from our prototype implementation called Hemingway and discuss some of the challenges involved in
12: developing such a system.
13: \end{abstract}