e9d9a1cbdbbe04ae.tex
1: \begin{abstract}
2: Function-as-a-Service (FaaS) has raised a growing interest in how to 
3: ``tame'' serverless computing to enable domain-specific use cases such as data-intensive applications and machine learning (ML), to name a few. Recently, several systems have been implemented for 
4: training ML models. Certainly, these research articles are significant steps 
5: in the correct direction. However, they do not completely answer the nagging question of when
6: serverless ML training can be more cost-effective compared to traditional ``serverful''
7: computing. To help in this endeavor, we 
8: propose \mlless, a FaaS-based ML training prototype built atop
9: IBM Cloud Functions. To boost cost-efficiency, \mlless\phantom{} 
10: implements two innovative optimizations tailored to the traits of serverless computing: on one hand, a significance filter, to make indirect
11: communication more effective,  and on the other hand, a scale-in auto-tuner, to reduce cost by benefiting from the FaaS
12: sub-second billing model (often per $100$ms). Our results certify that \mlless~can be $15$X faster than 
13:  serverful ML systems~\cite{pytorch} at a lower cost for sparse ML models that exhibit fast convergence such as sparse logistic regression and matrix
14: factorization. Furthermore, our results show that \mlless~can easily scale out to increasingly large fleets~of serverless workers.
15: \end{abstract}
16: