abstract:e9d9a1cbdbbe04ae.tex

1: \begin{abstract}

2: Function-as-a-Service (FaaS) has raised a growing interest in how to

3: ``tame'' serverless computing to enable domain-specific use cases such as data-intensive applications and machine learning (ML), to name a few. Recently, several systems have been implemented for

4: training ML models. Certainly, these research articles are significant steps

5: in the correct direction. However, they do not completely answer the nagging question of when

6: serverless ML training can be more cost-effective compared to traditional ``serverful''

7: computing. To help in this endeavor, we

8: propose \mlless, a FaaS-based ML training prototype built atop

9: IBM Cloud Functions. To boost cost-efficiency, \mlless\phantom{}

10: implements two innovative optimizations tailored to the traits of serverless computing: on one hand, a significance filter, to make indirect

11: communication more effective,  and on the other hand, a scale-in auto-tuner, to reduce cost by benefiting from the FaaS

12: sub-second billing model (often per $100$ms). Our results certify that \mlless~can be $15$X faster than

13:  serverful ML systems~\cite{pytorch} at a lower cost for sparse ML models that exhibit fast convergence such as sparse logistic regression and matrix

14: factorization. Furthermore, our results show that \mlless~can easily scale out to increasingly large fleets~of serverless workers.

15: \end{abstract}

16: