1: \begin{abstract}
2: Function-as-a-Service (FaaS) has raised a growing interest in how to
3: ``tame'' serverless computing to enable domain-specific use cases such as data-intensive applications and machine learning (ML), to name a few. Recently, several systems have been implemented for
4: training ML models. Certainly, these research articles are significant steps
5: in the correct direction. However, they do not completely answer the nagging question of when
6: serverless ML training can be more cost-effective compared to traditional ``serverful''
7: computing. To help in this endeavor, we
8: propose \mlless, a FaaS-based ML training prototype built atop
9: IBM Cloud Functions. To boost cost-efficiency, \mlless\phantom{}
10: implements two innovative optimizations tailored to the traits of serverless computing: on one hand, a significance filter, to make indirect
11: communication more effective, and on the other hand, a scale-in auto-tuner, to reduce cost by benefiting from the FaaS
12: sub-second billing model (often per $100$ms). Our results certify that \mlless~can be $15$X faster than
13: serverful ML systems~\cite{pytorch} at a lower cost for sparse ML models that exhibit fast convergence such as sparse logistic regression and matrix
14: factorization. Furthermore, our results show that \mlless~can easily scale out to increasingly large fleets~of serverless workers.
15: \end{abstract}
16: