f5779779c641050a.tex
1: \begin{abstract}
2: Stochastic gradient descent (SGD) algorithm is the method of choice in many machine learning tasks thanks to its scalability and efficiency in dealing with large-scale problems. 
3: In this paper, we focus on the shuffling version of SGD which matches the mainstream practical heuristics. 
4: We show the convergence to a global solution of shuffling SGD for a class of non-convex functions under over-parameterized settings. 
5: Our analysis employs more relaxed non-convex assumptions than previous literature.
6: Nevertheless, we maintain the desired computational complexity as shuffling SGD has achieved in the general convex setting. 
7: \end{abstract}
8: