835cb66f4cfdfce8.tex
1: \begin{abstract}
2:   This paper proposes \textit{Bayesian optimization augmented factoring self-scheduling} (BO FSS), a new parallel loop scheduling strategy.
3:   BO FSS is an automatic tuning variant of the factoring self-scheduling (FSS) algorithm and is based on Bayesian optimization (BO), a black-box optimization algorithm.
4:   Its core idea is to automatically tune the internal parameter of FSS by solving an optimization problem using BO.
5:   The tuning procedure only requires online execution time measurement of the target loop.
6:   In order to apply BO, we model the execution time using two Gaussian process (GP) probabilistic machine learning models.
7:   Notably, we propose a \textit{locality-aware GP} model, which assumes that the temporal locality effect resembles an exponentially decreasing function.
8:   By accurately modeling the temporal locality effect, our locality-aware GP model accelerates the convergence of BO.
9:   We implemented BO FSS on the GCC implementation of the \code{OpenMP} standard and evaluated its performance against other scheduling algorithms. 
10:   Also, to quantify our method's performance variation on different workloads, or \textit{workload-robustness} in our terms, we measure the \textit{minimax regret}. 
11:   According to the minimax regret, BO FSS shows more consistent performance than other algorithms.
12:   Within the considered workloads, BO FSS improves the execution time of FSS by as much as 22\% and 5\% on average. 
13: \end{abstract}
14: