0ddb4377bae575ae.tex
1: \begin{abstract}
2: Automating physical database design has remained a long-term interest in database research due to substantial performance gains afforded by optimised structures. Despite significant progress, a majority of today's commercial solutions are highly manual, requiring offline invocation by database administrators \linebreak[4] (DBAs) who are expected to identify and supply representative training workloads. Even the latest advancements like query stores provide only limited support for dynamic environments. This status quo is untenable: identifying \emph{representative} static workloads is no longer realistic; and physical design tools remain susceptible to the query optimiser's cost misestimates. \malinga{Furthermore, modern application environments such as hybrid transactional and analytical processing (HTAP) systems  render analytical modelling next to impossible.}
3: 
4: We propose a self-driving approach to online index selection that eschews the DBA and query optimiser, and instead \emph{learns} the benefits of viable structures through strategic exploration and direct performance observation. We view the problem as one of sequential decision making under uncertainty, specifically within the bandit learning setting. 
5: Multi-armed bandits balance exploration and exploitation to \emph{provably} guarantee average performance that converges to policies that are optimal with perfect hindsight.   \malinga{Our comprehensive empirical evaluation against a state-of-the-art commercial tuning tool demonstrates up to 75\% speed-up on shifting and ad-hoc workloads and up to 28\% speed-up on static workloads in analytical processing environments. In HTAP environments, our solution provides up to 59\% speed-up on shifting and 51\% speed-up on static workloads. Furthermore, 
6: our bandit framework outperforms deep reinforcement learning (RL) in terms of convergence speed and performance volatility (providing up to 58\% speed-up).}
7: \end{abstract}
8: