53539d5d0768b2cf.tex
1: \begin{abstract}
2: 
3: Edge Computing (EC) consists in deploying computational resources (CPU, memory, storage) into the edge of the network (base stations, central offices, access points), thus constituting a “cloud at the edge”. Most of the related work examines the performance of a single generic service running in the Edge. In our vision, instead, EC is a multi-tenant environment: the resource owner, i.e., the Network Operator (NO), virtualizes the resources and let third party Service Providers (SPs - tenants) run their services, which can be diverse and with heterogeneous requirements. The NO allocates the available limited resources to SPs, in order to satisfy their requirements and to optimize a global utility function. Due to confidentiality guarantees, the NO cannot observe the traffic and the data of SPs. Therefore, the NO takes allocation decisions just based on observed monitoring information. The goal of this paper is to optimize one ressource (e.g. cache) allocation using purely data-driven strategie, leveraging state-of-the-art AI technique e.g. Reinforcement Learning (RL). We analytically study its convergence properties and evaluate it in simulation. Starting from a simple Q-Learning algorithm, we highlight the impact of several learning techniques e.g. $\epsilon$ Stretched Exponential Decay and Experience Replay. 
4: \end{abstract}
5: