7cf2259727540f01.tex
1: \begin{abstract}
2: 
3: We consider a service system model primarily
4: motivated by the problem of efficient assignment of virtual machines to physical host machines in 
5: a network cloud, so that the number of occupied hosts is minimized.
6: 
7: There are multiple  types of  arriving 
8: customers,  where a customer's mean service time  depends 
9: on its type. 
10: There is an infinite number of servers. 
11: Multiple customers can be placed for service into 
12: one server, subject to general ``packing'' constraints.
13: Service times of different customers are independent, even if served simultaneously
14: by the same server.  
15: Each new arriving customer is placed for service immediately, either
16:  into a server already serving other customers 
17: (as long as packing constraints are not violated)
18: or into an idle server. 
19: After a service completion, each customer leaves its server and the system.
20: 
21: We propose an extremely simple and easily implementable
22: customer placement algorithm, called {\em Greedy-Random} (GRAND). 
23: It places each arriving customer uniformly at random into either one of the already occupied servers (subject to packing constraints)
24: or one of the so-called {\em zero-servers}, which are empty servers designated to be available to new arrivals.
25: One instance of GRAND, called GRAND($aZ$), where $a\ge 0$ is a parameter, is such that the number of zero-servers
26: at any given time $t$ is $aZ(t)$, where $Z(t)$ is the current total number of customers in the system.
27: We prove that GRAND($aZ$) with $a>0$ 
28: is asymptotically optimal, as the  customer arrival rates grow to infinity and $a\to 0$,
29: in the sense of minimizing the total number of occupied servers in steady state. 
30: In addition, we study by simulations various versions of GRAND 
31: and observe the dependence of convergence speed and steady-state performance 
32: on the number of zero-servers.
33: 
34: \end{abstract}
35: