section 4. Section 5 contains a summary and conclu-
sions.
2 THE MODEL
The system consists of N identical servers, which may
be used to serve jobs belonging to m different types.
However, once allocated to a type of service, a server
remains dedicated to jobs of that type only. In other
words, a static and non-sharing server allocation pol-
icy is employed: n
i
servers are assigned to jobs of type
i (n
1
+n
2
+... +n
m
= N). Such a policy may deliber-
ately take the decision to deny service to one or more
job types (this will certainly happen if the number of
services exceeds the number of servers).
Jobs of type i arrive according to an indepen-
dent Poisson process with rate λ
i
, and join a separate
queue. Their required service times are distributed
exponentially with mean 1/µ
i
. An admission policy
controlled by a set of thresholds is in operation: if
there are K
i
jobs of type i present in the system (wait-
ing and in service), then incoming type i jobs are not
accepted and are lost (i = 1,2, ...,m).
For the purposes of this model, the quality of
service experienced by an accepted job is measured
either in terms of its response time, W (the inter-
val between the job’s arrival and completion), or in
terms of its waiting time, w (excluding the service
time). Whichever the chosen measure, it is mentioned
explicitly in a service level agreement between the
provider and the users. We assume that each such
contract would include the following three clauses:
1. Charge: For each accepted and completed job of
type i a user shall pay a charge of c
i
(in practice
this may be proportional to the average length of
type i jobs).
2. Obligation: The response time, W
i
(or waiting
time, w
i
), of an accepted job of type i shall not
exceed q
i
.
3. Penalty: For each accepted job of type i whose
response time (or waiting time) exceeds q
i
, the
provider shall pay to the user a penalty of r
i
.
Thus, in this model, service type i is characterized
by its ‘demand parameters’ (λ
i
,µ
i
), and its ‘economic
parameters’, namely the triple
(c
i
,q
i
,r
i
) = (charge,obligation, penalty) (1)
Within the control of the provider are the server
allocations, n
i
, and the admission thresholds, K
i
. The
objective is to choose those allocations and thresholds
so as to maximize the total average revenue earned per
unit time in the steady state. A stationary regime al-
ways exists for a bounded queue, but if K
i
= ∞ for
some i, then the corresponding demand parameters
must satisfy λ
i
< n
i
µ
i
in order that the queue be stable.
Note that, although we make no assumptions
about the relative magnitudes of the charge and
penalty parameters, the more interesting case is where
the latter is at least as large as the former: c
i
≤ r
i
.
Otherwise one could guarantee a positive revenue by
accepting all jobs of type i, regardless of the load and
of the obligation made.
3 REVENUE EVALUATION
We concentrate first on the subsystem associated with
service i, for a given set of demand and economic
parameters, and fixed allocation n
i
and threshold K
i
.
That subsystem behaves like an M/M/n
i
/K
i
queue
(see, for example, (Mitrani, 1998)).
Denote by V
i
the average revenue earned from
type i jobs per unit time in the steady state. If the
QoS measure is the response time, then V
i
is given by
V
i
= λ
i
K
i
−1
∑
j=0
p
i, j
[c
i
− r
i
P(W
i, j
> q
i
)] , (2)
where p
i, j
is the stationary probability that there are j
jobs of type i in the M/M/n
i
/K
i
queue, andW
i, j
is the
response time of a type i job which finds, on arrival, j
other type i jobs present.
If the QoS measure is the waiting time, then V
i
is
given by a similar expression, with P(W
i, j
> q
i
) be-
ing replaced by P(w
i, j
> q
i
) (where w
i, j
is the waiting
time of a type i job which finds, on arrival, j other
type i jobs present).
The stationary distribution of the number of type
i jobs present is found by solving the balance and
normalizing equations. Similarly, the probabilities
P(W
i, j
> q
i
) can be evaluated by computing the dis-
tribution function of a convolution of the right num-
ber of exponential distributions. This can be done in
closed form.
When the computation of V
i
is done for different
sets of parameter values, it becomes clear that it is
a unimodal function of K
i
. That is, it has a single
maximum, which may be at K
i
= ∞ for lightly loaded
systems. We do not have a mathematical proof of
this proposition, but have verified it in numerous nu-
merical experiments. That observation implies that
one can search for the optimal admission threshold
by evaluating V
i
for consecutive values of K
i
, stop-
ping either when V
i
starts decreasing or, if that does
not happen, when the increase becomes smaller than
some ε. Such searches are typically very fast.
ICE-B 2007 - International Conference on e-Business
188