Optimization of Queueing Model with Server Heating and Cooling

Dudina, Olga; Dudin, Alexander

doi:10.3390/math7090768

Open AccessFeature PaperArticle

Optimization of Queueing Model with Server Heating and Cooling

by

Olga Dudina

^1,2 and

Alexander Dudin

^1,2,*

¹

Department of Applied Mathematics and Computer Science, Belarusian State University, 4, Nezavisimosti Ave., 220030 Minsk, Belarus

²

Applied Mathematics and Comminications Technology Institute, Peoples’ Friendship University of Russia (RUDN University), 6 Miklukho-Maklaya St, Moscow 117198, Russia

^*

Author to whom correspondence should be addressed.

Mathematics 2019, 7(9), 768; https://doi.org/10.3390/math7090768

Submission received: 29 July 2019 / Revised: 17 August 2019 / Accepted: 19 August 2019 / Published: 21 August 2019

(This article belongs to the Special Issue Stochastic Processes: Theory and Applications)

Download

Browse Figures

Versions Notes

Abstract

The operation of many real-world systems, e.g., servers of data centers, is accompanied by the heating of a server. Correspondingly, certain cooling mechanisms are used. If the server becomes overheated, it interrupts processing of customers and needs to be cooled. A customer is lost when its service is interrupted. To prevent overheating and reduce the customer loss probability, we suggest temporal termination of service of new customers when the temperature of the server reaches the predefined threshold value. Service is resumed after the temperature drops below another threshold value. The problem of optimal choice of the thresholds (with respect to the chosen economical criterion) is numerically solved under quite general assumptions about the parameters of the system (Markovian arrival process, phase-type distribution of service time, and accounting for customers impatience). Numerical examples are presented.

Keywords:

processor heating and cooling; markovian arrival process; phase-type service time distribution; impatience

1. Introduction

The goal of operation of many real-world systems is to obtain profit via providing service to some customers. For example, in data centers, the profit is obtained via storing and retrieving the information for users on demand. The operation of such systems is possible only under fulfillment of various limitations. An important problem in organization of the operation of data centers is the effective cooling of servers. High performance servers generate a lot of heat and it is necessary to effectively cool the central processing unit, memory modules, power supplies, graphics processing units and other devices to avoid system overheating and premature failures (see, e.g., [1]).

It is clear that, to maximize the profit, it is necessary to use the power of the available server to a maximum extent. However, this may cause overheating of the server, the loss of a customer who was using the service during the overheating moment and a temporal termination of the service for cooling the server. To prevent overheating of the server during service, it sounds reasonable to stop new services if the temperature of the server reaches some level (threshold). Definitely, this threshold should be less than the critical level but more or less close to this level. Otherwise, a certain part of the server capacity is not utilized and this may lead to the loss of some profit. If service can be postponed or interrupted, it is necessary to specify when service will be resumed. This can be done by means of introducing one more threshold. Service is resumed when the temperature of the server is dropped to this threshold. It is obvious that this threshold should be less than the first threshold. The difference between the thresholds should not be too large. Otherwise, again, the server capacity is under-utilized. However, if the difference is too small, the bans and permissions to start new services can occur too frequently and this can be charged by a decision-maker. Therefore, the problem of optimal choice of the two thresholds is not trivial and challenging.

In this paper, we numerically solve this problem in the following way. Under any fixed pair of thresholds, the behavior of the system is described by a Markov chain. This Markov chain is multi-dimensional because it has to include the components defining the number of customers in the system, the current state of the server (idle, operating, or cooling) and its temperature, underlying processes of customer arrival and service processes. Due to the existence of periods when service is not provided, in our model, we account for the possible impatience of the customers waiting in the queue. Due to considering impatience, this Markov chain does not belong to the class of level-independent quasi-birth-and-death processes and its analysis is non-trivial. We use results from [2,3] for computation of the stationary probabilities of the states of the chain. Having the stationary probabilities computed, we derive formulas for computation of the main performance indicators of the system and the cost criterion for any fixed pair of the thresholds defining behavior of the system. This allows us to numerically solve the problem of choosing the optimal values of the thresholds.

The considered model is very close to the models in which some additional resource is required to provide service to a customer. These models include, in particular, so-called queueing/inventory models (see, e.g., [4]), queueing systems with energy harvesting (see, e.g., [5]), queueing models with paired customers (see [6]), assembly-like queue (see [7]), passenger-taxi or double-ended queues (see [8]), coupled queues (see [9]), etc. In our model, the role of the additional resource is played by the lag between the critical and current value of the server’s temperature. The essential difference between our model and the above mentioned models consists of the following. Usually, the additional resource has the influence on the behavior of the queueing system only at the potential service beginning moments. If the resource is available, the known required for service amount of the resource is reserved. Service starts and, then, successfully finishes. Otherwise, if the resource is not available at the potential service beginning moment, service is cancelled or postponed until the required amount of the additional resource becomes available. In our model, we have a more complicated situation: the “additional resource” has a permanent influence on the behavior of the queueing system. Required for a customer service amount of resource is uncertain. It cannot be reserved and, as a consequence, the started service (with available resource at the service beginning moment) can be terminated ahead of the schedule and the customer is lost if the resource becomes unavailable already during service of this customer. Namely, due to uncertainty of the amount of the required resource, it is necessary to ban starting new service when the resource is still available but the number of available units of the resource is less than the threshold value.

The structure of the paper is the following. Section 2 contains the description of the mathematical model of the considered system. The stationary distribution of the multi-dimensional Markov chain describing the number of customers, current operation mode, excess of heating and underlying process of the

M A P

arrival processes of customers and

P H

distributed service time is analyzed in Section 3. The generator of this Markov chain is derived. Formulas for computing the key performance measures of the system, including the probabilities of a customer loss (due to the server overheating and due to the customer impatience) are given in Section 4. The results of numerical experiments that illustrate the dependence of the key performance measures of the system on the thresholds are described in Section 5. An optimization problem is considered in brief. Section 6 contains the conclusion of the paper.

2. Description of the Model

We consider a single-server queueing system that has an input buffer of an infinite capacity. Figure 1 illustrates the structure of the system under study.

Customer arrival is defined as the Markovian Arrival Process (

M A P

). Arrivals are governed by the underlying Markov chain

ν_{t}, t \geq 0,

with the finite state space

\{0, 1, \dots, W} .

The residence time of this chain in the state

ν

has an exponential distribution with the parameter

λ_{ν}, ν = \bar{0, W} .

Here and in what follows, the notation

ν = \bar{0, W}

means that the integer parameter

ν

takes values from the set

{0, 1, \dots, W} .

When the residence time in the state

ν

expires, with probability

p_{ν, ν^{'}} (0)

, the process

ν_{t}

makes a transition to the state

ν^{'}

without generation of a customer,

ν^{'} = \bar{0, W} ν \neq ν^{'},

and, with probability

p_{ν, ν^{'}} (1)

, the process

ν_{t}

makes a transition to the state

ν^{'}

with a generation of a customer,

ν, ν^{'} = \bar{0, W} .

The behavior of this arrival process is completely defined by the matrices

D_{0}

and

D_{1}

consisting of the entries

{(D_{1})}_{ν, ν^{'}} = λ_{ν} p_{ν, ν^{'}} (1), ν, ν^{'} = \bar{0, W,}

and

{(D_{0})}_{ν, ν} = - λ_{ν}, ν = \bar{0, W}, {(D_{0})}_{ν, ν^{'}} = λ_{ν} p_{ν, ν^{'}} (0), ν, ν^{'} = \bar{0, W}, ν \neq ν^{'} .

The matrix

D (1) = D_{0} + D_{1}

is assumed to be irreducible and is the generator of the process

ν_{t}, t \geq 0

.

The mean arrival rate

λ

is computes as

λ = θ D_{1} e

where

θ

is the unique solution to the equations

θ D (1) = 0, θ e = 1 .

Hereinafter,

e

denotes a column vector consisting of 1’s, and

0

is a zero row vector.

For more information about the

M A P

and its properties, see [10,11,12].

The service time of a customer has a

P H

distribution defined by the stochastic row vector

β

and sub-generator

S .

This time has the following interpretation. Let

m_{t}, t \geq 0,

be the continuous-time Markov process having a finite state space

{1, \dots, M, M + 1} .

The states

{1, \dots, M}

are transient and

M + 1

is the absorbing state. The initial state of this process at the moment of beginning of

P H

distributed time is randomly selected among the transient states

{1, \dots, M}

according to the distribution defined by the entries of the row vector

β = (β_{1}, \dots, β_{M})

. Then, the process

m_{t}

makes transitions within the set

{1, \dots, M}

of the transient states with intensities defined by the entries of the sub-generator S or to the absorbing state. The intensities of the transition to the absorbing state are given by the entries of the column vector

S_{0} = - S e .

Transition to the absorbing state implies the end of

P H

distributed time.

The Laplace–Stieltjes transform of the

P H

distribution is defined as

β {(s I - S)}^{- 1} S_{0}, R e s > 0 .

The mean service time is equal to

b_{1} = β {(- S)}^{- 1} e .

For more detailed information about the

P H

distribution, see [13]. Its applicability for good approximation of an arbitrary distribution is mentioned, e.g., in [14]. When the server becomes idle, the underlying process

m_{t}, t \geq 0,

does not make any transitions.

The problem of constructing the vector

β

and the matrices

D_{0}

and

D_{1}, S

based on available statistics regarding the real arrival and service processes is extensively addressed in the existing literature and may be solved following the results from, e.g., the papers [15,16,17].

During service, the server generates the heat and the temperature of the server is permanently monitored. Without essential loss of generality, we suppose that the temperature of the server is graded in some discrete units, e.g. degrees Celsius. The server can operate when this temperature is in the interval from

K^{'}

to

K^{″}

. According to the 2011 version of recommendations of the American Society of Heating, Refrigerating and Air Conditioning Engineers (ASHRAE), for class A1 systems the temperature of the server has to be in the range from 15

^{\circ}

C to 32

^{\circ}

C. To simplify notations, we do not keep track of the absolute temperature of the server, but the excess of the temperature over the lower temperature level. This means that we assume that the (relative) temperature of the server has to be in the range from 0 to K where

K = K^{″} - K^{'}

. When the temperature of the server reaches the upper level K, service of customers becomes impossible. We say that this server becomes overheated. The server temporarily stops its work, is considered blocked and has to be cooled. A customer using the service when overheating occurs is assumed to be lost. We suggest that the server does not generate the heat when it does not work (is idle or blocked). When the server is working, the rate of the server heating is assumed to be equal to

α

degrees during unit of time,

α > 0 .

In parallel to heating of the server, it is permanently cooled. We assume that the cooling rate is equal to

γ_{k}, γ_{k} \geq 0,

when the current temperature of the server is equal to

k, k = \bar{0, K} .

When the server becomes overheated, it stops generation of the heat and only is cooled. We assume that the server remains blocked until its temperature drops to the level (threshold)

K_{1}, K_{1} < K .

After that, the server becomes unblocked and can start service. We assume that the customers residing in the buffer are impatient. Each of these customers departs from the buffer without receiving service (is lost) independently of other waiting customers after a “patience time” expires. This time is exponentially distributed with the parameter

ϕ .

The overheating of the server implies the loss of the potential profit gained by customers’ service. This loss is related to the loss of customers, during service of which the overheating occurs, and the loss of a capacity (throughput) of the server spent on service of such customers. The overheating may require server recovery, not only cooling. Therefore, it is desirable to avoid the overheating. To prevent the overheating occurrence, it is reasonable to stop new services when the temperature of the server becomes pretty high. We assume that the threshold

K_{2}

is fixed such that

K_{1} < K_{2} \leq K .

The server cannot start new services if its temperature is equal or greater than

K_{2} .

However, the ongoing service continues. It cannot be interrupted unless the server becomes overheated, i.e., its temperature becomes equal to

K .

If this service is successfully finished while the server does not become overheated, the server remains blocked and does not start new services until its temperature drops to

K_{1} .

It is obvious that the values of performance indicators of the system depend on the choice of the pair of thresholds

(K_{1}, K_{2})

and our first goal is to provide a way for computing the values of these measures for any fixed values of thresholds. To this end, we elaborate the algorithm for computation of the stationary distribution of the system states.

3. Process of System States and Its Analysis

Let the critical temperature K and thresholds

K_{1}

and

K_{2}

be fixed,

0 \leq K_{1} < K_{2} \leq K .

It is easy to see that the behavior of the considered system can be described by the following regular irreducible continuous-time Markov chain

ξ_{t} = {n_{t}, r_{t}, k_{t}, ν_{t}, m_{t}}, t \geq 0,

where, during the epoch

t, t \geq 0,

$n_{t}$ is the number of customers in the buffer, $n_{t} \geq 0 .$
$r_{t}, r_{t} = \bar{0, 2},$ is the server state: $r_{t} = 0$ if the server is idle, $r_{t} = 1$ if the server is busy, and $r_{t} = 2$ if the server is blocked.
$k_{t}$ is the temperature of the server, $k_{t} = \bar{0, K} .$
$ν_{t}$ is the state of the underlying process of the $M A P$ , $ν_{t} = \bar{0, W} .$
$m_{t}$ is the state of the underlying process of the $P H$ service process, $m_{t} = \bar{1, M} .$

The Markov chain

ξ_{t}, t \geq 0,

has the following state space:

\begin{matrix} ({0, 0, k, ν}, k = \bar{0, K_{2} - 1}) ⋃ ({n, 1, k, ν, m}, n \geq 0, k = \bar{0, K - 1}, m = \bar{1, M}) ⋃ \\ ({n, 2, k, ν}, n \geq 0, k = \bar{K_{1} + 1, K}), ν = \bar{0, W} . \end{matrix}

To formally define the Markov chain

ξ_{t},

we need to specify its transition rates within this state space. Since this chain has five components when the server is not idle or blocked and four components when the server is idle or blocked, to avoid operations with multi-dimensional arrays, it is necessary to enumerate the states in some order. We assume the lexicographic ordering. This means that firstly the states of the Markov chain

ξ_{t}

are numbered in the increasing order of the component

n_{t} .

Within the set of the states having the same value, say n,

n \geq 0,

of this component, the states are numbered in the increasing order of the component

r_{t} .

Within the set of the states having the same values, say

(n, r)

,

n \geq 0, r = 0, 1, 2,

of these components, the states are numbered in the increasing order of the component

k_{t} .

Within the set of the states having the same values, say

(n, r, k),

n \geq 0, r = 0, 1, 2, k = \bar{0, K},

of the three components, the states are numbered in the increasing order of the component

ν_{t}, ν_{t} = \bar{0, W} .

Finally, the states from the sets

(n, 1, k, ν)

are numbered in the increasing order of the component

m_{t}, m_{t} = \bar{1, M} .

Let us denote by G the generator of the Markov chain

ξ_{t} .

It follows from the introduced enumeration of the components of the chain that G is the matrix consisting of the blocks

G_{n, n^{'}}, n, n^{'} \geq 0,

| n - n^{'} | \leq 1,

defining the intensities of transitions from the states having the value n of the component

n_{t}

to the states having the value

n^{'}

of this component.

Theorem 1.

The infinitesimal generator G of the Markov chain

ξ_{t}, t \geq 0,

has the following block-tridiagonal structure:

G = (\begin{matrix} G_{0, 0} & G_{0, 1} & O & O & O & \dots \\ G_{1, 0} & G_{1, 1} & G_{1, 2} & O & O & \dots \\ O & G_{2, 1} & G_{2, 2} & G_{2, 3} & O & \dots \\ ⋮ & ⋮ & ⋮ & ⋮ & ⋮ & ⋱ \end{matrix}) .

Here, the blocks

{(G_{0, 0}^{r, r^{'}})}_{r, r^{'} = \bar{0, 2}}

of the matrix

G_{0, 0},

whose diagonal entries are negative and define, up to the sign, the intensities of the exit of the Markov chain

ξ_{t}

from the corresponding states and the non-diagonal entries define the intensities of transitions that do not imply customers appearance in the empty buffer, have the following form:

G_{0, 0}^{0, 0} = I_{K_{2}} \otimes D_{0} + (E_{K_{2}}^{-} - C_{K_{2}}) \otimes I_{\bar{W}},

G_{0, 0}^{0, 1} = I_{K_{2}, K} \otimes D_{1} \otimes β,

G_{0, 0}^{0, 2} = O_{K_{2} \bar{W}, (K - K_{1}) \bar{W}},

G_{0, 0}^{1, 0} = I_{K, K_{2}} \otimes I_{\bar{W}} \otimes S_{0},

G_{0, 0}^{1, 1} = I_{K} \otimes D_{0} \oplus S + (E_{K}^{-} - C_{K} + α (E^{+} - I_{K})) \otimes I_{\bar{W} M},

G_{0, 0}^{1, 2} = {\bar{I}}_{K, K - K_{1}} \otimes I_{\bar{W}} \otimes S_{0} + α \hat{I} \otimes I_{\bar{W}} \otimes e_{M},

G_{0, 0}^{2, 0} = γ_{K_{1} + 1} {\tilde{I}}_{K - K_{1}, K_{2}} \otimes I_{\bar{W}},

G_{0, 0}^{2, 1} = O_{(K - K_{1}) \bar{W}, K \bar{W} M},

G_{0, 0}^{2, 2} = I_{K - K_{1}} \otimes D_{0} + (\tilde{E} - \tilde{C}) \otimes I_{\bar{W}} .

The matrix

G_{0, 1},

whose entries define the intensities of transitions when a customer arrives to the empty buffer, has the form

G_{0, 1} = (\begin{matrix} O_{K_{2} \bar{W}, K \bar{W} M} & O_{K_{2} \bar{W}, (K - K_{1}) \bar{W}} \\ G_{0, 1}^{1, 1} & O_{K \bar{W} M, (K - K_{1}) \bar{W}} \\ O_{(K - K_{1}) \bar{W}, K \bar{W} M} & G_{0, 1}^{2, 2} \end{matrix})

where

G_{0, 1}^{1, 1} = I_{K} \otimes D_{1} \otimes I_{M},

G_{0, 1}^{2, 2} = I_{K - K_{1}} \otimes D_{1} .

The matrix

G_{1, 0},

whose entries define the intensities of transitions when the single customer staying in the buffer, departs from the buffer (due to the impatience or service beginning), has the form

G_{1, 0} = (\begin{matrix} O_{K \bar{W} M, K_{2} \bar{W}} & G_{1, 0}^{1, 1} & O_{(K - K_{1}) \bar{W}, K_{2} \bar{W}} \\ O_{(K - K_{1}) \bar{W}, K_{2} \bar{W}} & G_{1, 0}^{2, 1} & G_{1, 0}^{2, 2} \end{matrix}),

where

G_{1, 0}^{1, 1} = ϕ I_{K \bar{W} M} + B \otimes I_{\bar{W}} \otimes S_{0} β,

G_{1, 0}^{2, 1} = γ_{K_{1} + 1} {\tilde{I}}_{K - K_{1}, K} \otimes I_{\bar{W}} \otimes β,

G_{1, 0}^{2, 2} = ϕ I_{(K - K_{1}) \bar{W}} .

The blocks

{(G_{n, n}^{r, r^{'}})}_{r, r^{'} = \bar{1, 2}}

of the matrix

G_{n, n}, n \geq 1,

whose diagonal entries are negative and define, up to the sign, the intensities of the exit of the Markov chain

ξ_{t}

from the corresponding states when the number of customers in the buffer is equal to

n, n \geq 1,

and the non-diagonal entries define the intensities of transitions that do not imply the change of the number of customers in the buffer, have the following form:

G_{n, n}^{1, 1} = I_{K} \otimes D_{0} \oplus S + (E_{K}^{-} - C_{K} + α (E^{+} - I_{K}) - n ϕ I_{K}) \otimes I_{\bar{W} M},

G_{n, n}^{1, 2} = {\bar{I}}_{K, K - K_{1}} \otimes I_{\bar{W}} \otimes S_{0} + α \hat{I} \otimes I_{\bar{W}} \otimes e_{M},

G_{n, n}^{2, 1} = O_{(K - K_{1}) \bar{W}, K \bar{W} M},

G_{n, n}^{2, 2} = I_{K - K_{1}} \otimes D_{0} + (\tilde{E} - \tilde{C}) \otimes I_{\bar{W}} - n ϕ I_{(K - K_{1}) \bar{W}}, n \geq 1 .

The blocks

{(G_{n, n + 1}^{r, r^{'}})}_{r, r^{'} = \bar{1, 2}}

of the matrix

G_{n, n + 1}, n \geq 1,

whose entries define the intensities of increasing the number of customers in the buffer from n to

n + 1

, have the following form:

G_{n, n + 1}^{1, 1} = I_{K} \otimes D_{1} \otimes I_{M},

G_{n, n + 1}^{1, 2} = O_{K \bar{W} M, (K - K_{1}) \bar{W}},

G_{n, n + 1}^{2, 1} = O_{(K - K_{1}) \bar{W}, K \bar{W} M},

G_{n, n + 1}^{2, 2} = I_{K - K_{1}} \otimes D_{1}, n \geq 1 .

The non-zero blocks

{(G_{n, n - 1}^{r, r^{'}})}_{r, r^{'} = \bar{1, 2}}

of the matrix

G_{n, n - 1}, n \geq 2,

whose entries define the intensities of decreasing the number of customers in the buffer from n to

n - 1

, have the following form:

G_{n, n - 1}^{1, 1} = n ϕ I_{K \bar{W} M} + B \otimes I_{\bar{W}} \otimes S_{0} β,

G_{n, n - 1}^{2, 1} = γ_{K_{1} + 1} {\tilde{I}}_{K - K_{1}, K} \otimes I_{\bar{W}} \otimes β,

G_{n, n - 1}^{2, 2} = n ϕ I_{(K - K_{1}) \bar{W}}, n \geq 2 .

Here,

I is the identity matrix, and O is a zero matrix of an appropriate dimension.
$\bar{W} = W + 1;$
⊗ and ⊕ are the symbols of the Kronecker product and the sum of matrices, respectively.
$E_{l}^{-}$ is a square matrix of size l with all zero entries except the entries ${(E_{l}^{-})}_{k, k - 1} = γ_{k}, k = \bar{1, l - 1}$ .
$C_{l}$ is a square matrix of size l with all zero entries except the entries ${(C_{l})}_{k, k} = γ_{k}, k = \bar{1, l - 1}$ .
$I_{K_{2}, K}$ is a matrix of size $K_{2} \times K$ with all zero entries except the entries ${(I_{K_{2}, K})}_{n, n}, n = \bar{0, K_{2} - 1},$ which are equal to 1.
$I_{K, K_{2}}$ is a matrix of size $K \times K_{2}$ with all zero entries except the entries ${(I_{K, K_{2}})}_{n, n}, n = \bar{0, K_{2} - 1},$ which are equal to 1.
$E^{+}$ is a square matrix of size K with all zero entries except the entries ${(E^{+})}_{k, k + 1}, k = \bar{0, K - 2},$ which are equal to 1.
${\bar{I}}_{K, K - K_{1}}$ is a matrix of size $K \times (K - K_{1})$ with all zero entries except the entries ${({\bar{I}}_{K, K - K_{1}})}_{n, n - K_{1} - 1}, n = \bar{K_{2}, K - 1},$ which are equal to 1.
$\hat{I}$ is a matrix of size $K \times (K - K_{1})$ with all zero entries except the entry ${(\hat{I})}_{K - 1, K - K_{1} - 1},$ which is equal to 1.
${\tilde{I}}_{K - K_{1}, K_{2}}$ is a matrix of size $(K - K_{1}) \times K_{2}$ with all zero entries except the entry ${({\tilde{I}}_{K - K_{1}, K_{2}})}_{0, K_{1}},$ which is equal to 1.
$\tilde{E}$ is a square matrix of size $K - K_{1}$ with all zero entries except the entries ${(\tilde{E})}_{k, k - 1} = γ_{K_{1} + k + 1}, k = \bar{1, K - K_{1} - 1}$ .
$\tilde{C}$ is a square matrix of size $K - K_{1}$ with all zero entries except the entries ${(\tilde{C})}_{k, k} = γ_{K_{1} + k + 1}, k = \bar{0, K - K_{1} - 1}$ .
B is a square matrix of size K with all zero entries except the entries ${(B)}_{k, k} = 1, k = \bar{0, K_{2} - 1}$ .
${\tilde{I}}_{K - K_{1}, K}$ is a matrix of size $(K - K_{1}) \times K$ with all zero entries except the entry ${({\tilde{I}}_{K - K_{1}, K})}_{0, K_{1}} = 1$ .

The proof of the theorem is implemented via the careful analysis of various scenarios of the system behavior at the moments of changing the states of the underlying processes of arrivals and service, changing the temperature of the server due to heating and cooling, customers departure due to impatience. The symbols of Kronecker product and sum of matrices are very helpful for description of transition intensities of several independent Markov processes.

It can be easily shown that the Markov chain

ξ_{t}

belongs to the class of Asymptotically Quasi–Toeplitz Markov Chains (

A Q T M C

) (see [2]).

Theorem 2.

The stationary distribution of the Markov chain

ξ_{t}

exists for any values of the system parameters.

The assertion of the theorem stems from the fact that the customers staying in the buffer are assumed to be impatient (

ϕ > 0

). The strict proof of Theorem 2 can be done by using the results from [2]. This proof is straightforward and rather routine, thus it is omitted here.

Let us denote by

π (n, r, k)

the row vector of stationary probabilities of the states of the chain having the value

(n, r, k)

of the first three components listed in the described above order.

In addition, denote

π (0, 0) = (π (0, 0, 0), \dots, π (0, 0, K_{2} - 1)),

π (n, 1) = (π (n, 1, 0), \dots, π (n, 1, K - 1)),

π (n, 2) = (π (n, 2, K_{1} + 1), \dots, π (n, 2, K)), n \geq 0,

π (0) = (π (0, 0), π (0, 1), π (0, 2)), π (n) = (π (n, 1), π (n, 2)), n \geq 1 .

Because the state space of the Markov chain

ξ_{t}

is infinite and the generator G of this chain does not have Toeplitz-like structure, the problem of computation of the vectors

π (n), n \geq 0,

is not easy. Fortunately, the chains with the generator of such a type were analysed in [2,3] and the algorithms developed in those papers allow computing these vectors.

4. Performance Indicators

Once the vectors

π (n), n \geq 0,

have been computed, we can calculate various performance indicators of the system.

The mean number N of customers in the buffer is computed by

N = \sum_{n = 1}^{\infty} n π (n) e .

The average temperature T of the server is computed by

T = \sum_{k = 1}^{K_{2} - 1} k π (0, 0, k) e + \sum_{n = 0}^{\infty} \sum_{k = 1}^{K - 1} k π (n, 1, k) e + \sum_{n = 0}^{\infty} \sum_{k = K_{1} + 1}^{K} k π (n, 2, k) e .

The variance of the temperature of the server is equal to

\sum_{k = 1}^{K_{2} - 1} k^{2} π (0, 0, k) e + \sum_{n = 0}^{\infty} \sum_{k = 1}^{K - 1} k^{2} π (n, 1, k) e + \sum_{n = 0}^{\infty} \sum_{k = K_{1} + 1}^{K} k^{2} π (n, 2, k) e - T^{2} .

The probability

P_{i d l e}

that the server is idle at an arbitrary moment is

P_{i d l e} = π (0, 0) e .

The probability

P_{i m m}

that the server is idle at the moment of an arbitrary customer arrival (and this customer immediately starts service) is

P_{i m m} = \frac{1}{λ} π (0, 0) (I_{K_{2}} \otimes D_{1}) e .

The probability

P_{b u s y}

that the server is busy at an arbitrary moment is

P_{b u s y} = \sum_{n = 0}^{\infty} π (n, 1) e .

The average number

N_{s y s t e m}

of customers in the system is computed by

N_{s y s t e m} = N + P_{b u s y} .

The probability

P_{b l o c k}

that the server is blocked at an arbitrary moment is

P_{b l o c k} = \sum_{n = 0}^{\infty} π (n, 2) e .

The probability

P_{i m p}

of an arbitrary customer loss due to impatience is

P_{i m p} = \frac{ϕ N}{λ} .

The intensity

λ_{o u t}

of the flow of served customers is

λ_{o u t} = \sum_{n = 0}^{\infty} π (n, 1) (e_{K \bar{W}} \otimes S_{0}) .

The probability

P_{o v e r h e a t i n g}

of an arbitrary customer loss due to the server overheating is

P_{o v e r h e a t i n g} = α λ^{- 1} \sum_{n = 0}^{\infty} π (n, 1, K - 1) e .

The intensity

l_{v a c a t i o n}

of the transition after the service completion to the vacation regime (the server is overheated or is preventively switched-off for cooling) is

l_{v a c a t i o n} = \sum_{n = 0}^{\infty} π (n, 1) ({\bar{I}}_{K, K - K_{1}} \otimes I_{\bar{W}} \otimes S_{0}) e .

The probability

P_{l o s s}

of an arbitrary customer loss is computed as

P_{l o s s} = P_{i m p} + P_{o v e r h e a t i n g} = 1 - \frac{λ_{o u t}}{λ} .

5. Numerical Example

Let the

M A P

arrival flow be defined by the matrices

D_{0} = (\begin{matrix} - 0.3379101412 & 0 \\ 0 & - 0.0109675577 \end{matrix}), D_{1} = (\begin{matrix} 0.3356635104 & 0.0022466308 \\ 0.0061087109 & 0.0048588468 \end{matrix}) .

The mean arrival rate is

λ = 5,

the coefficient of correlation of two successive intervals between arrivals

c_{c o r} = 0.2,

and the squared coefficient of variation of these intervals

c_{v a r} = 12.4 .

Let the maximum value of the temperature be

K = 50,

and the rate of the heat generation be

α = 1 .

The rate of the server cooling when its temperature is equal to k is

γ_{k} = \frac{0.3 k}{K}, k = \bar{1, K - 1},

γ_{K} = 0.01 .

The rate of a customer departure from the buffer due to impatience is

ϕ = 0.0015 .

The service time of a customer has the

P H

distribution with the irreducible representation

(β, S)

where

β = (0.9, 0.1),

S = (\begin{matrix} - 6 & 0 \\ 0 & - 0.2 \end{matrix}) .

The average service time is equal to 0.65 and the squared coefficient of variation of the service time is equal to 10.95.

Let us vary the threshold

K_{1}

over the interval

[0, K)

and the parameter

K_{2}

over the interval

[K_{1} + 1, K] .

Figure 2, Figure 3 and Figure 4 illustrate the dependence of the average number N of customers in the buffer, the average intensity

λ_{o u t}

of the flow of served customers and the average temperature T of the server on the values of

K_{1}

and

K_{2} .

It can be observed in Figure 2 that the average number N of customers in the buffer increases when the threshold

K_{2}

grows. This is easily explained by the fact that the probability of overheating occurrence increases when the threshold

K_{2}

becomes close to the critical temperature. It is assumed that the rate of the server cooling is small after overheating occurrence, the blocking period of the server becomes large and a lot of customers stay in the buffer. It should be noted that, for any fixed value of

K_{2}

there exists a value of

K_{1}

, which minimizes the average number N. The average intensity

λ_{o u t}

of the flow of served customers is small when both thresholds

K_{1}

and

K_{2}

are small (low temperature of the server is guaranteed at expense of managing too long blocking periods) and when both thresholds

K_{1}

and

K_{2}

are large (the server is quite often overheated, which causes the corresponding loss of customers). This intensity

λ_{o u t}

is much higher for intermediate values of the thresholds

K_{1}

and

K_{2} .

Figure 4 well matches to the just given explanation of the surface in Figure 3.

Figure 5, Figure 6 and Figure 7 illustrate the dependence of the probability

P_{i d l e}

that the server is idle, the probability

P_{b u s y}

that the server is busy, and the probability

P_{b l o c k}

that the server is blocked at an arbitrary moment on the values of

K_{1}

and

K_{2} .

The probability

P_{i d l e}

that the server is idle and the probability

P_{b u s y}

that the server is busy also are maximal for intermediate values of thresholds

K_{1}

and

K_{2} .

As expected, the probability

P_{b l o c k}

that the server is blocked decreases when the threshold

K_{1}

grows.

Figure 8, Figure 9 and Figure 10 illustrate the dependence of the probability

P_{i m p}

that an arbitrary customer is lost due to impatience, the loss probability

P_{o v e r h e a t i n g}

that an arbitrary customer is lost due to server overheating, and the probability

P_{l o s s}

that an arbitrary customer is lost on the values of

K_{1}

and

K_{2} .

Figure 9 evidently shows that the loss probability

P_{o v e r h e a t i n g}

that an arbitrary customer is lost due to server overheating sharply increases when the thresholds

K_{1}

and

K_{2}

grow. Therefore, the proposed mechanism for preventing overheating is highly effective. It can be observed in Figure 8, Figure 9 and Figure 10 that there exists a pair of the thresholds that minimizes the loss probability

P_{l o s s} .

The minimal value of the probability

P_{l o s s}

in this example is equal to 0.065255 and is achieved under the following values of the thresholds:

K_{1} = 36

and

K_{2} = 37 .

Customers loss in the considered system occurs due to overheating of the server during ongoing service and due to impatience of customers. The charges paid for these types of losses may be different. The charge paid for the loss due to overheating can be much higher because the loss due to impatience is just the loss of potential profit, while the loss due to overheating means the real loss of a customer, violation of service level agreement and possible expenditures to return the overheated server to the operable mode. Therefore, various other optimization problems can be formulated.

In this paper, we consider the following economical criterion of the quality of the system operation:

E = a λ P_{i m p} + (b + c) λ P_{o v e r h e a t i n g} + c l_{v a c a t i o n} .

This economical criterion indicates the charge paid by the system per unit of time, where a is the charge paid by the system for each customer loss due to impatience, b is the charge paid by the system for customer loss due to overheating, and c is the charge paid by the system for managing operation of the system via each transition to the blocking regime.

Let us fix the following values of the cost coefficients:

a = 1, b = 10, c = 2 .

Figure 11 illustrates the dependence of the economical criterion E on the values of

K_{1}

and

K_{2} .

The minimal value of the economical criterion E here is equal to

E^{*} = 0.0308509

and is achieved when

K_{1} = 29

and

K_{2} = 42 .

Note that, for the same system but without control, when no prevention of overheating is assumed (i.e.,

K_{2} = 50

, indicating the server is switched-off only when it becomes overheated, and

K_{1} = 49

), the value of the economical criterion is more than ten times higher:

E (49, 50) = 0.309677 .

6. Conclusions

In this paper, a novel in the literature queueing model is considered. This model considers the possible heating of a server during the service process that causes the necessity of its permanent cooling. Such a model can be applied, e.g., for optimization of operation of servers of data centers that generate a lot of heat during their operation and the proper cooling mechanisms have to be used to avoid a collapse of the server. We offer the discipline for control by the system operation aiming to prevent premature overheating of a server and the loss of customers. This discipline is defined by two thresholds. One threshold is used to define the temperature of a server that when exceeded causes the stop of new services and to block the server when its temperature becomes close to the critical temperature. One more threshold is used to define the temperature when the blocking of the server can be finished and service can be resumed. The system is analyzed under quite general assumptions about the arrival and service processes. The generator of the multi-dimensional Markov chain, which describes the behavior of the system under any values of thresholds, is derived. This allows computing the stationary distribution of the states of the Markov chain and the key performance indicators of the system. Usefulness of the proposed strategy of preventive control is demonstrated via numerical experiments.

The obtained results can be used for managerial goals. In fact, the results can be used for the choice of the proper equipment for service provisioning (accounting for the different speeds of operation and heat generation by the different servers), its cooling and optimal management by periodical switching-off the server via the optimal choice of the thresholds.

As directions for future research, systems with the Batch Markov Arrival Process, phase-type distribution of heating and cooling times, several possible modes of the server operation (with various service and heating rates), etc., can be considered.

Author Contributions

Conceptualization O.D. and A.D.; methodology A.D. and O.D.; software O.D.; validation O.D.; formal analysis O.D. and A.D.; investigation A.D.; writing—original draft preparation A.D.; writing—review and editing A.D.; supervision A.D.; and project administration O.D. and A.D.

Funding

The publication was prepared with the support of the “RUDN University Program 5-100”.

Conflicts of Interest

The authors declare no conflict of interest.

References

Available online: https://searchdatacenter.techtarget.com/answer/Whats-the-highest-server-temperature-you-can-handle (accessed on 25 July 2019).
Klimenok, V.I.; Dudin, A.N. Multi-dimensional asymptotically quasi-Toeplitz Markov chains and their application in queueing theory. Queueing Syst. 2006, 54, 245–259. [Google Scholar] [CrossRef]
Dudin, S.; Dudina, O. Retrial multi-server queueing system with PHF service time distribution as a model of a channel with unreliable transmission of information. Appl. Math. Model. 2019, 65, 676–695. [Google Scholar] [CrossRef]
Krishnamoorthy, A.; Manikandan, R.; Lakshmy, B. A revisit to queueing-inventory system with positive service time. Ann. Oper. Res. 2015, 233, 221–236. [Google Scholar] [CrossRef]
Kim, C.; Dudin, S.; Dudin, A.; Samouylov, K. Multi-threshold control by a single-server queuing model with a service rate depending on the amount of harvested energy. Perform. Eval. 2018, 127–128, 1–20. [Google Scholar] [CrossRef]
Latouche, G. Queues with paired customers. J. Appl. Probab. 1981, 18, 684–696. [Google Scholar] [CrossRef]
Harrison, J.M. Assembly-like queues. J. Appl. Probab. 1973, 10, 354–367. [Google Scholar] [CrossRef]
Kendall, D.G. Some problems in the theory of queues. J. R. Stat. Soc. Ser. Methodol. 1951, 13, 151–173. [Google Scholar] [CrossRef]
Evdokimova, E.; De Turck, K.; Fiems, D. Coupled queues with customer impatience. Perform. Eval. 2018, 118, 33–47. [Google Scholar] [CrossRef]
Chakravarthy, S.R. The batch Markovian arrival process: A review and future work. In Advances in Probability Theory and Stochastic Processes; Krishnamoorthy, A., Raju, N., Ramaswami, V., Eds.; Notable Publications Inc.: Branchburg, NJ, USA, 2001; pp. 21–29. [Google Scholar]
Lucantoni, D. New results on the single server queue with a batch Markovian arrival process. Commun. Stat. Stoch. Models 1991, 7, 1–46. [Google Scholar] [CrossRef]
Vishnevski, V.M.; Dudin, A.N. Queueing systems with correlated arrival flows and their applications to modeling telecommunication networks. Autom. Remote Control 2017, 78, 1361–1403. [Google Scholar] [CrossRef]
Neuts, M. Matrix-Geometric Solutions in Stochastic Models: An Algorithmic Approach; John Hopkins University Press: Baltimore, MD, USA, 1981. [Google Scholar]
Asmussen, S. Applied Probability and Queues; Springer: New York, NY, USA, 2003. [Google Scholar]
Buchholz, P.; Kemper, P.; Kriege, J. Multi-class Markovian arrival processes and their parameter fitting. Perform. Eval. 2010, 67, 1092–1106. [Google Scholar] [CrossRef]
Buchholz, P.; Kriege, J. Fitting correlated arrival and service times and related queueing performance. Queueing Syst. 2017, 85, 337–359. [Google Scholar] [CrossRef]
Okamura, H.; Dohi, T. Mapfit: An R-Based Tool for PH/MAP Parameter Estimation. Lect. Notes Comput. Sci. 2015, 9259, 105–112. [Google Scholar]

Figure 1. Structure of the system.

Figure 2. Dependence of the average number N of customers in the buffer on the values of

K_{1}

and

K_{2}

.

Figure 2. Dependence of the average number N of customers in the buffer on the values of

K_{1}

and

K_{2}

.

Figure 3. Dependence of the average average intensity

λ_{o u t}

of the flow of served customers on the values of

K_{1}

and

K_{2}

.

Figure 3. Dependence of the average average intensity

λ_{o u t}

of the flow of served customers on the values of

K_{1}

and

K_{2}

.

Figure 4. Dependence of the average temperature T of the server on the values of

K_{1}

and

K_{2}

.

Figure 4. Dependence of the average temperature T of the server on the values of

K_{1}

and

K_{2}

.

Figure 5. Dependence of the probability

P_{i d l e}

that the server is idle on the values of

K_{1}

and

K_{2}

.

Figure 5. Dependence of the probability

P_{i d l e}

that the server is idle on the values of

K_{1}

and

K_{2}

.

Figure 6. Dependence of the probability

P_{b u s y}

that the server is busy on the values of

K_{1}

and

K_{2}

.

Figure 6. Dependence of the probability

P_{b u s y}

that the server is busy on the values of

K_{1}

and

K_{2}

.

Figure 7. Dependence of the probability

P_{b l o c k}

that the server is blocked on the values of

K_{1}

and

K_{2}

.

Figure 7. Dependence of the probability

P_{b l o c k}

that the server is blocked on the values of

K_{1}

and

K_{2}

.

Figure 8. Dependence of the probability

P_{i m p}

that an arbitrary customer is lost due to impatience on the values of

K_{1}

and

K_{2}

.

Figure 8. Dependence of the probability

P_{i m p}

that an arbitrary customer is lost due to impatience on the values of

K_{1}

and

K_{2}

.

Figure 9. Dependence of the probability

P_{o v e r h e a t i n g}

that an arbitrary customer is lost due to server overheating on the values of

K_{1}

and

K_{2}

.

Figure 9. Dependence of the probability

P_{o v e r h e a t i n g}

that an arbitrary customer is lost due to server overheating on the values of

K_{1}

and

K_{2}

.

Figure 10. Dependence of the probability

P_{l o s s}

of an arbitrary customer loss on the values of

K_{1}

and

K_{2}

.

Figure 10. Dependence of the probability

P_{l o s s}

of an arbitrary customer loss on the values of

K_{1}

and

K_{2}

.

Figure 11. Dependence of the economical criterion E on the values of

K_{1}

and

K_{2}

.

Figure 11. Dependence of the economical criterion E on the values of

K_{1}

and

K_{2}

.

© 2019 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (http://creativecommons.org/licenses/by/4.0/).

Share and Cite

MDPI and ACS Style

Dudina, O.; Dudin, A. Optimization of Queueing Model with Server Heating and Cooling. Mathematics 2019, 7, 768. https://doi.org/10.3390/math7090768

AMA Style

Dudina O, Dudin A. Optimization of Queueing Model with Server Heating and Cooling. Mathematics. 2019; 7(9):768. https://doi.org/10.3390/math7090768

Chicago/Turabian Style

Dudina, Olga, and Alexander Dudin. 2019. "Optimization of Queueing Model with Server Heating and Cooling" Mathematics 7, no. 9: 768. https://doi.org/10.3390/math7090768

APA Style

Dudina, O., & Dudin, A. (2019). Optimization of Queueing Model with Server Heating and Cooling. Mathematics, 7(9), 768. https://doi.org/10.3390/math7090768

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Menu

Optimization of Queueing Model with Server Heating and Cooling

Abstract

1. Introduction

2. Description of the Model

3. Process of System States and Its Analysis

4. Performance Indicators

5. Numerical Example

6. Conclusions

Author Contributions

Funding

Conflicts of Interest

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI