Next Article in Journal
Progressive Iterative Approximation with Preconditioners
Next Article in Special Issue
Analysis of Single-Server Multi-Class Queue with Unreliable Service, Batch Correlated Arrivals, Customers Impatience, and Dynamical Change of Priorities
Previous Article in Journal
New Fusion Algorithm-Reinforced Pilot Control for an Agricultural Tricopter UAV
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

Priority Multi-Server Queueing System with Heterogeneous Customers

by
Valentina Klimenok
1,
Alexander Dudin
1 and
Vladimir Vishnevsky
2,*
1
Department of Applied Mathematics and Computer Science, Belarusian State University, 220030 Minsk, Belarus
2
Sciences and Closed Corporation Information and Networking Technologies, Institute of Control Sciences of Russian Academy, 119991 Moscow, Russia
*
Author to whom correspondence should be addressed.
Mathematics 2020, 8(9), 1501; https://doi.org/10.3390/math8091501
Submission received: 2 August 2020 / Revised: 20 August 2020 / Accepted: 2 September 2020 / Published: 4 September 2020
(This article belongs to the Special Issue Applications of Mathematical Analysis in Telecommunications)

Abstract

:
In this paper, we analyze a multi-server queueing system with heterogeneous customers that arrive according to a marked Markovian arrival process. Customers of two types differ in priorities and parameters of phase type distribution of their service time. The queue under consideration can be used to model the processes of information transmission in telecommunication networks in which often the flow of information is the superposition of several types of flows with correlation of inter-arrival times within each flow and cross-correlation. We define the process of information transmission as the multi-dimensional Markov chain, derive the generator of this chain and compute its stationary distribution. Expressions for computation of various performance measures of the system, including the probabilities of loss of customers of different types, are presented. Output flow from the system is characterized. The presented numerical results confirm the high importance of account of correlation in the arrival process. The values of important performance measures for the systems with the correlated arrival process are essentially different from the corresponding values for the systems with the stationary Poisson arrival process. Measurements in many real world systems show poor approximation of real flows by such an arrival process. However, this process is still popular among the telecommunication engineers due to the evident existing gap between the needs of adequately modeling the real life systems and the current state of the theory of algorithmic methods of queueing theory.

1. Introduction

One of the essential sections of the queuing theory is the theory of priority systems. In such systems, customers of different classes are assigned different categories of importance and service is carried out in accordance with a priority scheme. More important customers have an advantage in access to service compared to less important ones. Priority queueing models arise in many real world applications. In particular, the considered in this paper priority queueing model can be effectively used in various applications in telecommunication networks, where traffic prioritization is required, e.g., when using the IEEE 1588 synchronization protocol in cellular networks, DVB-T2 video transmission for synchronizing TV transmitters, in unmanned vehicle systems, in telemedicine applications, etc., it is necessary to ensure the guaranteed delivering of the highest priority packets. Similar models are used for voice and data transmission in multiprocessor switching nodes to ensure priority of voice traffic. The model has applications in other areas, including scheduling computations in multiprocessor systems, operation of medical institutions, etc. Various priority schemes are used in hospital emergency departments during sorting incoming patients according to the severity o, e.g.f the injury or disease, see, e.g., [1]. In the case of unreliable systems, a priority customer may be considered as equipment failure. Priority can be also set to maximize a company’s profit or increase system utilization. For instance, an online store the manager can set a high priority for customers of big spenders in order to prevent their departure to other online resources, see [2]. In some telecommunication networks, the priority of a customer is determined by its owner through a Service Level Agreement (SLA), whereby certain customers have chosen to pay more so as to get high-priority access to some high-demand resource.
Another set of potential applications of the priority queueing model like considered in our paper is described in [3] as applications in customer service centers, see [4], airport security checkpoints [5], hospital emergency rooms [6], cloud computing systems [7] or processor management in certain computer operating systems [8].
There is an extensive literature on priority queues. An overview of early research works on priority queueing models can be found in monographs [9,10,11,12,13] and references therein. Mostly of these works focused on the queueing models with stationary Poisson inputs. However, the area of application of these models is currently being greatly narrowed, since flows in modern telecommunication networks do not possess memoryless property of stationary Poisson flow. They are, as a rule, correlated and heterogeneous. In the case of homogeneous customers, a good mathematical model of such flows well known in the literature as a Markovian arrival process ( M A P ), see, e.g., [14,15,16]. A M A P is the significant generalization of the stationary Poisson process to the case of correlated bursty traffic. Queuing systems with a M A P and priorities are discussed in the papers [17,18,19,20,21,22,23,24,25,26].
Markovian arrival processes are a suitable class of stochastic processes to represent correlated traffic in case when all customers are of the same type and only the sequence of inter-arrival times is of interest. However, systems with correlated flows of heterogeneous customers are also of great interest to applications. Such flows are well modeled by a marked Markovian arrival process ( M M A P ), see [27]. Using of a M M A P allows to represent flows where inter-arrival times are correlated across customer classes and to achieve models of greater accuracy. However, very few results are known about priority queue with M M A P . We can only refer to the papers [1,25,26,28]. In [28] M M A P / M A P / 1 queue with preemptive priorities is analysed, the moments of queue size is derived. The article [1] deals with a multi-server queue with a M M A P , preemptive priorities and priority upgrade. For this system, the authors found a condition for the existence of a stationary regime and the bounds for the lengths of the queues. The problem of concrete computation of the stationary distribution of the system states is not touched in that paper. This problem is considered in the recent papers [25,26]. However, those papers deal with single server priority queues. In [25], possibility of increase of the non-preemptive priority during customer stay in the buffer is analysed. In [26], the discipline of flexible providing of non-preemptive priorities is under study.
In this paper, we consider a priority M M A P / P H 1 , 2 / N / N queueing system with two types of customers. As it is pointed out above, although analysis of such multi-server systems is very important for practice, these systems did not get a proper portion of attention of queueing theorists. e.g., the authors of [3] note that “Despite the large number of systems that can be viewed as instances of a priority queue with multiple servers, the literature devoted to their theoretical analysis appears rather moderate as the inherent complexity of these queues hinders their analysis”. It is also mentioned in [3] that “To the best of our knowledge, few exact results exist in the case of priority queues with multiple servers even under the simplest assumption of exponentially distributed service times”. In paper [3], the authors analyze the model similar to the considered in our paper with two essential differences: (i) Several priority classes are suggested in [3] while we consider the case of two classes, (ii) arrival flow is assumed be the renewal with phase type distribution of mutually independent inter-arrival times while we assume that the inter-arrival times can be dependent. One more advantages of our paper consists of the fact that we provide exact analysis of the model while only the accurate approximate solution is proposed in [3].
In this paper, we make some step to provide the analysis of a multi-server priority system. We assume that customers of one of the types have the preemptive priority. The service times of customers of both types have phase type distribution ( P H ) with different parameters. Exact description of the model is given in Section 2. The most important performance measures of the considered system are probabilities of loss of customers of different types. To calculate these probabilities, we, first, described the process of the system operation by a multi-dimensional Markov chain. This description is given in Section 3. The explicit form of the generator of this Markov is given in that section along with the short explanation of the form of the blocks of this generator. In Section 4, we briefly note how to compute stationary distribution of this Markov chain. We do not go into details of computations and give just an advice how to compute this distribution. It is worth to mention that although the state space of the Markov chain is finite, in situation when the number of servers N and (or) state spaces of the underlying processes of arrival and service process are large, computation of this distribution is far from the trivial. Using the stationary distribution been calculated, in Section 4 we also derive loss probabilities associated with the system and a number of other characteristics. Numerical results are provided in Section 5.

2. Model Description

We consider an N -server queueing system without a buffer. Customers of two types arrive to the system according to a M M A P . The arrivals in the M M A P is directed by the underlying process ν t , t 0 , which is an irreducible continuous time Markov chain with the state space { 0 , 1 , , W } . In case of two types of customers, the M M A P is completely defined by the state space of underlying process and ( W + 1 ) × ( W + 1 ) matrices D k , k = 0 , 1 , 2 , or their generating function D ( z ) = D 0 + D 1 z + D 2 z 2 . The entries of the matrix D k give the rate of transitions of the process ν t , t 0 , which are accompanied by generating a customer of type k , k = 1 , 2 . Non-diagonal entries of the matrix D 0 describe the rate of transitions of the process ν t , t 0 , which are not accompanied by generating a customer. Diagonal entries of the matrix D 0 are negative and such that the matrix D ( 1 ) = D 0 + D 1 + D 2 is an infinitesimal generator of the chain ν t , t 0 . The fundamental rate of arrivals of type-k customers is calculated as λ k = θ D k e , k = 1 , 2 , where θ is a row vector of the steady state probabilities of the underlying process ν t . The vector θ is the unique solution to the system θ D ( 1 ) = 0 , θ e = 1 . Here and in the sequal e is a column vector consisting of 1’s, and 0 is a row vector consisting of 0’s. The total fundamental rate of arrivals is λ = λ 1 + λ 2 . The variance of inter-arrival times of customers of type k is calculated as
v ( k ) = 2 θ ( D 0 D k ¯ ) 1 e λ k 1 λ k 2 , k ¯ k , k , k ¯ = 1 , 2 .
The coefficient of correlation of lengths of two successive inter-arrival times of k-type of customers is calculated as
c k = θ ( D 0 + D k ¯ ) 1 λ k D k ( D 0 + D k ¯ ) 1 e 1 λ k 2 ( v ( k ) ) 1 , k ¯ k , k , k ¯ = 1 , 2 .
More details about a M M A P can be found in [27].
The service time of the k-type customer has P H type distribution with irreducible representation ( β k , S k ) . Here β k is a row vector of size M k , and S k is a square matrix of size M k . Thus, the specified service time is interpreted as the time during which an underlying Markov chain m t ( k ) , t 0 , with state space { 1 , , M k , M k + 1 } will reach the only absorbing state M k + 1 . The transition rates of the chain m t ( k ) , t 0 , within the space of transient states { 1 , , M k } are defined by the sub-generator S k , and the rates of transitions to the absorbing state are defined by the vector S 0 ( k ) = S k e . At the time the service starts, the state of the process m t ( k ) , t 0 , is selected from the state space { 1 , , M k } according to the probability row vector β k . The service rate are calculated as μ k = ( β k S k 1 e ) 1 . More information about the P H type distribution can be found, e.g., in [17,29].
We assume that customers of type 1 have the preemptive priority. If a priority customer arrives to the system when all servers are busy and there are servers occupied with non-priority customers, then the arriving priority customer crowds out one of these customers (which is lost) and takes his/her place on the server. If all servers are occupied with priority customers, then an arriving priority customer is lost. If a non-priority customer arrives at the system when all server are busy, he/she leaves the system forever.

3. Process of the System States

Let at the time t ,
  • n t be the number of busy servers, n t = 0 , N ¯ ;
  • r t be the number of servers serving type 1 customers, r t = 0 , n t ¯ ;
  • ν t be the state of the underlying process ν t = 0 , W ¯ ;
  • m t ( j , k ) be the state of underlying process of service on jth server servicing type k customer, m t ( j , 1 ) = 1 , r t ¯ , m t ( j , 2 ) = 1 , n t r t ¯ . We assume the following dynamical enumeration of the busy servers. The servers, which provide service to customers of type 2 , are located after servers serving customers of type 1. In addition, we assume that the servers serving customers of the k-th type are numbered in the order of their occupation, i.e., the server that starts the service is numbered by the maximum number among all servers engaged in servicing customers of this type. When some server finishes the service, the corresponding renumbering of servers occurs.
The operation of the queue under consideration is described by the Markov chain
ξ t = { n t , r t , ν t , m t ( 1 , 1 ) , m t ( 2 , 1 ) , , m t ( r t , 1 ) , m t ( 1 , 2 ) , m t ( 2 , 2 ) , , m t ( n t r t , 2 ) }
with the state space
Ω = { ( n , r , ν , m ( j , 1 ) , m ( l , 2 ) ) , n = 0 , N ¯ , r = 0 , n ¯ , ν = 0 , W ¯ , m ( j , 1 ) = 1 , M 1 ¯ , j = 1 , r ¯ ,
m ( l , 1 ) = 1 , M 2 ¯ , l = 1 , n r ¯ } .
It can be calculated that cardinality of the set Ω is equal to K = ( W + 1 ) ( 1 + n = 1 N l = 0 n M 1 l M 2 n l ) .
Introduce the following notation:
e n is a column vector of size n, consisting of 1’s;
I (O) is an identity (zero) matrix of appropriate dimension. When needed we will identify the dimension of this matrix with suffix;
d i a g { A l , l = 1 , L ¯ } is a diagonal matrix with diagonal blocks A l ;
d i a g { A l , l = 0 , L ¯ } is a sub-diagonal matrix with the sub-diagonal blocks A l ;
d i a g + { A l , l = 0 , L ¯ } is an over-diagonal matrix with the over-diagonal blocks A l ;
⊗ and ⊕ are the symbols of the Kronecker product and sum of matrices, see [30];
A l = A A l , l 1 , A 0 = 1 ;
A l = m = 0 l 1 I n m A I n l m 1 , l 1 , for the matrix A having n rows;
W ¯ = W + 1 ;
Let us arrange the states of the chain ξ t in the lexicographic order and form the matrices Q n , n , n , n = 0 , N ¯ , consisting of the rates of the chain transition from the states corresponding to the value n of the first component to the states corresponding to the value n of this component. Then the infinitesimal generator of the chain is defined by the following theorem.
Theorem 1.
The infinitesimal generator of the Markov chain ξ t has the following block structure:
Q = Q 0 , 0 Q 0 , 1 O O O Q 1 , 0 Q 1 , 1 Q 1 , 2 O O O Q 2 , 1 Q 2 , 2 O O O O O Q N 1 , N 1 Q N 1 , N O O O Q N , N 1 Q N , N ,
where
Q n , n = d i a g { D 0 S 1 r S 2 n r , r = 0 , n ¯ } , n = 0 , N 1 ¯ ,
Q N , N = d i a g { ( D 0 + D 2 ) ) S 1 r S 2 N r , r = 0 , N 1 ¯ , D ( 1 ) S 1 N }
+ d i a g + { D 1 I M 1 r e M 2 β 1 I M 2 N r 1 , r = 0 , N 1 ¯ } ,
Q n , n 1 = d i a g { I W ¯ I M 1 r ( S 0 ( 2 ) ) n r , r = 0 , n 1 ¯ } O W ¯ M 1 n × W ¯ r = 0 n 2 M 1 r M 2 n r 1 | I W ¯ ( S 0 ( 1 ) ) n
+ d i a g { I W ¯ ( S 0 ( 1 ) ) r I M 2 n r r = 1 , n ¯ } , n = 1 , N ¯ ,
Q n , n + 1 = d i a g { D 2 I M 1 r I M 2 n r β 2 , r = 0 , n ¯ } | O W ¯ r = 0 n M 1 r M 2 n r × W ¯ M 1 n + 1
+ O W ¯ r = 0 n 1 M 1 r M 2 n r × W ¯ r = 0 n + 1 M 1 r M 2 n r + 1 O W ¯ M 1 n × W ¯ r = 0 n M 1 r M 2 n r + 1 | D 1 I M 1 n β 1
+ d i a g + { D 1 I M 1 r β 1 I M 2 n r , r = 0 , n 1 ¯ } | O W ¯ r = 0 n M 1 r M 2 n r × W ¯ M 1 n + 1 ,
n = 1 , N 1 ¯ .
Proof. 
The tri-block diagonal form of the generator is easily explained by the evident fact that the customers arrive to the considered system and depart from this system only one-by-one. Therefore, all blocks Q n , n of the generator are equal to zero matrices if | n n | > 1 . Before to immediately prove the expressions for the nonzero blocks of the generator, let us rewrite some blocks in the less compact but more transparent for explanations form:
Q N , N =
D ¯ S 2 N D 1 e M 2 β 1 I M 2 N 1 O O O D ¯ S 1 S 2 N 1 D 1 I M 1 e M 2 β 1 I M 2 N 2 O O O D ¯ S 1 2 S 2 N 2 O O O O D 1 I M 1 N 1 e M 2 β 1 O O O D ( 1 ) S 1 N ,
where D ¯ = D 0 + D 1 ,
Q n , n 1 =
I W ¯ ( S 0 ( 2 ) ) n O O O I W ¯ S 0 ( 1 ) I M 2 n 1 I W ¯ M 1 ( S 0 ( 2 ) ) n 1 O O O I W ¯ ( S 0 ( 1 ) ) 2 I M 2 n 2 I W ¯ M 1 2 ( S 0 ( 2 ) ) n 2 O O O O I W ¯ M 1 n 1 S 0 ( 2 ) O O O I W ¯ ( S 0 ( 1 ) ) n ,
n = 1 , N ¯ ,
Q n , n + 1 =
D 2 I M 2 n β 2 D 1 β 1 I M 2 n O O O D 2 I M 1 M 2 n 1 β 2 D 1 I M 1 β 1 I M 2 n 1 O O O D 2 I M 1 2 M 2 n 2 β 2 O O O O D 1 I M 1 n β 1 ,
n = 0 , N 1 ¯ .
Now, to prove the presented forms of the non-zero blocks Q n , n of the generator, we analyze transitions of the Markov chain ξ t , t 0 , during the infinitesimal length interval.
These blocks for various values of n and n have the following meaning:
  • The non-diagonal entries of the blocks Q n , n , n = 0 , N ¯ , define the transition rates of the chain ξ t , t 0 , which do not lead to the change in the number n of busy servers. The diagonal entries of the blocks Q n , n define the departure rates of the chain ξ t , t 0 , from the corresponding states. If n < N and the number of priority customers in the service is r, transitions, which do not lead to the change of the number n of busy servers, occur either when the underlying process of the M M A P makes an idle transition (i.e., a transition without generation of any customers) or a phase of service time of one of r priority customers is changed or a phase of service time of one of n r non-priority customers is changed. The corresponding transition rates are described by the matrix D 0 S 1 r S 2 n r , r = 0 , n ¯ , n = 0 , N 1 ¯ . It is worth to mention here that the operations of the Kronecker product and sum of matrices are very useful for description of transition rates or transition probabilities of the multi-dimensional random processes with independent Markovian components.
In the case n = N and r < N the number of busy servers and the number of priority customers in the service do not change when M M A P makes an idle transition or a non-priority customer arrives (this customer is lost). The corresponding transition rates are described by the matrix ( D 0 + D 2 ) S 1 r S 2 N r . If in this case a priority customer arrives, he/she pushes a non-priority customer from the service (which is lost) and takes his/her place on the server. In this case the number of priority customers in the service becomes equal to r + 1 . The corresponding transition rates are described by the matrix D 1 I M 1 r e M 2 β 1 I M 2 N r . If at an arrival time all servers are occupied by priority customers, i.e., r = N , an arriving customer, regardless of its priority, is lost. The corresponding transition rates are described by the matrix D ( 1 ) S 1 N .
  • The blocks Q n , n 1 , n = 1 , N ¯ , define the transition rates of the chain ξ t , t 0 , which lead to a decrease in the number of busy servers from n to n 1 . If the number of priority customers in the service is equal to r, such a transition occurs when the service of one of priority customers ends (the corresponding transition rates of the chain ξ t , t 0 , are described by the matrix I W ¯ ( S 0 ( 1 ) ) r I M 2 n r ) or when the service of one of non- priority customers ends (the corresponding transition rates are described by the matrix I W ¯ I M 1 r ( S 0 ( 2 ) ) n r ).
  • The blocks Q n , n + 1 , n = 0 , N 1 ¯ , define the transition rates of the chain ξ t , t 0 , which lead to an increase in the number of busy servers from n to n + 1 . Such an increase occurs if a non-priority customer arrives (the corresponding transition rates are defined by the matrix D 2 I M 1 r I M 2 n r β 2 or a priority customer arrives (the corresponding transition rates are defined by the matrix D 1 I M 1 r β 1 I M 2 n r ).
This completes the proof of the theorem. □

4. Stationary Distribution. Performance Measures

The Markov chain ξ t is irreducible and admits the values in a finite state space. Therefore, a unique stationary distribution of this chain exists for any values of the system parameters. Let p be the row vector of the steady state (stationary) probabilities of the states of the chain enumerated in the lexicographic order. It is well known that the vector p is defined as the unique solution of the Chapman-Kolmogorov (equilibrium or balance) equations
p Q = 0 , p e = 1 .
This system can be solved by any of the well-known methods for solving the finite system of linear algebraic equations. However, in the case of a large dimension of this system, the solution of this system can be not trivial due to existing restrictions on the computer memory and computation speed. Therefore, for solution of this system it is advisable to use a special stable algorithm proposed in [31] and based on the idea of substituting this system of equations by an alternative system derived via consideration of a sequence of specially constructed so called censored Markov chains.
As the result of computations, we obtain the partitioned vector p = ( p 0 , p 1 , , p N ) where p n is a row vector of steady state probabilities corresponding to the value n of the first component of the Markov chain, n = 0 , N ¯ . Having the vectors p n be calculated, we are able to calculate a number of stationary performance measures of the queue under consideration.
  • The distribution of the number of busy servers at an arbitrary time p n = p n e , n = 0 , N ¯ .
  • The average number of busy servers N b u s y = n = 1 N n p n .
  • The distribution of the number of servers providing service to priority customers
    q r ( 1 ) = n = r N p n u ( r , n r ) , r = 0 , N ¯ ,
    where u ( r , n r ) = 0 W ¯ l = 0 r 1 M 1 l M 2 n l T e W ¯ M 1 r M 2 n r 0 W ¯ l = r + 1 n M 1 l M 2 n l T .
    To make clear Formula (1), we note that, multiplying the vector p n by the vector u ( r , n r ) , we select and sum up the entries of the probability vector p n which correspond to the states with n busy servers of which r servers are busy with priority customers. Summing the results over n, we obtain the probability q r ( 1 ) .
  • The average number of servers providing service to priority customers N b u s y ( 1 ) = r = 1 N r q r ( 1 ) .
  • The distribution of the number of servers providing service to non-priority customers
    q m ( 2 ) = n = m N p n u ( n m , m ) , m = 0 , N ¯ .
    Formula (2) is explained similarly to Formula (1).
  • The average number of servers providing service to non-priority customers N b u s y ( 2 ) = m = 1 N m q m ( 2 ) . Evidently, N b u s y = N b u s y ( 1 ) + N b u s y ( 2 ) and this relation can be used for control of accuracy of computations.
  • The probability that a priority customer will be lost
    P l o s s , 1 = 1 λ 1 p N O W ¯ r = 0 N 1 M 1 r M 2 N r × W ¯ I W ¯ e M 1 N D 1 e .
    The brief explanation of Formula (3) can be done as follows. A priority customer will be lost if at the moment of his/her arrival all servers are occupied with priority customers. The ν th entry of the vector p N O W ¯ r = 0 N 1 M 1 r M 2 N r × W ¯ I W ¯ e M 1 N gives the probability that an arbitrary time all server are busy with priority customers and the underlying process of the M M A P is in the state ν , ν = 0 , W ¯ . Multiplying this vector by D 1 e λ 1 , we obtain the probability that at the moment of the priority customer arrival all servers are busy. In this case, the priority customers is lost.
  • The probability that a non-priority customer will be lost due to lack of free servers at the moment of his/her arrival
    P l o s s , 2 i n p u t = 1 λ 2 p N I W ¯ e M 1 0 M 2 N I W ¯ e M 1 1 M 2 N 1 I W ¯ e M 1 N M 2 0 D 2 e .
    Formula (4) is explained similarly to Formula (3).
  • The probability that an arriving priority customer pushes out a non-priority customer from the server
    P l o s s , 2 s e r v = 1 λ 1 p N I W ¯ e M 1 0 M 2 N I W ¯ e M 1 1 M 2 N 1 I W ¯ e M 1 N 1 M 2 1 I W ¯ 0 M 1 N M 2 0 D 1 e .
    The brief explanation of Formula (5) is as follows. The probability P l o s s , 2 s e r v is calculated by considering the situation at the moment of an arrival of a priority customer which meets all servers busy and at least one server occupied with a non-priority customer.
    The ν th entry of the vector p N I W ¯ e M 1 0 M 2 N I W ¯ e M 1 1 M 2 N 1 I W ¯ e M 1 N 1 M 2 1 I W ¯ 0 M 1 N M 2 0 gives the probability that at an arbitrary time all servers are busy, at least one server is occupied with a non-priority customer and the underlying process of the M M A P is in the state ν , ν = 0 , W ¯ . Multiplying this vector by D 1 e λ 1 , we obtain the probability that at the moment of a priority customer arrival all servers are busy and at least one server is occupied with a non-priority customer. In this case the priority customer pushes out a non-priority customer from the server and takes his/her place.
  • The probability that an arbitrary customer arriving to the system will be lost due to the lack of free servers
    P l o s s = λ 1 P l o s s , 1 + λ 2 P l o s s , 2 i n p u t λ .
    The numerator in the right hand side of Formula (6) is the rate of lost customers of two types and the denominator is the total input rate. The probability P l o s s is calculated as the ratio of these rates.

5. Numerical Experiments

In this section, we present the results of numerical experiments that allow us to estimate the effect of the input rate λ and correlation in the M M A P on the system performance measures.
We consider three M M A P s with the same arrival rates of customers of both types but different coefficients of correlation. These M M A P s are defined by the matrix D 0 , D 1 , D 2 as follows. For each M M A P , to get the matrices D 1 , D 2 , we first define a certain matrix D and then split it into the matrices D 1 , D 2 in the proportion D 1 = 0.7 D , D 2 = 0.3 D .
The first M M A P is the superposition of two stationary Poisson flows. In this case the matrices D 0 , D are defined as follows:
D 0 = 6.124137 , D = 6.124137 .
For this M M A P , the coefficients of variation of inter-arrival times are c v a r ( 1 ) = c v a r ( 2 ) = 1 , and the coefficients of correlation of inter-arrival times are c 1 = c 2 = 0 .
The second M M A P is defined by the matrices
D 0 = 8.281261 0 0 0.268743 , D = 8.226134 0.055127 0.149638 0.119104 .
For this M M A P , the coefficients of variation of inter-arrival times are c v a r ( 1 ) = 1.693996 , c v a r ( 2 ) = 3.417903 , and the coefficients of correlation of inter-arrival times are c 1 = 0.023423 , c 2 = 0.187824 .
The third M M A P is defined by the matrices
D 0 = 29.668039 0.003450 0.006900 0.952137 , D = 29.323061 0.341527 0.068995 0.876242 .
For this M M A P , the coefficients of variation of inter-arrival times are c v a r ( 1 ) = 2.394561 , c v a r ( 2 ) = 3.087863 , and the coefficients of correlation of inter-arrival times are c 1 = 0.205982 , c 2 = 0.402641 .
The number of servers N = 8 .
The service time of a priority customer has Erlang distribution defined by the vector β 1 = 1 0 and the matrix S 1 = 40 40 0 40 . The service time of a non-priority customer has Erlang distribution defined by the vector β 1 = 1 0 and the matrix S 1 = 10 10 0 10 .
Experiment 1. In the experiment, we investigate the behavior of loss probabilities associated with the system under consideration. We consider the probabilities P l o s s , 1 , P l o s s , 2 i n p u t and P l o s s , 2 s e r v as functions of the input rate λ and coefficients of correlation c 1 and c 2 .
Figure 1 depicts the dependence of the probability that a priority customer will be lost, P l o s s , 1 , on the input rate λ and coefficients of correlation c 1 and c 2 .
As expected, the value of P l o s s , 1 increases with increasing λ . Under the same value of λ , this probability is essentially greater for the larger coefficients of correlation in the M M A P . This effect is easily explained intuitively. The positive correlation in the arrival process causes fluctuation of the instantaneous arrival rate. Periods of time when customers arrive frequently (and likely a lot of customers is lost due to the business of all servers) alternate with the periods when customers arrive rarely (and likely starvation of the servers occurs).
Figure 2 shows the dependence of the probability of losing non-priority customers due to occupancy of all servers, P l o s s , 2 i n p u t , on the input rate λ and coefficients of correlation c 1 and c 2 .
Comparing the Figure 1 and Figure 2, we see that the behavior of the curves is similar, but with the same λ , the probability P l o s s , 2 i n p u t is greater than the probability P l o s s , 1 . This is because priority customers are blocked only when all servers are occupied with priority customers, while non-priority customers are blocked if all servers are busy with any customers, priority and non-priority.
Figure 3 shows the dependence of the probability that an arriving priority customer will push out a non-priority customer from the server, P l o s s , 2 s e r v , on the input rate λ and coefficients of correlation c 1 and c 2 .
It is seen from the figure that the curves for different M M A P s first increase, and then decrease. The reason for this behavior of the curves lies in the following. When the input rate is relatively small, an arriving priority customer often finds at least one free server and it does not have a need to push out the non-priority customer from the service. As the input rate increases, the system becomes more crowded, and priority customers are forced to push out a non-priority customer to get service. Therefore, the probability P l o s s , 2 s e r v increases and reaches maximum at some point λ m a x . The further decreasing of this probability is explained by the fact that with λ increasing the most servers become busy with priority customers. Then the number of non-priority customers in the service decreases (they are mostly lost upon arrival to the system) and the probability that a priority customer removes a non-priority customer from the service decreases.
Experiment 2. In this experiment, we investigate the mean number of servers providing service to the priority and non-priority customers as functions of the input rate λ and coefficients of correlation c 1 and c 2 .
Figure 4 shows the dependence of the mean number of servers providing service to priority customers, N b u s y , 1 , on the input rate λ and coefficients of correlation c 1 and c 2 .
It is expectable, that the value of N b u s y , 1 increases with λ increasing. More interesting is that, for the same values of λ , the mean number of servers servicing priority customers decreases with increasing correlation in the input flow. Such a relation between the values of N b u s y , 1 for M M A P s with different correlations is explained by the mentioned above fact that, with less correlation, customers arrive more uniformly, which guarantees more uniform occupation of servers and an increase in the mean number of occupied servers.
Figure 5 depicts the mean number of servers providing service to non-priority customers, N b u s y , 2 , as a function of the input rate λ and coefficients of correlation c 1 and c 2 .
It is seen from the figure, that the curves for different M M A P s first increase, and then decrease. Such a behavior of the curves can be explained by the mechanism of occupation of servers by non-priority customers: when the input rate is relatively small, an arriving non-priority customer often finds at least one free server and occupies it. This customer has a good chance of not being pushed out of service by a priority customer. Therefore, in a certain area, the mean number of servers servicing non-priority customers increases with λ increasing. When the input rate increases further, the system becomes more loaded, and priority customers push out non-priority customers from the servers. Thus, most servers become busy with priority customers and the mean number of servers servicing non-priority customer decreases.

6. Output Flow

Quite often, service to customers is provided not by one set of servers but by the series of such sets. This implies the necessity to consider not a separate queueing system, but a tandem or a network of queues. In such a case, it is very important to investigate the output flow from each system. The output flow from the considered queueing system is characterized as follows.
The output flow from the system under consideration is a M M A P . The underlying process of this M M A P is the Markov chain ξ t which describes the operation of the system, i.e.,
ξ t = { n t , r t , ν t , m t ( 1 , 1 ) , m t ( 2 , 1 ) , , m t ( r t , 1 ) , m t ( 1 , 2 ) , m t ( 2 , 2 ) , , m t ( n t r t , 2 ) } , t 0 .
Let us enumerate the states of the chain ξ t in the lexicographic order. Then the output flow is defined by the following theorem.
Theorem 2.
The output flow from the system under study is a M M A P that is defined by the matrix D 0 ( o u t p u t ) , D 1 ( o u t p u t ) , D 2 ( o u t p u t ) , which are calculated by the formulas
D 0 ( o u t p u t ) = Q 0 , 0 Q 0 , 1 O O O O Q 1 , 1 Q 1 , 2 O O O O Q 2 , 2 O O O O O Q N 1 , N 1 Q N 1 , N O O O O Q N , N ,
where the matrices Q n , n , n = 0 , N ¯ , Q n , n + 1 , n = 0 , N 1 ¯ , are defined in Theorem 1,
D 1 ( o u t p u t ) = d i a g { S 0 , 1 ( n ) , n = 1 , N ¯ } ,
where
S 0 , 1 ( n ) = O O O I W ¯ S 0 ( 1 ) I M 2 n 1 O O O I W ¯ ( S 0 ( 1 ) ) 2 I M 2 n 2 O O O I W ¯ ( S 0 ( 1 ) ) n ,
D 2 ( o u t p u t ) = d i a g { S 0 , 2 ( n ) , n = 1 , N ¯ } ,
where
S 0 , 2 ( n ) =
I W ¯ ( S 0 ( 2 ) ) n O O O O I W ¯ M 1 ( S 0 ( 2 ) ) n 1 O O O O I W ¯ M 1 2 ( S 0 ( 2 ) ) n 2 O O O O I W ¯ M 1 n 1 S 0 ( 2 ) O O O O .
Proof. 
The underlying process ξ t of the output M M A P makes idle transitions when:
(i) The underlying process ν t of the input M M A P makes transitions without a customer arrival or the phase of service processes m t ( k ) , k = 1 , 2 , changes in the set of transient states. If the number of busy servers is n, the rates of corresponding transitions are described by the block Q n , n of the matrix D 0 ( o u t p u t ) .
(ii) The process ν t makes a transition accompanied by arrival a customer of type 1 or 2. If the number of busy servers is n, the rates of corresponding transition are described by the block Q n , n + 1 of the matrix D 0 ( o u t p u t ) .
The underlying process ξ t of the output M M A P makes transitions, which are accompanied by generation of type k customer, when one of servers servicing type k customer finishes the service, k = 1 , 2 . If the number of servers servicing type k customer is n, the rates of corresponding transitions are described by the block S 0 , k ( n ) of the matrix D k ( o u t p u t ) , k = 1 , 2 .  □
Corollary 1.
The output rate of type-k customers is computed by
μ k = p D k ( o u t p u t ) e , k = 1 , 2 ,
where p is the solution of the Chapman-Kolmogorov equations derived above.
The loss probability of type-k customers is computed by
P l o s s , k = 1 μ k λ k , k = 1 , 2 .
The loss probability of type-2 customer loss due pushing out of service is computed by
P l o s s , 2 p u s h = P l o s s , 2 P l o s s , 2 i n p u t .

7. Conclusions

In this paper, we investigated the multi-server priority queueing system with correlated flow of two types of customers. Such kind of systems practically is not investigated in the existing literature. The type of a customer defines its priority and distribution of the required service time. The system is analysed under quite general assumptions about the arrival and service processes. We calculated the stationary distribution of system states and the main performance measures including the probability of losses due to the lack of free servers at an arrival moment and due to forcing out of service a non-priority customer with priority ones. We conducted numerical experiments that showed the influence of the mean arrival rate and the effect of correlation in the input flow on the system performance measures. It is evidently seen that the use of the superposition of the stationary Poisson processes as a model of arrival flow leads to the significantly redundant optimism in prediction of the values of the main performance indicators of the system. This is unacceptable in real life applications and justifies the necessity of the provided mathematical analysis of the system.
The used methodology of analysis (primarily the methodology for constructing the structured generator of multi-dimensional Markov chain) looks to be suitable for extension to the systems with more than two priority classes. The results seem to be also extendable to the systems with finite or infinite buffers. The results can be used to model modern telecommunication networks where the flows of information may be essentially heterogeneous with respect to the required bandwidth, importance for the system and tolerance to the losses and (or) delay and jitter.

Author Contributions

Conceptualization, V.K., A.D. and V.V.; methodology, V.K. and A.D.; software, A.D. and V.V.; validation, A.D. and V.V.; formal analysis, V.K.; investigation, A.D.; writing, original draft preparation, V.K. and V.V.; writing, review and editing V.K. and A.D.; supervision A.D. and V.V.; project administration, V.K. and A.D. All authors have read and agreed to the published version of the manuscript.

Funding

This work has been supported by grant No 19-29-06043 of the Russian Foundation for Fundamental Research.

Conflicts of Interest

The authors declare no conflict of interest.

References

  1. He, Q.M.; Xie, J.; Zhao, X. Priority queue with customer upgrades. Nav. Res. Logist. 2012, 59, 362–375. [Google Scholar] [CrossRef] [Green Version]
  2. McWherter, D.; Schroeder, B.; Ailamaki, N.; Harchol-Balter, M. Priority mechanisms for OLTP and transactional web applications. In Proceedings of the 20th International Conference on Data Engineering (ICDE 2004), Boston, MA, USA, 2 April 2004; pp. 535–546. [Google Scholar]
  3. Brandwajn, A.; Begin, T. Multi-server preemptive priority queue with general arrivals and service times. Perform. Eval. 2017, 115, 150–164. [Google Scholar]
  4. Gans, N.; Koole, G.; Mandelbaum, A. Telephone call centers: A tutorial and literature review. Manuf. Serv. Oper. Manag. 2002, 5, 79–141. [Google Scholar] [CrossRef] [Green Version]
  5. De Lange, R.; Samoilovich, I.; van der Rhee, B. Virtual queuing at airport security lanes. Eur. J. Oper. Res. 2013, 225, 153–165. [Google Scholar] [CrossRef]
  6. Lin, D.; Patrick, J.; Labeau, F. Estimating the waiting time of multi-priority emergency patients with downstream blocking. Healthc. Manag. Sci. 2014, 17, 88–99. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  7. Ellens, W.; Akkerboom, J.; Litjens, R.; van den Berg, H. Performance of cloud computing centers with multiple priority classes. In Proceedings of the 2012 IEEE Fifth International Conference on Cloud Computing, Honolulu, HI, USA, 24–29 June 2012; pp. 245–252. [Google Scholar]
  8. Stallings, W. Operating Systems Internals and Design Principles, 4th ed.; Prentice Hall: Englewood Cliffs, NJ, USA, 2004. [Google Scholar]
  9. Gnedenko, B.V.; Danielyan, E.A.; Dimitrov, B.N.; Klimov, G.P.; Matvejev, V.F. Priority Queueing Systems; Moscow State University: Moscow, Russian, 1973. (In Russian) [Google Scholar]
  10. Matveev, V.F.; Ushakov, V.G. Queueing Systems; Moscow State University: Moscow, Russian, 1984. (In Russian) [Google Scholar]
  11. Miller, R. Priority queues. Ann. Math. Stat. 1960, 31, 86–1032. [Google Scholar] [CrossRef]
  12. Kleinrock, L. Queueing Systems Volume II: Computer Applications; John Wiley and Sons: New York, NY, USA, 1976. [Google Scholar]
  13. Takagi, H. Queueing analysis: A Foundation of Performance Evaluation, Volume 1: Vacation and Priority Systems, Part 1; North-Holland: Amsterdam, The Netherlands, 1991. [Google Scholar]
  14. Lucantoni, D. New results on the single server queue with a batch Markovian arrival process. Commun. Stat. Stoch. Model. 1991, 7, 1–46. [Google Scholar]
  15. Chakravarthy, S.R. The batch Markovian arrival process: A review and future work. In Advances in Probability Theory and Stochastic Processes; AIP Conference Proceedings: Melville, NY, USA, 2001; pp. 21–49. [Google Scholar]
  16. Dudin, A.N.; Klimenok, V.I.; Vishnevsky, V.M. The Theory of Queuing Systems with Correlated Flows; Springer: Berlin/Heidelberg, Germany, 2019. [Google Scholar]
  17. Bocharov, P.P.; D’Apice, C.; Pechinkin, A.V.; Salerno, S. Queueing Theory; Utrecht-Boston, VSP: Boston, MA, USA, 2004. [Google Scholar]
  18. Choi, B.D.; Hwang, G.U. The MAP, M/G1, G2/1 queue with preemptive priority. J. Appl. Math. Stoch. Anal. 1997, 10, 407–421. [Google Scholar] [CrossRef] [Green Version]
  19. Machihara, F. A bridge between preemptive and nonpreemptive queueing models. Perform. Eval. 1995, 23, 93–106. [Google Scholar] [CrossRef]
  20. Takine, T.; Sengupta, B. A single server queue with service interruptions. Queueing Syst. 1997, 26, 285–300. [Google Scholar] [CrossRef]
  21. Choi, B.D.; Shin, B.C.; Choi, K.B.; Han, D.H.; Jang, J.S. Priority queue with two state Markov modulated arrivals. In Proceedings of the ICC/SUPERCOMM ’96-International Conference on Communications, Dallas, TX, USA, 23–27 June 1996; pp. 1055–1059. [Google Scholar]
  22. Krishnamoorthy, A.; Divya, V. (M, MAP)/(PH, PH)/1 queue with Nonpremptive Priority, Working Interruption and Protection. Reliab. Theory Appl. 2018, 13, 14–34. [Google Scholar]
  23. Sun, B.; Lee, M.H.; Dudin, A.N.; Dudin, S.A. MAP + MAP/M2/N/ queueing system with absolute priority and reservation of servers. Math. Probl. Eng. 2014, 2014, 813150. [Google Scholar] [CrossRef] [Green Version]
  24. Sun, B.; Lee, M.H.; Dudin, S.A.; Dudin, A.N. Analysis of multiserver queueing system with opportunistic occupation and reservation of servers. Math. Probl. Eng. 2014, 2014, 178108. [Google Scholar] [CrossRef]
  25. Klimenok, V.; Dudin, A.; Dudina, O.; Kochetkova, I. Queuing System with Two Types of Customers and Dynamic Change of a Priority. Mathematics 2020, 8, 824. [Google Scholar] [CrossRef]
  26. Dudin, S.; Dudina, O.; Samouylov, K.; Dudin, A. Improvement of fairness of non-preemptive priorities in transmission of heterogeneous traffic. Mathematics 2020, 8, 929. [Google Scholar] [CrossRef]
  27. He, Q.M. Queues with marked calls. Adv. Appl. Probab. 1996, 28, 567–587. [Google Scholar] [CrossRef] [Green Version]
  28. Horvath, G. Efficient analysis of the queue length moments of the MMAP/MAP/1 preemptive priority queue. Perform. Eval. 2012, 69, 684–700. [Google Scholar] [CrossRef] [Green Version]
  29. Neuts, M. Matrix-Geometric Solutions in Stochastic Models; The Johns Hopkins University Press: Baltimore, MD, USA, 1981. [Google Scholar]
  30. Graham, A. Kronecker Products and Matrix Calculus with Applications; Ellis Horwood: Chichester, UK, 1981. [Google Scholar]
  31. Klimenok, V.I.; Kim, C.S.; Orlovsky, D.S.; Dudin, A.N. Lack of invariant property of Erlang BMAP/PH/N/0 model. Queueing Syst. 2005, 49, 187–213. [Google Scholar] [CrossRef]
Figure 1. The dependence of the probability P l o s s , 1 on the input rate λ for the marked Markovian arrival processes ( M M A P s) with different coefficients of correlation.
Figure 1. The dependence of the probability P l o s s , 1 on the input rate λ for the marked Markovian arrival processes ( M M A P s) with different coefficients of correlation.
Mathematics 08 01501 g001
Figure 2. The dependence of the probability P l o s s , 2 i n p u t on the input rate λ for the M M A P s with different coefficients of correlation.
Figure 2. The dependence of the probability P l o s s , 2 i n p u t on the input rate λ for the M M A P s with different coefficients of correlation.
Mathematics 08 01501 g002
Figure 3. The dependence of the probability P l o s s , 2 s e r v on the input rate λ for the M M A P s with different coefficients of correlation.
Figure 3. The dependence of the probability P l o s s , 2 s e r v on the input rate λ for the M M A P s with different coefficients of correlation.
Mathematics 08 01501 g003
Figure 4. Dependence of the mean number N b u s y , 1 of servers providing service to priority customers on the input rate λ for the M M A P s with different coefficients of correlation.
Figure 4. Dependence of the mean number N b u s y , 1 of servers providing service to priority customers on the input rate λ for the M M A P s with different coefficients of correlation.
Mathematics 08 01501 g004
Figure 5. Dependence of the mean number N b u s y , 2 of servers providing service to non-priority customers on the input rate λ for the M M A P s with different coefficients of correlation.
Figure 5. Dependence of the mean number N b u s y , 2 of servers providing service to non-priority customers on the input rate λ for the M M A P s with different coefficients of correlation.
Mathematics 08 01501 g005

Share and Cite

MDPI and ACS Style

Klimenok, V.; Dudin, A.; Vishnevsky, V. Priority Multi-Server Queueing System with Heterogeneous Customers. Mathematics 2020, 8, 1501. https://doi.org/10.3390/math8091501

AMA Style

Klimenok V, Dudin A, Vishnevsky V. Priority Multi-Server Queueing System with Heterogeneous Customers. Mathematics. 2020; 8(9):1501. https://doi.org/10.3390/math8091501

Chicago/Turabian Style

Klimenok, Valentina, Alexander Dudin, and Vladimir Vishnevsky. 2020. "Priority Multi-Server Queueing System with Heterogeneous Customers" Mathematics 8, no. 9: 1501. https://doi.org/10.3390/math8091501

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop