The k-out-of-n:G System Viewed as a Multi-Server Queue

Achyutha Krishnamoorthy; Anu Nuthan Joshua; Ambily P. Mathew

doi:10.3390/math12020210

Abstract

This paper extends the k-out-of-n:G reliability system to a multi-server queue. We study a multi-server reliability-queuing model with the N-policy of repair. The queuing system considered here has n servers, each of which has identically and exponentially distributed service times with parameter

μ

. Servers are subject to breakdown at an exponential rate

γ

. The repair process follows the N-policy of repair. Although these servers work independently of each other, service can be provided only when k functional servers are available in the system. We study the model in the steady state, using the matrix analytic method. We evaluate some associated performance measures and provide graphical/numerical illustrations. We consider an optimization problem, and the results of the study are presented.

Keywords:

reliability; k-out of n system; N-policy; HAP system; breakdown

MSC:

60K25; 60K10; 60K20; 90B22; 90B25

1. Introduction

This paper extends the reliability system k-out-of-n:G to a multi-server queue (n parallel server, with server failure and N-policy of repair). As in the k-out-of-n:G system, at least k servers should be operational in order for the whole system to work (system can provide service). At the moment that the number of operational servers is reduced to

k - 1

, the service is disrupted. As a result, the system can start operation only after one working server is added to the system. This model can be applied directly to the high-altitude platform (HAP) system used for smooth telecommunication when all other facilities fail, due to natural calamities. The system collapses when the number of operational servers is reduced to

k - 1

, irrespective of the number of customers in the system. As with parallel (at least one operational component should be UP for the system to function)/serial (all components should be operational for the system to be functional) systems in the reliability context, we encounter a similar situation in the queuing model discussed in this paper. Thus, this paper extends the k-out-of-n reliability system to the queuing system and simultaneously to the classical multi-server queuing system. In the proposed queuing model, an element of dependency on the number of operational servers arises. Thus, it differs from the classical multi-server queuing models in that at least k (

k = 1,

for the classical multi-server system with server failures and their repair) servers should be operational to provide a service. This phenomenon can be interpreted as the system becoming overloaded when the number of operational servers reduces to

k - 1 .

The mathematical foundation of reliability was laid by the pioneering work of Barlow and Proschan (1965) [1]. The element of interest is the reliability of the machine in a given interval of time or failure-free operation up to time t. Naturally, one investigates the distribution of the number of failed components at any random time when the machine is in operation. The k-out-of-n system is classified as COLD, WARM or HOT, depending on whether the operational components are also subject to deterioration when the system is down. In the COLD system, no operational components fail when the system is DOWN (not operational); for the WARM system, the operational components fail at a reduced rate when the system is DOWN compared to that when it is UP (operational); and in the case of a HOT system, the failure rate of components is the same, irrespective of whether the system is UP or DOWN. As such, for k-out-of-n systems, the notion of “providing service” was not in vogue until Krishnamoorthy, Sathian and Viswanath (2016) [2] introduced it. They considered separately two distinct cases of providing service. In one of them, the system serves (repairs) failed machines in an organization. The k-out-of-n system is considered as a single server subject to failure. Therefore, at most, one failed machine undergoes repair by the server at a time. The repair of failed components is according to some specific policy. In this case, the system state remains finite because it is assumed that there are only a limited number of machines in the organization. In the second model, the authors assumed that when the server (the k-out-of-n system) is idle, it provides a service to external customers. However, priority is given to the repair of internal customers (for example, failed machines in the organization). For these two cases, the authors investigated continuous-time server reliability.

The N-policy was introduced by Yadin and Naor in [3]. The N-policy of repair was introduced by Krishnamoorthy et al. (2002) [4] and is as follows: when the number of operational components comes down to level

N (k \leq N \leq n)

, a repair facility starts repairing failed units, one at a time. This repair process continues until all failed components are brought back to an operational state. It is assumed that the components repaired are as good as new. This assumption is essential for mathematical tractability. There is another means of keeping the system reliability high—placing an order for

n - k + 1

components when the number of operational components goes down to N. It takes a certain amount of time, deterministic or random, for the order to materialize. Sometimes, this may take place only after the system becomes non-operational. The higher the N value, the higher the probability of the system remaining UP until the order materialization. We can further improve the system reliability by combining the N-policy of repair and the order placement for new components. The order can be placed above the level at which the number of operating components drops to N or we can place the order at this level or even below this level. This policy can be further modified for the cancellation of a placed order if the number of operational components reaches the maximum n through the repair process before the materialization of the order placed. Thus, various extensions are possible. N-policy queuing systems with server breakdowns (both working and non-working breakdowns) are studied extensively in the queuing literature (for example, [5,6,7]).

Other than the N-policy of repair, for policies such as D (the accumulated work load reaching or crossing the level D (D is a continuous random variable)) and T, the time until the repair facility is activated after the system completely returns to a fully functional state (all components are in an operational state due to repair), are used. Additional solutions are different combinations of these policies. In the queuing literature, the N-policy of repair is the best- and the T-policy is the worst-performing. Similarly, comparisons among the combinations of policies can also be performed.

Highlights of This Work

The present work is the first to consider the k-out-of-n system as a multi-server queue.
Unlike the classical multi-server queue, in the present work, at least k servers should be operational to provide a service. The multi-server case can be deduced from the present work by assuming that $k = 1$ .
Two birth and death processes encountered in the analysis are (i) the customer’s arrival and service processes and (ii) the accumulation of servers that break while providing a service; until this number reaches $n - N$ (pure death process), the repair process starts from this point of time and, with this, we have a birth and death process. Note that we assume all random variables (inter-arrival times, service time, repair time of failed servers) involved to be exponentially distributed. The combined process turns out to be a birth and death process.
The system under consideration reduces to the classical $M / M / \infty$ queue if we assume that the servers (system components) do not fail and that $k = 1$ . Then, we take the limit as $n \to \infty .$

The remaining part of this paper is as follows. In Section 2, the description of the problem is given. In Section 3, the mathematical modeling of the problem and its analysis are presented. The stability condition is derived in Section 4 and the steady state probabilities found. Section 5 outlines certain distributions and performance measures of the system’s behavior. Numerical and graphical illustrations providing insights into the working of the system are included in Section 6. An optimization problem is considered in Section 7. Concluding remarks are provided in Section 8.

2. Model Description

The model considered here is one that extends the k-out-of-n:G reliability system to a multi-server queue, especially to a multi-server queue with n servers/units working in parallel.

We consider a multi-server queueing system where customers arrive according to a Poisson process with parameter

λ

. The service facility consists of n servers/units, each of which can provide service to individual customers. We are considering a k-out-of-n:G system All the n parallel servers have identically and exponentially distributed service times with parameter

μ

. Although these servers work independently, service will take place only if at least k servers are operational or in working condition. These servers are susceptible to breakdown. Breakdown occurs at exponentially distributed time intervals with rate

γ

. Repair will take place according to the N-policy of repair as described in the previous section. When the number of operational components comes down to the level

N (k \leq N < n)

, a repair facility starts repairing failed units one at a time. The repair process continues until all failed components are restored to an operational state. It is assumed that the time taken to repair each unit is exponentially distributed with parameter

δ

. The system considered here is COLD, i.e., when the system fails due to the lack of at least k operational units, the units that are operational do not deteriorate until the system restarts again, with the failed units replaced with new ones. If we are considering a k-out-of-

n : F

COLD system, system failure occurs when any k of the n units fail. Here, the model under consideration is a k-out-of-n:G COLD system.

We formulate the problem mathematically as follows.

3. Mathematical Formulation

Define, for

t \geq 0

,

$N_{1} (t) :$ the number of customers in the system at time t;
$N_{2} (t) :$ the number of operational units in the system at time t;
$R (t) :$ the status of the server that repairs the failed components at time t.

$R (t) = \{\begin{matrix} 0 & i f t h e r e p a i r i s O F F \\ 1 & i f t h e r e p a i r i s O N \end{matrix}$

Then,

{(N_{1} (t), N_{2} (t), R (t)) | t \geq 0}

is a regular irreducible

C T M C

on state space

Ω = {(n_{1}, n_{2}, 1) : n_{1} \geq 0; k - 1 \leq n_{2} \leq n - 1} ⋃ {(n_{1}, n_{2}, 0) : n_{1} \geq 0; N + 1 \leq n_{2} \leq n} .

The generator matrix for this process when the states are arranged lexicographically is of the form

Q = [\begin{matrix} A_{1}^{0} & A_{0} \\ A_{2}^{1} & A_{1}^{1} & A_{0} \\ A_{2}^{2} & A_{1}^{2} & A_{0} \\ ⋱ & ⋱ \\ A_{2}^{n - 1} & A_{1}^{n - 1} & A_{0} \\ A_{2} & A_{1} & A_{0} \\ ⋱ & ⋱ & ⋱ \end{matrix}]

A_{1}^{i}

contains transitions within level i for

0 \leq i \leq n - 1

.

A_{2}^{i}

(which are diagonal matrices) contain transitions from level i to i − 1 for

1 \leq i \leq n - 1

.

A_{2}

(diagonal matrix) contains transitions from i to i − 1 and

A_{1}

within level i.

A_{0}

(diagonal matrix) contains transitions from i to i + 1,

\forall i

. All the matrices are square matrices of dimension d = 2n − (N + k) + 1.

A_{0} = [\begin{matrix} 0 \\ λ I_{d - 1} \end{matrix}]

A_{0_{(n_{1}, n_{2}, n_{3})}}^{(n_{1} + 1, n_{2}, n_{3})} = λ

For

1 \leq i \leq k,

A_{2}^{i} = [\begin{matrix} 0 \\ i μ I_{d - 1} \end{matrix}]

{A_{2}^{i}}_{(n_{1}, n_{2}, n_{3})}^{(n_{1} - 1, n_{2}, n_{3})} = n_{1} μ

For

k + 1 \leq i \leq N,

A_{2}^{i} = [\begin{matrix} 0 \\ k μ \\ (k + 1) μ \\ ⋱ \\ (i - 1) μ \\ i μ I \end{matrix}]

I is an identity matrix of order

d - (i - k + 1) .

{A_{2}^{i}}_{(n_{1}, n_{2}, n_{3})}^{(n_{1} - 1, n_{2}, n_{3})} = \{\begin{matrix} n_{2} μ & i f k \leq n_{2} \leq i - 1 \\ i μ & i f i \leq n_{2} \leq n \end{matrix}

For

N + 1 \leq i \leq n - 1

,

A_{2}^{i} = [\begin{matrix} 0 \\ k μ \\ (k + 1) μ \\ ⋱ \\ N μ \\ I_{2} \otimes (N + 1) μ \\ ⋱ \\ I_{2} \otimes (i - 1) μ \\ i μ I \end{matrix}]

{A_{2}^{i}}_{(n_{1}, n_{2}, n_{3})}^{(n_{1} - 1, n_{2}, n_{3})} = \{\begin{matrix} n_{2} μ & i f k \leq n_{2} \leq i - 1 \\ i μ & i f i \leq n_{2} \leq n \end{matrix}

A_{2} = [\begin{matrix} 0 \\ k μ \\ (k + 1) μ \\ ⋱ \\ N μ \\ I_{2} \otimes (N + 1) μ \\ ⋱ \\ I_{2} \otimes (n - 1) μ \\ n μ \end{matrix}]

{A_{2}}_{(n_{1}, n_{2}, n_{3})}^{(n_{1} - 1, n_{2}, n_{3})} = n_{2} μ

To easily represent matrix

A_{1}^{i}

, the states

{(n_{1}, n_{2}, 1) : k - 1 \leq n_{2} \leq N}

of order

N - k + 2

are grouped together and given subscript 1, the states

{(n_{1}, n_{2}, r) : N + 1 \leq n_{2} \leq n - 1, r = 0, 1}

of order

2 (n - 1 - N)

are grouped together and given subscript 2, and the state

{(n_{1}, n_{2}, 0) : n_{2} = n}

is given subscript 3.

A_{1}^{0} = L

A_{1}^{i} = L - A_{2}^{i}

L = [\begin{matrix} L_{11} & L_{12} & 0 \\ L_{21} & L_{22} & L_{23} \\ 0 & L_{32} & L_{33} \end{matrix}]

L_{11} = γ E^{-} + δ E^{+} - δ I_{N - k + 2} - (λ + γ) [\begin{matrix} 0 \\ I_{N - k + 1} \end{matrix}]

where

E^{-}

and

E^{+}

(refer to page 19) are square matrices of dimension

N - k + 2

.

L_{12} = E_{N - k + 21} \otimes [0 δ]

L_{21} = E_{N - k + 21}^{'} \otimes {[γ γ]}^{'}

E_{N - k + 21}

is a matrix (refer to page 19) of order

(N - k + 2) \times (n - 1) - N

.

L_{22} = E^{+} \otimes [\begin{matrix} 0 & 0 \\ 0 & δ \end{matrix}] + E^{-} \otimes [\begin{matrix} γ & 0 \\ 0 & γ \end{matrix}] - I_{n - 1 - N} \otimes [\begin{matrix} 0 & 0 \\ 0 & δ \end{matrix}] - (λ + γ) I_{2 (n - 1 - N)}

where

E^{-}

and

E^{+}

(refer to page 19) are square matrices of dimension

(n - 1) - N

.

L_{23} = e_{n - 1 - N} \otimes {[0 δ]}^{'}

L_{32} = e_{n - 1 - N}^{'} \otimes [γ 0],

e_{n - 1 - N}

is a column vector with 1 in the

(n - 1 - N)

position.

L_{33} = - (λ + γ)

A_{1} = L - A_{2} .

The transitions in

A_{1}^{0} = L

excluding diagonal entries are given as

{A_{1}^{0}}_{(n_{1}, n_{2}, n_{3})}^{(n_{1}, m_{2}, m_{3})} = \{\begin{matrix} δ & i f n_{3} = 1, m_{3} = 1, m_{2} = n_{2} + 1, k - 1 \leq n_{2} \leq n - 2 \\ δ & i f n_{3} = 1, m_{3} = 0, n_{2} \leq n - 1, m_{2} = n \\ γ & i f n_{3} = m_{3}, m_{2} = n_{2} - 1, n_{2} \neq N + 1 \\ γ & i f n_{3} = 0, m_{3} = 1, m_{2} = N, n_{2} = N + 1 \end{matrix}

The transitions from one state to another are given below:

Transitions due to the arrival of a customer to the system:
$(n_{1}, n_{2}, r) \to (n_{1} + 1, n_{2}, r)$ with rate λ when $n_{2} \neq k - 1 .$
Transitions due to the service completion of a customer:
$(n_{1}, n_{2}, r) \to (n_{1} - 1, n_{2}, r)$ with rate
$n_{1} μ$ when $1 \leq n_{1} \leq k;$
$n_{2} μ$ when $k + 1 \leq n_{1} \leq n, n_{2} \leq n_{1};$
$n_{1} μ$ when $k + 1 \leq n_{1} \leq n, n_{2} > n_{1};$
Transitions due to the breakdown of an operational unit in the k-out-of-N system:
$(n_{1}, n_{2}, r) \to (n_{1}, n_{2} - 1, r)$ with rate γ when $n_{2} \geq k, n_{2} \neq N + 1, r = 0, 1 .$
$(n_{1}, N + 1, 0) \to (n_{1}, N, 1)$ with rate γ.
Transitions due to the repair of an operational unit in the k-out-of-N system:
$(n_{1}, n_{2}, 1) \to (n_{1}, n_{2} + 1, 1)$ with rate δ when $k - 1 \leq n_{2} \leq n - 1 .$
$(n_{1}, n - 1, 1) \to (n_{1}, n, 0)$ with rate δ.

4. Stability Analysis

4.1. Stability Condition

Let

A = A_{2} + A_{1} + A_{0}

.

A = [\begin{matrix} - δ & δ \\ γ & - γ - δ & δ \\ γ & - γ - δ & δ \\ ⋱ & ⋱ \\ γ & - γ - δ & δ \\ γ & - γ - δ & B_{1} \\ B_{2} & B_{3} & B_{4} \\ B_{5} & B_{3} & B_{4} \\ ⋱ & ⋱ & ⋱ \\ B_{5} & B_{3} & B_{4} \\ B_{5} & B_{3} & B_{6} \\ B_{7} & - γ \end{matrix}]

where

B_{1} = [\begin{matrix} 0 & δ \end{matrix}], B_{2} = [\begin{matrix} γ \\ γ \end{matrix}], B_{3} = [\begin{matrix} - γ & 0 \\ 0 & - γ - δ \end{matrix}], B_{4} = [\begin{matrix} 0 & 0 \\ 0 & δ \end{matrix}],

B_{5} = [\begin{matrix} γ & 0 \\ 0 & γ \end{matrix}], B_{6} = [\begin{matrix} 0 \\ δ \end{matrix}], B_{7} = [\begin{matrix} γ & 0 \end{matrix}]

Let

π = (π_{k - 1}, π_{k}, π_{k + 1} \dots π_{N - 1}, π_{N}, π_{N + 1}, \dots π_{n - 1}, π_{n})

be the steady-state probability vector of the infinitesimal generator matrix

A

.

In other words,

π A = 0; π e = 1 .

(1)

Define

U_{n - 1} = \frac{1}{γ} B_{6}

U_{i} = \{\begin{matrix} {(B_{3} + B_{7} U_{n - 1})}^{- 1} (- B_{4}) & f o r i = n - 2 \\ {(B_{3} + B_{5} U_{i + 1})}^{- 1} (- B_{4}) & f o r N - 1 \leq i \leq n - 3 \\ {(B_{3} + B_{5} U_{N + 1})}^{- 1} (- B_{1}) & f o r i = N \end{matrix}

For

k \leq i \leq N

,

π_{i} = {(\frac{δ}{γ})}^{i - (k - 1)} π_{k - 1}

For

N + 1 \leq i \leq n - 1

,

π_{i} = π_{i - 1} U_{i - 1}

π_{n} = π_{n - 1} U_{n - 1}

From the normalizing condition

π e = 1

, we have

π_{k - 1} [\sum_{i = 0}^{k} {(\frac{δ}{γ})}^{i} + {(\frac{δ}{γ})}^{k} \sum_{l = N}^{n - 1} \prod_{j = N}^{l} U_{j} e] = 1 .

(2)

The infinitesimal generator of this Markov chain indicates that it is a level-independent quasi-birth–death process. Thus, this queuing system is stable if and only if

π A_{0} e < π A_{2} e

; see Neuts [8]. The stability condition is given as follows:

Theorem 1.

The given system is stable if and only if

λ < \sum_{i = 0}^{N - k} (k + i) μ {(\frac{δ}{γ})}^{i + 1} π_{k - 1} + \sum_{l = N + 1}^{n} l μ \prod_{j = N}^{l} U_{j} e {(\frac{δ}{γ})}^{k} π_{k - 1}

(3)

4.2. Steady-State Probability Vector

Assuming the stability of the system, we proceed to find the steady-state probability of the system states.

Let

x

be the steady-state probability vector of

Q

, i.e.,

x

satisfies

x Q = 0

and

x e = 1 .

We partition this vector as

x = (x_{0}, x_{1}, x_{2} \dots),

where

x_{i} = x_{(i, n_{2}, r)}

are of dimension

d = 2 n - (N + k) + 1

.

x_{n + i} = x_{n} R^{i}, i \geq 1

where the matrix R is the minimal non-negative solution to the matrix quadratic equation

R^{2} A_{2} + R A_{1} + A_{0} = 0

and the vectors

x_{0}, x_{1}, \dots, x_{n} \dots

are obtained by solving the equations

\begin{matrix} x_{0} A_{1}^{0} + x_{1} A_{2}^{1} = 0 \end{matrix}

(4)

\begin{matrix} x_{i - 1} A_{0} + x_{i} A_{1}^{i} + x_{2} A_{2}^{i} = 0, for i \leq i \leq n - 2 \end{matrix}

(5)

\begin{matrix} x_{n - 2} A_{0} + x_{n - 1} A_{1}^{n - 1} + x_{n} A_{2} = 0 \end{matrix}

(6)

\begin{matrix} x_{n - 1} A_{0} + x_{n} (A_{1} + R A_{2}) = 0 \end{matrix}

(7)

subject to the normalizing condition

\sum_{i = 0}^{n - 1} x_{i} e + x_{n} {(I - R)}^{- 1} e = 1

(8)

4.3. Special Case

If we consider a system in which

k = 1

and assume that the servers do not break down, the above queuing model reduces to the classical

M / M / n

queuing system. In this case,

{N_{1} (t); t \geq 0}

forms a

C T M C

on state space

{0, 1, 2, 3, . . .}

The infinitesimal generator matrix reduces to the form

Q^{'} = [\begin{matrix} - λ & λ \\ μ & - (λ + μ) & λ \\ 2 μ & - (λ + 2 μ) & λ \\ ⋱ & ⋱ & ⋱ \\ n μ & - (λ + n μ) & λ \\ ⋱ & ⋱ & ⋱ \end{matrix}]

The above system is stable if and only if

λ < n μ

. For further details, refer to [9].

5. System Characteristics

5.1. Distribution of Server Idle Times

Some servers are not operational when the number of customers in the system is less than the number of working servers. Here, we compute the distribution of server idle times.

Consider the Markov Chain

{(N_{1} (t), N_{2} (t), R (t)) | t \geq 0}

on state space,

Ω_{1} = {(n_{1}, n_{2}, 1) : 0 \leq n_{1} \leq n - 1; n_{1} < n_{2} : k - 1 \leq n_{2} \leq n - 1} ⋃ {(n_{1}, n_{2}, 0) : 0 \leq n_{1} \leq n - 1; n_{1} < n_{2} : N + 1 \leq n_{2} \leq n} ⋃ {*} .

Here,

{*}

denotes the absorbing state indicating that all servers are busy. The time to absorption of this

C T M C

to

{*}

is the time for which the

n_{2} - n_{1}

operational units are idle. The infinitesimal generator matrix of this Markov chain is of the form

Q_{1} = [\begin{matrix} T_{1} & T_{1}^{0} \\ 0 & 0 \end{matrix}]

Q_{1} = [\begin{matrix} A_{1}^{0} & A_{0} \\ A_{2}^{1} & A_{1}^{1} & A_{0} \\ ⋱ & ⋱ \\ A_{2}^{k - 2} & A_{1}^{k - 2} & A_{0} \\ C_{2}^{k - 1} & C_{1}^{k - 1} & C_{0}^{k - 1} \\ ⋱ & ⋱ & ⋱ \\ C_{2}^{n - 2} & C_{1}^{n - 2} & C_{0}^{n - 2} \\ C_{2}^{n - 1} & C_{1}^{n - 1} \end{matrix}]

For

k - 1 \leq i \leq N - 1

,

C_{2}^{i} = [\begin{matrix} 0 & i μ I_{2 n - i - 1 - N} \end{matrix}]

C_{0}^{i} = [\begin{matrix} 0 \\ λ I_{2 n - i - 2 - N} \end{matrix}]

For

N \leq i \leq n - 2

C_{2}^{i} = [\begin{matrix} 0 & i μ I_{2 (n - i) - 1} \end{matrix}]

C_{0}^{i} = [\begin{matrix} 0 \\ λ I_{2 (n - i) - 3} \end{matrix}]

C_{2}^{n - 1} = [\begin{matrix} 0 & 0 & (n - 1) μ \end{matrix}]

C_{1}^{n - 1} = [\begin{matrix} - (n - 1) μ \end{matrix}]

The matrices

C_{1}^{i}; k - 1 \leq i \leq N - 1

are square matrices of order

2 n - (i + 1 + N)

obtained by deleting

i - 1

rows and columns from

A_{1}^{i} : k - 1 \leq i \leq N - 1

.

C_{1}^{i}; N \leq i \leq n - 1

are square matrices of order

2 n - 2 - i; N \leq i \leq n - 1

obtained by deleting

N - 1 + (i - N)

rows and columns from

A_{1}^{i}; N \leq i \leq n - 1

.

The matrix

T_{1}^{0}

is a column matrix

[0, . . 0, T_{1}^{k - 1}, . . T_{1}^{N - 1}, T_{1}^{N}, . . T_{1}^{n - 1}]

.

T_{1}^{i}; k - 1 \leq i \leq N - 1

is a column matrix with the first entry

λ + γ

.

T_{1}^{i}; N \leq i \leq n - 2

is a column matrix with the first two entries

λ + γ

.

T_{1}^{n - 1} = [λ + γ]

Theorem 2.

Let U be the random variable designating the server idle time. Then,

P (U > t) = α . e^{Q_{1} t} e

. This distribution is also of the

P H

type with representation

P H (α, Q_{1})

, where

α = \frac{1}{d} (x_{(0, k - 1, 1)}, . . ., x_{(0, N, 1)} . . . x_{(0, n, 0)} . . ., x_{(k - 1, k, 1)} . . x_{(k - 1, n, 0)}, . . . x_{(n - 2, n - 1, 0)}, . ., x_{(n - 2, n, 0)}, x_{(n - 1, n, 0)})

, and d is the normalizing constant.

5.2. Distribution of First Passage Time from an Inoperative State to n-Operational Server State

Under the assumption that a sufficiently large number of customers are present in the system, we compute the distribution of time taken for the system to pass from a state in which there are

k - 1

operational servers to a state in which there are n operational servers.

Consider the Markov chain

{(N_{2} (t), 1) | t \geq 0}

on state space

Ω_{2} = {(n_{2}, 1) : k - 1 \leq n_{2} \leq n - 1} ⋃ {*} .

Here,

{*}

denotes the absorbing state indicating that all servers are working. The time to absorption of this

C T M C

to

{*}

is the first passage time from state

(k - 1, 1)

to the state

{*}

. The infinitesimal generator matrix of this Markov chain, when the states are arranged in ascending order of

n_{2}

, is of the form

Q_{2} = [\begin{matrix} - δ & δ & 0 \\ γ & - (δ + γ) & δ & 0 \\ ⋱ & ⋱ & ⋱ \\ γ & - (δ + γ) & δ \\ γ & - (δ + γ) & δ \\ 0 & 0 & 0 & 0 & 0 & 0 \end{matrix}]

Theorem 3.

Let V be the random variable denoting the first passage time from an inoperative state to an n-operational server state, under the assumption of a large number of customers in the system. Then, V has a

P H

distribution with representation

P H (β, Q_{2})

, where

β = (1, 0, . . ., 0)

.

5.3. Distribution of First Passage Time from an n-Operational Server State to an Inoperative State

Here, we also compute the distribution of time taken for the system to pass from a state in which there are n operational servers to a state in which there are

k - 1

operational servers, under the assumption that a sufficiently large number of customers are present in the system.

Consider the Markov chain

{(N_{2} (t), r) | t \geq 0}

on state space

Ω_{3} = {(n_{2}, 1) : k \leq n_{2} \leq n - 1} ⋃ {(n_{2}, 0) : N + 1 \leq n_{2} \leq n} ⋃ {*} .

Here,

{*}

denotes the absorbing state indicating that only

k - 1

servers are working/operational. The time to absorption of this

C T M C

to

{*}

is the first passage time from state

(n, 0)

to the state

{*}

. When the states are arranged in decreasing order of

n_{2}

, the infinitesimal generator matrix of this Markov chain is of the form

Q_{3} = [\begin{matrix} S_{1} & S_{1}^{0} \\ 0 & 0 \end{matrix}]

where

S_{1} = [\begin{matrix} S_{11} & S_{12} & 0 \\ S_{21} & S_{22} & S_{23} \\ 0 & S_{32} & S_{33} \end{matrix}]

S_{11} = - γ

S_{12} = γ

S_{21} = e_{1} \otimes {[0, δ]}^{'},

is

e_{1}

is a column matrix of order

n - 1 - N .

S_{22} = I_{(n - 1 - N)} \otimes [\begin{matrix} - γ \\ - (γ + δ) \end{matrix}] + E^{-} \otimes [\begin{matrix} 0 & 0 \\ 0 & δ \end{matrix}] + E^{+} \otimes [\begin{matrix} γ & 0 \\ 0 & γ \end{matrix}]

where

E^{-}

and

E^{+}

are square matrices (refer to page 19) of dimension

n - 1 - N

.

S_{23} = E_{(n - 1) - N 1} \otimes {[γ γ]}^{'}

E_{(n - 1) - N 1}

is a matrix (refer to page 19) of order

(n - 1) - N \times (N - k) .

S_{32} = E_{1 (n - 1) - N} \otimes [0 δ]

E_{1 (n - 1) - N}

is a matrix of order

(N - k) \times (n - 1) - N .

S_{33} = - (λ + γ) I_{N - k} + δ E^{-} + γ E^{+}

where

E^{-}

and

E^{+}

are square matrices (refer to page 19) of dimension

N - k

.

S_{1}^{0} = γ e_{2 n - (N + k)}

Theorem 4.

Let W be the random variable denoting the first passage time from the n-operational server state to an inoperative state, under the assumption of a large number of customers in the system. Then, W has a

P H

distribution with representation

P H (χ, Q_{3})

, where

χ = (1, 0, . . ., 0)

.

5.4. Distribution of Number of Times That the System Becomes Inoperative before Reaching the n Server State

Under the assumption of a large number of customers in the system, we compute the distribution of the number of times that the system hits the

k - 1

server state before reaching the n server state. Let

Δ

denote the absorbing state indicating the number of working servers hitting n. Then,

(M (t), N_{2} (t)),

(where

M (t)

is the number of times that the system hits the

k - 1

server state before reaching the n server state) is the Markov chain on state space

Ω_{4} = {Δ} ⋃ {(m, n_{2}) : m \geq 0, k - 1 \leq n_{2} \leq n - 1} .

The infinitesimal generator is of the form

Q_{4} = [\begin{matrix} 0 & 0 & 0 & 0 & 0 & . . . \\ δ e_{n - k} & M_{0} & M_{1} & 0 & 0 & . . . \\ δ e_{n - k} & 0 & M_{0} & M_{1} & 0 & . . . \\ δ e_{n - k} & 0 & 0 & M_{0} & M_{1} \\ ⋮ & ⋱ & ⋱ \end{matrix}]

where

M_{0} = [\begin{matrix} - δ & δ e_{n - k}^{'} \\ 0 & γ E^{-} + δ E^{+} - (γ + δ) I_{n - k} \end{matrix}]

M_{1} = γ E_{21}

E_{21}

is a matrix of order

n - k - 1

with 1 in the

(2, 1)

position and

E^{-}, E^{+}

are matrices (refer to page 19) of order

n - k

.

Let

y_{m}

denote the probability that the system hits the

k - 1

server state m times before reaching the n server state.

y_{0} = - δ ψ M_{0}^{- 1} e

y_{m} = {(- 1)}^{m} δ ψ {(M_{1} M_{0}^{- 1})}^{m} M_{0}^{- 1} e

where

ψ

is the initial probability vector

ψ = \frac{1}{d_{1}} (π_{k - 1}, . . π_{N}, . . . π_{(N + 1, 1)} . . . π_{(n - 1, 1)})

, and

d_{1}

is the normalizing constant.

Theorem 5.

The expected number of times that the system hits the

k - 1

server state before reaching the n server state is

\sum m y_{m}

.

5.5. Other Performance Measures

Fraction of time for which the system is under repair:
$F_{R e p a i r} = \sum_{n_{1} = 0}^{\infty} \sum_{n_{2} = k - 1}^{n - 1} x_{(n_{1}, n_{2}, 1)}$
Fraction of time for which the servers are idle:
$F_{I d l e} = \sum_{n_{1} = 0}^{\infty} x_{(n_{1}, k - 1, 1)} + \sum_{n_{1} = 0, n_{1} \leq n_{2}}^{\infty} \sum_{n_{2} = k - 1}^{n} \sum_{r = 0, 1} x_{(n_{1}, n_{2}, r)}$
System reliability, i.e., the probability that at least k servers are operational:
$S = \sum_{n_{1} = 0}^{\infty} \sum_{n_{2} \geq k}^{n} \sum_{r = 0, 1} x_{(n_{1}, n_{2}, r)} = 1 - \sum_{n_{1} = 0}^{\infty} x_{(n_{1}, k - 1, 1)}$
Average number of customers in the system:
$E_{S y s t e m} = \sum_{n_{1} = 0}^{\infty} n_{1} x_{n_{1}}$
Average number of failed servers in the system:
$E_{F S} = \sum_{n_{2} = k - 1}^{n - 1} (n - n_{2}) \sum_{n_{1} = 0}^{\infty} \sum_{r = 0, 1} x_{(n_{1}, n_{2}, r)}$
Average number of servers that are idle:
$E_{I S} = \sum_{n_{1} = 0}^{\infty} (k - 1) x_{(n_{1}, k - 1, 1)} + \sum_{n_{1} = 0, n_{1} \leq n_{2}}^{\infty} \sum_{n_{2} = k - 1}^{n} \sum_{r = 0, 1} (n_{2} - n_{1}) x_{(n_{1}, n_{2}, r)}$

6. Numerical Illustrations

In this section, we give some numerical examples that show the effect of the level N of the N-policy and the repair rate

δ

on certain performance measures. For this, we consider a 5-out-of-

20 : G

system.

6.1. Effect of Level N on the Performance of the System

In this numerical example, the following parameters are kept fixed with values as given below:

λ = 10; μ = 4; γ = 2; δ = 1

From Table 1 and Figure 1, Figure 2, Figure 3 and Figure 4, the following conclusions can be made:

Table 1. Effect of N on

F_{R e p a i r}

,

F_{I d l e}

, S and

E_{r e p a i r}

.

Figure 1. Effect of parameter N on the fraction of time for which the servers are under repair.

Figure 2. Effect of the level N on the fraction of time for which the servers are idle.

Figure 3. Effect of the level N on the reliability of the system.

Figure 4. Effect of the level N on

E_{S y s t e m}

.

The fraction of time for which the servers are under repair is the minimum for a value $N = 7$ . To minimize $F_{R e p a i r}$ , we need to initiate the repair process when the number of non-operational servers is 13.
The fraction of time for which the servers are idle can be minimized if we choose $N = 8$ . It plays a crucial role in enhancing the cost-effectiveness of the model.
The system reliability is maximum when $N = 7$ . We can achieve a more reliable system if we take into consideration the results in Figure 3.
The expected number of customers found waiting in the system is the minimum if we choose $N = 8$ . The value of the expected number of customers is low as the servers are working in parallel.

The above example illustrates that the analysis can guide us in properly managing the repair policy with specific objectives.

6.2. Effect of the Repair Rate $δ$ on the Performance of the System

In this section, we study the effect of the repair rate

δ

on the performance of the system. In this example, the following parameters are kept fixed with values as given below:

λ = 35; μ = 26; γ = 3; N = 6

From Table 2 and Figure 5, Figure 6, Figure 7 and Figure 8, the following conclusions can be made:

Table 2. Effect of repair rate

δ

on

F_{R e p a i r}

,

F_{I d l e}

, S and

E_{r e p a i r}

.

Figure 5. Effect of the repair rate

δ

on the fraction of time for which the servers are under repair.

Figure 6. Effect of the repair rate

δ

on the fraction of time for which the servers are idle.

Figure 7. Effect of the repair rate

δ

on the reliability of the system.

Figure 8. Effect of

δ

on

E_{S y s t e m}

.

The fraction of time for which the servers are under repair is the minimum for a value $δ = 4$ . We can decide to what extent we need to enhance the repair facilities or resources based on the data in Figure 5.
The fraction of time for which the servers are idle is maximum for $δ = 2$ . The data in Figure 6 play a crucial role in specifically designing the facilities so that the idle time of the servers can be effectively utilized and the entire system can be managed accordingly.
The system reliability increases with increasing values of $δ$ as expected. The rate at which this increase occurs helps us to decide on the optimum value to be considered when compared to the resources or efforts involved in enhancing the repair facilities; see Figure 7.
The expected number of customers found waiting in the system is maximum for $δ = 2$ .

7. Cost Analysis and Optimization Problem

Cost analysis plays an important role in decision making or in formulating policies related to the working of any system that we encounter in everyday life. In this section, we propose a cost function related to the system under consideration. We consider an optimization problem to find the value of the level N of the N-policy of the repair. With the help of numerical examples as well as graphical illustrations, we show that an optimal value N exists so that the total expected cost is the minimum. We also study the effect of the repair rate

δ

on the the total expected cost.

To determine the optimal level of N at which the repair facility can start working and to determine the effectiveness of enhancing the repair rate, we proceed as follows. For the cost analysis, we define the following costs.

$C_{1}$ : Establishment cost or setup cost.
$C_{2}$ : Holding cost per customer per unit time.
$C_{3}$ : Unit time cost to run the repair mechanism.
$C_{4}$ : Unit time cost incurred due to the idleness of the servers.
$C_{5}$ : unit time revenue received from the busy servers.
$C_{6}$ : Unit time revenue received when at least k servers are operational.

The expected total cost is

T C = C_{1} + C_{2} * E_{S y s t e m} + C_{3} * F_{R e p a i r} + C_{4} * F_{I d l e} - C_{5} * (1 - F_{I d l e}) - C_{6} * S

We fix the following values:

C_{1} = 1000, C_{2} = 75, C_{3} = 100, C_{4} = 200, C_{5} = 500, C_{6} = 160

The values of other parameters are the same as in Section 6.

From Table 3 and Figure 9, we see that the optimal value of the level N is $N = 8$ , for which the total expected cost is the minimum. Thus, it is optimal to wait until 12 servers are non-operational before starting the repair.

Table 3. Effect of N and $δ$ on expected total cost.

Figure 9. Effect of the changes in level N on the expected total cost.
From Table 3 and Figure 10, it can be seen that the expected total cost is the maximum when the repair rate is $δ = 2$ . This means that if we spend more to increase the rate at which the repair is performed from 1 to 2 or more, it will not be reflected in the total expected cost. Thus, a decision regarding the facilities to be arranged to ensure a specific repair rate can be made using this cost analysis. In this specific numerical example, it is enough to ensure a repair rate of at most $δ = 1$ .

Figure 10. Effect of the repair rate $δ$ on the expected total cost.
By analyzing the problem numerically, we can decide on the optimal values of the level N of the $N$ -policy and also the optimal repair rate $δ$ to be maintained.

8. Conclusions

This paper studies a multi-server queuing system with the N-policy of repair, by viewing it as a k-out-of-n:G system. The steady-state distributions of various system states are computed. The distribution of server idle times is analyzed. Under the assumption of a sufficiently large number of customers in the system, the distribution of the first passage times from an inoperative state to an n server state and vice versa has been found. The assumption of a sufficiently large number of customers in the system is made to avoid complications that may arise due to future arrivals. Other system performance measures such as system reliability, the expected number of failed servers, etc., are computed.

The effect of an increase in the level N and the repair rate

δ

on various performance measures, when other parameters are kept fixed, is studied numerically and graphically. An increase in the repair rate significantly increases the system reliability and decreases the expected number of customers in the system. It reduces the idleness of servers and reduces the fraction of time for which the servers are under repair. On the other hand, an increase in N increases the fraction of time for which the servers are under repair. However, as N increases, the idleness of servers and the expected number in the system first decrease and then increase, while the system reliability first increases and then decreases. The optimal value of N depends on the parameters of the system. The examples illustrate how an optimal policy could be derived for a multi-server queuing system, keeping in mind certain specific objectives. A cost function has been constructed and the results of the cost analysis are presented.

There can be several extensions to the problem considered in this paper. An extension of the present work to one in which consecutive k-out-of-n systems provide a service to customers, either linearly or in a circular fashion, is proposed for future research. A similar study of the k-out-of-

n : F

system is underway. Moreover, the k-out-of-n system could be analyzed under both HOT and WARM conditions.

Author Contributions

Conceptualization, A.K.; Methodology, A.N.J.; Validation, A.P.M.; Formal analysis, A.N.J.; Data curation, A.P.M.; Writing—original draft, A.K., A.N.J. and A.P.M.; Writing—review & editing, A.K., A.N.J. and A.P.M.; Supervision, A.K. All authors have read and agreed to the published version of the manuscript.

Funding

Dr. Anu Nuthan Joshua and Dr. Ambily P. Mathew received support from DST-RSF research project number 22-49-02023 (RSF) and research project number 64800 (DST) for the preparation of this publication. https://search.crossref.org/funding.

Data Availability Statement

Data are contained within the article.

Conflicts of Interest

The authors declare no conflicts of interest.

Notations and Abbreviations

The following abbreviations are used in this manuscript:

$C T M C$	Continuous-time Markov chain
$Q B D$ process	Quasi-birth–death process
e	All one vector with appropriate dimension
$I_{n}$	Identity matrix of order $n \times n$
$E^{-}$	Square matrix of appropriate size with all zero entries except the entries ${(E^{-})}_{(j, j - 1)}$ , which are equal to 1
$E^{+}$	Square matrix of appropriate size with all zero entries except the entries ${(E^{+})}_{(j, j + 1)}$ , which are equal to 1
$E_{i j}$	Matrix of appropriate size with all zero entries except the entry $(i, j)$ , which is equal to 1
$e_{i}$	Column vector of appropriate dimension with all zero entries except the entry i, which is equal to 1
$A^{'}$	Transpose of matrix A
$0$	Matrix whose entries are 0, of appropriate size
$A \otimes B$	Kronecker product; if $A = [a_{i j}]$ is a matrix of order $m \times n$ and if B is a matrix of order $p \times q$ , then $A \otimes B = [a_{i j} B]$ will denote a matrix of order $m p \times n q$

References

Barlow, R.E.; Proschan, F. Mathematical Theory of Reliability; John Wiley and Sons: New York, NY, USA, 1965. [Google Scholar]
Krishnamoorthy, A.; Sathian, M.K.; Narayanan Viswanath, C. Reliability of a k-out-of-n System with Repair by a Single Server Extending Service to External Customers with Pre-emption. Electron. J. Reliab. Theory Appl. 2016, 11, 61–93. [Google Scholar]
Yadin, M.; Naor, P. Queueing system with a removable service station. Oper. Res. Q. 1963, 14, 393–405. [Google Scholar] [CrossRef]
Krishnamoorthy, A.; Ushakumari, P.V.; Lakshmy, B. k-out-of-n-system with repair: The N-policy. Asia-Pac. J. Oper. Res. 2002, 19, 47–61. [Google Scholar]
Yen, T.-C.; Wang, K.-H.; Chen, J.-Y. Optimization Analysis of the N-Policy M/G/1 Queue with Working Breakdowns. Symmetry 2020, 12, 583. [Google Scholar] [CrossRef]
Vemuri, V.K.; Boppana, V.S.N.H.P.; Kotagiri, C.; Bethapudi, R.T. Optimal strategy analysis of an N-policy two-phase M^X/M/1 queueing system with server startup and breakdowns. Opsearch 2011, 48, 109–122. [Google Scholar] [CrossRef]
Singh, C.J.; Jain, M.; Kumar, B. Analysis of queue with two phases of service and m phases of repair for server breakdown under N-policy. Int. J. Serv. Oper. Manag. 2013, 16, 373–406. [Google Scholar] [CrossRef]
Neuts, M.F. Matrix—Geometric Solutions in Stochastic Models: An Algorithmic Approach; The Johns Hopkins University Press: Baltimore, MD, USA, 1981. [Google Scholar]
Gross, D.; Harris, C. Fundamentals of Queuing Theory, 3rd ed.; John Wiley: Chichester, UK, 1988. [Google Scholar]

Figure 1. Effect of parameter N on the fraction of time for which the servers are under repair.

Figure 2. Effect of the level N on the fraction of time for which the servers are idle.

Figure 3. Effect of the level N on the reliability of the system.

Figure 4. Effect of the level N on

E_{S y s t e m}

.

Figure 4. Effect of the level N on

E_{S y s t e m}

.

Figure 5. Effect of the repair rate

δ

on the fraction of time for which the servers are under repair.

Figure 5. Effect of the repair rate

δ

on the fraction of time for which the servers are under repair.

Figure 6. Effect of the repair rate

δ

on the fraction of time for which the servers are idle.

Figure 6. Effect of the repair rate

δ

on the fraction of time for which the servers are idle.

Figure 7. Effect of the repair rate

δ

on the reliability of the system.

Figure 7. Effect of the repair rate

δ

on the reliability of the system.

Figure 8. Effect of

δ

on

E_{S y s t e m}

.

Figure 8. Effect of

δ

on

E_{S y s t e m}

.

Figure 9. Effect of the changes in level N on the expected total cost.

Figure 10. Effect of the repair rate

δ

on the expected total cost.

Figure 10. Effect of the repair rate

δ

on the expected total cost.

Table 1. Effect of N on

F_{R e p a i r}

,

F_{I d l e}

, S and

E_{r e p a i r}

.

Table 1. Effect of N on

F_{R e p a i r}

,

F_{I d l e}

, S and

E_{r e p a i r}

.

N	$F_{Repair}$	$F_{Idle}$	S	$E_{System}$
6	0.0507	0.0332	0.9691	0.0651
7	0.0499	0.0265	0.9726	0.0530
8	0.0528	0.0246	0.9722	0.0505
9	0.0562	0.0245	0.9708	0.0510
10	0.0588	0.0250	0.9696	0.0523
11	0.0608	0.0255	0.9685	0.0537
12	0.0626	0.0261	0.9676	0.0551
13	0.0642	0.0268	0.9667	0.0564
14	0.0658	0.0274	0.9658	0.0578
15	0.0676	0.0281	0.9649	0.0594
16	0.0700	0.0291	0.9636	0.0615
17	0.0753	0.0314	0.9608	0.0662
18	0.0909	0.0379	0.9527	0.0799

Table 2. Effect of repair rate

δ

on

F_{R e p a i r}

,

F_{I d l e}

, S and

E_{r e p a i r}

.

Table 2. Effect of repair rate

δ

on

F_{R e p a i r}

,

F_{I d l e}

, S and

E_{r e p a i r}

.

$δ$	$F_{Repair}$	$F_{Idle}$	S	$E_{System}$
1	0.0507	0.0332	0.9691	0.0651
2	0.0418	0.0469	0.9850	0.0775
3	0.0272	0.0418	0.9950	0.0624
4	0.0232	0.0383	0.9979	0.0544
5	0.0223	0.0364	0.9989	0.0504
6	0.0222	0.0352	0.9993	0.0481
7	0.0225	0.0345	0.9995	0.0466
8	0.0228	0.0340	0.9997	0.0456
9	0.0232	0.0336	0.9998	0.0448
10	0.0235	0.0333	0.9998	0.0442
11	0.0237	0.0331	0.9999	0.0438
12	0.0240	0.0329	0.9999	0.0434

Table 3. Effect of N and

δ

on expected total cost.

Table 3. Effect of N and

δ

on expected total cost.

N	TC	$δ$	TC
6	528.2877	1	528.2877
7	523.1107	2	540.4254
8	521.8479	3	535.9006
9	521.9576	4	532.8554
10	522.4002	5	531.2721
11	522.9301	6	530.3771
12	523.4913	7	529.8193
13	524.0646	8	529.4412
14	524.6497	9	529.1666
15	525.2904	10	528.9562
16	526.1948	11	528.7882
17	528.1975	12	528.6496
18	534.0302

Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

© 2024 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).

Article Metrics

Citations

Article Access Statistics

Journal Statistics

Multiple requests from the same IP address are counted as one view.

The k-out-of-n:G System Viewed as a Multi-Server Queue

Abstract

1. Introduction

Highlights of This Work

2. Model Description

3. Mathematical Formulation

4. Stability Analysis

4.1. Stability Condition

4.2. Steady-State Probability Vector

4.3. Special Case

5. System Characteristics

5.1. Distribution of Server Idle Times

5.2. Distribution of First Passage Time from an Inoperative State to n-Operational Server State

5.3. Distribution of First Passage Time from an n-Operational Server State to an Inoperative State

5.4. Distribution of Number of Times That the System Becomes Inoperative before Reaching the n Server State

5.5. Other Performance Measures

6. Numerical Illustrations

6.1. Effect of Level N on the Performance of the System

6.2. Effect of the Repair Rate δ on the Performance of the System

7. Cost Analysis and Optimization Problem

8. Conclusions

Author Contributions

Funding

Data Availability Statement

Conflicts of Interest

Notations and Abbreviations

References

Article Metrics

Article Access Statistics

6.2. Effect of the Repair Rate $δ$ on the Performance of the System