Using Timeliness in Tracking Infections

Bastopcu, Melih; Ulukus, Sennur

doi:10.3390/e24060779

Open AccessArticle

Using Timeliness in Tracking Infections^†

by

Melih Bastopcu

¹

and

Sennur Ulukus

^2,*

¹

Coordinated Science Laboratory, University of Illinois Urbana-Champaign, Urbana, IL 61801, USA

²

Department of Electrical and Computer Engineering, University of Maryland, College Park, MD 20742, USA

^*

Author to whom correspondence should be addressed.

^†

This paper is an extended version of our paper published in 2021 IEEE INFOCOM 2021—IEEE International Conference on Computer Communications (online), 10–13 May 2021.

Entropy 2022, 24(6), 779; https://doi.org/10.3390/e24060779

Submission received: 8 March 2022 / Revised: 17 May 2022 / Accepted: 27 May 2022 / Published: 31 May 2022

(This article belongs to the Special Issue Age of Information: Concept, Metric and Tool for Network Control)

Download

Browse Figures

Versions Notes

Abstract

:

We consider real-time timely tracking of infection status (e.g., COVID-19) of individuals in a population. In this work, a health care provider wants to detect both infected people and people who have recovered from the disease as quickly as possible. In order to measure the timeliness of the tracking process, we use the long-term average difference between the actual infection status of the people and their real-time estimate by the health care provider based on the most recent test results. We first find an analytical expression for this average difference for given test rates, infection rates and recovery rates of people. Next, we propose an alternating minimization-based algorithm to find the test rates that minimize the average difference. We observe that if the total test rate is limited, instead of testing all members of the population equally, only a portion of the population may be tested in unequal rates calculated based on their infection and recovery rates. Next, we characterize the average difference when the test measurements are erroneous (i.e., noisy). Further, we consider the case where the infection status of individuals may be dependent, which occurs when an infected person spreads the disease to another person if they are not detected and isolated by the health care provider. In addition, we consider an age of incorrect information-based error metric where the staleness metric increases linearly over time as long as the health care provider does not detect the changes in the infection status of the people. Through extensive numerical results, we observe that increasing the total test rate helps track the infection status better. In addition, an increased population size increases diversity of people with different infection and recovery rates, which may be exploited to spend testing capacity more efficiently, thereby improving the system performance. Depending on the health care provider’s preferences, test rate allocation can be adjusted to detect either the infected people or the recovered people more quickly. In order to combat any errors in the test, it may be more advantageous for the health care provider to not test everyone, and instead, apply additional tests to a selected portion of the population. In the case of people with dependent infection status, as we increase the total test rate, the health care provider detects the infected people more quickly, and thus, the average time that a person stays infected decreases. Finally, the error metric needs to be chosen carefully to meet the priorities of the health care provider, as the error metric used greatly influences who will be tested and at what test rate.

Keywords:

timely infection tracking; age of information; timely tracking of multiple processes; Markovian infection spread model

1. Introduction

We consider the problem of timely tracking of an infectious disease, e.g., COVID-19, in a population of n people. In this problem, a health care provider wants to detect infected people as quickly as possible in order to take precautions such as isolating them from the rest of the population. The health care provider also wants to detect people who have recovered from the disease as soon as possible since these people need to return to work which is especially critical in sectors such as education, food retail, public transportation, etc. Ideally, the health care provider should test all people all the time. However, as the total test rate is limited, the question is how frequently the health care provider should apply tests on these people when their infection and recovery rates are known. In a broader sense, this problem is related to timely tracking of multiple processes in a resource-constrained setting where each process takes binary values of 0 and 1 with different change rates.

Recent studies have shown that people who have recovered from infectious diseases such as COVID-19 can be reinfected. Furthermore, the recovery times of individuals may vary significantly. For these reasons, in this problem, the ith person becomes infected with rate

λ_{i}

which is independent of the others. Similarly, the ith person recovers from the disease with rate

μ_{i}

. We note that the index i may represent a specific individual or a group of individuals that share common features such as age, gender, and profession. Depending on the demographics, coefficients

λ_{i}

and

μ_{i}

may be statistically known by the health care provider. We denote the infection status of the ith person as

x_{i} (t)

(shown with the black curves on the left in Figure 1) which takes the value 1 when the person is infected and the value 0 when the person is healthy. The health care provider applies tests to people marked as healthy with rate

s_{i}

and to people marked as infected with rate

c_{i}

. Based on the test results, the health care provider forms an estimate for the infection status of the ith person denoted by

{\hat{x}}_{i} (t)

(shown with the blue curves on the right in Figure 1) which takes the value 1 when the most recent test result is positive and the value 0 when it is negative.

We measure the timeliness of the tracking process by the difference between the actual infection status of people and the real-time estimate of the health care provider which is based on the most recent test results. The difference can occur in two different cases: (i) when the person is sick (

x_{i} (t) = 1

) and the health care provider maps this person as healthy (

{\hat{x}}_{i} (t) = 0

), and (ii) when the person recovers from the disease (

x_{i} (t) = 0

) but the health care provider still considers this person as infected (

{\hat{x}}_{i} (t) = 1

). The former case represents the error due to late detection of infected people, while the latter case represents the error due to late detection of healed people. Depending on the health care provider’s preferences, detecting infected people may be more important than detecting recovered people (controlling infection), or the other way around (returning people to workforce).

The age of information was proposed to measure timeliness of information in communication systems, and has been studied in the context of queueing systems [1,2,3,4,5,6,7,8], multi-hop and multi-cast networks [9,10,11,12,13,14,15,16,17], social networks [18], timely remote estimation of random processes [19,20,21,22,23,24,25], energy harvesting systems [26,27,28,29,30,31,32,33,34,35,36,37,38,39,40], wireless fading channels [41,42], scheduling in networks [43,44,45,46,47,48,49,50,51,52,53,54,55], lossless and lossy source and channel coding [56,57,58,59,60,61,62,63,64,65,66], vehicular, IoT and UAV systems [67,68,69,70], caching systems [71,72,73,74,75,76,77,78,79,80,81,82], computation-intensive systems [83,84,85,86,87,88,89,90], learning systems [91,92,93], gossip networks [94,95,96,97] and so forth. A more detailed review of the age of information literature can be found in references [98,99,100]. Most relevant to our work, the real-time timely estimation of single and multiple counting processes [19,25], a Wiener process [20], a random walk process [101], and binary and multiple states Markov sources [23,51,102] have been studied. The study that is closest to our work is reference [23], where the remote estimation of a symmetric binary Markov source is studied in a time-slotted system by finding the optimal sampling policies via formulating a Markov Decision Process (MDP) for real-time error, AoI and AoII metrics. Different from [23], in our work, we consider real-time timely estimation of multiple non-symmetric binary sources for a continuous time system. In our work, the sampler (health care provider) does not know the states of the sources (infection status of people), and thus, takes the samples (applies medical tests) randomly (exponential random variables) with fixed rates. Thus, we optimize the test rates of people to minimize the real-time estimation error.

In this paper, we consider the real-time timely tracking of infection status of n people. We first find an analytical expression for the long-term average difference between the actual infection status of people and the estimate of the health care provider based on test results. Then, we propose an alternating minimization-based algorithm to identify the test rates

s_{i}

and

c_{i}

for all people. We observe that if the total test rate is limited, we may not apply tests on all people equally. Next, we provide an alternative method to characterize the average difference, by finding the steady state of a Markov chain defined by

(x_{i} (t), {\hat{x}}_{i} (t))

. By using this alternative method, we determine the average estimation error when there are errors in the test measurements expressed by a false positive rate p and a false negative rate q. Next, we consider the infection status of two people where an infected person may spread the disease to another person if the infection has not been detected by the health care provider to consequently isolate the infected person. Finally, we consider an age of incorrect information-based error metric where the estimation error increases linearly over time when the health care provider has not detected the changes in the infection status of the people.

Through extensive numerical results, we observe that increasing the total test rate helps track the infection status of people better, and increasing the size of the population increases diversity which may be exploited to improve the performance. Depending on the health care provider’s priorities, we can allocate additional tests to people marked as healthy to detect the infections faster or to people marked as infected to detect the recoveries more quickly. In order to combat the test errors, the health care provider may prefer to apply tests to only a selected portion of the population with higher test rates. When the infection status of a person depends on that of another person, the average time that a person remains infected can be reduced by increasing the total test rate as it helps to detect the infected people more quickly. Finally, we observe that depending on the error metric used, the test rate distribution among the population differs greatly, and thus, we should choose an error metric that aligns with the priorities of the health care provider.

2. System Model

We consider a population of n people. We denote the infection status of the ith person at time t as

x_{i} (t)

(black curve in Figure 2a) which takes binary values 0 or 1 as follows,

\begin{matrix} x_{i} (t) = \{\begin{matrix} 1, & if the i th person is infected at time t, \\ 0, & otherwise . \end{matrix} \end{matrix}

(1)

In this paper, we consider a model where each person can be infected multiple times after recovering from the disease. We denote the time interval that the ith person stays healthy for the jth time as

W_{i} (j)

which is exponentially distributed with rate

λ_{i}

. We denote the recovery time for the ith person after being infected with the virus for the jth time as

R_{i} (j)

which is exponentially distributed with rate

μ_{i}

.

A health care provider wants to track the infection status of each person. Based on the test results at times

t_{i, ℓ}

, the health care provider generates an estimate for the status of the ith person denoted as

{\hat{x}}_{i} (t)

(blue curve in Figure 2a) by

\begin{matrix} {\hat{x}}_{i} (t) = x_{i} (t_{i, ℓ}), t_{i, ℓ} \leq t < t_{i, ℓ + 1} . \end{matrix}

(2)

When

{\hat{x}}_{i} (t)

is 1, the health care provider applies the next test to the ith person after an exponentially distributed time with rate

c_{i}

. When

{\hat{x}}_{i} (t)

is 0, the next test is applied to the ith person after an exponentially distributed time with rate

s_{i}

.

An estimation error happens when the actual infection status of the ith person,

x_{i} (t)

, is different than the estimate of the health care provider,

{\hat{x}}_{i} (t)

, at time t. This could happen in two ways: when

x_{i} (t) = 1

and

{\hat{x}}_{i} (t) = 0

, i.e., when the ith person is sick, but remains undetected by the health care provider, and when

x_{i} (t) = 0

and

{\hat{x}}_{i} (t) = 1

, i.e., when the ith person has recovered, but the health care provider is unaware that the ith person has recovered.

We denote the error caused by the former case, i.e., when

x_{i} (t) = 1

and

{\hat{x}}_{i} (t) = 0

, by

Δ_{i 1} (t)

(green areas in Figure 2b),

\begin{matrix} Δ_{i 1} (t) = max {x_{i} (t) - {\hat{x}}_{i} (t), 0}, \end{matrix}

(3)

and we denote the error caused by the latter case, i.e., when

x_{i} (t) = 0

and

{\hat{x}}_{i} (t) = 1

, by

Δ_{i 2} (t)

(orange areas in Figure 2b),

\begin{matrix} Δ_{i 2} (t) = max {{\hat{x}}_{i} (t) - x_{i} (t), 0} . \end{matrix}

(4)

Then, the total estimation error for the ith person

Δ_{i} (t)

is

\begin{matrix} Δ_{i} (t) = θ Δ_{i 1} (t) + (1 - θ) Δ_{i 2} (t), \end{matrix}

(5)

where

θ

is the importance factor in

[0, 1]

. A large

θ

gives more importance to the detection of infected people, and a small

θ

gives more importance to the detection of recovered people.

We define the long-term weighted average difference between

x_{i} (t)

and

{\hat{x}}_{i} (t)

as

\begin{matrix} Δ_{i} = lim_{T \to \infty} \frac{1}{T} \int_{0}^{T} Δ_{i} (t) d t . \end{matrix}

(6)

Then, the overall average difference of all people

Δ

is

\begin{matrix} Δ = \frac{1}{n} \sum_{i = 1}^{n} Δ_{i} . \end{matrix}

(7)

Our aim is to track the infection status of all people. Due to limited resources, there is a total test rate constraint

\sum_{i = 1}^{n} s_{i} + \sum_{i = 1}^{n} c_{i} \leq C

. Thus, our aim is to find the optimal test rates

s_{i}

and

c_{i}

to minimize

Δ

in (7) while satisfying this total test rate constraint. We formulate the following problem,

\begin{matrix} min_{{s_{i}, c_{i}}} & Δ \\ s . t . & \sum_{i = 1}^{n} s_{i} + \sum_{i = 1}^{n} c_{i} \leq C \\ s_{i} \geq 0, c_{i} \geq 0, i = 1, \dots, n . \end{matrix}

(8)

We provide a summary of the list of the variables used in this work in Table 1. In the next section, we find the total average difference

Δ

.

3. Average Difference Analysis

In this section, we provide a probabilistic analysis to characterize the average difference

Δ

. In Section 5.1, we give an alternative method to find

Δ

by analyzing the steady-state distribution of the Markov chain induced by the states

(x_{i} (t), {\hat{x}}_{i} (t))

. Here, we first find analytical expressions for

Δ_{i 1} (t)

in (3) and

Δ_{i 2} (t)

in (4) when

s_{i} > 0

and

c_{i} > 0

. We note that

Δ_{i 1} (t)

can be equal to 1 when

{\hat{x}}_{i} (t) = 0

and is always equal to 0 when

{\hat{x}}_{i} (t) = 1

. Assume that at time 0, both

x_{i} (0)

and

{\hat{x}}_{i} (0)

are 0. After an exponentially distributed time with rate

λ_{i}

, which is denoted by

W_{i}

, the ith person is infected, and thus

x_{i} (t)

becomes 1. At that time, since

{\hat{x}}_{i} (t) = 0

,

Δ_{i 1} (t)

becomes 1. Further,

Δ_{i 1} (t)

will be equal to 0 again either when the ith person recovers from the disease which happens after

R_{i}

which is exponentially distributed with rate

μ_{i}

or when the health care provider performs a test on the ith person after

D_{i}

, which is exponentially distributed with rate

s_{i}

. We define

T_{m} (i)

as the earliest time at which one of these two cases happens, i.e.,

T_{m} (i) = min {R_{i}, D_{i}}

(which is shown by the green areas in Figure 3a). We note that

T_{m} (i)

is also exponentially distributed with rate

μ_{i} + s_{i}

, and we have

P (T_{m} (i) = R_{i}) = \frac{μ_{i}}{μ_{i} + s_{i}}

and

P (T_{m} (i) = D_{i}) = \frac{s_{i}}{μ_{i} + s_{i}}

. If the ith person recovers from the disease before testing, we return to the initial case where both

x_{i} (t)

and

{\hat{x}}_{i} (t)

are equal to 0 again. In this case, the cycle repeats itself, i.e., the ith person becomes sick again after

W_{i}

and

Δ_{i 1} (t)

remains as 1 until either the person recovers or the health care provider performs a test which takes another

T_{m} (i)

duration. If the health care provider performs a test before the person recovers, then

{\hat{x}}_{i} (t)

becomes 1. We denote the time interval for which

{\hat{x}}_{i} (t)

stays at 0 as

I_{i 1}

which is given by

\begin{matrix} I_{i 1} = \sum_{ℓ = 1}^{K_{1}} T_{m} (i, ℓ) + W_{i} (ℓ), \end{matrix}

(9)

where

K_{1}

is geometric with rate

P (T_{m} (i) = D_{i}) = \frac{s_{i}}{μ_{i} + s_{i}}

. Due to [103] (Prob. 9.4.1),

\sum_{ℓ = 1}^{K_{1}} T_{m} (i, ℓ)

and

\sum_{ℓ = 1}^{K_{1}} W_{i} (ℓ)

are exponentially distributed with rates

s_{i}

and

\frac{λ_{i} s_{i}}{μ_{i} + s_{i}}

, respectively. As

E [I_{i 1}] = E [\sum_{ℓ = 1}^{K_{1}} T_{m} (i, ℓ)] + E [\sum_{ℓ = 1}^{K_{1}} W_{i} (ℓ)]

, we have

\begin{matrix} E [I_{i 1}] = \frac{1}{s_{i}} + \frac{s_{i} + μ_{i}}{s_{i} λ_{i}} . \end{matrix}

(10)

When

{\hat{x}}_{i} (t) = 1

, the health care provider marks the ith person as infected. The ith person recovers from the virus after

R_{i}

. After the ith person recovers, either the health care provider performs a test after

Z_{i}

which is exponentially distributed with rate

c_{i}

or the ith person is reinfected with the virus which takes

W_{i}

time. We define

T_{u} (i)

as the earliest time at which one of these two cases happens, i.e.,

T_{u} (i) = min {W_{i}, Z_{i}}

(which is shown by the orange areas in Figure 3b). Similarly, we note that

T_{u} (i)

is exponentially distributed with rate

λ_{i} + c_{i}

, and we have

P (T_{u} (i) = W_{i}) = \frac{λ_{i}}{λ_{i} + c_{i}}

and

P (T_{u} (i) = Z_{i}) = \frac{c_{i}}{λ_{i} + c_{i}}

. If the person is reinfected with the virus before a test is applied, this cycle repeats itself, i.e., the ith person recovers after another

R_{i}

, and then either a test is applied to the ith person, or the person is infected again which takes another

T_{u} (i)

. If the health care provider performs a test to the ith person before the person is reinfected, the health care provider marks the ith person as healthy again, i.e.,

{\hat{x}}_{i} (t)

becomes 0. We denote the time interval that

{\hat{x}}_{i} (t)

is equal to 1 as

I_{i 2}

which is given by

\begin{matrix} I_{i 2} = \sum_{ℓ = 1}^{K_{2}} T_{u} (i, ℓ) + R_{i} (ℓ), \end{matrix}

(11)

where

K_{2}

is geometric with rate

P (T_{u} (i) = Z_{i}) = \frac{c_{i}}{λ_{i} + c_{i}}

. Similarly,

\sum_{ℓ = 1}^{K_{2}} T_{u} (i, ℓ)

and

\sum_{ℓ = 1}^{K_{2}} R_{i} (ℓ)

are exponentially distributed with rates

c_{i}

and

\frac{c_{i} μ_{i}}{λ_{i} + c_{i}}

, respectively. As

E [I_{i 2}] = E [\sum_{ℓ = 1}^{K_{2}} T_{u} (i, ℓ)] + E [\sum_{ℓ = 1}^{K_{2}} R_{i} (ℓ)]

, we have

\begin{matrix} E [I_{i 2}] = \frac{1}{c_{i}} + \frac{c_{i} + λ_{i}}{c_{i} μ_{i}} . \end{matrix}

(12)

We denote the time interval between the jth and

(j + 1)

th times that

{\hat{x}}_{i} (t)

changes from 1 to 0 as the jth cycle

I_{i} (j)

where

I_{i} (j) = I_{i 1} (j) + I_{i 2} (j)

. We note that

Δ_{i 1} (t)

is always equal to 0 during

I_{i 2} (j)

, i.e.,

{\hat{x}}_{i} (t) = 1

, and

Δ_{i 1} (t)

is equal to 1 when

x_{i} (t) = 1

in

I_{i 1} (j)

. We denote the total time duration when

Δ_{i 1} (t)

is equal to 1 as

T_{e, 1} (i, j)

during the jth cycle where

T_{e, 1} (i, j) = \sum_{ℓ = 1}^{K_{1}} T_{m} (i, ℓ)

. Thus, we have

E [T_{e, 1} (i)] = \frac{1}{s_{i}}

. Then, using ergodicity, similar to [80],

Δ_{i 1}

is equal to

\begin{matrix} Δ_{i 1} = \frac{E [T_{e, 1} (i)]}{E [I_{i}]} = \frac{E [T_{e, 1} (i)]}{E [I_{i 1}] + E [I_{i 2}]} . \end{matrix}

(13)

Thus, we have

\begin{matrix} Δ_{i 1} = \frac{μ_{i} λ_{i}}{μ_{i} + λ_{i}} \frac{c_{i}}{μ_{i} c_{i} + λ_{i} s_{i} + c_{i} s_{i}} . \end{matrix}

(14)

Next, we find

Δ_{i 2}

. We note that

Δ_{i 2} (t)

is equal to 1 when

x_{i} (t) = 0

in

I_{i 2} (j)

and is always equal to 0 during

I_{i 1} (j)

. Similarly, we denote the total time duration where

Δ_{i 2} (t)

is equal to 1 in the jth cycle

I_{i} (j)

as

T_{e, 2} (i, j)

which is equal to

T_{e, 2} (i, j) = \sum_{ℓ = 1}^{K_{2}} T_{u} (i, ℓ)

. Thus, we have

E [T_{e, 2} (i)] = \frac{1}{c_{i}}

. Then, similar to

Δ_{i 1}

in (13),

Δ_{i 2}

is equal to

\begin{matrix} Δ_{i 2} = \frac{μ_{i} λ_{i}}{μ_{i} + λ_{i}} \frac{s_{i}}{μ_{i} c_{i} + λ_{i} s_{i} + c_{i} s_{i}} . \end{matrix}

(15)

By using (5), (14), and (15), we obtain

Δ_{i}

as

\begin{matrix} Δ_{i} = \frac{μ_{i} λ_{i}}{μ_{i} + λ_{i}} \frac{θ c_{i} + (1 - θ) s_{i}}{μ_{i} c_{i} + λ_{i} s_{i} + c_{i} s_{i}} . \end{matrix}

(16)

Then, by inserting (16) in (7), we obtain

Δ

. In the next section, we solve the optimization problem in (8).

4. Optimization of Average Difference

In this section, we solve the optimization problem in (8). Using

Δ_{i}

in (16) in (7), we rewrite (8) as

\begin{matrix} min_{{s_{i}, c_{i}}} & \sum_{i = 1}^{n} \frac{μ_{i} λ_{i}}{μ_{i} + λ_{i}} \frac{θ c_{i} + (1 - θ) s_{i}}{μ_{i} c_{i} + λ_{i} s_{i} + c_{i} s_{i}} \\ s . t . & \sum_{i = 1}^{n} s_{i} + \sum_{i = 1}^{n} c_{i} \leq C \\ s_{i} \geq 0, c_{i} \geq 0, i = 1, \dots, n . \end{matrix}

(17)

We define the Lagrangian function [104] for (17) as

\begin{matrix} L = & \sum_{i = 1}^{n} \frac{μ_{i} λ_{i}}{μ_{i} + λ_{i}} \frac{θ c_{i} + (1 - θ) s_{i}}{μ_{i} c_{i} + λ_{i} s_{i} + c_{i} s_{i}} + β (\sum_{i = 1}^{n} s_{i} + c_{i} - C) - \sum_{i = 1}^{n} ν_{i} s_{i} - \sum_{i = 1}^{n} η_{i} c_{i}, \end{matrix}

(18)

where

β \geq 0

,

ν_{i} \geq 0

, and

η_{i} \geq 0

. The KKT conditions are

\begin{matrix} \frac{\partial L}{\partial s_{i}} = & \frac{μ_{i} λ_{i} c_{i}}{μ_{i} + λ_{i}} \frac{(1 - θ) μ_{i} - θ (c_{i} + λ_{i})}{{(μ_{i} c_{i} + λ_{i} s_{i} + s_{i} c_{i})}^{2}} + β - ν_{i} = 0, \end{matrix}

(19)

\begin{matrix} \frac{\partial L}{\partial c_{i}} = & \frac{μ_{i} λ_{i} s_{i}}{μ_{i} + λ_{i}} \frac{θ λ_{i} - (1 - θ) (μ_{i} + s_{i})}{{(μ_{i} c_{i} + λ_{i} s_{i} + s_{i} c_{i})}^{2}} + β - η_{i} = 0, \end{matrix}

(20)

for all i. The complementary slackness conditions are

\begin{matrix} β (\sum_{i = 1}^{n} s_{i} + c_{i} - C) = 0, ν_{i} s_{i} = 0, η_{i} c_{i} = 0 . \end{matrix}

(21)

First, we find

s_{i}

. From (19), we have

\begin{matrix} {(μ_{i} c_{i} + λ_{i} s_{i} + s_{i} c_{i})}^{2} = \frac{μ_{i} λ_{i} c_{i}}{μ_{i} + λ_{i}} \frac{θ (c_{i} + λ_{i}) - (1 - θ) μ_{i}}{β - ν_{i}} . \end{matrix}

(22)

When

θ (c_{i} + λ_{i}) \geq (1 - θ) μ_{i}

, we solve (22) for

s_{i}

as

\begin{matrix} s_{i} = \frac{μ_{i} c_{i}}{λ_{i} + c_{i}} {(\sqrt{\frac{1}{μ_{i} c_{i}} \frac{λ_{i}}{μ_{i} + λ_{i}} \frac{θ (c_{i} + λ_{i}) - (1 - θ) μ_{i}}{β}} - 1)}^{+}, \end{matrix}

(23)

where we used the fact that we either have

s_{i} > 0

and

ν_{i} = 0

, or

s_{i} = 0

and

ν_{i} \geq 0

, due to (21). Here,

{(\cdot)}^{+} = max (\cdot, 0)

. On the other hand, when

θ (c_{i} + λ_{i}) < (1 - θ) μ_{i}

, we have

\frac{\partial Δ_{i}}{\partial s_{i}} > 0

, and thus it is optimal to choose

s_{i} = 0

as our aim is to minimize

Δ

in (7). In this case, when

s_{i} = 0

, we have

Δ_{i} = \frac{θ λ_{i}}{μ_{i} + λ_{i}}

which is independent of the value of

c_{i}

. As we obtain the same

Δ_{i}

for all values of

c_{i}

, and the total update rate is limited, i.e.,

\sum_{i = 1}^{n} s_{i} + c_{i} \leq C

, in this case, it is optimal to choose

c_{i} = 0

as well (i.e., when

s_{i} = 0

).

Next, we find

c_{i}

. From (20), we have

\begin{matrix} {(μ_{i} c_{i} + λ_{i} s_{i} + s_{i} c_{i})}^{2} = \frac{μ_{i} λ_{i} s_{i}}{μ_{i} + λ_{i}} \frac{(1 - θ) (μ_{i} + s_{i}) - θ λ_{i}}{β - η_{i}} . \end{matrix}

(24)

When

(1 - θ) (μ_{i} + s_{i}) \geq θ λ_{i}

, we solve (24) for

c_{i}

as

\begin{matrix} c_{i} = \frac{λ_{i} s_{i}}{μ_{i} + s_{i}} {(\sqrt{\frac{1}{λ_{i} s_{i}} \frac{μ_{i}}{μ_{i} + λ_{i}} \frac{(1 - θ) (s_{i} + μ_{i}) - θ λ_{i}}{β}} - 1)}^{+}, \end{matrix}

(25)

where we used the fact that we either have

c_{i} > 0

and

η_{i} = 0

, or

c_{i} = 0

and

η_{i} \geq 0

, due to (21). Similarly, when

(1 - θ) (s_{i} + μ_{i}) < θ λ_{i}

, we have

\frac{\partial Δ_{i}}{\partial c_{i}} > 0

. Thus, in this case, it is optimal to choose

c_{i} = 0

. When

c_{i} = 0

, we have

Δ_{i} = \frac{(1 - θ) μ_{i}}{μ_{i} + λ_{i}}

which is independent of the value of

s_{i}

. Thus, it is optimal to choose

s_{i} = 0

when

c_{i} = 0

.

From (23), if

\frac{1}{μ_{i} c_{i}} \frac{λ_{i}}{μ_{i} + λ_{i}} (θ (c_{i} + λ_{i}) - (1 - θ) μ_{i}) \leq β

, we must have

s_{i} = 0

. Thus, for a given

c_{i}

, the optimal test rate allocation policy for

s_{i}

is a threshold policy where

s_{i}

’s with small

\frac{1}{μ_{i} c_{i}} \frac{λ_{i}}{μ_{i} + λ_{i}} (θ (c_{i} + λ_{i}) - (1 - θ) μ_{i})

are equal to zero. Similarly, from (25), if

\frac{1}{λ_{i} s_{i}} \frac{μ_{i}}{μ_{i} + λ_{i}} ((1 - θ) (s_{i} + μ_{i}) - θ λ_{i}) \leq β

, we must have

c_{i} = 0

. Thus, for a given

s_{i}

, the optimal policy to determine

c_{i}

is a threshold policy where

c_{i}

’s with small

\frac{1}{λ_{i} s_{i}} \frac{μ_{i}}{μ_{i} + λ_{i}} ((1 - θ) (s_{i} + μ_{i}) - θ λ_{i})

are equal to zero.

Next, we show that in the optimal policy, if

s_{i} > 0

and

c_{i} > 0

for some i, then the total test rate constraint must be satisfied with equality, i.e.,

\sum_{i = 1}^{n} s_{i} + c_{i} = C

.

Lemma 1.

In the optimal policy, if

s_{i} > 0

and

c_{i} > 0

for some i, then we have

\sum_{i = 1}^{n} s_{i} + c_{i} = C

.

Proof of Lemma 1.

The derivatives of

Δ_{i}

with respect to

s_{i}

and

c_{i}

are

\begin{matrix} \frac{\partial Δ_{i}}{\partial s_{i}} = \frac{μ_{i} λ_{i} c_{i}}{μ_{i} + λ_{i}} \frac{(1 - θ) μ_{i} - θ (c_{i} + λ_{i})}{{(c_{i} μ_{i} + s_{i} c_{i} + λ_{i} s_{i})}^{2}}, \end{matrix}

(26)

\begin{matrix} \frac{\partial Δ_{i}}{\partial c_{i}} = \frac{μ_{i} λ_{i} s_{i}}{μ_{i} + λ_{i}} \frac{θ λ_{i} - (1 - θ) (s_{i} + μ_{i})}{{(c_{i} μ_{i} + s_{i} c_{i} + λ_{i} s_{i})}^{2}} . \end{matrix}

(27)

We note that

s_{i} > 0

in (23) implies that

θ (c_{i} + λ_{i}) > (1 - θ) μ_{i}

. In this case, we have

\frac{\partial Δ_{i}}{\partial s_{i}} < 0

. Similarly,

c_{i} > 0

in (25) implies that

(1 - θ) (s_{i} + μ_{i}) > θ λ_{i}

. Thus, we have

\frac{\partial Δ_{i}}{\partial c_{i}} < 0

. Therefore, in the optimal policy, if we have

s_{i} > 0

and

c_{i} > 0

for some i, then we must have

\sum_{i = 1}^{n} s_{i} + c_{i} = C

. Otherwise, we can further decrease

Δ

in (7) by increasing

c_{i}

or

s_{i}

. □

Next, we propose an alternating minimization-based algorithm for finding

s_{i}

and

c_{i}

. For this purpose, for given initial

(s_{i}, c_{i})

pairs, we define

ϕ_{i}

as

\begin{matrix} ϕ_{i} = \{\begin{matrix} \frac{1}{μ_{i} c_{i}} \frac{λ_{i}}{μ_{i} + λ_{i}} (θ (c_{i} + λ_{i}) - (1 - θ) μ_{i}), & i = 1, \dots, n, \\ \frac{1}{λ_{i} s_{i}} \frac{μ_{i}}{μ_{i} + λ_{i}} ((1 - θ) (s_{i} + μ_{i}) - θ λ_{i}), & i = n + 1, \dots, 2 n . \end{matrix} \end{matrix}

(28)

Then, we define

u_{i}

as

\begin{matrix} u_{i} = \{\begin{matrix} \frac{μ_{i} c_{i}}{λ_{i} + c_{i}} {(\sqrt{\frac{ϕ_{i}}{β}} - 1)}^{+}, & i = 1, \dots, n, \\ \frac{λ_{i} s_{i}}{μ_{i} + s_{i}} {(\sqrt{\frac{ϕ_{i}}{β}} - 1)}^{+}, & i = n + 1, \dots, 2 n . \end{matrix} \end{matrix}

(29)

From (23) and (25),

s_{i} = u_{i}

and

c_{i} = u_{n + i}

, for

i = 1, \dots, n

.

Next, we find

s_{i}

and

c_{i}

by determining

β

in (29). First, assume that, in the optimal policy, there is an i such that

s_{i} > 0

and

c_{i} > 0

. Thus, by Lemma 1, we must have

\sum_{i = 1}^{n} s_{i} + c_{i} = C

. We initially take random

(s_{i}, c_{i})

pairs such that

\sum_{i = 1}^{n} s_{i} + c_{i} = C

. Then, given the initial

(s_{i}, c_{i})

pairs, we immediately choose

u_{i} = 0

for

ϕ_{i} < 0

. For the remaining

u_{i}

with

ϕ_{i} \geq 0

, we apply a solution method similar to that in [80]. By assuming

ϕ_{i} \geq β

, i.e., by disregarding

{(\cdot)}^{+}

in (29), we solve

\sum_{i = 1}^{2 n} u_{i} = C

for

β

. Then, we compare the smallest

ϕ_{i}

which is larger than zero in (28) with

β

. If we have

ϕ_{i} \geq β

, then it implies that

u_{i} \geq 0

for all remaining i. Thus, we have obtained

u_{i}

values for given initial (

s_{i}, c_{i}

) pairs. If the smallest

ϕ_{i}

which is larger than zero is smaller than

β

, then the corresponding

u_{i}

is negative and we should choose

u_{i} = 0

for the smallest non-negative

ϕ_{i}

. Then, we repeat this procedure until the smallest non-negative

ϕ_{i}

is larger than

β

. After determining all

u_{i}

, we obtain

s_{i} = u_{i}

and

c_{i} = u_{n + i}

for

i = 1, \dots, n

. Then, with the updated values of

(s_{i}, c_{i})

pairs, we keep finding

u_{i}

’s until the KKT conditions in (19) and (20) are satisfied.

We note that for indices (persons) i for which

(s_{i}, c_{i})

are zero, the health care provider does not perform any tests, and maps these people as either always infected, i.e.,

{\hat{x}}_{i} (t) = 1

for all t, or always healthy, i.e.,

{\hat{x}}_{i} (t) = 0

. If

{\hat{x}}_{i} (t) = 0

for all t,

Δ_{i} = \frac{θ λ_{i}}{μ_{i} + λ_{i}}

, and if

{\hat{x}}_{i} (t) = 1

for all t,

Δ_{i} = \frac{(1 - θ) μ_{i}}{μ_{i} + λ_{i}}

. Thus, for such i, the health care provider should choose

{\hat{x}}_{i} (t) = 0

for all t, if

\frac{θ λ_{i}}{μ_{i} + λ_{i}} < \frac{(1 - θ) μ_{i}}{μ_{i} + λ_{i}}

, and should choose

{\hat{x}}_{i} (t) = 1

for all t, otherwise, without performing any tests.

Finally, we note that the problem in (17) is not a convex optimization problem as the objective function is not jointly convex in

s_{i}

and

c_{i}

. Therefore, the solutions obtained via the proposed method may not be globally optimal. For this reason, we select different initial starting points and apply the proposed alternating minimization-based algorithm and choose the solution that achieves the smallest

Δ

in (7).

In the next section, we first provide an alternative method to find the average difference

Δ

in (6) and then characterize the average difference for the erroneous test measurements.

5. Average Difference for the Case with Erroneous Test Measurements

We note that the infection status of the ith person and its estimate at the health care provider form a continuous time Markov chain (Section 7.5 of [105]) with the states

(x_{i} (t), {\hat{x}}_{i} (t)) \in {(0, 0), (0, 1), (1, 0), (1, 1)}

. In this section, by finding the steady-state distribution for

(x_{i} (t), {\hat{x}}_{i} (t))

, we provide an alternative method to find

Δ

in (6). Then, we consider the case with erroneous test measurements. For this case, we characterize the long-term average difference for the ith person denoted by

Δ_{i}^{e}

.

5.1. An Alternative Method to Characterize Average Difference

When there is no error in the tests, the state transition graph is shown in Figure 4a. Assuming that

s_{i} > 0

,

c_{i} > 0

, every state is accessible from any other state, and thus, the Markov chain induced by the system is irreducible. Note that in Section 4, we see that the testing rates for some people can be equal to 0, i.e.,

s_{i} = 0

and

c_{i} = 0

. For these people, we choose

{\hat{x}}_{i} (t)

to be either always 0 or 1, i.e., consider them as always healthy or sick all the time. Depending on the choice of

{\hat{x}}_{i} (t)

, when

s_{i} = 0

and

c_{i} = 0

, either the states

(0, 0)

and

(1, 0)

, or the states

(0, 1)

and

(1, 1)

will be transient, and thus, have 0 probability in the steady state. By using small time-step approximation to a discrete time Markov chain, one can show that the self transition probabilities are non-zero, and thus, the Markov chain induced by the system is also aperiodic (Section 7.5 of [105]). Therefore, the Markov chain shown in Figure 4a admits a unique stationary distribution given by

π = {π_{00}, π_{01}, π_{10}, π_{11}}

. We find the stationary distribution by writing the local-balance equations which are given as

\begin{matrix} π_{00} λ_{i} = & π_{10} μ_{i} + π_{01} c_{i}, \end{matrix}

(30)

\begin{matrix} π_{10} (μ_{i} + s_{i}) = & π_{00} λ_{i}, \end{matrix}

(31)

\begin{matrix} π_{01} (c_{i} + λ_{i}) = & π_{11} μ_{i}, \end{matrix}

(32)

\begin{matrix} π_{11} μ_{i} = & π_{10} s_{i} + π_{01} λ_{i} . \end{matrix}

(33)

By using (30)–(33) and

\sum_{k = 1}^{2} \sum_{ℓ = 1}^{2} π_{k ℓ} = 1

, we find the steady-state distribution

π

as

\begin{matrix} π_{01} = & \frac{μ_{i} λ_{i}}{μ_{i} + λ_{i}} \frac{s_{i}}{μ_{i} c_{i} + λ_{i} s_{i} + c_{i} s_{i}}, \end{matrix}

(34)

\begin{matrix} π_{10} = & \frac{μ_{i} λ_{i}}{μ_{i} + λ_{i}} \frac{c_{i}}{μ_{i} c_{i} + λ_{i} s_{i} + c_{i} s_{i}}, \end{matrix}

(35)

and

π_{00} = \frac{μ_{i} + s_{i}}{λ_{i}} π_{10}

, and

π_{11} = \frac{c_{i} + λ_{i}}{μ_{i}} π_{01}

. We note that

Δ_{i 1}

in (14) is also equal to

π_{10}

in (35), i.e., we have

Δ_{i 1} = π_{10}

. Similarly,

Δ_{i 2}

in (15) is equal to

π_{01}

in (34). Thus, by observing that the states

(x_{i} (t), {\hat{x}}_{i} (t))

form a continuous time Markov chain, we can find the average difference

Δ

in (6) by finding the steady-state distribution for

π

. This method will be particularly useful in the following section where we consider the case with erroneous test measurements.

5.2. Average Difference with Erroneous Test Measurements

In this section, we consider the case where the test measurements can be erroneous. When a test in applied to an infected person, i.e., when

x_{i} (t) = 1

, the test result will be 0 with probability q and 1 with probability

1 - q

, where

0 \leq q < \frac{1}{2}

. In other words, the false-negative probability is equal to q. Similarly, when a test is applied to a healthy person, i.e., when

x_{i} (t) = 0

, the test result will be 1 with probability p and 0 with probability

1 - p

, where

0 \leq p < \frac{1}{2}

. Thus, the false-positive probability is equal to p. The probability distribution for the test measurements is provided in Table 2.

In this section, we consider the case where the health care provider applies only one test rate

v_{i}

to the ith person, whether the person is currently marked as healthy or infected. That is, we do not consider separate testing rates of

s_{i}

and

c_{i}

for healthy and infected people as we did before, instead, here both

s_{i}

and

c_{i}

are equal o

v_{i}

. Since the health care provider applies the same test rate for the ith person, here we do not consider the importance factor

θ

either. Then, we define the long-term average difference for the ith person with the error on the test measurements as follows, where the superscript e stands for “erroneous”.

\begin{matrix} Δ_{i}^{e} = Δ_{i 1}^{e} + Δ_{i 2}^{e}, \end{matrix}

(36)

and the definitions of

Δ_{i 1}^{e}

and

Δ_{i 2}^{e}

follow similarly from (13). We note that with the test rates

v_{i}

and errors on the test measurements, the states

(x_{i} (t), {\hat{x}}_{i} (t))

form a continuous time Markov chain, and the corresponding state transition graph is shown in Figure 4b. Assuming that

v_{i} > 0

, one can show that there is a unique steady-state distribution

π^{e} = {π_{00}^{e}, π_{01}^{e}, π_{10}^{e}, π_{11}^{e}}

which can be found by solving the local balance equations which are given as follows

\begin{matrix} π_{00}^{e} (v_{i} p + λ_{i}) = & π_{01}^{e} v_{i} (1 - p) + π_{10}^{e} μ_{i}, \end{matrix}

(37)

\begin{matrix} π_{10}^{e} (v_{i} (1 - q) + μ_{i}) = & π_{00}^{e} λ_{i} + π_{11}^{e} v_{i} q, \end{matrix}

(38)

\begin{matrix} π_{01}^{e} (v_{i} (1 - p) + λ_{i}) = & π_{00}^{e} v_{i} p + π_{11}^{e} μ_{i}, \end{matrix}

(39)

\begin{matrix} π_{11}^{e} (v_{i} q + μ_{i}) = & π_{10}^{e} v_{i} (1 - q) + π_{01}^{e} λ_{i} . \end{matrix}

(40)

Then, by using (37)–(40) and

\sum_{k = 1}^{2} \sum_{ℓ = 1}^{2} π_{k ℓ}^{e} = 1

, we find the steady-state distribution

π^{e}

as

\begin{matrix} π_{00}^{e} = & \frac{μ_{i} λ_{i} q + (1 - p) μ_{i} (v_{i} + μ_{i})}{(λ_{i} + μ_{i}) (λ_{i} + μ_{i} + v_{i})}, \end{matrix}

(41)

\begin{matrix} π_{01}^{e} = & \frac{μ_{i} λ_{i} (1 - q) + p μ_{i} (v_{i} + μ_{i})}{(λ_{i} + μ_{i}) (λ_{i} + μ_{i} + v_{i})}, \end{matrix}

(42)

\begin{matrix} π_{10}^{e} = & \frac{μ_{i} λ_{i} (1 - p) + q λ_{i} (v_{i} + λ_{i})}{(λ_{i} + μ_{i}) (λ_{i} + μ_{i} + v_{i})}, \end{matrix}

(43)

\begin{matrix} π_{11}^{e} = & \frac{μ_{i} λ_{i} p + (1 - q) λ_{i} (v_{i} + λ_{i})}{(λ_{i} + μ_{i}) (λ_{i} + μ_{i} + v_{i})} . \end{matrix}

(44)

We note that

Δ_{i 1}^{e}

, and

Δ_{i 2}^{e}

are equal to

π_{10}^{e}

in (43), and

π_{01}^{e}

in (41), respectively. Thus, if

v_{i} > 0

, then

Δ_{i}^{e}

in (36) becomes

\begin{matrix} Δ_{i}^{e} = \frac{p μ_{i}^{2} + q λ_{i}^{2} + (2 - p - q) μ_{i} λ_{i} + v_{i} (p μ_{i} + q λ_{i})}{(λ_{i} + μ_{i}) (λ_{i} + μ_{i} + v_{i})} . \end{matrix}

(45)

We immediately note that if false-positive test probability p and false-negative test probability q are equal to 0,

Δ_{i}^{e}

becomes

\frac{2 μ_{i} λ_{i}}{(λ_{i} + μ_{i}) (λ_{i} + μ_{i} + v_{i})}

which is equal to

Δ_{i 1} + Δ_{i 2}

provided in (14) and (15), respectively, when

v_{i} = s_{i} = c_{i}

. Then,

\frac{\partial Δ_{i}^{e}}{\partial p} \geq 0

is equivalent to

v_{i} + μ_{i} - λ_{i} \geq 0

and

\frac{\partial Δ_{i}^{e}}{\partial q} \geq 0

is equivalent to

v_{i} + λ_{i} - μ_{i} \geq 0

which means that depending on the values of

v_{i}

,

μ_{i}

, and

λ_{i}

, the long-term average difference

Δ_{i}^{e}

can be an increasing function of only p or only q, or both p and q, but

Δ_{i}^{e}

cannot be a decreasing function of both p and q. This is expected as false-negative and false-positive tests negatively affect the estimation process. One can also show that

\frac{\partial Δ_{i}^{e}}{\partial v_{i}} < 0

and

\frac{\partial^{2} Δ_{i}^{e}}{\partial v_{i}^{2}} > 0

which means that

Δ_{i}^{e}

decreases with

v_{i}

and is a convex function of the test rate

v_{i}

.

Next, we consider the case when

v_{i} = 0

. Note that when

v_{i} = 0

, the health care provider either maps these people as always sick or always healthy depending on their infection and recovery rates. Thus, when

v_{i} = 0

and depending on the estimate

{\hat{x}}_{i} (t)

, two of the states in Figure 4b will never be visited and thus, these states will have 0 steady-state probabilities. For this case, the steady states are given by

{\bar{π}}_{1, {\hat{x}}_{i}}^{e}

and

{\bar{π}}_{0, {\hat{x}}_{i}}^{e}

. The local balance equation is

λ_{i} {\bar{π}}_{0, {\hat{x}}_{i}}^{e} = μ_{i} {\bar{π}}_{1, {\hat{x}}_{i}}^{e}

. By using

{\bar{π}}_{0, {\hat{x}}_{i}}^{e} + {\bar{π}}_{1, {\hat{x}}_{i}}^{e} = 1

, we find the steady-state distribution as

{\bar{π}}_{0, {\hat{x}}_{i}}^{e} = \frac{μ_{i}}{μ_{i} + λ_{i}}

, and

{\bar{π}}_{1, {\hat{x}}_{i}}^{e} = \frac{λ_{i}}{μ_{i} + λ_{i}}

. Thus, if

μ_{i} < λ_{i}

, i.e., if people are infected more frequently, then the health care provider chooses its estimate as

{\hat{x}}_{i} (t) = 1

and,

Δ_{i}^{e} = \frac{μ_{i}}{μ_{i} + λ_{i}}

. If

μ_{i} \geq λ_{i}

, i.e., if people stay healthy more often, then we have

{\hat{x}}_{i} (t) = 0

, and

Δ_{i}^{e} = \frac{λ_{i}}{μ_{i} + λ_{i}}

. Therefore, when

v_{i} = 0

, we have

\begin{matrix} Δ_{i}^{e} = min \{\frac{μ_{i}}{μ_{i} + λ_{i}}, \frac{λ_{i}}{μ_{i} + λ_{i}}\} . \end{matrix}

(46)

In order to find the optimal test rates

v_{i}

in the case of errors on the test measurements, we formulate the following optimization problem

\begin{matrix} min_{{v_{i}}} & \sum_{i = 1}^{n} 𝟙 {v_{i} > 0} \frac{p μ_{i}^{2} + q λ_{i}^{2} + (2 - p - q) μ_{i} λ_{i} + v_{i} (p μ_{i} + q λ_{i})}{(λ_{i} + μ_{i}) (λ_{i} + μ_{i} + v_{i})} \\ + 𝟙 {v_{i} = 0} min \{\frac{μ_{i}}{μ_{i} + λ_{i}}, \frac{λ_{i}}{μ_{i} + λ_{i}}\} \\ s . t . & \sum_{i = 1}^{n} v_{i} \leq C \\ v_{i} \geq 0, i = 1, \dots, n, \end{matrix}

(47)

where the objective function is given by the summation of

Δ_{i}^{e}

in (45) when

v_{i} > 0

and

Δ_{i}^{e}

in (46) when

v_{i} = 0

over all people and

𝟙 {.}

is the indicator function taking value 1 when

{\cdot}

is true and 0, otherwise. In (47), we have a constraint on the total test rate, i.e.,

\sum_{i = 1}^{n} v_{i} \leq C

. We note that the optimization problem in (47) is in general not convex due to the indicator function in the objective function. However, for a given set of

𝟙 {v_{i} = 0}

, the optimization problem in (47) is convex and can be solved optimally. Thus, by solving the problem in (47) for all possible set of

𝟙 {v_{i} = 0}

, we can determine the global optimal solution which requires to solve

2^{n}

different optimization problems which can be impractical for large n. Because of this reason, next, we provide a greedy algorithm to solve the optimization problem in (47).

In the greedy solution, initially, assuming that

𝟙 {v_{i} > 0} = 1

for all i, we consider the following the optimization problem

\begin{matrix} min_{{v_{i}}} & \sum_{i = 1}^{n} \frac{p μ_{i}^{2} + q λ_{i}^{2} + (2 - p - q) μ_{i} λ_{i} + v_{i} (p μ_{i} + q λ_{i})}{(λ_{i} + μ_{i}) (λ_{i} + μ_{i} + v_{i})} \\ s . t . & \sum_{i = 1}^{n} v_{i} \leq C \\ v_{i} \geq 0, i = 1, \dots, n, \end{matrix}

(48)

where the objective function in (48) is equal to

Δ_{i}^{e}

in (45). For this optimization problem, we define the Lagrangian function for (48) as

\begin{matrix} L = & \sum_{i = 1}^{n} \frac{p μ_{i}^{2} + q λ_{i}^{2} + (2 - p - q) μ_{i} λ_{i} + v_{i} (p μ_{i} + q λ_{i})}{(λ_{i} + μ_{i}) (λ_{i} + μ_{i} + v_{i})} + \bar{β} (\sum_{i = 1}^{n} v_{i} - C) - \sum_{i = 1}^{n} {\bar{ν}}_{i} v_{i}, \end{matrix}

(49)

where

\bar{β} \geq 0

,

{\bar{ν}}_{i} \geq 0

. We note that the problem defined in (48) is a convex optimization problem, and thus we can find the optimal test rates

v_{i}

by analyzing the KKT and the complementary slackness conditions. The KKT conditions are given by

\begin{matrix} \frac{\partial L}{\partial v_{i}} = & \frac{- 2 (1 - p - q) μ_{i} λ_{i}}{(μ_{i} + λ_{i}) {(μ_{i} + λ_{i} + v_{i})}^{2}} + \bar{β} - {\bar{ν}}_{i} = 0, \end{matrix}

(50)

for all i. The complementary slackness conditions are

\begin{matrix} \bar{β} (\sum_{i = 1}^{n} v_{i} - C) = 0, {\bar{ν}}_{i} v_{i} = 0 . \end{matrix}

(51)

By using (50) and (51), we find the optimal

v_{i}

values for the problem in (48) as

\begin{matrix} v_{i} = (μ_{i} + λ_{i}) {(\sqrt{\frac{μ_{i} λ_{i}}{{(μ_{i} + λ_{i})}^{3}} \frac{2 (1 - p - q)}{\bar{β}}} - 1)}^{+} . \end{matrix}

(52)

With the test rates

v_{i}

in (52) we find the average differences

Δ_{i}^{e}

in (45) and then compare them with

Δ_{i}^{e}

in (46) when

v_{i} = 0

. Due to the errors in the tests,

Δ_{i}^{e}

in (46) with

v_{i} = 0

can be smaller than

Δ_{i}^{e}

in (45) with the test rates

v_{i}

found in (52). For these people, we choose index i where the difference between

Δ_{i}^{e}

in (45) with the

v_{i}

in (52) and

Δ_{i}^{e}

in (46) is the highest. Then, we take

v_{i} = 0

as applying no test to this person can further decrease

Δ_{i}^{e}

. For the remaining people, we solve the optimization problem in (48). After obtaining the test rates for the remaining people, we again compare average differences

Δ_{i}^{e}

with the test rates in (52) and with no test and we choose

v_{i} = 0

for the person where

Δ_{i}^{e}

can be further decreased. We repeat these steps until all

Δ_{i}^{e}

s with

v_{i} > 0

cannot be further decreased by choosing

v_{i} = 0

.

We note that the solution obtained in (52) has a threshold structure. As false-positive and -negative test rates increase, the term

\frac{2 (1 - p - q)}{\bar{β}}

in (52) becomes smaller. As a result, some people with higher

\sqrt{\frac{{(μ_{i} + λ_{i})}^{3}}{μ_{i} λ_{i}}}

may not be tested by the health care provider. Thus, when p and q are high, a smaller portion of the population is tested with higher test rates in order to combat the test errors.

6. Average Estimation Error with Dependent Infection Rates

In this section, we consider the case where we have two people whose infection rates depend on each other. When these two people are healthy, they can be individually infected with the virus after an exponential time with rate

λ

. When one of these two people is infected and this has not been detected by the health care provider, this person can infect the other healthy person after an exponential time with rate

λ_{12}

which has been illustrated in Figure 5. Thus, when both of the people are healthy, their individual infection rate is

λ

. However, when one of them is sick and this has not been detected by the health care provider, the healthy person’s total infection rate is equal to

λ + λ_{12}

. On the other hand, if only one person is infected, i.e.,

x_{i} (t) = 1

, which has also been detected by the health care provider,

{\hat{x}}_{i} (t) = 1

, then we assume that we isolate the infected person from the healthy one, and thus, the healthy person’s infection rate remains as

λ

instead of

λ + λ_{12}

. When the people are infected, they recover from the disease after an exponential time with rate

μ

.

When the health care provider believes that a person is healthy, i.e.,

{\hat{x}}_{i} (t) = 0

, the next test is applied to this person after an exponential time with rate s. When the health care provider believes that a person is sick, i.e.,

{\hat{x}}_{i} (t) = 1

, the next test applied to this person after an exponential time with rate c. Here, we note that since the people are identical in terms of their infection and recovery rates, the health care provider applies the same test rates.

Similar to Section 5, we note that the states

{x_{1} (t), {\hat{x}}_{1} (t), x_{2} (t), {\hat{x}}_{2} (t)}

form a continuous time Markov chain where the unique stationary distribution is given by

π^{d} = {π_{0000}^{d}, π_{0001}^{d}, \dots, π_{1111}^{d}}

. In order to find the stationary distribution, we write the local balance equations as follows

\begin{matrix} 2 λ π_{0000}^{d} = & μ π_{1000}^{d} + c π_{0100}^{d} + μ π_{0010}^{d} + c π_{0001}^{d}, \end{matrix}

(53)

\begin{matrix} (2 λ + c) π_{0001}^{d} = & μ π_{0011}^{d} + c π_{0101}^{d} + μ π_{1001}^{d}, \end{matrix}

(54)

\begin{matrix} (λ + λ_{12} + μ + s) π_{0010}^{d} = & c π_{0110}^{d} + μ π_{1010}^{d} + λ π_{0000}^{d}, \end{matrix}

(55)

\begin{matrix} (λ + μ) π_{0011}^{d} = & c π_{0111}^{d} + μ π_{1011}^{d} + s π_{0010}^{d} + λ π_{0001}^{d}, \end{matrix}

(56)

\begin{matrix} (2 λ + c) π_{0100}^{d} = & c π_{0101}^{d} + μ π_{0110}^{d} + μ π_{1100}^{d}, \end{matrix}

(57)

\begin{matrix} (2 λ + 2 c) π_{0101}^{d} = & μ π_{0111}^{d} + μ π_{1101}^{d}, \end{matrix}

(58)

\begin{matrix} (λ + μ + s + c) π_{0110}^{d} = & λ π_{0100}^{d} + μ π_{1110}^{d}, \end{matrix}

(59)

\begin{matrix} (λ + μ + c) π_{0111}^{d} = & s π_{0110}^{d} + λ π_{0101}^{d} + μ π_{1111}^{d}, \end{matrix}

(60)

\begin{matrix} (λ + λ_{12} + μ + s) π_{1000}^{d} = & λ π_{0000}^{d} + c π_{1001}^{d} + μ π_{1010}^{d}, \end{matrix}

(61)

\begin{matrix} (λ + μ + s + c) π_{1001}^{d} = & μ π_{1011}^{d} + λ π_{0001}^{d}, \end{matrix}

(62)

\begin{matrix} (2 μ + 2 s) π_{1010}^{d} = & (λ + λ_{12}) π_{1000}^{d} + (λ + λ_{12}) π_{0010}^{d}, \end{matrix}

(63)

\begin{matrix} (2 μ + s) π_{1011}^{d} = & s π_{1010}^{d} + λ π_{1001}^{d} + λ π_{0011}^{d}, \end{matrix}

(64)

\begin{matrix} (λ + μ) π_{1100}^{d} = & s π_{1000}^{d} + λ π_{0100}^{d} + c π_{1101}^{d} + μ π_{1110}^{d}, \end{matrix}

(65)

\begin{matrix} (λ + μ + c) π_{1101}^{d} = & s π_{1001}^{d} + λ π_{0101}^{d} + μ π_{1111}^{d}, \end{matrix}

(66)

\begin{matrix} (2 μ + s) π_{1110}^{d} = & λ π_{1100}^{d} + s π_{1010}^{d} + λ π_{0110}^{d}, \end{matrix}

(67)

\begin{matrix} 2 μ π_{1111}^{d} = & s π_{1110}^{d} + λ π_{1101}^{d} + s π_{1011}^{d} + λ π_{0111}^{d} . \end{matrix}

(68)

By using (53)–(57) and

\sum_{j = 1}^{2} \sum_{ℓ = 1}^{2} \sum_{m = 1}^{2} \sum_{h = 1}^{2} π_{j ℓ m h}^{d} = 1

, we find the stationary distribution

π^{d}

. We denote the long-term average estimation error for person i as

Δ_{i}^{d}

for

i = 1, 2

, where the superscript d stands for “dependent”, which is given by

\begin{matrix} Δ_{i}^{d} = Δ_{i 1}^{d} + Δ_{i 2}^{d}, \end{matrix}

(69)

where

Δ_{i 1}^{d}

and

Δ_{i 2}^{d}

follow from (13). Then, we have

\begin{matrix} Δ_{11}^{d} = & π_{1000}^{d} + π_{1001}^{d} + π_{1010}^{d} + π_{1011}^{d}, \end{matrix}

(70)

\begin{matrix} Δ_{12}^{d} = & π_{0100}^{d} + π_{0101}^{d} + π_{0110}^{d} + π_{0111}^{d}, \end{matrix}

(71)

\begin{matrix} Δ_{21}^{d} = & π_{0010}^{d} + π_{0110}^{d} + π_{1010}^{d} + π_{1110}^{d}, \end{matrix}

(72)

\begin{matrix} Δ_{22}^{d} = & π_{0001}^{d} + π_{0101}^{d} + π_{1001}^{d} + π_{1101}^{d} . \end{matrix}

(73)

In Section 8, for given infection, recovery and test rates, we numerically evaluate the stationary distribution and find the average difference

Δ_{i}^{d}

.

7. Age of Incorrect Information Based Error Metric

To date, we have considered an estimation error metric that takes the value 1 if the actual infection status of a person is different than the real-time estimation at the health care provider. Thus, the error metric takes values based on the information content. On the other hand, the traditional age metric introduced in [1] considers only the time passed since the most recently received status update packet is generated at the source. As a result, the traditional age metric does not consider the information content and age alone may not be a suitable performance metric for the problem considered in our work.

In the context of infection tracking, it is important to know how long the estimations at the health care provider have been different from the actual infection status of the people. However, the error metric that we have considered thus far does not have the time component, i.e., it only takes value 1 independent of the time duration that it has been off from the actual health status. Motivated by the AoII introduced in [51,102] which accounts for both the time and the information content, in this section, we consider the following error metric, where the superscript s stands for “synchronization” implied in AoII,

\begin{matrix} Δ_{i}^{s} = (t - V_{i} (t)) 𝟙 {{\hat{x}}_{i} (t) \neq x_{i} (t)}, \end{matrix}

(74)

where

V_{i} (t)

is the last time instant where the health care provider makes an accurate estimation of the health status for the ith person, i.e., the last time instant when

Δ_{i}^{s} = 0

. Similarly, we define

\begin{matrix} Δ_{i 1}^{s} = & (t - V_{i 1} (t)) max {x_{i} (t) - {\hat{x}}_{i} (t), 0}, \end{matrix}

(75)

\begin{matrix} Δ_{i 2}^{s} = & (t - V_{i 2} (t)) max {{\hat{x}}_{i} (t) - x_{i} (t), 0}, \end{matrix}

(76)

where

V_{i 1} (t)

and

V_{i 2} (t)

are equal to the last time instants when

Δ_{i 1}^{s}

and

Δ_{i 2}^{s}

are equal to 0, respectively. A sample evolution of

Δ_{i 1}^{s}

and

Δ_{i 2}^{s}

is shown in Figure 6 and we note that

Δ_{i}^{s} (t) = Δ_{i 1}^{s} (t) + Δ_{i 2}^{s} (t)

.

Similar to Section 3, the infection and the recovery rates of the ith person are

λ_{i}

and

μ_{i}

, respectively. In this section, the health care provider applies only one test rate for each person denoted by

w_{i}

. That is, we do not consider separate testing rates of

s_{i}

and

c_{i}

for healthy and infected people as we did previously, instead, here both

s_{i}

and

c_{i}

are equal o

w_{i}

. We first consider the case where

w_{i} > 0

. By following the steps in Section 3, one can show that

E [I_{i 1}] = \frac{1}{w_{i}} + \frac{w_{i} + μ_{i}}{w_{i} λ_{i}}

and

E [I_{i 2}] = \frac{1}{w_{i}} + \frac{w_{i} + λ_{i}}{w_{i} μ_{i}}

which can be obtained by substituting

w_{i}

instead of

s_{i}

and

c_{i}

in (10) and (12), respectively. Next, we denote the total area when

Δ_{i 1}^{s} (t) > 0

as

A_{e, 1} (i, j)

during the jth cycle where

A_{e, 1} (i, j) = \sum_{ℓ = 1}^{K_{1}} \frac{T_{m} {(i, ℓ)}^{2}}{2}

and

K_{1}

has a geometric distribution with success rate

\frac{w_{i}}{μ_{i} + w_{i}}

. Then, we have

E [A_{e, 1} (i)] = \frac{1}{w_{i} (w_{i} + μ_{i})}

. Similarly, we denote the total area when

Δ_{i 2}^{s} (t) > 0

as

A_{e, 2} (i, j)

during the jth cycle where

A_{e, 2} (i, j) = \sum_{ℓ = 1}^{K_{2}} \frac{T_{u} {(i, ℓ)}^{2}}{2}

and

K_{2}

has a geometric distribution with success rate

\frac{w_{i}}{λ_{i} + w_{i}}

. Then, we have

E [A_{e, 2} (i)] = \frac{1}{w_{i} (w_{i} + λ_{i})}

. By using ergodicity, the long-term average differences become

Δ_{i 1}^{s} = \frac{E [A_{e, 1} (i)]}{E [I_{i 1}] + E [I_{i 2}]}

and

Δ_{i 2}^{s} = \frac{E [A_{e, 2} (i)]}{E [I_{i 1}] + E [I_{i 2}]}

which gives

\begin{matrix} Δ_{i}^{s} = Δ_{i 1}^{s} + Δ_{i 2}^{s} = \frac{μ_{i} λ_{i}}{μ_{i} + λ_{i}} \frac{2 w_{i} + μ_{i} + λ_{i}}{(w_{i} + μ_{i} + λ_{i}) (w_{i} + μ_{i}) (w_{i} + λ_{i})}, \end{matrix}

(77)

when

w_{i} > 0

. One can show that

Δ_{i}^{s}

is a decreasing function of

w_{i}

, i.e.,

\frac{\partial Δ_{i}^{s}}{\partial w_{i}} < 0

, and

Δ_{i}^{s}

is a convex function of

w_{i}

, i.e.,

\frac{\partial^{2} Δ_{i}^{s}}{\partial w_{i}^{2}} > 0

.

When

w_{i} = 0

, we have

E [I_{i}] = \frac{μ_{i} λ_{i}}{μ_{i} + λ_{i}}

, i.e.,

E [I_{i}]

is equal to the expected time of a person’s healthy and sick states. Since the health care provider applies no tests to test a person, it either estimates this person to be always sick (

{\hat{x}}_{i} (t) = 1

) or always healthy (

{\hat{x}}_{i} (t) = 0

). When

w_{i} = 0

and

{\hat{x}}_{i} (t) = 1

, then

Δ_{i}^{s} = \frac{1}{μ_{i}} \frac{λ_{i}}{μ_{i} + λ_{i}}

. When

w_{i} = 0

and

{\hat{x}}_{i} (t) = 1

, we have

Δ_{i}^{s} = \frac{1}{λ_{i}} \frac{μ_{i}}{μ_{i} + λ_{i}}

. If

μ_{i} < λ_{i}

, then the health care provider

{\hat{x}}_{i} (t) = 1

, and

{\hat{x}}_{i} (t) = 0

, otherwise. Thus, when

w_{i} = 0

, we have

Δ_{i}^{s} = min \{\frac{1}{μ_{i}} \frac{λ_{i}}{μ_{i} + λ_{i}}, \frac{1}{λ_{i}} \frac{μ_{i}}{μ_{i} + λ_{i}}\}

.

In order to find the optimal test rates, we formulate the following optimization problem

\begin{matrix} min_{{w_{i}}} & \sum_{i = 1}^{n} 𝟙 {w_{i} > 0} \frac{μ_{i} λ_{i}}{μ_{i} + λ_{i}} \frac{2 w_{i} + μ_{i} + λ_{i}}{(w_{i} + μ_{i} + λ_{i}) (w_{i} + μ_{i}) (w_{i} + λ_{i})} \\ + 𝟙 {w_{i} = 0} min \{\frac{1}{μ_{i}} \frac{λ_{i}}{μ_{i} + λ_{i}}, \frac{1}{λ_{i}} \frac{μ_{i}}{μ_{i} + λ_{i}}\} \\ s . t . & \sum_{i = 1}^{n} w_{i} \leq C \\ w_{i} \geq 0, i = 1, \dots, n, \end{matrix}

(78)

where the objective function in (78) is equal to the summation of

Δ_{i}^{s}

in (77) when

w_{i} > 0

and

Δ_{i}^{s}

when

w_{i} = 0

over all people. In order to solve the problem in (78), we follow the same greedy solution approach in Section 5. First, by assuming that

w_{i} > 0

, and thus, the average difference

Δ_{i}^{s}

is given in (77), we solve the following optimization problem

\begin{matrix} min_{{w_{i}}} & \sum_{i = 1}^{n} \frac{μ_{i} λ_{i}}{μ_{i} + λ_{i}} \frac{2 w_{i} + μ_{i} + λ_{i}}{(w_{i} + μ_{i} + λ_{i}) (w_{i} + μ_{i}) (w_{i} + λ_{i})} \\ s . t . & \sum_{i = 1}^{n} w_{i} \leq C \\ w_{i} \geq 0, i = 1, \dots, n . \end{matrix}

(79)

Since the problem in (79) is a convex optimization problem, by defining Lagrangian function and analyzing the KKT and the complementary slackness conditions, we can find the optimal

w_{i}

values. In order to avoid being repetitive, we skip these optimization steps. Then, we compare

Δ_{i}^{s}

in (77) with

w_{i}

values found in (79) with

min {\frac{1}{μ_{i}} \frac{λ_{i}}{μ_{i} + λ_{i}}, \frac{1}{λ_{i}} \frac{μ_{i}}{μ_{i} + λ_{i}}}

. If we can reduce

Δ_{i}^{s}

further, we choose

w_{i} = 0

for the person with the highest improvement. Then, we solve the optimization problem in (79) for the remaining people. We repeat these steps until there is no improvement in

Δ_{i}^{s}

by choosing

w_{i} = 0

.

In the next section, we provide extensive numerical results to evaluate optimal test rates in various settings considered in this paper.

8. Numerical Results

In this section, we provide seven numerical results. For these examples, we take

λ_{i}

as

\begin{matrix} λ_{i} = a r^{i}, i = 1, \dots, n, \end{matrix}

(80)

where

r = 0.9

and a is such that

\sum_{i = 1}^{n} λ_{i} = 6

. Furthermore, we take

μ_{i}

as

\begin{matrix} μ_{i} = b q^{i}, i = 1, \dots, n, \end{matrix}

(81)

where

q = 1.1

and b is such that

\sum_{i = 1}^{n} μ_{i} = 4

. Since

λ_{i}

in (80) decreases with i, people with lower indices become infected more quickly compared to people with higher indices. Since

μ_{i}

in (81) increases with i, people with higher indices recover more quickly compared to people with lower indices. Thus, a person with a low index becomes infected quickly and recovers slowly.

In the first example, we take the total number of people as

n = 10

, the total test rate as

C = 16

, and

θ = 0.5

. We start with randomly chosen

s_{i}

and

c_{i}

such that

\sum_{i = 1}^{n} s_{i} + c_{i} = 16

, and apply the alternating minimization-based method proposed in Section 4. We repeat this process for 30 different initial

(s_{i}, c_{i})

pairs and choose the solution that gives the smallest

Δ

. In Figure 7a, we observe that the first three people are never tested by the health care provider. We note that

s_{i}

, which is the test rate when

{\hat{x}}_{i} (t) = 0

, initially increases with i but then decreases with i. This means that people who become infected rarely are tested less frequently when they are marked as healthy. Similarly, we observe in Figure 7a that

c_{i}

, which is the test rate when

{\hat{x}}_{i} (t) = 1

, monotonically increases with i. In other words, people who recover from the virus quickly are tested more frequently when they are marked as infected.

In Figure 7b, we plot

Δ_{i}

resulting from the solution found from the proposed algorithm,

Δ_{i}

when the health care provider applies tests to everyone in the population uniformly, i.e.,

s_{i} = c_{i} = \frac{C}{2 n}

for all i, and

Δ_{i}

when the health care provider applies no tests, i.e.,

s_{i} = c_{i} = 0

for all i. In the case of no tests, we have

Δ_{i} = min {\frac{θ λ_{i}}{μ_{i} + λ_{i}}, \frac{(1 - θ) μ_{i}}{μ_{i} + λ_{i}}}

. We observe in Figure 7b that the health care provider applies tests on people whose

Δ_{i}

can be reduced the most as opposed to uniform testing where everyone is tested equally. Thus, the first three people who have the smallest

Δ_{i}

are not tested by the health care provider. With the proposed solution, by not testing the first three people,

Δ_{i}

are further reduced for the remaining people compared to uniform testing. For the people who are not tested, the health care provider chooses

{\hat{x}}_{i} (t) = 1

all the time, i.e., marks these people always sick as

\frac{θ λ_{i}}{μ_{i} + λ_{i}} > \frac{(1 - θ) μ_{i}}{μ_{i} + λ_{i}}

. This is expected as these people have high

λ_{i}

and low

μ_{i}

, i.e., they are infected easily and they stay sick for a long time.

In the second example, we use the same set of variables except for the total test rate C. We vary the total test rate C in between 5 and 20. We plot

Δ

with respect to C in Figure 8. We observe that

Δ

decreases with C. Thus, with higher total test rates, the health care provider can track the infection status of the population better as expected.

In the third example, we use the same set of variables except for the total number of people n. In addition, we also use uniform infection and healing rates, i.e.,

λ_{i} = \frac{6}{n}

and

μ_{i} = \frac{4}{n}

for all i, for comparison with

λ_{i}

in (80) and

μ_{i}

in (81), while keeping the total infection and healing rates the same, i.e.,

\sum_{i = 1}^{n} λ_{i} = 6

and

\sum_{i = 1}^{n} μ_{i} = 4

, for both cases. We vary the number of people n from 2 to 30. We observe in Figure 9 that when the infection and healing rates are uniform in the population, the health care provider can track the infection status with the same efficiency, even though the population size increases (while keeping the total infection and healing rates fixed). For the case of

λ_{i}

in (80) and

μ_{i}

in (81), when we increase the population size, we increase the number of people who rarely become sick, i.e., people with high i indices, and also people who rarely heal from the disease, i.e., people with small i indices. Thus, it becomes easier for the health care provider to track the infection status of the people. This is why when we use

λ_{i}

in (80) and

μ_{i}

in (81), we observe in Figure 9 that the health care provider can track the infection status of the people better, even though the population size increases.

In the fourth example, we employ the same set of variables as the first example except for the importance factor

θ

. Here, we vary

θ

in between

0.2

and

0.7

. We plot

Δ

in (7),

{\bar{Δ}}_{1}

which is

{\bar{Δ}}_{1} = \frac{1}{n} \sum_{i = 1}^{n} Δ_{i 1}

, and

{\bar{Δ}}_{2}

which is

{\bar{Δ}}_{2} = \frac{1}{n} \sum_{i = 1}^{n} Δ_{i 2}

in Figure 10a. Note that

{\bar{Δ}}_{1}

represents the average difference when people are infected, but have not been detected by the health care provider, and

{\bar{Δ}}_{2}

represents the average difference when people have recovered, but the health care provider still marks them as infected. Note that when

θ

is high, we assign importance to minimization of

{\bar{Δ}}_{1}

, i.e., the early detection of people with infection, and when

θ

is low, we give importance to minimization of

{\bar{Δ}}_{2}

, i.e., the early detection of people who recovered from the disease. This is why we observe in Figure 10a that

{\bar{Δ}}_{1}

decreases with

θ

while

{\bar{Δ}}_{2}

increases with

θ

.

We plot the total test rates

\sum_{i = 1}^{n} s_{i}

and

\sum_{i = 1}^{n} c_{i}

in Figure 10b. We observe in Figure 10b that if it is more important to detect the infected people, i.e., if

θ

is high, then the health care provider should apply higher test rates to people who are marked as healthy. In other words,

\sum_{i = 1}^{n} s_{i}

increases with

θ

. Similarly, if it is more important to detect people who recovered from the disease, then the health care provider should apply high test rates to people who are marked as infected. That is,

\sum_{i = 1}^{n} c_{i}

is high when

θ

is low. Therefore, depending on the priorities of the health care provider, a suitable

θ

needs to be chosen.

In the fifth numerical result, we consider the case where there are errors in the test measurements, i.e., the model in Section 5. We take the total test rate as

C = 20

, and vary error rates in the test

p = q = {0.1, 0.2, 0.4}

. In Figure 11a, we provide the test rates

v_{i}

that we found as a result of our greedy policy in Section 5. When the error rates p and q are low, i.e., when

p = q = 0.1

, we see that the health care provider applies tests to everyone in the population and the corresponding

Δ_{i}^{e}

is lower than applying no test as shown in Figure 11b. As we increase the error rates, we observe that some people in the population start to be not tested by the health care provider, see Figure 11a when

p = q = {0.2, 0.4}

. In this case, the health care provider applies more tests to the remaining people to combat the test errors. However, although it applies more tests to the remaining people, we observe in Figure 11b that the achieved average difference

Δ_{i}^{e}

becomes higher as error rates increase.

In the sixth numerical result, we consider the case where the infection status of the people depend on each other. In other words, when one person is infected, they can infect the other person with rate

λ_{12}

when they are not detected by the health care provider, i.e., the infection model in Section 6. For this example, first, we take

μ = 5

,

λ = 2.5

,

s = c = \frac{C}{4}

and vary

λ = {2, \dots, 200}

and

C = {20, 40, 60}

. If

λ_{12} = 0

, i.e., if the infection status of people are independent from each other, then the average time that person 1 or 2 is sick is equal to

\frac{λ}{λ + μ} = \frac{1}{3}

. As we increase infection rate

λ_{12}

among the person 1 and 2, we see in Figure 12a that the average time that person 1 is sick increases. However, we note that as we increase the total test rate, the health care provider can detect a sick person more frequently, and this explains why the average infected time is low in Figure 12a when the test rate is high. Then, we consider

λ_{12} = {5, 10, 15}

and vary the total test rates

λ = {2, \dots, 200}

. We plot the average time that both person 1 and 2 stay as sick in Figure 12b. As we increase the total test rate, the health care provider detects the infected person more quickly, and thus, prohibits the infection from spreading. As a result, we observe that the average time that both people are infected decreases in C in Figure 12b. Since both people can be infected with the virus independent from each other with rate

λ

, the plots in Figure 12b do not drop to 0.

In the last numerical result, we consider the age of incorrect information-based error metric in Section 7. Here, the estimation error increases with the time that the health care provider does not detect the changes in the infection status of the people. As a result, the average difference expression given by

Δ_{i}^{s}

in (77) is different than

Δ_{i}^{e}

in (45) when

p = q = 0

. For this example, we consider the total test rate

C = 4

and compare the normalized average differences given by

\frac{Δ_{i}^{s}}{\sum_{i = 1}^{n} Δ_{i}^{s}}

, and

\frac{Δ_{i}^{e}}{\sum_{i = 1}^{n} Δ_{i}^{e}}

and the corresponding test rates

w_{i}

and

v_{i}

. In Figure 13b, depending on the error metric model, people who are tested by the health care provider show considerable variation in their test rates. For example, with the error metric

Δ_{i}^{s}

in (77), we apply tests to every third person while the same person is not tested with the error metric

Δ_{i}^{e}

in (45). In Figure 13a, we provide the normalized average difference values. Here, the average normalized error for the tested people exhibit similar values whereas the normalized difference may vary for the untested people. Thus, we should choose a suitable error metric that satisfies the priorities of the health care provider as it greatly affects who is tested and with which test rates.

9. Conclusions and Discussion

We considered the timely tracking of infection status of individuals in a population. For exponential infection and healing processes with given rates, we determined the rates of exponential testing processes. We considered errors on the test measurements and observed that in order to combat the test errors, a limited portion of the population may be tested with higher test rates. Then, we studied a dependent infection spread model for two people, where one infected person can spread the virus to the other if it has not been detected by the health care provider. Finally, we studied an AoII-based error metric where the error function linearly increases over time as the changes in the infection status have not been detected by the health care provider. We observed in numerical results that the test rates depend on the individuals’ infection and recovery rates, the individuals’ last known state of being healthy or infected, as well as the health care provider’s priorities of detecting infected people versus detecting recovered people more quickly.

In the literature, in order to model epidemics, population is partitioned into groups called compartments. One such example is the SIR model used in [106] with the compartments susceptible (S), infected (I), and recovered (R) which has been further developed by adding the states hospitalized (H), and death (D) in [107]. In these epidemic models, the transitions between the compartments are assumed to be Markovian. In [107], with epidemiological data, the delay distributions for the infected (I) to hospitalized (H), and infected (I) to death (D) are well approximated by exponential and gamma distributions, respectively. However, due to the lack of data availability the delay distribution for infected (I) to recovered (R) is modeled with gamma distribution with higher tolerance. In our work, we modeled infection and recovery times, i.e., the delays between recovered (R) to infected (I) and infected (I) to recovered (R) with exponential distributions. Therefore, more realistic infection tracking models can be developed by considering gamma distributions as observed in [107]. This more realistic model corresponds to the problem of real-time timely tracking of a binary Markov source in a serially connected network. The serially connected network model was studied in [8] with the traditional age of information metric. We note that considering the same networking model with the AoII-based error metric to track information dissemination of a binary Markov source represents a promising research direction and has direct applications to the real-time tracking of epidemic spread models. One can also study the extension of dependent infection spread model in Section 6 to

n > 2

people as a future research direction.

Another interesting research direction could be to consider different kinds of tests with different false-positive and false-negative test rates. Regarding this problem, instead of having a total test rate capacity C, we may consider a total test budget K. Assuming that each test bears a different cost, the goal might be to identify how many tests the health care provider should obtain from each type. Here, one can study a trade-off between applying fewer tests with a small probability of error versus applying more tests to individuals with a high probability of error. Moreover, one can consider a scenario where the health care provider may prefer to apply different test types to individuals depending on their infection and recovery rates.

Author Contributions

Conceptualization, M.B. and S.U.; methodology, M.B. and S.U.; software, M.B.; validation, M.B. and S.U.; formal analysis, M.B. and S.U.; investigation, M.B. and S.U.; resources, M.B. and S.U.; data curation, M.B. and S.U.; writing—original draft preparation, M.B. and S.U.; writing—review and editing, M.B. and S.U.; visualization, M.B. and S.U.; supervision, S.U.; project administration, S.U.; funding acquisition, S.U. All authors have read and agreed to the published version of the manuscript.

Funding

This work was supported by NSF Grants CCF 17-13977 and ECCS 18-07348.

Data Availability Statement

Not applicable.

Conflicts of Interest

The authors declare no conflict of interest.

References

Kaul, S.K.; Yates, R.D.; Gruteser, M. Real-time status: How often should one update? In Proceedings of the 2012 Proceedings IEEE INFOCOM, Orlando, FL, USA, 25–30 March 2012. [Google Scholar]
Kadota, I.; Sinha, A.; Uysal-Biyikoglu, E.; Singh, R.; Modiano, E. Scheduling policies for minimizing age of information in broadcast wireless networks. IEEE/ACM Trans. Netw. 2018, 26, 2637–2650. [Google Scholar] [CrossRef] [Green Version]
Kam, C.; Kompella, S.; Nguyen, G.D.; Wieselthier, J.E.; Ephremides, A. Age of information with a packet deadline. In Proceedings of the 2016 IEEE International Symposium on Information Theory (ISIT), Barcelona, Spain, 10–15 July 2016. [Google Scholar]
Sun, Y.; Uysal-Biyikoglu, E.; Yates, R.D.; Koksal, C.E.; Shroff, N.B. Update or wait: How to keep your data fresh. IEEE Trans. Inf. Theory 2017, 63, 7492–7508. [Google Scholar] [CrossRef]
Najm, E.; Telatar, E. Status updates in a multi-stream M/G/1/1 preemptive queue. In Proceedings of the IEEE Infocom 2018-IEEE Conference On Computer Communications Workshops (Infocom Wkshps), Honolulu, HI, USA, 15–19 April 2018. [Google Scholar]
Soysal, A.; Ulukus, S. Age of information in G/G/1/1 systems: Age expressions, bounds, special cases, and optimization. IEEE Trans. Inf. Theory 2021, 67, 7477–7489. [Google Scholar] [CrossRef]
Buyukates, B.; Ulukus, S. Age of information with Gilbert-Elliot servers and samplers. arXiv 2020, arXiv:2002.05711. [Google Scholar]
Yates, R.D. The age of information in networks: Moments, distributions, and sampling. IEEE Trans. Inf. Theory 2020, 66, 5712–5728. [Google Scholar] [CrossRef]
Talak, R.; Karaman, S.; Modiano, E. Minimizing age-of-information in multi-hop wireless networks. In Proceedings of the 2017 55th Annual Allerton Conference on Communication, Control, and Computing (Allerton), Monticello, IL, USA, 3–6 October 2017. [Google Scholar]
Tripathi, V.; Moharir, S. Age of information in multi-source systems. In Proceedings of the GLOBECOM 2017-2017 IEEE Global Communications Conference, Centre, Singapore, 4–8 December 2017. [Google Scholar]
Bedewy, A.M.; Sun, Y.; Shroff, N.B. The age of information in multihop networks. IEEE/ACM Trans. Netw. 2019, 27, 1248–1257. [Google Scholar] [CrossRef] [Green Version]
Zhong, J.; Yates, R.D.; Soljanin, E. Multicast with prioritized delivery: How fresh is your data? In Proceedings of the 2018 IEEE 19th International Workshop on Signal Processing Advances in Wireless Communications (SPAWC), Kalamata, Greece, 25–28 June 2018. [Google Scholar]
Buyukates, B.; Soysal, A.; Ulukus, S. Age of information in two-hop multicast networks. In Proceedings of the 2018 52nd Asilomar Conference on Signals, Systems, and Computers, Pacific Grove, CA, USA, 28–31 October 2018. [Google Scholar]
Buyukates, B.; Soysal, A.; Ulukus, S. Age of information in multihop multicast networks. J. Commun. Netw. 2019, 21, 256–267. [Google Scholar] [CrossRef] [Green Version]
Buyukates, B.; Soysal, A.; Ulukus, S. Age of information in multicast networks with multiple update streams. In Proceedings of the 2019 53rd Asilomar Conference on Signals, Systems, and Computers, Pacific Grove, CA, USA, 3–6 November 2019. [Google Scholar]
Krishnan, K.S.A.; Sharma, V. Minimizing age of information in a multihop wireless network. In Proceedings of the ICC 2020-2020 IEEE International Conference on Communications, Dublin, Ireland, 7–11 June 2020. [Google Scholar]
Farazi, S.; Klein, A.G.; Brown, D.R., III. Fundamental bounds on the age of information in multi-hop global status update networks. J. Commun. Netw. 2019, 21, 268–279. [Google Scholar] [CrossRef]
Ioannidis, S.; Chaintreau, A.; Massoulie, L. Optimal and scalable distribution of content updates over a mobile social network. In Proceedings of the IEEE Infocom, Rio De Janeiro, Brazil, 24 April 2009. [Google Scholar]
Wang, M.; Chen, W.; Ephremides, A. Reconstruction of counting process in real-time: The freshness of information through queues. In Proceedings of the ICC 2019-2019 IEEE International Conference on Communications (ICC), Shanghai, China, 20–24 May 2019. [Google Scholar]
Sun, Y.; Polyanskiy, Y.; Uysal-Biyikoglu, E. Remote estimation of the Wiener process over a channel with random delay. In Proceedings of the 2017 IEEE International Symposium on Information Theory (ISIT), Aachen, Germany, 25–30 June 2017. [Google Scholar]
Sun, Y.; Cyr, B. Information aging through queues: A mutual information perspective. In Proceedings of the 2018 IEEE 19th International Workshop on Signal Processing Advances in Wireless Communications (SPAWC), Kalamata, Greece, 25–28 June 2018. [Google Scholar]
Chakravorty, J.; Mahajan, A. Remote estimation over a packet-drop channel with Markovian state. IEEE Trans. Autom. Control 2020, 65, 2016–2031. [Google Scholar] [CrossRef] [Green Version]
Kam, C.; Kompella, S.; Ephremides, A. Age of incorrect information for remote estimation of a binary Markov source. In Proceedings of the IEEE INFOCOM 2020-IEEE Conference on Computer Communications Workshops (INFOCOM WKSHPS), Toronto, ON, Canada, 6–9 July 2020. [Google Scholar]
Arafa, A.; Banawan, K.; Seddik, K.G.; Poor, H.V. Sample, quantize and encode: Timely estimation over noisy channels. IEEE Trans. Commun. 2021, 69, 6485–6499. [Google Scholar] [CrossRef]
Bastopcu, M.; Ulukus, S. Who should Google Scholar update more often? In Proceedings of the IEEE INFOCOM 2020-IEEE Conference on Computer Communications Workshops (INFOCOM WKSHPS), Toronto, ON, Canada, 6–9 July 2020. [Google Scholar]
Bacinoglu, B.T.; Sun, Y.; Uysal-Biyikoglu, E.; Mutlu, V. Achieving the age-energy trade-off with a finite-battery energy harvesting source. In Proceedings of the 2018 IEEE International Symposium on Information Theory (ISIT), Vail, CO, USA, 17–22 June 2018. [Google Scholar]
Baknina, A.; Ozel, O.; Yang, J.; Ulukus, S.; Yener, A. Sending information through status updates. In Proceedings of the 2018 IEEE International Symposium on Information Theory (ISIT), Vail, CO, USA, 17–22 June 2018. [Google Scholar]
Baknina, A.; Ulukus, S. Coded status updates in an energy harvesting erasure channel. In Proceedings of the 2018 52nd Annual Conference on Information Sciences and Systems (CISS), Princeton, NJ, USA, 21–23 March 2018. [Google Scholar]
Wu, X.; Yang, J.; Wu, J. Optimal status update for age of information minimization with an energy harvesting source. IEEE Trans. Green Commun. Netw. 2018, 2, 193–204. [Google Scholar] [CrossRef]
Feng, S.; Yang, J. Optimal status updating for an energy harvesting sensor with a noisy channel. In Proceedings of the IEEE INFOCOM 2018-IEEE Conference on Computer Communications Workshops (INFOCOM WKSHPS), Honolulu, HI, USA, 15–19 April 2018. [Google Scholar]
Feng, S.; Yang, J. Minimizing age of information for an energy harvesting source with updating failures. In Proceedings of the 2018 IEEE International Symposium on Information Theory (ISIT), Vail, CO, USA, 17–22 June 2018. [Google Scholar]
Arafa, A.; Yang, J.; Ulukus, S.; Poor, H.V. Age-minimal online policies for energy harvesting sensors with incremental battery recharges. In Proceedings of the 2018 Information Theory and Applications Workshop (ITA), San Diego, CA, USA, 11–16 February 2018. [Google Scholar]
Arafa, A.; Yang, J.; Ulukus, S. Age-minimal online policies for energy harvesting sensors with random battery recharges. In Proceedings of the 2018 IEEE international conference on communications (ICC), Kansas City, MO, USA, 20–24 May 2018. [Google Scholar]
Arafa, A.; Yang, J.; Ulukus, S.; Poor, H.V. Age-minimal transmission for energy harvesting sensors with finite batteries: Online policies. IEEE Trans. Inf. Theory 2020, 66, 534–556. [Google Scholar] [CrossRef] [Green Version]
Arafa, A.; Yang, J.; Ulukus, S.; Poor, H.V. Online timely status updates with erasures for energy harvesting sensors. In Proceedings of the 2018 56th Annual Allerton Conference on Communication, Control, and Computing (Allerton), Monticello, IL, USA, 2–5 October 2018. [Google Scholar]
Arafa, A.; Yang, J.; Ulukus, S.; Poor, H.V. Using erasure feedback for online timely updating with an energy harvesting sensor. In Proceedings of the 2019 IEEE International Symposium on Information Theory (ISIT), Paris, France, 7–12 July 2019. [Google Scholar]
Arafa, A.; Yang, J.; Ulukus, S.; Poor, H.V. Timely status updating over erasure channels using an energy harvesting sensor: Single and multiple sources. IEEE Trans. Green Commun. Netw. 2022, 6, 6–19. [Google Scholar] [CrossRef]
Farazi, S.; Klein, A.G.; Brown, D.R., III. Average age of information for status update systems with an energy harvesting server. In Proceedings of the IEEE INFOCOM 2018-IEEE Conference on Computer Communications Workshops (INFOCOM WKSHPS), Honolulu, HI, USA, 15–19 April 2018. [Google Scholar]
Leng, S.; Yener, A. Age of information minimization for an energy harvesting cognitive radio. IEEE Trans. Cogn. Commun. Netw. 2019, 5, 427–439. [Google Scholar] [CrossRef]
Chen, Z.; Pappas, N.; Bjornson, E.; Larsson, E.G. Age of information in a multiple access channel with heterogeneous traffic and an energy harvesting node. In Proceedings of the IEEE INFOCOM 2019-IEEE Conference on Computer Communications Workshops (INFOCOM WKSHPS), Paris, France, 29 April–2 May 2019. [Google Scholar]
Bhat, R.V.; Vaze, R.; Motani, M. Throughput maximization with an average age of information constraint in fading channels. IEEE Trans. Wirel. Commun. 2021, 20, 481–494. [Google Scholar] [CrossRef]
Ostman, J.; Devassy, R.; Durisi, G.; Uysal, E. Peak-age violation guarantees for the transmission of short packets over fading channels. In Proceedings of the IEEE INFOCOM 2019-IEEE Conference on Computer Communications Workshops (INFOCOM WKSHPS), Paris, France, 29 April–2 May 2019. [Google Scholar]
Bastopcu, M.; Ulukus, S. Age of information with soft updates. In Proceedings of the 2018 56th Annual Allerton Conference on Communication, Control, and Computing (Allerton), Monticello, IL, USA, 2–5 October 2018. [Google Scholar]
Bastopcu, M.; Ulukus, S. Minimizing age of information with soft updates. J. Commun. Netw. 2019, 21, 233–243. [Google Scholar] [CrossRef] [Green Version]
Buyukates, B.; Soysal, A.; Ulukus, S. Age of information scaling in large networks. In Proceedings of the 2019 IEEE Global Communications Conference (GLOBECOM), Waikoloa, HI, USA, 9–13 December 2019. [Google Scholar]
Buyukates, B.; Soysal, A.; Ulukus, S. Age of information scaling in large networks with hierarchical cooperation. In Proceedings of the 2019 IEEE Global Communications Conference (GLOBECOM), Waikoloa, HI, USA, 9–13 December 2019. [Google Scholar]
Buyukates, B.; Soysal, A.; Ulukus, S. Scaling laws for age of information in wireless networks. IEEE Trans. Wirel. Commun. 2021, 20, 2413–2427. [Google Scholar] [CrossRef]
Zhong, J.; Yates, R.D.; Soljanin, E. Minimizing content staleness in dynamo-style replicated storage systems. In Proceedings of the IEEE INFOCOM 2018-IEEE Conference on Computer Communications Workshops (INFOCOM WKSHPS), Honolulu, HI, USA, 15–19 April 2018. [Google Scholar]
Rajaraman, N.; Vaze, R.; Reddy, G. Not just age but age and quality of information. IEEE J. Sel. Areas Commun. 2021, 39, 1325–1338. [Google Scholar] [CrossRef]
Liu, Z.; Ji, B. Towards the tradeoff between service performance and information freshness. In Proceedings of the ICC 2019-2019 IEEE International Conference on Communications (ICC), Shanghai, China, 20–24 May 2019. [Google Scholar]
Maatouk, A.; Assaad, M.; Ephremides, A. The age of incorrect information: An enabler of semantics-empowered communication. arXiv 2020, arXiv:2012.13214. [Google Scholar]
Uysal, E.; Kaya, O.; Ephremides, A.; Gross, J.; Codreanu, M.; Popovski, P.; Johansson, K.H. Semantic communications in networked systems. arXiv 2021, arXiv:2103.05391. [Google Scholar]
Ayan, O.; Vilgelm, M.; Klügel, M.; Hirche, S.; Kellerer, W. Age-of-information vs. value-of-information scheduling for cellular networked control systems. In Proceedings of the 10th ACM/IEEE International Conference on Cyber-Physical Systems, Montreal, QC, Canada, 16–18 April 2019. [Google Scholar]
Bastopcu, M.; Ulukus, S. Timely group updating. In Proceedings of the 2021 55th Annual Conference on Information Sciences and Systems (CISS), Baltimore, MD, USA, 24–26 March 2021. [Google Scholar]
Banerjee, S.; Bhattacharjee, R.; Sinha, A. Fundamental limits of age-of-information in stationary and non-stationary environments. In Proceedings of the 2020 IEEE International Symposium on Information Theory (ISIT), Los Angeles, CA, USA, 21–26 June 2020. [Google Scholar]
Zhong, J.; Yates, R.D. Timeliness in lossless block coding. In Proceedings of the 2016 Data Compression Conference (DCC), Snowbird, UT, USA, 29 March–1 April 2016. [Google Scholar]
Zhong, J.; Yates, R.D.; Soljanin, E. Timely lossless source coding for randomly arriving symbols. In Proceedings of the 2018 IEEE Information Theory Workshop (ITW), Guangzhou, China, 25–29 November 2018. [Google Scholar]
Mayekar, P.; Parag, P.; Tyagi, H. Optimal lossless source codes for timely updates. In Proceedings of the 2018 IEEE International Symposium on Information Theory (ISIT), Vail, CO, USA, 17–22 June 2018. [Google Scholar]
Mayekar, P.; Parag, P.; Tyagi, H. Optimal source codes for timely updates. IEEE Trans. Inf. Theory 2020, 66, 3714–3731. [Google Scholar] [CrossRef] [Green Version]
Bastopcu, M.; Buyukates, B.; Ulukus, S. Optimal selective encoding for timely updates. In Proceedings of the 2020 54th Annual Conference on Information Sciences and Systems (CISS), Princeton, NJ, USA, 18–20 March 2020. [Google Scholar]
Buyukates, B.; Bastopcu, M.; Ulukus, S. Optimal selective encoding for timely updates with empty symbol. In Proceedings of the 2020 IEEE International Symposium on Information Theory (ISIT), Los Angeles, CA, USA, 21–26 June 2020. [Google Scholar]
Bastopcu, M.; Buyukates, B.; Ulukus, S. Selective encoding policies for maximizing information freshness. IEEE Trans. Commun. 2021, 69, 5714–5726. [Google Scholar] [CrossRef]
Ramirez, D.; Erkip, E.; Poor, H.V. Age of information with finite horizon and partial updates. In Proceedings of the ICASSP 2020-2020 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), Barcelona, Spain, 4–8 May 2020. [Google Scholar]
Arafa, A.; Banawan, K.; Seddik, K.G.; Poor, H.V. On timely channel coding with hybrid ARQ. In Proceedings of the 2019 IEEE Global Communications Conference (GLOBECOM), Big Island, HI, USA, 9–13 December 2019. [Google Scholar]
Arafa, A.; Wesel, R.D. Timely transmissions using optimized variable length coding. In Proceedings of the IEEE INFOCOM 2021-IEEE Conference on Computer Communications Workshops (INFOCOM WKSHPS), Vancouver, BC, Canada, 10–13 May 2021. [Google Scholar]
Bastopcu, M.; Ulukus, S. Partial updates: Losing information for freshness. In Proceedings of the 2020 IEEE International Symposium on Information Theory (ISIT), Los Angeles, CA, USA, 21–26 June 2020. [Google Scholar]
Abd-Elmagid, M.A.; Dhillon, H.S. Average peak age-of-information minimization in UAV-assisted IoT networks. IEEE Trans. Veh. Technol. 2019, 68, 2003–2008. [Google Scholar] [CrossRef] [Green Version]
Liu, J.; Wang, X.; Dai, H. Age-optimal trajectory planning for UAV-assisted data collection. In Proceedings of the IEEE INFOCOM 2018-IEEE Conference on Computer Communications Workshops (INFOCOM WKSHPS), Honolulu, HI, USA, 15–19 April 2018. [Google Scholar]
Abd-Elmagid, M.A.; Pappas, N.; Dhillon, H.S. On the role of age of information in the internet of things. IEEE Commun. Mag. 2019, 57, 72–77. [Google Scholar] [CrossRef] [Green Version]
Alabbasi, A.; Aggarwal, V. Joint information freshness and completion time optimization for vehicular networks. IEEE Trans. Serv. Comput. 2020, 15, 1–14. [Google Scholar] [CrossRef] [Green Version]
Gao, W.; Cao, G.; Srivatsa, M.; Iyengar, A. Distributed maintenance of cache freshness in opportunistic mobile networks. In Proceedings of the 2012 IEEE 32nd International Conference on Distributed Computing Systems, Macau, China, 18–21 June 2012. [Google Scholar]
Yates, R.D.; Ciblat, P.; Yener, A.; Wigger, M. Age-optimal constrained cache updating. In Proceedings of the 2017 IEEE International Symposium on Information Theory (ISIT), Aachen, Germany, 25–30 June 2017. [Google Scholar]
Kam, C.; Kompella, S.; Nguyen, G.D.; Wieselthier, J.E.; Ephremides, A. Information freshness and popularity in mobile caching. In Proceedings of the 2017 IEEE International Symposium on Information Theory (ISIT), Aachen, Germany, 25–30 June 2017. [Google Scholar]
Zhang, S.; Li, J.; Luo, H.; Gao, J.; Zhao, L.; Shen, X.S. Towards fresh and low-latency content delivery in vehicular networks: An edge caching aspect. In Proceedings of the 2018 10th International Conference on Wireless Communications and Signal Processing (WCSP), Hangzhou, China, 18–20 October 2018. [Google Scholar]
Tang, H.; Ciblat, P.; Wang, J.; Wigger, M.; Yates, R. Age of information aware cache updating with file- and age-dependent update durations. In Proceedings of the 2020 18th International Symposium on Modeling and Optimization in Mobile, Ad Hoc, and Wireless Networks (WiOPT), Volos, Greece, 15–19 June 2020. [Google Scholar]
Zhong, J.; Yates, R.D.; Soljanin, E. Two freshness metrics for local cache refresh. In Proceedings of the 2018 IEEE International Symposium on Information Theory (ISIT), Vail, CO, USA, 17–22 June 2018. [Google Scholar]
Yang, L.; Zhong, Y.; Zheng, F.; Jin, S. Edge caching with real-time guarantees. arXiv 2019, arXiv:1912.11847. [Google Scholar]
Bastopcu, M.; Ulukus, S. Maximizing information freshness in caching systems with limited cache storage capacity. In Proceedings of the 2020 54th Asilomar Conference on Signals, Systems, and Computers, Pacific Grove, CA, USA, 1–5 November 2020. [Google Scholar]
Bastopcu, M.; Ulukus, S. Cache freshness in information updating systems. In Proceedings of the 2021 55th Annual Conference on Information Sciences and Systems (CISS), Baltimore, MD, USA, 24–26 March 2021. [Google Scholar]
Bastopcu, M.; Ulukus, S. Information freshness in cache updating systems. IEEE Trans. Wirel. Commun. 2021, 20, 1861–1874. [Google Scholar] [CrossRef]
Kaswan, P.; Bastopcu, M.; Ulukus, S. Freshness based cache updating in parallel relay networks. In Proceedings of the 2021 IEEE International Symposium on Information Theory (ISIT), Melbourne, Australia, 12–20 July 2021. [Google Scholar]
Gu, Y.; Wang, Q.; Chen, H.; Li, Y.; Vucetic, B. Optimizing information freshness in two-hop status update systems under a resource constraint. IEEE J. Sel. Areas Commun. 2021, 39, 1380–1392. [Google Scholar] [CrossRef]
Kuang, Q.; Gong, J.; Chen, X.; Ma, X. Age-of-information for computation-intensive messages in mobile edge computing. arXiv 2019, arXiv:1901.01854. [Google Scholar]
Gong, J.; Kuang, Q.; Chen, X.; Ma, X. Reducing age-of-information for computation-intensive messages via packet replacement. In Proceedings of the 2019 11th International Conference on Wireless Communications and Signal Processing (WCSP), Xi’an, China, 23–25 October 2019. [Google Scholar]
Zou, P.; Ozel, O.; Subramaniam, S. Trading off computation with transmission in status update systems. In Proceedings of the 2019 IEEE 30th Annual International Symposium on Personal, Indoor and Mobile Radio Communications (PIMRC), Istanbul, Turkey, 8–11 September 2019. [Google Scholar]
Bastopcu, M.; Ulukus, S. Age of information for updates with distortion. In Proceedings of the 2019 IEEE Information Theory Workshop (ITW), Gotland, Sweden, 25–28 August 2019. [Google Scholar]
Bastopcu, M.; Ulukus, S. Age of information for updates with distortion: Constant and age-dependent distortion constraints. IEEE/ACM Trans. Netw. 2021, 29, 2425–2438. [Google Scholar] [CrossRef]
Buyukates, B.; Ulukus, S. Timely updates in distributed computation systems with stragglers. In Proceedings of the 2020 54th Asilomar Conference on Signals, Systems, and Computers, Pacific Grove, CA, USA, 1–5 November 2020. [Google Scholar]
Buyukates, B.; Ulukus, S. Timely distributed computation with stragglers. IEEE Trans. Commun. 2020, 68, 5273–5282. [Google Scholar] [CrossRef]
Zou, P.; Ozel, O.; Subramaniam, S. Optimizing information freshness through computation–transmission tradeoff and queue management in edge computing. IEEE/ACM Trans. Netw. 2021, 29, 949–963. [Google Scholar] [CrossRef]
Buyukates, B.; Ulukus, S. Timely communication in federated learning. In Proceedings of the IEEE INFOCOM 2021-IEEE Conference on Computer Communications Workshops (INFOCOM WKSHPS), Vancouver, BC, Canada, 10–13 May 2021. [Google Scholar]
Ozfatura, E.; Buyukates, B.; Gündüz, D.; Ulukus, S. Age-based coded computation for bias reduction in distributed learning. In Proceedings of the GLOBECOM 2020-2020 IEEE Global Communications Conference, Taipei, Taiwan, 7–11 December 2020. [Google Scholar]
Ceran, E.T.; Gündüz, D.; György, A. A reinforcement learning approach to age of information in multi-user networks with HARQ. IEEE J. Sel. Areas Commun. 2021, 39, 1412–1426. [Google Scholar] [CrossRef]
Yates, R.D. The age of gossip in networks. In Proceedings of the 2021 IEEE International Symposium on Information Theory (ISIT), Melbourne, Australia, 12–20 July 2021. [Google Scholar]
Buyukates, B.; Bastopcu, M.; Ulukus, S. Age of gossip in networks with community structure. In Proceedings of the 2021 IEEE 22nd International Workshop on Signal Processing Advances in Wireless Communications (SPAWC), Lucca, Italy, 27–30 September 2021. [Google Scholar]
Bastopcu, M.; Buyukates, B.; Ulukus, S. Gossiping with binary freshness metric. In Proceedings of the 2021 IEEE Globecom Workshops (GC Wkshps), Madrid, Spain, 7–11 December 2021. [Google Scholar]
Kaswan, P.; Ulukus, S. Timely gossiping with file slicing and network coding. arXiv 2022, arXiv:2202.00649. [Google Scholar]
Kosta, A.; Pappas, N.; Angelakis, V. Age of information: A new concept, metric, and tool. Found. Trends Netw. 2017, 12, 162–259. [Google Scholar] [CrossRef] [Green Version]
Sun, Y.; Kadota, I.; Talak, R.; Modiano, E. Age of information: A new metric for information freshness. Synth. Lect. Commun. Netw. 2019, 12, 1–224. [Google Scholar] [CrossRef]
Yates, R.D.; Sun, Y.; Brown, D.R., III; Kaul, S.K.; Modiano, E.; Ulukus, S. Age of information: An introduction and survey. IEEE J. Sel. Areas Commun. 2021, 39, 1183–1210. [Google Scholar] [CrossRef]
Yun, J.; Joo, C.; Eryilmaz, A. Optimal real-time monitoring of an information source under communication costs. In Proceedings of the 2018 IEEE Conference on Decision and Control (CDC), Miami Beach, FL, USA, 17–19 December 2018. [Google Scholar]
Maatouk, A.; Kriouile, S.; Assaad, M.; Ephremides, A. The age of incorrect information: A new performance metric for status updates. IEEE/ACM Trans. Netw. 2020, 28, 2215–2228. [Google Scholar] [CrossRef]
Yates, R.D.; Goodman, D.J. Probability and Stochastic Processes; Wiley: Hoboken, NJ, USA, 2014. [Google Scholar]
Boyd, S.P.; Vandenberghe, L. Convex Optimization; Cambridge University Press: Cambridge, UK, 2004. [Google Scholar]
Bertsekas, D.P.; Tsitsiklis, J.N. Introduction to Probability; Athena Scientific: Belmont, MA, USA, 2008. [Google Scholar]
Chen, Y.C.; Lu, P.E.; Chang, C.S.; Liu, T.H. A time-dependent SIR model for COVID-19 with undetectable infected persons. IEEE Trans. Netw. Sci. Eng. 2020, 7, 3279–3294. [Google Scholar] [CrossRef]
Olmez, S.Y.; Mori, J.; Miehling, E.; Başar, T.; Smith, R.L.; West, M.; Mehta, P.G. A data-informed approach for analysis, validation, and identification of COVID-19 models. In Proceedings of the 2021 American Control Conference (ACC), New Orleans, LA, USA, 25–28 May 2021. [Google Scholar]

Figure 1. System model. There are n people whose infection status are given by

x_{i} (t)

. The health care provider applies tests on these people. Based on the test results, estimations for the infection status

{\hat{x}}_{i} (t)

are generated. Infected people are shown in red and healthy people are shown in green.

Figure 1. System model. There are n people whose infection status are given by

x_{i} (t)

. The health care provider applies tests on these people. Based on the test results, estimations for the infection status

{\hat{x}}_{i} (t)

are generated. Infected people are shown in red and healthy people are shown in green.

Figure 2. (a) A sample evolution of

x_{i} (t)

and

{\hat{x}}_{i} (t)

, and (b) the corresponding

Δ_{i} (t)

in (5). Green areas correspond to the error caused by

Δ_{i 1} (t)

in (3). Orange areas correspond to the error caused by

Δ_{i 2} (t)

in (4).

Figure 2. (a) A sample evolution of

x_{i} (t)

and

{\hat{x}}_{i} (t)

, and (b) the corresponding

Δ_{i} (t)

in (5). Green areas correspond to the error caused by

Δ_{i 1} (t)

in (3). Orange areas correspond to the error caused by

Δ_{i 2} (t)

in (4).

Figure 3. A sample evolution of (a)

Δ_{i 1} (t)

, and (b)

Δ_{i 2} (t)

in a typical cycle.

Figure 3. A sample evolution of (a)

Δ_{i 1} (t)

, and (b)

Δ_{i 2} (t)

in a typical cycle.

Figure 4. Transition graphs of the states

(x_{i} (t), {\hat{x}}_{i} (t))

(a) when there is no error in the tests, and (b) when there are errors in the tests.

Figure 4. Transition graphs of the states

(x_{i} (t), {\hat{x}}_{i} (t))

(a) when there is no error in the tests, and (b) when there are errors in the tests.

Figure 5. The infection rates of two people where the individual infection rate is equal to

λ

. When the infection has not been detected, these two people can infect each other with rate

λ_{12}

.

Figure 5. The infection rates of two people where the individual infection rate is equal to

λ

. When the infection has not been detected, these two people can infect each other with rate

λ_{12}

.

Figure 6. A sample evolution of (a)

Δ_{i 1}^{s} (t)

, and (b)

Δ_{i 2}^{s} (t)

in a typical update cycle.

Figure 6. A sample evolution of (a)

Δ_{i 1}^{s} (t)

, and (b)

Δ_{i 2}^{s} (t)

in a typical update cycle.

Figure 7. (a) Test rates

s_{i}

and

c_{i}

, (b) corresponding average difference

Δ_{i}

.

Figure 7. (a) Test rates

s_{i}

and

c_{i}

, (b) corresponding average difference

Δ_{i}

.

Figure 8. The average difference

Δ

with respect to total test rate C.

Figure 8. The average difference

Δ

with respect to total test rate C.

Figure 9. The average difference

Δ

with respect to number of people n. We use uniform infection and healing rates, i.e.,

λ_{i} = \frac{6}{n}

and

μ_{i} = \frac{4}{n}

for all i, and also

λ_{i}

in (80) and

μ_{i}

in (81) with

\sum_{i = 1}^{n} λ_{i} = 6

and

\sum_{i = 1}^{n} μ_{i} = 4

.

Figure 9. The average difference

Δ

with respect to number of people n. We use uniform infection and healing rates, i.e.,

λ_{i} = \frac{6}{n}

and

μ_{i} = \frac{4}{n}

for all i, and also

λ_{i}

in (80) and

μ_{i}

in (81) with

\sum_{i = 1}^{n} λ_{i} = 6

and

\sum_{i = 1}^{n} μ_{i} = 4

.

Figure 10. (a)

Δ

in (7),

{\bar{Δ}}_{1}

which is

\frac{1}{n} \sum_{i = 1}^{n} Δ_{i 1}

, and

{\bar{Δ}}_{2}

which is

\frac{1}{n} \sum_{i = 1}^{n} Δ_{i 2}

, (b) corresponding total test rates

\sum_{i = 1}^{n} s_{i}

and

\sum_{i = 1}^{n} c_{i}

.

Figure 10. (a)

Δ

in (7),

{\bar{Δ}}_{1}

which is

\frac{1}{n} \sum_{i = 1}^{n} Δ_{i 1}

, and

{\bar{Δ}}_{2}

which is

\frac{1}{n} \sum_{i = 1}^{n} Δ_{i 2}

, (b) corresponding total test rates

\sum_{i = 1}^{n} s_{i}

and

\sum_{i = 1}^{n} c_{i}

.

Figure 11. (a) Test rates

v_{i}

, (b) corresponding average difference

Δ_{i}^{e}

when there is error in the tests.

Figure 11. (a) Test rates

v_{i}

, (b) corresponding average difference

Δ_{i}^{e}

when there is error in the tests.

Figure 12. (a) The percentage of the time that person 1 stays as infected while we increase

λ_{12}

, (b) the percentage of the time that both person 1 and 2 stay as infected while we increase the total test rate C.

Figure 12. (a) The percentage of the time that person 1 stays as infected while we increase

λ_{12}

, (b) the percentage of the time that both person 1 and 2 stay as infected while we increase the total test rate C.

Figure 13. (a) The normalized average differences

\frac{Δ_{i}^{s}}{\sum_{i = 1}^{n} Δ_{i}^{s}}

, and

\frac{Δ_{i}^{e}}{\sum_{i = 1}^{n} Δ_{i}^{e}}

, and (b) the corresponding test rates

w_{i}

and

v_{i}

.

Figure 13. (a) The normalized average differences

\frac{Δ_{i}^{s}}{\sum_{i = 1}^{n} Δ_{i}^{s}}

, and

\frac{Δ_{i}^{e}}{\sum_{i = 1}^{n} Δ_{i}^{e}}

, and (b) the corresponding test rates

w_{i}

and

v_{i}

.

Table 1. List of variables used in this work.

Variables	Definition of the Variables
Section 2, Section 3 and Section 4
n	number of people in the population
$x_{i} (t)$	infection status of the ith person at time t
${\hat{x}}_{i} (t)$	estimation of $x_{i} (t)$ at the health care provider
$λ_{i},$ $μ_{i}$	infection and recovery rates for the ith person
$c_{i}$ , $s_{i}$	test rates applied to the ith person when ${\hat{x}}_{i} (t) = 1$ , and ${\hat{x}}_{i} (t) = 0$
$Δ_{i} (t)$	total estimation error for the ith person at time t
$θ$	importance factor in $[0, 1]$
$Δ_{i}$	the long-time weighted average for the ith person
C	total test rate constraint
Section 5
$Δ_{i}^{e}$	the long-time average difference for the ith person with
$Δ_{i}^{e}$	erroneous test measurements
q	false-negative testing probability with $0 \leq q < \frac{1}{2}$
p	false-positive testing probability with $0 \leq p < \frac{1}{2}$
$v_{i}$	test rate applied to the ith person with erroneous test measurements
Section 6
$λ,$ $μ$	individual infection and recovery rate of a person
$λ_{12}$	the rate of spreading the virus from an undetected infected person
$λ_{12}$	to a healthy person
c, s	test rates applied to people when ${\hat{x}}_{i} (t) = 1$ , and ${\hat{x}}_{i} (t) = 0$
$Δ_{i}^{d}$	the long-time average difference for the ith person with
$Δ_{i}^{d}$	dependent infection rates
Section 7
$w_{i}$	test rate applied to the ith person for AoII-based error metric
$Δ_{i}^{s}$	the long-time average difference for the ith person with
$Δ_{i}^{s}$	AoII-based error metric

Table 2. The probability distribution for successful and false test measurements.

$x_{i} (t)$ ∖ ${\hat{x}}_{i} (t)$	0	1
0	$1 - p$	p
1	q	$1 - q$

Publisher’s Note: MDPI stays neutral with regard to jurisdictional claims in published maps and institutional affiliations.

© 2022 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).

Share and Cite

MDPI and ACS Style

Bastopcu, M.; Ulukus, S. Using Timeliness in Tracking Infections. Entropy 2022, 24, 779. https://doi.org/10.3390/e24060779

AMA Style

Bastopcu M, Ulukus S. Using Timeliness in Tracking Infections. Entropy. 2022; 24(6):779. https://doi.org/10.3390/e24060779

Chicago/Turabian Style

Bastopcu, Melih, and Sennur Ulukus. 2022. "Using Timeliness in Tracking Infections" Entropy 24, no. 6: 779. https://doi.org/10.3390/e24060779

APA Style

Bastopcu, M., & Ulukus, S. (2022). Using Timeliness in Tracking Infections. Entropy, 24(6), 779. https://doi.org/10.3390/e24060779

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Menu

Using Timeliness in Tracking Infections^†

Abstract

1. Introduction

2. System Model

3. Average Difference Analysis

4. Optimization of Average Difference

5. Average Difference for the Case with Erroneous Test Measurements

5.1. An Alternative Method to Characterize Average Difference

5.2. Average Difference with Erroneous Test Measurements

6. Average Estimation Error with Dependent Infection Rates

7. Age of Incorrect Information Based Error Metric

8. Numerical Results

9. Conclusions and Discussion

Author Contributions

Funding

Data Availability Statement

Conflicts of Interest

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI

Article Menu

Using Timeliness in Tracking Infections †

Abstract

1. Introduction

2. System Model

3. Average Difference Analysis

4. Optimization of Average Difference

5. Average Difference for the Case with Erroneous Test Measurements

5.1. An Alternative Method to Characterize Average Difference

5.2. Average Difference with Erroneous Test Measurements

6. Average Estimation Error with Dependent Infection Rates

7. Age of Incorrect Information Based Error Metric

8. Numerical Results

9. Conclusions and Discussion

Author Contributions

Funding

Data Availability Statement

Conflicts of Interest

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI

Using Timeliness in Tracking Infections^†