Optimal Power Control in Wireless Powered Sensor Networks: A Dynamic Game-Based Approach

Xu, Haitao; Guo, Chao; Zhang, Long

doi:10.3390/s17030547

Open AccessArticle

Optimal Power Control in Wireless Powered Sensor Networks: A Dynamic Game-Based Approach

by

Haitao Xu

^1,*

,

Chao Guo

²

and

Long Zhang

³

¹

School of Computer and Communication Engineering, University of Science and Technology Beijing; Beijing 100083, China

²

Communication Engineering Department, Beijing Electronics Science and Technology Institute, Beijing 100070, China

³

School of Information and Electrical Engineering, Hebei University of Engineering, Handan 056038, China

^*

Author to whom correspondence should be addressed.

Sensors 2017, 17(3), 547; https://doi.org/10.3390/s17030547

Submission received: 28 December 2016 / Revised: 16 February 2017 / Accepted: 8 March 2017 / Published: 9 March 2017

(This article belongs to the Special Issue Wireless Rechargeable Sensor Networks)

Download

Browse Figures

Versions Notes

Abstract

:

In wireless powered sensor networks (WPSN), it is essential to research uplink transmit power control in order to achieve throughput performance balancing and energy scheduling. Each sensor should have an optimal transmit power level for revenue maximization. In this paper, we discuss a dynamic game-based algorithm for optimal power control in WPSN. The main idea is to use the non-cooperative differential game to control the uplink transmit power of wireless sensors in WPSN, to extend their working hours and to meet QoS (Quality of Services) requirements. Subsequently, the Nash equilibrium solutions are obtained through Bellman dynamic programming. At the same time, an uplink power control algorithm is proposed in a distributed manner. Through numerical simulations, we demonstrate that our algorithm can obtain optimal power control and reach convergence for an infinite horizon.

Keywords:

differential game; power control; WPSN

1. Introduction

Conventional wireless sensor networks (WSN) are always disposable systems, because sensors cannot be recharged due to random deployment and the entire network is invalid when the batteries of wireless sensors run out of energy [1]. As the use of the WSN is strictly limited by the life span of the sensors’ batteries, energy consumption has become one of the biggest constraints of the wireless sensor node and has posed many challenges to WSNs [2]. Energy has become one of the scarcest resources in WSN [3]. Wireless power transfer (WPT) and other energy harvesting technologies provide solutions in such situations. Benefiting from microwave wireless power transfer, the wireless powered sensor networks (WPSN) can be used to reduce the operational cost, provide a stable energy supply, and achieve much longer operating lifetimes [4].

WPSN has been widely researched in the recent literature [5,6,7,8]. Wireless sensors in WPSN can be powered through WPT in the downlink which is radio frequency enabled, and can use the harvesting energy for information transmission in the uplink. Compared to other energy harvesting technologies, WPT can achieve long-distance energy transfer and constant energy supplementation [6]. However, in WPSN, the distance between the wireless sensors and energy nodes (ENs) may cause performance unfairness, because of the near-far effect. When wireless sensors are located far away from the ENs, they will receive less energy, because of power transmission attenuation. But they may need more energy for the uplink information transmission. Thus, communication and energy scheduling should be considered on this occasion. For the downlink energy transfer, energy beamforming technology is used for sensors which are located far away from energy sources, so that they can receive stronger energy beams [7]. In this case, it is essential to research the uplink transmit power control problem, to balance throughput and energy performance among different sensors [8].

When considering the literature, many studies have been completed in WPSN for power control problems. In [9], the information relay nodes were working as energy beacon and information relays, and that energy and information can be transferred and transmitted through relay nodes. The protocol was divided into three phases. By jointly optimizing the duration and power allocation for transmission, the network throughput was maximized. In [10], the authors researched the energy harvesting-based wireless networks and proposed an iterative method-based solution for the sub-channel and power allocation. The authors defined a logarithmic utility function, considering both the aggregated rate and the harvested energy. The sub-channel and power allocation are obtained through biconvex optimization. The convergence of the proposed algorithm is also proved.

Game theory solves the resource allocation problem of a system with conflicting components. It has recently received an increasing interest in the context of wireless sensor networks [11,12]. It can be used to solve the optimal energy management problems in wireless powered sensor networks, to meet the perceived QoS (Quality of Services) performance [13]. In [13], the wireless energy request policy was researched and analyzed, based on a constrained stochastic game model. A constrained Nash equilibrium solution was obtained, while meeting Qos requirements, and achieved an energy request cost minimization. In [14], in WPSN, a Nash bargaining-based optimal power control approach was proposed, to balance the information transmission efficiency. The whole game process was simplified into three parts, and the power control and time allocation algorithm were proved to be quasiconcave.

Nevertheless, to the best of our best knowledge, all of the works above optimize the network performance, but do not consider the dynamic characteristics of a sensor’s energy which are exponent variables, and do not consider the optimization in given a time period. Differential Game, firstly proposed by Isaace [15], is one of the most practical and complex branches of game theory and can be used to solve a class of resource allocation problems, under which the evolution of the state is described by a differential equation and the players act throughout a time interval [16]. In this paper, we propose a new method for uplink transmit power control, based on a differential game. Each sensor node can be satisfied with a constant wireless energy transfer from a hybrid access point, and the hybrid access point has an energy transfer function and information transmission function. The energy dynamic of sensors is considered as the state of the system, which is denoted by differential equations. We suppose that all sensors are rational players and the combination of energy and throughput revenues are interpreted as the optimization objectives for different players. We will obtain individual feedback Nash equilibriums for the sensors in a finite time horizon and those in an infinite time horizon, and an iterative algorithm is presented to achieve optimal solutions for uplink power control. The numerical results will be given, to present the correctness of the differential game analysis.

The remainder of the paper is organized as follows. Section 2 introduces the system model of WPSN and the uplink power control problem in a differential game. Section 3 provides feedback Nash equilibrium solutions for each wireless sensor and a differential game-based iterative algorithm. Numerical simulations are given in Section 4. Finally, we conclude the work in Section 5.

2. System Model and Problem Formulation

In this section, we propose a differential game model for uplink transmit power control in wireless powered sensor networks (WPSN). We consider a WPSN where there is one wireless energy transmitter serving several wireless sensors in its coverage area (as shown in Figure 1). The wireless energy transmitter can achieve a constant power supply in the downlink and can work as an access point for information transmission in the uplink, receiving signals from distributed wireless sensors. Thus, the wireless energy transmitter can be considered as a hybrid access point (H-AP) for energy and information transmission. All of the devices (including wireless energy transmitters and wireless sensors) in WPSN are assumed to work on orthogonal frequency bands, and work in the half-duplex mode. The wireless sensors use the energy harvested from H-AP for information transmission [17]. The energy harvested by each sensor is stored in a rechargeable battery and then used for wireless information transmission (WIT). Moreover, the wireless sensors control their uplink transmit power to extend the working hours, meanwhile improving their own QoS, which is a distributed optimization problem and leads to a dynamic game that can be modeled by a non-cooperative differential game.

In this paper, the “harvest-then-transmit” protocol [18] is considered. The time duration of transmission is assumed to be a different block transmission time with a normalized duration. Energy and information are transmitted from block to block. For each transmission block, it can be divided into two phases (as shown in Figure 2). The first phase is the time duration of wireless energy transfer (WET), which is denoted by

τ_{i}

, and the second phase is the time duration of wireless information transmission (WIT), which is represented as

1 - τ_{i}

.

Our target is to control the uplink transmit power of sensors in WPSN, to maximize the sensors’ own economic revenue during the time period

t \in [0, T]

. A differential game-based model is constructed to describe the revenue maximization problem. Through the optimal power control, the wireless sensors in WPSN can achieve a balance between energy consumption and QoS improvement. In order to simplify the system, we will consider a WPSN with one H-AP and

N

wireless sensors, where

N

is the set of wireless sensors (players). During the first phase of wireless energy transfer, let

p_{T}^{i}

denote the transfer power from H-AP to sensor

i

. It is assumed that

p_{T}^{i}

satisfies a maximum power constraint

P_{T}^{\max}

(i.e.,

0 \leq p_{T}^{i} \leq P_{T}^{\max}

) [19]. The harvesting energy in sensor

i

is given by [20]:

E_{h} = η_{i} τ_{i} {‖ g_{T}^{i} ‖}^{2} p_{T}^{i}, x_{i} (0) = 0

(1)

where

η_{i}

is the energy conversion efficiency of player

i

, and

0 < η_{i} \leq 1

.

{‖ g_{T}^{i} ‖}^{2}

is the downlink channel gain. Let

x_{i} (t)

denote the power level of player

i

, which can be interpreted as the state variables of a system. State variables are dynamic variables over different time periods that are influenced by the uplink transmit power, as well as by exiting levels of the state variables. Let

p_{i} (t)

denote the uplink transmit power of player

i

, which is viewed as the control variable. The dynamic of the power level can be characterized as a linear differential equation, i.e.:

d x_{i} = [- μ_{i} x_{i} - (1 - τ_{i}) p_{i} + η_{i} τ_{i} {‖ g_{T}^{i} ‖}^{2} p_{T}^{i}] d t, x_{i} (0) = 0

(2)

where

μ_{i}

is the energy loss coefficient.

x_{i} (0) = 0

, is the initial state, which means that there is no energy transmission at the beginning of the game.

Now, we discuss how wireless sensors control their uplink transmit power to achieve revenue maximization, to reach an equilibrium between energy consumption and an achievable throughput. Subject to the limited energy, each sensor aims to minimize the uplink transmit power to extend the working hours, but may result in less information transmission and a low QoS. Therefore, each sensor needs to balance the conflict between energy consumption and an achievable throughput. Generally speaking, there will be a queue length or buffer size for each H-AP. When the buffer size of the H-AP is full, it will refuse to provide the service for any uplink information transmission. Therefore, in our game, we suppose that there are enough buffer sizes for information transmission and only consider how to control the uplink transmission power to achieve revenue maximization. The structure of the optimization model will consist of energy revenue specifications and QoS revenue specifications.

Firstly, we give the energy revenue definition. The energy revenue depends on the energy storage in the sensors and the energy’s unit price. Assuming the unit price is

ε

, the instantaneous energy revenue is defined as a linear form, as follows:

U_{i}^{e n g} = ε x_{i}

(3)

In perfect competition, each sensor will use the lowest power possible, to reduce the energy consumption and increase the energy revenue, given by Equation (2). However, less transmission power may cause a low transmission rate and low QoS. Thus, we introduce a QoS revenue to describe the conflict between energy consumption and QoS requirements. As the “harvest-then-transmit” protocol is considered, there is no interference from the energy transmission. Let the achieve rate of sensor

i

denote the QoS revenue, where the QoS revenue specifications are obtained as:

U_{i}^{Q o S} = ρ (1 - τ_{i}) \log (1 + \frac{g_{i} p_{i}}{σ_{i}^{2}}) = ρ (1 - τ_{i}) \log (1 + γ_{i} p_{i})

(4)

where

γ_{i} = g_{i} / σ_{i}^{2}

,

p_{i}

is the uplink transmit power and the control variables of the game.

g_{i}

is the uplink channel power gain.

ρ

is a constant parameter that denotes the unit rate revenue.

Based on the above assumption, the total revenue of wireless sensor

i

is denoted as follows:

U_{i}^{Re v e n u e} = U_{i}^{Q o S} + U_{i}^{E n g} = ρ (1 - τ_{i}) \log (1 + γ_{i} p_{i}) + ε x_{i}

(5)

In this paper, we use the noncooperative differential game theory [21] to analyze the optimal uplink transmit power and to achieve revenue maximization for each sensor. Let the target QoS level for each sensor be denoted by

S_{i}

. We evaluate the balance between energy consumption and QoS over the time interval

[0, T]

, using the term

α_{i} (x_{i} (T) - S_{i})

, where

α_{i}

is a constant parameter and

T

is the end of the control. Let

r

denote the discount rate, where the dynamic game of the power control for each sensor noncooperatively chooses its uplink transmit power as:

\begin{array}{l} L_{i} & = \max_{p_{i}} \int_{0}^{T} U_{i}^{\Pr o f i t} (t) e^{- r t} d t + U_{i}^{T e r m a l} (t) \\ = \max_{p_{i}} \int_{0}^{T} [ρ (1 - τ_{i}) \log (1 + γ_{i} p_{i}) + ε x_{i}] e^{- r t} d t + α_{i} (x_{i} (T) - S_{i}) \end{array}

(6)

Subject to the deterministic dynamics:

d x_{i} = [- μ_{i} x_{i} - (1 - τ_{i}) p_{i} + η_{i} τ_{i} {‖ g_{T}^{i} ‖}^{2} p_{T}^{i}] d t

(7)

Now, we formulate the optimal power control for all sensors in WPSN as a differential game, as follows.

Players : All wireless sensors $i \in N$ in the WPSN.
Strategy space: All wireless sensors can noncooperatively choose their uplink transmit power ${p_{i}^{*} (t)}$ , to maximize the revenue.
State: The power level state is denoted by vector $x_{i} (t)$ , where the state is controlled by the dynamic constraint in Equation (2).
Objective function: All of the wireless sensors act to maximize their discounted revenues over a time interval $[0, T]$ , respectively.

3. Game Analysis

In this section, we analyse the optimal uplink transmit power for each wireless sensor. In the following subsections, we first discuss the optimal uplink transmit power in a finite horizon. Then, the optimal strategy will be considered under an infinite horizon. An uplink power control algorithm based on the differential game will be given in the third subsection.

3.1. Analysis of Differential Game in Finite-Horizon

The finite horizon differential game will be solved, based on the dynamic optimization program technique, which was developed by Bellman [22,23]. According to Bellman’s dynamic programming principle, the uplink transmit power should be optimal for the given time duration.

Lemma 1.

For the optimization Equations (6) and (7), an n-tuple of strategies

{p_{i}^{*} (t, x), f o r i \in N}

constitutes a feedback Nash equilibrium solution if there exists a functional

V^{i} (t, x)

, defined on the time interval

[0, T]

and satisfying the following relations for each

i \in N

[22,23]:

\begin{array}{l} V^{i} (t, x) = \int_{t}^{T} [ρ (1 - τ_{i}) \log (1 + γ_{i} {p_{i}}^{*}) + ε {x_{i}}^{*}] e^{- r s} d s + α_{i} (x_{i}^{*} (T) - S_{i}) \\ \geq \int_{t}^{T} [ρ (1 - τ_{i}) \log (1 + γ_{i} p_{i}) + ε x_{i}] e^{- r s} d s + α_{i} (x_{i} (T) - S_{i}) \end{array}

(8)

V^{i} (T, x) = α_{i} (x_{i} (T) - S_{i})

(9)

where the time interval

[0, T]

:

d x_{i}^{*} = [- μ_{i} x_{i}^{*} - (1 - τ_{i}) p_{i}^{*} + η_{i} τ_{i} {‖ g_{T}^{i} ‖}^{2} p_{T}^{i}] d t

(10)

For all

t \in [0, T]

, if the strategies

{p_{i}^{*} (s), f o r i \in N}

provide a feedback Nash equilibrium to the differential game problem on the time interval

[0, T]

, it can provide a feedback Nash equilibrium for the same problem on the time interval

[t, T]

.

Lemma 2.

A feedback Nash equilibrium solution to the games (6) and (7) has to satisfy the following conditions:

\begin{array}{l} - V_{t}^{i} (t, x) = \max_{p_{i}} {[ρ (1 - τ_{i}) \log (1 + γ_{i} {p_{i}}^{*}) + ε {x_{i}}^{*}] e^{- r t} \\ + V_{x}^{i} (t, x) (- μ_{i} x_{i} - (1 - τ_{i}) p_{i}^{*} + η_{i} τ_{i} {‖ g_{T}^{i} ‖}^{2} p_{T}^{i})} \end{array}

(11)

V^{i} (T, x) = e^{- r T} α_{i} (x_{i} (T) - S_{i})

(12)

Lemma 3.

In the wireless information phase, the optimal uplink transmit power for each sensor

i \in N

in WPSN, satisfies:

p_{i}^{*} (s) = ρ [(α_{i} - \frac{ε}{r + μ_{i}}) e^{(r + μ_{i}) (t - T)} - \frac{ε}{r + μ_{i}}] - \frac{1}{γ_{i}}

(13)

Proof.

See Appendix A.

☐

3.2. Analysis of Infinite-Horizon Differential Game

Consider the infinite-horizon autonomous game problem with constant discounting, in which

T

approaches infinity and where the objective functions and state dynamics are both autonomous. Now consider the alternative game to (6) and (7):

L_{i} = \max_{p_{i}} \int_{0}^{\infty} U_{i}^{\Pr o f i t} (t) e^{- r t} d t = \max_{p_{i}} \int_{0}^{\infty} [ρ (1 - τ_{i}) \log (1 + γ_{i} p_{i}) + ε x_{i}] e^{- r t} d t

(14)

Subject to the deterministic dynamics:

d x_{i} = [- μ_{i} x_{i} - (1 - τ_{i}) p_{i} + η_{i} τ_{i} {‖ g_{T}^{i} ‖}^{2} p_{T}^{i}] d t, x_{i} (0) = 0

(15)

The infinite-horizon autonomous game is independent of the choice of t and only dependent upon the state at the starting time, which is 0. Then, a feedback Nash equilibrium solution for the infinite-horizon autonomous games (14) and (15) can be characterized as follows:

Lemma 4.

An n-tuple of strategies

{q_{i}^{*} (x), f o r i \in N}

constitutes a feedback Nash equilibrium solution if there exists a functional

W^{i} (x)

, defined on the time interval

[0, T]

and satisfying the following set of partial differential equations for each

i \in N

:

r W^{i} (x) = \max_{p_{i}} {ρ (1 - τ_{i}) \log (1 + γ_{i} q_{i}) + ε x_{i} + W_{x}^{i} (x) (- μ_{i} x_{i} - (1 - τ_{i}) q_{i} + η_{i} τ_{i} {‖ g_{T}^{i} ‖}^{2} p_{T}^{i})}

(16)

Lemma 5.

The optimal uplink power for each wireless sensor is independent of the time, which is the game equilibrium strategy and can be expressed as:

{q_{i}}^{*} = \frac{ρ}{ε} (r + μ_{i}) - \frac{1}{γ_{i}}

(17)

Proof.

See Appendix B.

☐

Lemma 6.

The optimal strategy for the infinite-horizon differential game satisfies:

\begin{array}{l} x_{i} = (\frac{ρ (1 - τ_{i}) (r + μ_{i})}{μ_{i} ε} - \frac{(1 - τ_{i})}{μ_{i} γ_{i}} - \frac{η_{i} τ_{i} {‖ g_{T}^{i} ‖}^{2} p_{T}^{i}}{μ_{i}}) e^{- μ_{i} t} \\ - \frac{ρ (1 - τ_{i}) (r + μ_{i})}{μ_{i} ε} + \frac{(1 - τ_{i})}{μ_{i} γ_{i}} + \frac{η_{i} τ_{i} {‖ g_{T}^{i} ‖}^{2} p_{T}^{i}}{μ_{i}} \end{array}

(18)

Proof.

Substituting the optimal uplink power obtained in Equation (17), which is also the game equilibrium strategy, into the state function, yields:

d x_{i} = [- μ_{i} x_{i} - (1 - τ_{i}) (\frac{ρ}{ε} (r + μ_{i}) - \frac{1}{γ_{i}}) + η_{i} τ_{i} {‖ g_{T}^{i} ‖}^{2} p_{T}^{i}] d t, x_{i} (0) = 0

(19)

☐

The optimal state trajectory can be obtained through solving the above dynamics, and is denoted as:

\begin{array}{l} x_{i} = (\frac{ρ (1 - τ_{i}) (r + μ_{i})}{μ_{i} ε} - \frac{(1 - τ_{i})}{μ_{i} γ_{i}} - \frac{η_{i} τ_{i} {‖ g_{T}^{i} ‖}^{2} p_{T}^{i}}{μ_{i}}) e^{- μ_{i} t} \\ - \frac{ρ (1 - τ_{i}) (r + μ_{i})}{μ_{i} ε} + \frac{(1 - τ_{i})}{μ_{i} γ_{i}} + \frac{η_{i} τ_{i} {‖ g_{T}^{i} ‖}^{2} p_{T}^{i}}{μ_{i}} \end{array}

(20)

3.3. Uplink Power Control Algorithm

In this subsection, we present an uplink power control algorithm (Algorithm 1) in wireless powered sensor networks, based on the infinite-horizon solutions presented in Section 3.2, which is as follows:

Algorithm 1. The strategy for each sensor to determine the optimal uplink transmit power
1:	Initially, sensor set the power level $x$ as 0, there is no energy transmission at the beginning of the game.
2:	for sensor $i \in N$ do
3:	Start game, initial parameters $τ_{i}$ , $μ_{i}$ , $η_{i}$ for the game;
4:	Based on the QoS requirements, set the final rate revenue level as $S_{i}$
5:	while $x_{i} > 0$ , do
6:	Calculate the optimal uplink power based on Equation (17);
7:	Calculate the optimal strategy of power level based on Equation (20);
8:	Calculate the maximized revenue for each sensor based on Equations (14), (17) and (20);
9:	Updata power level $x_{i}$ for each sensor;
10:	end while
11:	end for

In the above algorithm, each sensor continues to calculate the optimal uplink transmit power, until there is no energy left in the sensor’s batteries for information transmission.

4. Numerical Simulations

4.1. Optimal Power and Revenue

In this section, we evaluate the proposed differential game model by simulations. The simulation results of the finite-horizon and infinite-horizon differential game are both presented. We assume that the number of sensors in WPSN is

N = 20

, and consider the time horizon

T = 100

. Based on Equation (21), the parameter

A_{i} (t)

of the value function

V^{i} (t, x)

will directly impact the variation of the optimal uplink power. Thus, Figure 3 shows how the key parameter

A_{i} (t)

varies with time. It is plotted in seconds. We observe that

A_{i} (t)

monotonically increases for sensor 1, sensor 2, and sensor 5, monotonically decreases for sensor 3. Based on Equation (27) in Appendix A, we can see that the variation of

A_{i} (t)

is affected by the constant parameter

α_{i}

and the energy loss coefficient

μ_{i}

. Then, different sensors will have a different variation trend of

A_{i} (t)

. The optimal uplink transmit power of sensors under a finite-horizon are plotted in Figure 4. The optimal uplink transmit power has the same variation trend as parameter

A_{i} (t)

. In Figure 5, we show the optimal uplink transmit power under infinite-horizon. The uplink transmit power is constant and independent of time. Figure 6 explores the relationship between the optimal trajectories of the state, which are the power levels of each sensor. It can be observed that the power level has exhibits an initial growth trend. However, as the time increases, it converges to a state value. In other words, the dynamic of the power level is convergent and the convergence speed is fast. Finally, the revenue variation with time and the maximized revenue of each sensor, are evaluated and shown in Figure 7 and Figure 8.

4.2. Residual Energy

In this section, we compare the proposed differential game (DG) algorithm with the Nash bargaining game (NBG) algorithm in [14], which is also a game theory-based power control method in WPSN. We use the same information transmission power for the simulations, and the test is configured with the same parameters. The residual energy of sensors one to four are shown in Figure 9. Each sensor should have a residual energy, in order to deal with information transmission tasks. As the time increases, the residual energy of the sensors under our algorithm increase, and rapidly converge to produce a stable level. The residual energy of the sensors based on the Nash bargaining game remain unchanged. Figure 9 also shows that the residual energy under our algorithm is higher than that under the NBG algorithm. Wireless sensors thus have more power for information transmission under our algorithm.

4.3. QoS Revenue

According to the QoS revenue function in Equation (4), the QoS revenue is simulated and the comparison between our DG algorithm and the NBG algorithm is shown in Figure 10. All sensors are tested in our simulations. As the time increases, because the QoS revenue is directly proportional to the energy level, the QoS revenue under the DG algorithm increases. Although the revenue under the DG algorithm is lower than that of the NBG algorithm, the increase of QoS revenue is fast. However, the QoS revenue under the NBG algorithm maintains a constant value. In addition, our algorithm reveals a better performance than the NBG algorithm.

5. Conclusions

In this paper, we research the uplink transmit power control problem in wireless powered sensor networks. We propose a non-cooperative differential game model to analyze the optimal transmission power for the energy harvesting sensors. In the game, each sensor determines the uplink transmit power, to maximize the utility combination of energy revenue and QoS revenue in a time horizon. According to the Bellman dynamic programming, we can individually obtain the Nash equilibrium (NE) solutions under a finite-horizon and an infinite-horizon. When all sensors achieve NE, the optimal trajectory of the power level can be derived and the maximized revenue can be obtained. The correctness and convergence of the proposed algorithm is proved through numerical simulations.

In future work, we will attempt to combine the power control problem and time scheduling problem, in order to analyse the buffer size influences in our model, which is more practical for the limited network resource. Then, the way in which we can achieve optimal power control under an appropriate MAC algorithm can be ascertained. Finally, the whole revenue can be maximized, based on this solution.

Acknowledgments

This work was supported by the National Science Foundation Project of China (No. 61501026, 61402147), Research and Development Program for Science and Technology of Handan of China (No. 1621203037), and Fundamental Research Funds for the Central Universities (No. FRF-TP-15-032A1).

Author Contributions

Haitao Xu conceived the main idea and the differential game theory model; all authors contributed to data analysis and wrote the paper.

Conflicts of Interest

The authors declare no conflict of interest.

Appendix A

Proof of Lemma 3.

Based on the dynamic programming, let the first derivative of Equation (11) be zero, where the utility function reaches the maximum value. Thus we have:

p_{i}^{*} (s) = ρ V_{x}^{i} {(t, x)}^{(- 1)} e^{- r t} - \frac{1}{γ_{i}}

(A1)

Upon the incorporation of

p_{i}^{*} (s)

into (11) and (12) and solving (11) and (12), we obtain the value function:

V^{i} (t, x) = [A_{i} (t) x_{i} + B_{i} (t)] e^{- r t}

(A2)

where

A_{i} (t)

and

B_{i} (t)

satisfy:

d A_{i} (t) = [(r + μ_{i}) A_{i} (t) - ε] d t

(A3)

A_{i} (T) = α_{i}

(A4)

\begin{array}{l} d B_{i} (t) = [r B_{i} (t) - ρ (1 - τ_{i}) \log (1 + γ_{i} {p_{i}}^{*}) \\ - A_{i} (t) [η_{i} τ_{i} {‖ g_{T}^{i} ‖}^{2} p_{T}^{i} - (1 - τ_{i}) p_{i}^{*}]] d t \end{array}

(A5)

B_{i} (T) = - α_{i} S_{i}

(A6)

Solving Equations (A3) and (A4), we have:

A_{i} (t) = (α_{i} - \frac{ε}{r + μ_{i}}) e^{(r + μ_{i}) (t - T)} - \frac{ε}{r + μ_{i}}

(A7)

B_{i} (s)

can be solved as:

B_{i} (t) = e^{r t} [\int_{0}^{t} f_{i} (t) e^{- r s} d s + B_{i}^{0}]

(A8)

With:

B_{i}^{0} = - α_{i} S_{i} - \int_{0}^{T} F_{i} (t) e^{- r s} d s

(A9)

Incorporating

A_{i} (t)

into (21), we obtain the optimal uplink transmit power for each sensor:

p_{i}^{*} (s) = ρ [(α_{i} - \frac{ε}{r + μ_{i}}) e^{(r + μ_{i}) (t - T)} - \frac{ε}{r + μ_{i}}] - \frac{1}{γ_{i}}

(A10)

Hence, Lemma 3 follows.

☐

Appendix B

Proof of Lemma 5.

Performing the indicated maximization in (16), we obtain:

{q_{i}}^{*} = ρ W_{x}^{i} {(x)}^{(- 1)} - \frac{1}{γ_{i}}

(A11)

Incorporating

p_{i}^{*} (s)

into (16) and solving (16) yields:

W^{i} (x) = C x_{i} + D

(A12)

where:

C = \frac{ε}{r + μ_{i}}

(A13)

D = \frac{ρ}{r} (1 - τ_{i}) \log (1 + γ_{i} {q_{i}}^{*}) + \frac{ε}{r + μ_{i}} \frac{1}{r} (η_{i} τ_{i} {‖ g_{T}^{i} ‖}^{2} p_{T}^{i} - (1 - τ_{i}) {q_{i}}^{*})

(A14)

The game equilibrium strategy can then be expressed as:

{q_{i}}^{*} = \frac{ρ}{ε} (r + μ_{i}) - \frac{1}{γ_{i}}

(A15)

Hence, Lemma 5 follows.

☐

References

Akan, O.B.; Isik, M.T.; Baykal, B. Wireless passive sensor networks. IEEE Commun. Mag. 2009, 47, 92–99. [Google Scholar] [CrossRef]
Compte, S.S.; Lloret, J.; Pineda, M.G.; Alarcón, T. Power saving and energy optimization techniques for Wireless Sensor Networks. J. Commun. Eng. Technol. Publ. 2011, 6, 439–459. [Google Scholar]
Niyato, D.; Hossain, E.; Rashid, M.M.; Bhargava, V.K. Wireless sensor networks with energy harvesting technologies: A game-theoretic approach to optimal energy management. IEEE Wirel. Commun. 2007, 14, 90–96. [Google Scholar] [CrossRef]
Bi, S.; Zeng, Y.; Zhang, R. Wireless powered communication networks: An overview. IEEE Wirel. Commun. 2016, 23, 10–18. [Google Scholar]
Lee, S.; Zhang, R. Cognitive wireless powered network: Spectrum sharing models and throughput maximization. IEEE Trans. Cogn. Commun. Netw. 2015, 1, 335–346. [Google Scholar] [CrossRef]
Ma, Y.; Chen, H.; Lin, Z.; Li, Y.; Vucetic, B. Distributed resource allocation for power beacon-assisted wireless-powered communications. In Proceedings of the 2015 IEEE International Conference on Communications (ICC), London, UK, 8–12 June 2015; pp. 3849–3854.
Liu, L.; Zhang, R.; Chua, K.C. Multi-antenna wireless powered communication with energy beamforming. IEEE Trans. Commun. 2014, 62, 4349–4361. [Google Scholar] [CrossRef]
Huang, C.; Zhang, R.; Cui, S. Optimal power allocation for outage probability minimization in fading channels with energy harvesting constraints. IEEE Trans. Wirel. Commun. 2014, 13, 1074–1087. [Google Scholar] [CrossRef]
Zhou, R.; Cheng, R.S. Optimal scheduling and power allocation for wireless powered two-way relaying systems. In Proceedings of the Wireless Communications and Networking Conference (WCNC), Doha, Qatar, 3–6 April 2016; pp. 1–6.
Kim, M.; Lee, K.; Cho, D.H. Proportional Fair Resource Allocation in Energy Harvesting-Based Wireless Networks. IEEE Syst. J. 2016. [Google Scholar] [CrossRef]
Lee, J.H.; Pak, D. A Game Theoretic Optimization Method for Energy Efficient Global Connectivity in Hybrid Wireless Sensor Networks. Sensors 2016, 16, 1380. [Google Scholar] [CrossRef] [PubMed]
Li, M.; Chen, P.; Gao, S. Cooperative Game-Based Energy Efficiency Management over Ultra-Dense Wireless Cellular Networks. Sensors 2016, 16, 1475. [Google Scholar] [CrossRef] [PubMed]
Niyato, D.; Lu, X.; Wang, P.; Kim, D.I.; Han, Z. Distributed Wireless Energy Scheduling for Wireless Powered Sensor Networks. In Proceedings of the 2016 IEEE International Conference on Communications (ICC), Kuala Lumpur, Malaysia, 23–27 May 2016; pp. 1–6.
Zheng, Z.; Song, L.; Niyato, D.; Han, Z. Resource Allocation in Wireless Powered Relay Networks through a Nash Bargaining Game. In Proceedings of the 2016 IEEE International Conference on Communications (ICC), Kuala Lumpur, Malaysia, 23–27 May 2016; pp. 1–6.
Rufus, I. Differential Games III; Dover Publications, Inc.: Mineola, NY, USA, 1954. [Google Scholar]
Xu, H.; Zhou, X. Optimal Power Control in Cooperative Relay Networks Based on a Differential Game. ETRI J. 2014, 36, 280–285. [Google Scholar] [CrossRef]
Yin, S.; Qu, Z.; Wang, Z.; Li, L. Energy-efficient Cooperation in Cognitive Wireless Powered Networks. IEEE Commun. Lett. 2017, 21, 128–131. [Google Scholar] [CrossRef]
Ju, H.; Zhang, R. Throughput maximization in wireless powered communication networks. IEEE Trans. Wirel. Commun. 2014, 13, 418–428. [Google Scholar] [CrossRef]
Chingoska, H.; Hadzi-Velkov, Z.; Nikoloska, I.; Zlatanov, N. Resource Allocation in Wireless Powered Communication Networks with Non-Orthogonal Multiple Access. IEEE Wirel. Commun. Lett. 2016, 5, 684–687. [Google Scholar] [CrossRef]
Wu, Y.; Chen, X.; Yuen, C.; Zhong, C. Robust Resource Allocation for Secrecy Wireless Powered Communication Networks. IEEE Commun. Lett. 2016, 20, 2430–2433. [Google Scholar] [CrossRef]
Başar, T.; Olsder, G.J. Dynamic Noncooperative Game Theory, 2nd ed.; Academic Press: Cambridge, MA, USA, 1999. [Google Scholar]
Yeung, D.W.K.; Petrosjan, L.A. Cooperative Stochastic Differential Games; Springer Science & Business Media: Dordrecht, The Netherlands, 2006. [Google Scholar]
Martin, J.O. An Introduction to Game Theory; Oxford University Press: Oxford, UK, 2004; Volume 9, pp. 841–846. [Google Scholar]

Figure 1. System model for a WPSN.

Figure 2. Two-step transmission phase.

Figure 3. Variation of

A_{i} (t)

with time.

Figure 3. Variation of

A_{i} (t)

with time.

Figure 4. Variation of the optimal uplink power in finite-horizon.

Figure 5. Variation of the optimal uplink power in infinite-horizon.

Figure 6. Dynamic of power level (the optimal state trajectory).

Figure 7. Dynamic of revenue level.

Figure 8. Revenue level of each sensor.

Figure 9. Residual energy of sensors.

Figure 10. QoS revenue of sensors.

© 2017 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license ( http://creativecommons.org/licenses/by/4.0/).

Share and Cite

MDPI and ACS Style

Xu, H.; Guo, C.; Zhang, L. Optimal Power Control in Wireless Powered Sensor Networks: A Dynamic Game-Based Approach. Sensors 2017, 17, 547. https://doi.org/10.3390/s17030547

AMA Style

Xu H, Guo C, Zhang L. Optimal Power Control in Wireless Powered Sensor Networks: A Dynamic Game-Based Approach. Sensors. 2017; 17(3):547. https://doi.org/10.3390/s17030547

Chicago/Turabian Style

Xu, Haitao, Chao Guo, and Long Zhang. 2017. "Optimal Power Control in Wireless Powered Sensor Networks: A Dynamic Game-Based Approach" Sensors 17, no. 3: 547. https://doi.org/10.3390/s17030547

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Menu

Optimal Power Control in Wireless Powered Sensor Networks: A Dynamic Game-Based Approach

Abstract

1. Introduction

2. System Model and Problem Formulation

3. Game Analysis

3.1. Analysis of Differential Game in Finite-Horizon

3.2. Analysis of Infinite-Horizon Differential Game

3.3. Uplink Power Control Algorithm

4. Numerical Simulations

4.1. Optimal Power and Revenue

4.2. Residual Energy

4.3. QoS Revenue

5. Conclusions

Acknowledgments

Author Contributions

Conflicts of Interest

Appendix A

Appendix B

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI