1. Introduction
With the wide applications of sensor networks, the demand for long-term operation of the networks in various occasions is increasing, which makes the lifetime of wireless sensor networks become an increasingly important research point. A great part of the recent literature [
1,
2,
3,
4,
5] has focused on this problem from different aspects, such as charging process, efficient routing, dynamic user association, etc. Among them, routing and charging technology are two common ways. Routing energy-saving algorithms [
6,
7,
8,
9] are used to reduce nodes’ energy consumption by all means. However, the method cannot fundamentally solve the energy constraint problems; that the capacity of sensor battery is always small and needs to be replaced frequently for long-term operation. It is costly to replace battery for nodes in time when they deplete energy, especially for nodes in remote areas. The wireless energy transmission technology, allowing fixed or mobile energy supplier to transfer energy to nodes wirelessly, becomes a significant solution to charge nodes timely. Its high replenishment efficiency and strong controllability guarantee broad application prospects.
In wireless sensor network (WSN), the sensor nodes send packets to the sink node in the form of multi-hop. However, unreasonable routing strategy can easily lead to energy waste or uneven energy consumption, thus shortening the network lifetime. Before the emergence of charging technology, research mainly focused on optimizing routing strategies to reduce network energy consumption and balance network energy [
10,
11,
12,
13,
14]. Among them, the minimum path algorithm [
10] tends to minimize the total energy consumption of network. Distributed energy balanced routing (DEBR) [
11], one of the most effective energy aware routing protocol, takes the node’s residual energy into consideration to balance the energy utilization rate and residual energy. The energy aware routing protocol is extended to address big data routing in [
12], and the quality of service and priority levels are considered for routing selection. The authors in [
13] divided the network into clusters and a mobile collector to collect data from chosen cluster heads, and the concept of fuzzy if-then rules is adopted to choose the cluster head. The energy optimized dynamic clustering (EODC) [
14] also considers nodes’ location, energy, and link quality. And particle swarm optimization (PSO) and manhattan distance are used for the clustering of nodes, so as to optimize the consumption of energy.
Afterwards, the charging technology is applied to WSN, and wireless rechargeable sensor networks (WRSN) emerges. In WRSN, nodes are able to harvest energy from the radio frequency signals sent by the mobile wireless charging vehicle (WCV), which is the mainstream energy supplier. For charging strategy, one key issue is how to allocate the charging time and to adjust the charging sequence, which has a great impact on the charging effect. In [
15], the nodes, whose energy below a certain energy threshold are identified as the target charging nodes, and the charging time is distributed proportionally according to their energy requirements, so as to maximize the energy utilization. A double warning thresholds with double preemption (DWDP) charging scheme is proposed in [
16], in which double thresholds are used to adjust charging priorities of different sensors, so as to improve the charging throughput of the network and charging efficiency of WCV. In [
17], sensor nodes with charging requirements are firstly screened out, and those with low residual energy will be given priority to charge. After determining the charging sequence, the charging path is planned to extend the total charging time by reducing the moving time, so as to supplement more energy for the network. However, WCV in these literatures only charges for nodes whose energy are lower than the threshold in each charging cycle. Thus, the situation that most nodes need to be charged at the same cycle may occur after a period of time. At this time, premature death for some nodes may be inevitable when the WCV is unable to replenish corresponding energy for nodes. In [
18,
19], the charging operation is triggered periodically taking into account every node in the network. These algorithms mainly focus on the full coverage of the network, the charging points locations and the charging time allocation. In [
20], a charging path named TRIANGLE is proposed. In [
21], four different charging paths are compared and the influence of the number of charging points is also discussed. The authors in [
22] investigate a throughput maximization problem by allocating WCV charging time.
However, the above studies only consider the routing strategy or charging strategy unilaterally, ignoring the interaction between the charging strategy and routing algorithm. Articles which simultaneously optimize the network lifetime from both aspects show great research value. In [
23], the concept of energy-aware routing cost is proposed for nodes to choose the next node, and the charging path is planned according to residual node energy after routing process. The authors in [
24] investigate the problem of maximizing the minimum sampling rate of the nodes, by jointly considering routing and energy beamforming for wireless energy transmission. In [
25], the simultaneous wireless information and power transfer (SWIPT) technology is applied into the cluster sensor networks. Moreover, a non-convex constrained optimization problem is proposed and solved by the distributed iteration algorithm, which can reduce energy consumption and improve energy efficiency. Nevertheless, the above optimizations adopt the point-to-point charging mode, where WCV charges only one sensor node at a time. In a multi-node charging scheme, WCV is able to charge multiple neighboring nodes within its charging range simultaneously, which greatly improves the charging efficiency [
26]. The authors in [
27] adopt multi-node charging scheme and genetic algorithm to optimize the routing and charging process with the purpose of obtaining the best data transmission path and mobile path. In this way, the waste of charging time is reduced and sensor nodes can get more charging energy. The authors in [
28] develop a mathematical model for the WCV as a mobile platform for both WET and data collection, with the goal of minimizing energy consumption associated with the WCV. Gauangjie Han et al. [
29] propose a grid-based charging routing joint optimization algorithm, in which the WCV stays at fixed charging points in each charging cycle and replenishes energy according to node energy consumption rate to balance node energy; then, nodes select the next hop node according to their location and charging efficiency. The location of charging points is the same each charging cycle, which leads that nodes’ charging efficiency not change. The next hop node of each node is fixed during the data transmission process, and some nodes that remain in a high energy consumption state will die faster.
Based on the above analysis, we propose an energy balanced routing strategy that is adaptive to the charging method in multi-node charging mode. Firstly, the charging efficiency of the node is adjusted by changing the position of the charging points in different cycles, so as to improve the overall replenishment effect. Moreover, the WCV charges for nearby nodes according to their energy consumption rate to ensure the energy balance of network as far as possible. Meanwhile, the node makes different routing choices according to its own charging characteristics to make compromise between single node energy consumption and the whole energy consumption, with the goal to not only reduce the overall network energy consumption but also optimize the average hops. 
In 
Section 2, the system model is formulated in advance. The charging strategy is proposed in 
Section 3. The routing algorithm adaptive to charging process is presented in 
Section 4. Implementation guidelines and complexity analysis are discussed in 
Section 5. Simulation and comparison results are followed in 
Section 6. Finally, 
Section 7 highlights the conclusion.
  2. System Model 
As shown in 
Figure 1, we consider a network with sink node, a WCV and 
n nodes that are labeled as 
. Nodes are randomly deployed within a circular area of diameter 
L, and are responsible for sensing and transmitting data to the sink node. The WCV sends wireless signals to nodes through energy transmitting module, and nodes’ energy receiving module transforms them into energy. The whole sensing region is divided into different concentric static rings around the sink node. Here, the circumferences with different diameters represent charging paths, and the WCV will stop at the charging points on these paths to charge the nodes in its charging radius of 
. Moreover, the radius of circumference closest to the sink node is 
d, and each radius of other circumferences increases by 
d. For the convenience of expression, we marked the charging path of 
 away from sink node as 
. 
To ensure that nodes at any location can be charged, the charging points are evenly distributed on the charging path, and the set of charging points 
 is represented as:
      where 
 is the number of charging points on the 
, and it meets 
, in which 
l denotes the difference in the number of charging points between adjacent two paths, and its value also shows the number of charging points on 
. When 
l becomes larger, charging points on the paths will be denser, and the WCV will stay at each charging point for a shorter time when the charging period is fixed. In addition, 
 stands for the 
u-th charging points on 
. When WCV stays at charging point 
, it can energize a plurality of sensor nodes within 
. According to Friis propagation model [
18,
19], the charging efficiency of node 
I can be calculated by Equation (2): 
The equation often serves as a basic model for different applications. In WRSN, WCV transmits RF signals through circularly polarized antennas, which have transmit gain 
. Nodes receive the signal via linearly polarized dipole antennas, which have receive gain 
. 
λ is the wavelength. 
 is polarization loss, 
η can be referred as rectifier efficiency and 
β is a parameter to adjust the Friis’ free space equation for short distance transmission. Moreover, 
 is transmitting power and 
 is the distance between node 
i and the WCV. When 
, the equation is reformulated as 
 in [
19], where 
.
According to the charging method described above, it is obvious that the nodes may be charged multiple times in a charging cycle due to the overlapping of the charging areas of different charging points, and the energy of the sensor node after charging is:
      where 
 is the residual energy of node 
i before charging, 
 is the charging time that WCV stays at each charging point on 
, that is, 
 represents the energy that node 
i receives from the WCV at the 
u-th charging point on 
. 
 is the maximum storage energy threshold of nodes.
In WRSN, node 
i generates sensing data and then transmits these data to the nodes in its forward region 
NF(i) [
15]—nodes which are closer to sink node than node 
i and also are in node 
i’s transmitting area. Furthermore, energy consumption occurs when nodes transmit and receive data packets. According to energy loss model [
30], the receiving energy consumption is much less than the transmitting energy consumption, thus, it is ignored here.
To ensure the long-term stable operation of the network, it is necessary to optimize both charging and routing process to maximize the network lifetime, that is:
      Here, 
 means the survival time of the first death node in the network, which is generally called network lifetime [
31,
32,
33,
34]. The 
 denotes the energy consumption per transmitting data rate from node 
i to 
j, and 
 represents the transmission data rate from node 
i to node 
j. [
35] From this equation, it can be concluded that providing more energy or reducing the energy loss is conducive to prolonging the network lifetime. Since the nodes are randomly deployed in the network, it is necessary to set a reasonable location planning of charging points, in which WCV can supply energy for nodes at any position. The energy consumption state of nodes at different positions differs; thus, it is absolutely essential to allocate the charging time at each charging point reasonably to provide enough energy for nodes. When WCV stays at certain charging point, the power received by the surrounding nodes are different, thus the energy obtained by nodes are also different. The nodes with low charging efficiency may not obtain enough energy to maintain its energy consumption, then their energy gradually decline faster. Therefore, adjusting the node charging efficiency and balancing the node energy can avoid some nodes’ energy far less than the average level and lessening the network lifetime. 
As mentioned above, the charging efficiency, charging time and the energy after charging are different between nodes. If the routing algorithm only considers the current node energy and ignores the other charging characteristics, for some nodes with high residual energy and low charging efficiency, they will be selected as relay nodes with great probability and need to be recharged for a longer period of time to replenish the lost energy, which is unfavorable for the improvement of the overall network energy efficiency. Thus, it is important that selection of the next hop should consider the difference of charging characteristics of nodes.
  3. Charging Strategy
In WRSN, the sensor nodes send packets to the sink node hop-by-hop, and some nodes undertake more data transmission tasks and thus consume energy faster. In case of the early death of such nodes, we allocate charging time at different charging points on the basis of the average energy consumption of nearby nodes, and dynamically plan the location of the charging points to reduce the difference of the charging efficiency of nodes, so that those nodes that consume more energy can obtain more energy in the charging stage. Through this way, we achieve the purpose of balancing the network energy and prolonging the node survival time. 
According to Equation (3), the energy that nodes obtain in charging period is positively correlation with charging efficiency and charging time. For the nodes farther away from the charging points, their charging efficiency is lower, so those nodes tend to acquire less energy during the charging cycle. Moreover, the high energy consumption nodes with insufficient charging energy may stay in the low energy state, and then die earlier. Meanwhile, the nodes near the charging points have a high charging efficiency, and they often obtain more energy than average level in each charging cycle, which can easily lead to charging saturation. In order to change the imbalance of charging energy caused by the difference of charging efficiency, the positions of the charging points are changed at the end of each charging cycle, so that the charging efficiency of each node changes in different charging cycles. After several charging cycles, the energy obtained by nodes with the same distance from the charging path will be more balanced, and the charging effect can be improved.
The sum of charging time at each charging points is fixed, which is recorded as the charge cycle 
. After each charge cycle, the angle formed by a new position of charging point to sink node and the last position of charging point to sink node is 
. Here, it should be noted that the charging points turn back to original position after every 
nc cycles later. For example, when 
, the position before and after turning is shown in 
Figure 2. And 
 can be expressed as:
Hence, the charging point 
 on 
 will rotate to different positions in different charging cycles, and the energy consumptions of the surrounding sensor nodes will be different. Furthermore, we set the charging time of each charging point on the same path 
 to be the same, which is recorded as 
. It satisfies:
After that, in order to recharge more energy for those nodes with higher energy consumption in the network, the charging time of them should be extended appropriately. Since the charging time on the same charging path is the same, if the average energy consumption rate of the nodes within the charging range of the charging point on 
 is faster, the charging time 
 should be higher correspondingly. The average energy consumption of all nodes charged by WCV on 
 is recorded as:
      where 
 is the initial energy of sensor node, 
 is the residual energy of node 
i, 
 is the node set charged by the WCV on 
 and 
 is the total number of elements in the set 
.
To ensure that the charging time 
 is positively correlated with the average energy consumption of the sensor nodes, the charging time of the charging points on each path satisfies:
  4. Routing Strategy
In WRSN, nodes transmit information to sink nodes in the form of multi-hop, during which the WCV provides energy for them. The charging process will change the energy state of nodes, which greatly influences their next hop selection. When the source node 
i selects the node close to it as the next hop, it is beneficial to avoid the node 
i consuming too fast. However, the choice may cause a long transmission chain from the source node to sink node, increasing the total energy consumption on the chain. Similarly, if the node 
i selects the node closer to the sink node, its own energy consumption will increase, but other network performance indicators may be optimized, such as reducing the hop count and total energy consumption. Meanwhile, as shown by Equation (8), nodes that consume energy quickly tend to obtain more charging time. For nodes with high charging efficiency, more charging time means more charging energy; thus, appropriately increasing the energy consumption speed of those nodes is beneficial to utilize the charging energy effectively. Additionally, for nodes with low charging efficiency, the energy they obtained in charging part may not be able to compensate for their energy loss in the routing process. For a long run of high consumption in routing, they will gradually decline residual energy until death. Therefore, the routing strategy needs to be adjusted according to the impact of the charging process. Different routes should be selected based on nodes’ residual energy and charging characteristic. For nodes with high charging efficiency, the energy consumption rate can be increased appropriately to improve the network performance. For nodes with low residual energy and low charging efficiency, the energy consumption should be reduced to prolong their lifetime. Based on this, this section proposes a routing strategy adaptive to the node charging characteristics, which takes the energy of node 
i, 
j and the relevant charging characteristics as the deciding factors of the routing cost 
. In this strategy, 
 can be calculated as:
      where 
, 
 are the energy of node 
i, 
j after being charged, 
 is the energy consumption of node 
i sending unit data packet to node 
j, and 
 is the forward energy consumption of node 
j. Considering from the perspective of node 
i, 
 is smaller when node 
j closer to node 
i is selected, which is beneficial to extend the life of node 
i. However, the forward energy consumption will be larger for node 
j. From the point of view of the next hop node 
j, when node 
j is closer to sink node, the forward energy consumption will be smaller. It is conducive to prolong the lifetime of node 
j. However, the long transmission distance from node 
i to node 
j will increase the energy consumption of node 
i. Therefore, the selection of the next hop node needs to adjust the two parts to achieve a balance, reducing the total energy consumption and optimizing the network performance indicators.
In WRSN, the node energy is obtained by the joint influence of node energy consumption stage and charging stage. From the previous analysis, the charging energy obtained by nodes with the same distance to the charging path can be adjusted to a basic balance by charging strategy; however, the charging efficiency of the nodes closer to the charging path is still higher than that of the nodes away from the charging path, so that more energy can be obtained by those nodes in the same charging time. For convenience of description, 
 is used to characterize how much energy node 
i gets in a certain charging time. For the node 
i between charging path 
 and charging path 
, 
 is:
      among them, 
, 
 is the distance from node 
i to sink node. Equation (10) shows that the closer node 
i is to the charging path, the smaller 
 is, then the more energy can be obtained by node 
i in the same charging time. For node 
i with a small value 
, high energy consumption can get node 
i more energy in the charging cycle to slow down its energy consumption speed; thus, increasing 
 properly will not have a great impact on the lifetime of node 
i. On the contrary, for node 
i with a large value 
, the energy obtained through charging stage may not be able to make up for high consumption, which may lead to node’s death, thus energy consumption should be reduced as far as possible. Therefore, adjusting the next hop selection through 
 can effectively improve the overall performance of the network and prolong the network lifetime. When 
, the energy consumption of node 
i in the routing phase can be increased appropriately, i.e., the 
 increases. In this case, when node 
i chooses the node 
j closer to sink node, the overall energy consumption and the number of network hops can be reduced. When 
, choosing a closer node 
j can extend the network lifetime effectively.
Equation (9) shows that when 
 is small, 
 can be adjusted to be small, and the part that affects the 
 value is mainly 
. The node 
i selecting node 
j closer to sink node is advantageous for lowering energy consumption of the node 
j and vice versa. In summary, node 
i will preferably select the node with smaller 
 as the next hop in 
:
In routing, node i selects node whose  and  are small to be the next hop, ensuring that the total distance of the route approaches the linear distance from node i to sink node, thereby reducing the route consumption. In addition, the algorithm can adaptively select the next hop node according to the characteristics of different nodes. On the one hand, selecting the node j closer to the sink node for the node i with higher residual energy and better charging character in the charging phase can reduce the overall network average hop count and enhance energy utilization. For nodes with low energy, reducing their transmission distance helps to extend network lifetime. On the other hand, appropriately increasing the energy consumption speed of nodes with better charging character is beneficial to guiding the charging phase to allocate more charging time, thereby improving the replenishing effect of the charging phase.
  5. Implementation Guidelines and Complexity Analysis
Here, we give simple implementation guidelines of our algorithm. As a data aggregation and processing center, sink node has sufficient energy and computing capacity. To lessen the burden of WCV and nodes, some tasks with high computing demands (such as charging plan formulation) can be processed in a centralized computing infrastructure by sink node. As for the routing selection, it is easy for each node to locally execute this process.
In the system, nodes send data packets containing the sensing data and their energy information to sink node, then sink node calculates the positions of charging points and all the , nodes set charged by WCV on . In the worst case, the positions of charging points can be calculated according to d and l in  steps. Moreover, considering that there will always be only nc different kinds of charging points distribution, the total complexity of the calculation of charging positions is . To determine the  and calculate the average consumption , there will be totally  steps, so the time complexity is . 
Then, the sink node sends the charging scheduling to WCV for the charging cycle. Considering the worst case where each node can be charged by  charging points, the  can be gained after  times computation. In this case, the complexity to calculate  for n nodes is .
In the routing part, node i chooses the next hop from the forward neighbor nodes set . In the worst case, the number of nodes in  is . Then, node i chooses  with the minimum value  as its next hop node. For all sensor nodes in the network, the operation needs to be repeated n times, so that the time complexity is .
As a result, we conclude that the worst computing time complexity of the whole algorithm is .