You are currently viewing a new version of our website. To view the old version click .
Sensors
  • Article
  • Open Access

Published: 6 September 2016

A Greedy Scanning Data Collection Strategy for Large-Scale Wireless Sensor Networks with a Mobile Sink

,
,
,
and
1
Department of Information and Communication Systems, Hohai University, Changzhou 213000, China
2
National Institute of Telecommunications (Inatel), Minas Gerais 30000-000, Brazil
3
The Instituto de Telecomunicações, Universidade da Beira Interior, Covilhã 6200-062, Portugal
4
The University ITMO, St. Petersburg 190000, Russia
This article belongs to the Special Issue New Paradigms in Cyber-Physical Social Sensing

Abstract

Mobile sink is widely used for data collection in wireless sensor networks. It can avoid ‘hot spot’ problems but energy consumption caused by multihop transmission is still inefficient in real-time application scenarios. In this paper, a greedy scanning data collection strategy (GSDCS) is proposed, and we focus on how to reduce routing energy consumption by shortening total length of routing paths. We propose that the mobile sink adjusts its trajectory dynamically according to the changes of network, instead of predetermined trajectory or random walk. Next, the mobile sink determines which area has more source nodes, then it moves toward this area. The benefit of GSDCS is that most source nodes are no longer needed to upload sensory data for long distances. Especially in event-driven application scenarios, when event area changes, the mobile sink could arrive at the new event area where most source nodes are located currently. Hence energy can be saved. Analytical and simulation results show that compared with existing work, our GSDCS has a better performance in specific application scenarios.

1. Introduction

With the development of wireless communication technologies, wireless sensor networks (WSNs) have been widely used in various applications, e.g., industry monitoring [,], scientific data collection [,,], target tracking [], military surveillance, and even underwater environment []. In these applications, the sensory data needs to be uploaded to the sink quickly and accurately; thus, data collection plays an important role in WSNs. In most of application scenarios, energy of sensor node is limited and can not be replenished [,,], although some charging algorithms are proposed recently [,]. Hence, it is significant to economize energy consumption in data collection. Traditional methods of data collection are to utilize static sink or base station []. These methods always lead to “hot spot” problem. Hence, mobile sinks are introduced to solve this problem. Recent researches reveal that using the mobile property of sink is more promising for energy efficient data collection. In these researches, vehicles or aircrafts, which are equipped with wireless communication device, can move to the source nodes and collect data from them directly. It results in energy consumption balanced of the whole network and the lifetime of networks prolonged. In some application scenarios, nodes in WSNs may not be fully connected because of the presence of dead nodes or obstacles. Appointing a mobile sink to move into each unconnected region to collect data is a feasible solution. In this way, more information can be collected [].
However, data collection using a mobile sink introduces new challenges for WSNs. One challenge is how to update the current location of sink with lower overheads and route data to it. The mobile sink can collect data in non-real time or real-time. In non-real time data collection, data is routed and buffered at certain nodes before it is collected by a periodically visiting mobile sink. It results in a high latency of data and the tendency of nodes’ memory overflow. In real-time data collection, the mobile sink moves around, and data is routed to the location of mobile sink, which requires nodes to obtain the current sink location []. It involves high control overheads for location updating and routing hand-offs. In this paper, we focus on real-time data collection using a mobile sink.
Another challenge is how to set a relative optimal mobility pattern for the mobile sink. There are three main mobility patterns: trajectory with dynamic adjustment, random walking and predetermined trajectory. The performance of a mobility pattern is judged by the following two parameters: energy consumption efficient, and network lifetime.
Based on the analysis mentioned above, we propose a greedy scanning data collection strategy for large-scale WSNs with a mobile sink. Our contributions are the following:
  • Sink Location Local Updating Method: In our data collection strategy, the network is divided into virtual grids. When the mobile sink moves from one grid to another, only a few of the grids need to update the location of mobile sink. Sensory data from the whole network can be routed to the sink timely.
  • Greedy Scanning Mobility Pattern: The mobile sink moves like a scanning curve in the screen of radar. According to the sum of data received from each direction, the mobile sink makes the decision to move to the area with more sensory data.
The remainder of this paper is organized as follows. Firstly, the related work of data collection schemes with a mobile sink is introduced in Section 2. The detail of GSDCS is described in Section 3. The simulation results and performance evaluations are given in Section 4. Finally, Section 5 concludes the paper.

3. Greedy Scanning Data Collection Strategy

3.1. Network Model

Data gathering strategies using mobile sink can prolong the network lifetime. However, for event area changing continuously situation, normal data gathering strategies with mobile sinks are not suitable. Especially, when the events involve animals migrating, enemy troops’ intrusion, and fire monitoring of forest, the events area appears dynamically, and regionally. Our GSDCS is designed for this situation. The typical application for this situation is animals monitoring. In this kind of application scenario, the wild animals have many group behaviors in many cases [,,]. Hence, we set our application scenario as social animals monitoring in a field. Then the event area is local, so we use a circular to represent the event area. As the animals may migrate or interest of monitoring may change, the location of circular event area may change with time. As illustrated in Figure 2, sensor nodes are deployed randomly in a rectangle area. The network is divided into grids, consists of N nodes, and one mobile sink, which collects data from the whole network. The number of virtual grid cells are M. All nodes are well connected, and the network is full covered. Sensor nodes are static and location aware (i.e., equipped with GPS-capable antennae). Each node has the same initial energy, sensing radius r s , and communication radius R. To guarantee the full cover of network, N M a 2 / π r s 2 is necessary. However, in the simulation or practical application, the N adopted is much larger than M a 2 / π r s 2 . Hence, we assume that each virtual grid cell always has sensor nodes. The mobile sink is not constrained by energy and moves without predefined trajectory. The trajectory of mobile sink is constructed dynamically according to the current status of network. The situations of obstacle or multi-path are not discussed in this paper. In our GSDCS, the notation and corresponding definition is shown in Table 2.
Figure 2. A wireless sensor network with virtual grid structure.
Table 2. Notations and definitions.
Our GSDCS is composed of three phases:
  • Network initialization: To achieve the goal of updating sink location in a local area, the network area is divided into virtual grids. Nodes in the same grid form a group. The initialization process includes the grid structure establishment, head nodes election and neighbor table establishment. The details are discussed in Section 3.2.
  • Data collection: In this phase, the way of data routing is proposed first. With the cooperation of virtual grid structure, this data routing way can deliver sensory data to the mobile sink easily. Whereafter, we talk about the moving pattern of the mobile sink with which energy consumption of network can be balanced. To minimize the energy cost in updating sink location information, the local updating with virtual grid structure is discussed at the end of this phase.
  • Head nodes re-election: The long time working of head nodes may lead to network energy being unbalanced and lifetime being decreased. To improve the performance of the network, the re-election of head nodes is discussed in Section 3.4.

3.2. Network Initialization

As described in Section 3.1, after the network deployment, the first phase is initializing the network. It includes three sub-phases, the establishment of grid-based virtual structure, the election of head nodes and the establishment of a neighbor table.

3.2.1. The Establishment of Virtual Grid Structure

As the deployment area is a rectangle, we adopt Cartesian coordinates for convenience. The origin of coordinates is located at deployment area center. The sink is static during the establishment. The process of establishing the virtual grid structure is as follows:

Step 1. Calculate the Virtual Grid Cell Side Length 

In order to guarantee that each node can communicate with all nodes in its neighbor grids (i.e., UDLR of a grid), The relation of grid cell side length a and node communication range R must satisfy a 5 R / 5 , as shown in Figure 3. In our simulation, we adopt the a as 5 R / 5 for energy efficient purpose.
Figure 3. The relation of communication radius with side length of grid cell.

Step 2. Broadcast the Sink Initial Location and Grid Cell Side Length 

After calculating the grid cell side length, the mobile sink broadcasts a H E L L O packet to all nodes, which includes the side length of grid a and the coordinate of sink. The initial location of sink is ( x 0 , y 0 ) . The H E L L O packet is shown in Figure 4.
Figure 4. Format of H E L L O packet.

Step 3. Calculate the R C N for Every Grid Cell 

To identify each grid cell and their relative position in the deployment area, we introduce R C N (Row Column Number), which is composed of row number ( M r ) and column number ( M c ), as illustrated in Figure 5. For each node i can compute the R C N it belongs to by Equations (4) and (5) based on its location ( x i , y i ) ( 1 i N ) , and the broadcast H E L L O package received in Step 1. As all nodes are static, the R C N of each grid cell is determined and unchangeable after the network is deployed.
R C N = ( M r , M c )
M r = y i y 0 a 2 a
M c = x i x 0 a 2 a
Figure 5. The row column number ( R C N ) of each virtual grid cell.

Step 4. Calculate the D N for Every Grid Cell 

The mobile sink is moving from one grid to another during the data collection. To reduce the location broadcasting scope, each grid cell calculates the relative direction to the mobile sink. We introduce the concept D N (Direction Number) for each grid cell to describe the relative direction, as shown in Figure 6. The grid cell in which mobile sink is located is marked as G S (Grid of Sink). The value of D N and its meaning are as follows:
  • The current grid cell is the G S ;
  • The current grid cell is located in the right side of column that G S belongs to;
  • The current grid cell is located in the left side of column that G S belongs to;
  • The current grid cell is right above G S ;
  • The current grid cell is right under G S ;
Figure 6. The direction number ( D N ) of each grid cell.
Each node computes its D N according to the H E L L O packet. The process of calculation is shown in Equation (6).
D N = 0 , x i x 0 a 2 & y i y 0 a 2 1 , ( x i x 0 ) > a 2 2 , ( x i x 0 ) < a 2 3 , x i x 0 a 2 & ( y i y 0 ) > a 2 4 , x i x 0 a 2 & ( y i y 0 ) < a 2
As the D N indicates the relative position between the current grid cell and the mobile sink, the D N s of some grid cells will be modified when the mobile sink moves into another grid cell.

3.2.2. The Election of Head Nodes

It will cause significant energy by broadcasting the mobile sink new location to every node, routing the sensory data to the sink hop by hop. In our GSDCS, the concept “head nod” is introduced, it has three tasks: (1) collecting sensory data from the same grid cell nodes; (2) routing the collected data and the packets from neighbor head nodes to the mobile sink; (3) maintaining the D N value of its current grid cell when the latter is moving. After the mobile sink moves into a new grid, it informs some grid cell head nodes to update their D N value.
Two parameters are used to determine which nodes can be chosen as the head nodes: (1) residual energy of each node; (2) distance between each node and center point of corresponding grid cell. The node which has more energy and shorter distance is more suitable to be a head node in the same grid cell. Here we have Equation (7). Each node broadcasts its K value in the current grid cell. The node with a minimum K is elected as the current grid cell head node.
K = D T o C e n t e r E r e s i d u a l

3.2.3. The Establishment of Neighbor Table

After each grid cell has elected its head node, these head nodes need to establish their own neighbor table. Firstly, these head nodes broadcast their R C N s, and coordinates in their communication range. As described in Section 3.2.1, all the head nodes in neighbor grids can receive the information. According to the broadcast information, neighbor head nodes can be classified into four categories, as shown in Table 3. From the neighbor table, each head node can get the neighbor head node’s coordinate in corresponding direction. For example, if a head node is in the left neighbor grid, its coordinate will be written in the column of L G .
Table 3. Neighbor table.
We assume that the R C N of a head node is ( M r , M c ) and the R C N of its a neighbor head node is ( M r , M c ) . The criterion of classification for neighbor head nodes is shown in Table 4.
Table 4. The criterion of neighbor table establishment.

3.3. Data Collection

In the phase of data collection, there involves to elements, the nodes and the mobile sink. Hence, we discuss this phase from two aspects: the data routing and the sink trajectory planning.

3.3.1. Data Routing

The purpose of data routing is to deliver sensory data packets to the mobile sink from source nodes. When source nodes sense data, they encapsulate the sensory data and R C N of the source node together into a data packet, as shown in Figure 7. These data packets are routed to the sink hop by hop among head nodes.
Figure 7. The data packet structure of sensory data.
When a node generates a data packet, it transmits the latter to its head node in the same grid cell. That means each head node collects data packets from source nodes which are firstly located in the same grid. Then, these head nodes transmit the data packets to a neighbor head node. After the neighbor head node gets the data packets, it relays the data packets to its neighbor head node until the D N of the current head node is 0. Finally, the head node transmits the data packet to the sink, and the transmission of source data is completed. By checking the D N , one head node can figure out the direction to transmission. Hence, based on the direction to transmission, the head node can locate the next hop neighbor head node from the neighbor table, as shown in Table 5.
Table 5. The way to choose a neighbor head node.
For example, if the D N of a head node A is 1, then node A checks L G column of its neighbor table (this column records the coordinate of the left neighbor head node). We assume the node recorded in L G column is node B. Node B is designated as the next hop node, and the data packet is delivered to node B. When node B receives the data packet, it also checks its D N and executives the same procedure. Finally, the data packet will reach to node X which D N is 0. This condition indicates that node X and sink are in the same grid. Hence, node X uploads the data packet directly to the sink. The routing process can be described by the flow chart in Figure 8, and we assume the head node which holds the data packet is N D P for short.
Figure 8. Flow chart of routing process.

3.3.2. Sink Trajectory Planning

In our GSDCS, the mobile sink moves from one center point of a grid cell to another, and collects sensory data. As describe in previous subsection, each node routing the sensory data to the mobile sink. Hence, GSDCS needs to address two problems for mobile sink moving: (1) mobile sink moving pattern; (2) location information local updating.
In our strategy, the moving pattern of the mobile sink is designed like scanning curve in CRT (Cathode Ray Tube) screen. That is why our strategy is called scanning strategy. The deployment area is divided into grid cells, and the mobile sink traverses along one column as one round trip, which is called one collecting period. After each collecting period, the sink switches to another column to start its next round trip based on our greedy strategy. In one collecting period, the sink moves to grid cell whose D N is 3, firstly. When the sink moves to a new grid cell, it parks in the center of the new grid cell for a while. The mobile sink may collect data when it is moving. After the mobile sink has parked in a boundary grid cell for a while, the sink moves to the opposite direction, as shown in Figure 9. When the mobile sink arrives at another boundary grid, it changes its direction again. If the mobile sink arrives at the gird cell where it starts the collecting period, the collecting period ends.
Figure 9. The trajectory of the sink in one collecting period.
Since the data packets contain the sensory data and the corresponding R C N , the mobile sink can count the total source nodes in each row, in the current column, in the current column left area, and in the current column right area, respectively. The mobile sink decides the column to move to in the next collecting period based on this counted information. The counted information is also used by the mobile sink to decide the parking time in each grid. If one row which a grid is in has more source nodes, the mobile sink will park in this grid for a longer time, dynamically.
We assume that the speed of the mobile sink is a constant v, and the longitudinal length of network is L. The time of moving in a period is T = 2 ( L a ) / v . We also assume that the total time of static state for parking is equal to the total time for moving in a collecting period. Hence, the total time of a collecting period is 2 T . In the first collecting period, the mobile sink spends the same time in each grid cell. The time sink spends in a grid consists of parking time and moving time. The total time sink spends in each grid is 2 T / L / a . When the sink has been in a non-boundary grid cell for T / L / a , it moves into the next one. After the sink has been in a boundary grid cell for 2 T / L / a , it also moves into the next one. The mobile sink maintains four kinds of counter, C o u n t e r ( i ) , C o u n t e r l e f t , C o u n t e r m i d d l e , C o u n t e r r i g h t . The i represents M r of one row. As we have mentioned above, the data packet consists of R C N , and sensory data. M r and M c of R C N give information about which row and column the data packet comes from. When the mobile sink gets one data packet, it adds the corresponding counter, respectively. The C o u n t e r ( i ) can reflect how many data packets are collected from each row. The mobile sink can deduce the direction this data packet comes from: left side columns, right side columns or the current column.
When the first collecting period is completed, the mobile sink may switch the column based on our pervious description. It also needs to adjust the parking time in each grid cell according to the collecting period counter values.
There are three choices for the sink: the left column, the current column and the right column. The mobile sink chooses the column C s = m a x ( C o u n t e r l e f t , C o u n t e r m i d d l e , C o u n t e r r i g h t ) as its next collecting period column. When two or more counter values are equal, the mobile sink stays in the original column. The moving operation is shown in Figure 10.
Figure 10. Sink moves to next column.
As mentioned above, in the first collecting period, the parking time in each grid cell is equal. However, in real environments, obstacles and external disturbance may cause the distribution of sensor node and source events density uneven. Our GSDCS takes advantage of C o u n t e r ( i ) to adjust the parking time for each gird cell in the following collecting periods. If a grid cell has a larger C o u n t e r ( i ) , the mobile sink will park in the grid for longer. Then the mobile sink calculates the ratio of the total data from one row to all rows, which can be formulated by C o u n t e r ( i ) C o u n t e r ( i ) . The adjustment the parking time of mobile sink in the grid cell of row i is shown as follows:
(1)
If the grid of row i is a non-boundary grid, then the parking time in this grid is:
t p a r k ( i ) = T × C o u n t e r ( i ) C o u n t e r ( i ) 2
The total time sink spends in this grid for a single trip is:
t ( i ) = T × C o u n t e r ( i ) C o u n t e r ( i ) 2 + a v
(2)
If the grid of row i is a boundary grid, then the parking time in this grid is:
t p a r k ( i ) = T × C o u n t e r ( i ) C o u n t e r ( i )
The total time sink spends in this grid is:
t ( i ) = T × C o u n t e r ( i ) C o u n t e r ( i ) + a v
Note, when a grid is a non-boundary grid, the mobile sink would travel through this grid twice in one collecting period. All the counters are set to zero at the beginning of a collecting period. When one collecting period completes, the mobile sink calculates the parameter t ( i ) of next period for each grid according to all the counters. The sink moving process can be described by the flow chart in Figure 11.
Figure 11. Flow chart of sink moving process.

3.3.3. Location Information Local Updating

To reduce the energy consumption for updating sink current location, our GSDCS only updates some head nodes’ D N when the mobile sink moves into a different grid. In our GSDCS, the D N is responsible for representing the relative direction of a grid to the mobile sink. Once the mobile sink moves into a new grid cell, it broadcasts a notification packet to the head nodes of corresponding grid cells for updating the D N value. There are two cases for updating the location information when the mobile sink moves into a neighbor grid cell: (1) the mobile sink moves along the column in one collecting period; (2) the mobile sink moves into another column when goes on to the next collecting period. If the mobile sink moves along the column in one collecting period, then two head nodes need to update their D N s. One head node is in the grid cell which the mobile sink used to be, the other is in the grid cell which the mobile sink parks in currently. The procedure can be described in Figure 12. The D N s of two head nodes in the marked grids are updated. That means only two nodes consume energy when the mobile sink moves to a new grid cell along the column direction.
Figure 12. The way of updating when the mobile sink moves along the column in one collecting period. (a) The mobile sink moves to the upside grid cell; (b) The head nodes of the marked grid cells are needed to be updated.
If the mobile sink moves into another column when goes on to the next collecting period, all the head nodes which are in the two columns need to update their D N s. One column is which sink used to be in. The other is the one sink parks in currently. This procedure can be described in Figure 13. The head nodes in the marked grid cells need to update their D N s. That means 16 head nodes consume energy when the mobile sink moves into another column in our example.
Figure 13. The way of updating when the mobile sink moves into another column when goes on to the next collecting period. (a) The mobile sink moves to the left neighbor column to start a new collecting period; (b) The head nodes of the marked grid cells are needed to be updated.

3.4. Head Nodes Re-Election

To achieve balanced energy consumption, the head node should be re-elected when its energy is below a certain threshold. Here we appoint E as the current energy of the head node, while E is the original energy when it is elected as head node. If the ratio of E to E is below a threshold T h , then the operation of re-election will be started. The current node informs other nodes in the same grid cell, and the latter compete for the role of head node with the new K. After a new head node is elected, it broadcasts in its communication range. The nodes in the same grid could know which one is the current head node. The relation of grid cell side length a and node communication range R must satisfy a 5 R 5 as described in Section 3.2.1, all the neighbor head nodes can receive this information. According to this broadcasting information, the neighbor head nodes are also informed about the change of head node in this grid. Then, these neighbor head nodes update their neighbor table, and the former head node transfers its D N and neighbor table to the current head node. In the re-election process, the node which is going to retire still has the duty of head node until the new elected node becomes a head node. That is to say, the old head node is still responsible for collecting and routing sensory data before the re-election process is completed.

4. Simulation and Performance Evaluation

4.1. Simulation Model

The simulation parameters and corresponding range of values are listed in Table 6. Then we talk about these parameters in detail. In our simulation, sensor nodes are deployed in a square area, and the side length of this area is 200 m. The number of nodes varies from 200 to 600. All nodes have the same communication range and initial energy. They are 75 m and 2 J respectively. The energy model in HEED [] is adopted. To receive n b b i t s at the receiver, the radio expends n b × E e l e c J . The energy consumption of transmitting n b b i t s at the sender has two cases. Here we define the distance between a sender and a receiver as d, and threshold distance of communication is d 0 . In the first case, if d < d 0 is occurred, the sender consumes n b × ( E e l e c + E f s × d 2 ) J . If d d 0 , n b × ( E e l e c + E m p × d 4 ) J is consumed. The value of E f s is 10 pJ/bit/m2, and E m p is 0.0013 pJ/bit /m4. The control message size is 25 bytes, and data packet size is 100 bytes. To simulate the uneven event source, we assume the event area is a circular area, and all the nodes in this area are appointed as source nodes. Sink moves at a constant velocity, and source nodes have a constant rate of producing sensory data. The lifetime of network is the time when the first node runs out of its energy. The performance metrics are as follows:
  • Lifetime: The time when the first node runs out of its energy.
  • Average Residual Energy: The average residual energy of all nodes when the network is ended.
  • Length of Sink’s Movement: The length of sink’s trajectory when the network is end.
  • Variance of Residual Energy: The variance of all nodes’ residual energy when the network is ended.
  • Number of Data Packets Collected: The number of data packets collected by sink when the network is ended.
Table 6. Simulation parameters.

4.2. Performance Analysis under Different Parameters

4.2.1. The Impact of Head Nodes’ Threshold T h

We first investigate the impact of head node re-election threshold on the performance of our algorithm by varying the value of T h . The parameters we set are shown as follow: (1) the number of all nodes is 500; (2) r e is 30 m; (3) the event area changes by period, and the period is 800 s; (4) one source node produces one data packet in one second; (5) the velocity of sink is 5 m/s; (6) T h can be 10 % , 20 % , 30 % , 40 % , 50 % , 60 % , 70 % , 80 % and 90 % . As we mentioned in Section 3.4, E as the current energy of the head node, while E is the original energy when it is elected as head node. If the ratio of E to E is below a threshold T h , then head nodes will be re-elected.
In Figure 14, (1) black line represents the impact of T h on the lifetime of network. The network can obtain a relatively long lifetime when T h increased from 20 % to 70 % . A lifetime achieves the biggest value when T h is 70 % . When T h is too low or too high, network dies very soon. According to the definition of T h , head nodes with a small T h consumes more energy when they retire. That means these retired head nodes have little energy while other nodes have much energy. These retired head nodes are easy to die. Network dies very soon with a small T h . In the opposite condition, if T h has a big value, re-election will be carried out frequently. As the re-election process consumes energy, hence energy is consumed fast. Then network also dies soon. When T h is in the middle area, the network has a relative balanced condition, which leads to a longer network lifetime.
Figure 14. The impact of T h on the network performance.
(2) Red line represents the impact of T h on the average residual energy. When T h is 10 % , the average residual energy is very high. As the analysis in (1), the end of network is just due to that some retired head nodes die soon. hence many nodes haven’t consumed much energy. So the residual energy is high. When T h increases, residual energy decreases with longer lifetime. When T h reaches 80 % , re-election is carried out frequently, and residual energy increases with a decreased lifetime. When T h is 90 % , much energy is spent on re-election process. The residual energy decreases although this condition has a short lifetime. The lower residual energy is, the better network runs. When T h is 70 % , the residual energy is lowest.
(3) Blue line represents the impact of T h on variance of residual energy. The variance can reflect the balance of all nodes’ energy consumption. The network is more balanced when the variance is lower. At first, the variance is very low, that is because retired head nodes die so fast. When the first node dies, most nodes consume little energy. However, with the increase of T h , such as 20 % , more nodes consume much energy while others consume little. The peak value is occurred in 20 % . The variance is decreased from 20 % to 90 % . That is because the frequent re-election, and head nodes do not need to retire until much energy is consumed.
According to (1), (2) and (3), network can get the longest lifetime, the lowest average residual energy and a very low variance of residual energy when 70 % is adopted.

4.2.2. The Impact of Event Area’s Radius r e

We study the impact of event area’s radius r e on the network performance. The event area’s radius increases from 20 m to 60 m. The parameters we set are shown as follows: (1) the number of all nodes is 500; (2) T h is 70 % ; (3) The event area changes by period, and the period is 800 s; (4) one source node produces one data packet in one second; (5) the velocity of sink is 5 m/s. The performance metrics here are network’s lifetime, residual energy’s variance and number of data packets collected.
In Figure 15, black line represents the impact of r e on lifetime of network. The network’s lifetime decreases by radius increasing. The network’s lifetime decreases fast firstly. Then speed of decrease becomes slow after that. As radius becomes larger, more nodes become source nodes. That means sink can collect more data packets in the same time. However, the lifetime of network is decreasing. The blue line represents the whole number of data collected in lifetime. We can find that the number of data packets collected is also decreased, and it decreases fast from 40 to 50. We can study the balance condition of network from red line. We can find that the red line is generally on the rise. Hence the network is balanced well when the radius is 20.
Figure 15. The impact of r e on the network performance.
According to the three performance metrics, we can find that our algorithm can have a better performance with a small event area.

4.2.3. The Impact of Period of Changing Event Area

In this section, we study the impact of period of changing event area. As we defined in Section 3.3.2, the time sink spend to complete a column running is 2 T , so we appoint 2 T as a unit to study this topic. We call 2 T as column period. The parameters we set is shown as follow: (1) the number of all nodes is 500; (2) r e is 30 m; (3) T h is 70 % ; (4) one source node produces one data packet in one second; (5) the velocity of sink is 5 m/s. The period of changing varies from 2 to 16 column period.
In Figure 16, the black line represents the lifetime of the network. The network’s lifetime increases firstly, and it gets the peak value at 10. Then the network’s lifetime decreases. When the period of changing is 2 column period, the lifetime is the shortest. The blue line represents the number of data collected. The blue line is almost coincident with the black line. The red line represents variance of residual energy. Before 14 column period, the variance is relatively stable and small. When changing period is greater than or equal to 14 column period, variance increases fast. As we introduced in Section 3.3.2, sink has a trend to move to the area with more source data. The sink also has the trend to stay for longer in this area. Hence sink moves to event area in the experiment. According to the relationship between grid length and communication radius, our virtual structure has 6 × 6 grids. So when changing period adopts 2 column period, sink always can not reach the event area before event area changes. Energy couldn’t be saved and network dies soon. By the increasing of changing period, sink has sufficient time to reach event area and stays for more time in the event area. Then much energy can be saved and network works longer. We can find that network’s lifetime is longest when 10 column period is adopted. When changing period is greater than 10 period column, lifetime of the network decreases quickly. In this situation, event area stays in someplace for long time, and sink will always travel in the columns with source nodes. Although much energy is saved in overall, the nodes in the same columns with source nodes consume much energy. The nodes in other columns consume almost no energy. The network’s lifetime becomes shorter, and the variance of residual energy becomes higher. According to this experiment, the network can get a good performance when 10 column period is adopted.
Figure 16. The impact of period of changing event area on the network performance.

4.2.4. The Impact of Velocity of Source Data

In this section, we study the impact of velocity of source data on the performance of our algorithm. The parameters we set are shown as follows: (1) the number of all nodes is 500; (2) r e is 30 m; (3) T h is 70 % ; (4) the event area changes by period, and the period is 800 s; (5) the velocity of sink is 5 m/s. The number of data packets source nodes produce varies from 0 . 2 to 1 . 8 in one second.
In Figure 17, the black line represents network’s lifetime. The network’s lifetime decreases with the increasing of velocity of source data. When velocity is less than or equal to 0 . 8 , network’s lifetime decreases fast. When velocity is greater than 0 . 8 , network’s lifetime decreases very slowly. The blue line represents the number of data packets collected. The number is decreased in the mass. However, the number gets the peak value when velocity is 1 . 0 . That means sink gets the most data although the lifetime of this situation is short. The red line represents the variance of residual energy. The red line is increased in the mass, so the network could be more balanced if the velocity is smaller.
Figure 17. The impact of velocity of source data on the network performance.
According to the experiment results and analysis above, network can achieve a better performance if velocity is smaller in the mass. There is a special case in it. When velocity adopts 1.0, sink can collect the most data although the lifetime is short.

4.2.5. The Impact of Velocity of Sink

In this section, we study the impact of velocity of sink on the performance of network. The parameters we set are shown as follows: (1) the number of all nodes is 500; (2) r e is 30 m; (3) T h is 70 % ; (4) the event area changes by period, and the period is 800 s; (5) one source node produces one data packet in one second. The velocity of sink varies from 2 m/s to 16 m/s. The result of experiment is shown in Figure 18.
Figure 18. The impact of velocity of sink on the network performance.
The black line represents network’s lifetime. Network’s lifetime increases fast at first, and it gets a peak value at 8 m/s. Then network’s lifetime decreases slowly. In our algorithm, if sink moves slowly, then nodes around sink may consume much energy. Hot spot problem will occur. If sink moves fast, the network needs to update frequently. The operation of updating also consumes energy, and the energy consumed of updating is much less than energy consumed of transmitting data, thus there will be a point where the most balanced situation is achieved, and 8 m/s is the point. Network’s lifetime increases fast from 2 m/s to 8 m/s. That is because when sink moves faster, the energy consumption will not be always concentrated in a region. Hence network is becoming more and more balanced, and lifetimes becomes longer. When more than 8 m/s, there is a slight decline. That is the impact of too frequent updating operation. The red line represents average residual energy. Corresponding with the black line, average residual energy is low when network’s lifetime is high. The blue line represents the variance of residual energy, and it is decreased in the mass. That means the network is more balanced when sink moves faster.
According to the data obtained from this experiment, the network gets the best performance at 8 m/s, and the performance of network is relatively good when velocity of sink is greater than 8 m/s.

4.3. Comparison with VGDRA

In this section, we compare the performance of proposed strategy with VGDRA []. VGDRA also adopts virtual grid structure and mobile sink, but sink’s trajectory is fixed. We compare the two algorithms from three parts. Part 1: The source nodes are distributed unevenly in a small area of network, as we can see in Figure 19a; Part 2: The source nodes are evenly distributed in the network, as we can see in Figure 19b; Part 3: Only the energy consumption of updating routes is considered. In the Figure 19, the red lines indicate the routing path. The dots with green border are source nodes. The red star is the mobile sink.
Figure 19. Two application scenarios (a) Source nodes are distributed in a local area; (b) Source nodes are evenly distributed.

4.3.1. Source Nodes Are Distributed Unevenly in the Network

In this part, the application scenario in which source nodes are distributed in a small area of network is set. The parameters we set is shown as follow: (1) the number of nodes varies from 200 to 600; (2) r e is 30 m; (3) T h is 70 % ; (4) the event area changes by period, and the period is 800 s; (5) one source node produces one data packet in one second; (6) the velocity of sink is 5 m/s. Figure 20a shows the impact of nodes’ number on the lifetime of network. As expected, the lifetime of our strategy is far greater than VGDRA’s. That is because the sink in our strategy can move toward the event area. However, in VGDRA, sink always moves along the fixed trajectory no matter where the event area is. The network of our strategy gets the maximum value when nodes number is 500. With the increase of nodes’ number, the lifetime of VGDRA is almost unchanged. Hence in this application scenario, the number of nodes has little impact on VGDRA’s lifetime. Figure 20b shows the impact of nodes’ number on average residual energy. Our strategy’s average residual energy is lower than VGDRA’s. Our strategy can make the network have a higher use ratio in this application scenario. Figure 20c shows the impact of nodes’ number on residual energy variance. Our strategy’s variance is little higher than VGDRA’s. According to the data of VGDRA’s network lifetime, VGDRA’s lower variance is due to that most nodes have not started working yet when the first node runs out of its energy. So our strategy has a better performance in this application scenario.
Figure 20. The comparison with VGDRA when source nodes are distributed unevenly. (a) Lifetime of network vs. the number of nodes; (b) Average residual energy vs. the number of nodes; (c) Variance of residual energy vs. the number of nodes.

4.3.2. Source Nodes Are Distributed Evenly in the Network

In this application scenario, source nodes are evenly distributed. Each place has the same probability of source node. The parameters we set is shown as follow: (1) the number of nodes varies from 200 to 600; (2) the source nodes to all the nodes ratio is 20 % ; (3) T h is 70 % ; (4) the event area changes by period, and the period is 800 s; (5) one source node produces one data packet in one second; (6) the velocity of sink is 5 m/s. Figure 21a shows the impact of nodes’ number on the lifetime of network. Figure 21b shows the impact of node’ number on the average residual energy. In our strategy, much energy hasn’t be used when network stops to work. The residual energy is wasted. From this perspective, VGDRA has a better performance. Figure 21c shows the impact of nodes’ number on the residual energy variance. We can find that our GSDCS’s variance is higher than VGDRA’s. When our strategy is used in this application scenario, the number of data packets sink collected from left side and right side may be almost the same. Then sink may always moves in the middle columns of network. Nodes in these middle columns may have “hot spot” problem. However in VGDRA, sink always moves along the boundary of network, so VGDRA can make network balanced. So VGDRA can work better in this application scenario.
Figure 21. The comparison with VGDRA when source nodes are evenly distributed. (a) Lifetime of network vs. the number of nodes; (b) Average residual energy vs. the number of nodes; (c) Variance of residual energy vs. the number of nodes.

4.3.3. Only the Energy Consumption of Updating Routes Is Considered

In this part, we study the cost of updating routes. We only deduct the energy consumption of updating routes. Here we adopt the initial energy in [], and the initial energy is 1 mJ. As the distribution of source nodes has impact on results, so we talk separately. We use the length of sink moving as performance metric. That means we count how long sink has moved before the first node dies. Figure 22, shows the result of experiment. The blue line represents the length sink moves in VGDRA. The blue line is the lowest, and the length is almost unchanged with the increasing of nodes. The red line is the case of our GSDCS in an evenly distribution application scenario. The red line is higher than blue line, and the red line is increasing. The black line represents the lifetime of our GSDCS in the application scenario where source nodes are distributed in a small area of network. The black line is the highest, and it is increasing in mass. Hence, the cost of updating routes in our strategy is far below VGDRA’s.
Figure 22. The comparison with VGDRA when only the updating energy is considered.

5. Conclusions

In this paper, we have discussed the strategy of data collection with a mobile sink which was named GSDCS. We first proposed a virtual grid structure to assist data collection. Each grid cell was given R C N (row column number) and D N (direction number). Based on the structure, a way of data routing was proposed later. Combined with R C N and D N , data packets could be routed to sink easily. Then we proposed the trajectory planning of mobile sink. In the sink trajectory planning, sink has a tendency to move to the event area where most source nodes are located. By doing so, the total length of routing paths is shortened. Finally, to make the network more balanced, we proposed a way to re-elect the head node in each virtual grid cell. Compared with VGDRA in the simulation, GSDCS has a better performance in specific application scenarios.

Acknowledgments

The work is supported by “Qing Lan Project” and “the National Natural Science Foundation of China under Grant 61572172, 61602152 and 61601169”, supported by “the Fundamental Research Funds for the Central Universities, No.2016B10714 and No.2016B03114”, supported by “Science & Technology Pillar Program (Social development) of Changzhou Science and Technology Bureau, No.CE20165023”, and supported by “Finep, with resources from Funttel, Grant No. 01.14.0231.00, under the Radiocommunication Reference Center (Centro de Referência em Radiocomunicações - CRR) project of the National Institute of Telecommunications (Instituto Nacional de Telecomunicações - Inatel), Brazil, by National Funding from the FCT - Fundação para a Ciência e a Tecnologia through the UID/EEA/50008/2013 Project, by Government of Russian Federation, Grant 074-U01".

Author Contributions

Chuan Zhu researched the literature, conceived of the study concepts, and drafted the manuscript; Sai Zhang and Guangjie Han carried out the simulation and analyzed the simulation results; Jinfang Jiang and Joel J.P.C Rodrigues provided useful suggestions and helped revise the manuscript. All authors read and approved the final manuscript.

Conflicts of Interest

The authors declare no conflict of interest.

References

  1. Quang, P.T.A.; Kim, D.S. Enhancing real-time delivery of gradient routing for industrial wireless sensor networks. IEEE Trans. Ind. Inform. 2012, 8, 61–68. [Google Scholar] [CrossRef]
  2. Han, G.; Liu, L.; Jiang, J.; Shu, L.; Hancke, G. Analysis of energy-efficient connected target coverage algorithms for industrial wireless sensor networks. IEEE Trans. Ind. Inform. 2016. [Google Scholar] [CrossRef]
  3. Zhao, M.; Yang, Y. Bounded relay hop mobile data gathering in wireless sensor networks. IEEE Trans. Comput. 2012, 61, 265–277. [Google Scholar] [CrossRef]
  4. Tseng, Y.C.; Wu, F.J.; Lai, W.T. Opportunistic data collection for disconnected wireless sensor networks by mobile mules. Ad Hoc Netw. 2013, 11, 1150–1164. [Google Scholar] [CrossRef]
  5. Dong, M.; Ota, K.; Liu, A. RMER: Reliable and Energy Efficient Data Collection for Large-scale Wireless Sensor Networks. IEEE Internet Things J. 2016, 3, 511–519. [Google Scholar] [CrossRef]
  6. Han, G.; Jiang, J.; Zhang, C.; Duong, T.; Guizani, M.; Karagiannidis, G. A Survey on Mobile Anchor Node Assisted Localization in Wireless Sensor Networks. IEEE Commun. Surv. Tutor. 2016, 18, 2220–2243. [Google Scholar] [CrossRef]
  7. Shen, J.; Tan, H.W.; Wang, J.; Wang, J.W.; Lee, S.Y. A novel routing protocol providing good transmission reliability in underwater sensor networks. J. Internet Technol. 2015, 16, 171–178. [Google Scholar]
  8. Xie, S.; Wang, Y. Construction of tree network with limited delivery latency in homogeneous wireless sensor networks. Wireless Pers. Commun. 2014, 78, 231–246. [Google Scholar] [CrossRef]
  9. Zhang, Y.; Sun, X.; Wang, B. Efficient Algorithm for K-Barrier Coverage Based on Integer Linear Programming. China Commun. 2016, 13, 16–23. [Google Scholar] [CrossRef]
  10. Guo, P.; Wang, J.; Li, B.; Lee, S.Y. A Variable Threshold-value Authentication Architecture for Wireless Mesh Networks. J. Internet Technol. 2014, 15, 929–936. [Google Scholar]
  11. Han, G.; Dong, Y.; Guo, H.; Shu, L.; Wu, D. Cross-layer optimized routing in wireless sensor networks with duty cycle and energy harvesting. Wirel. Commun. Mob. Comput. 2015, 15, 1957–1981. [Google Scholar] [CrossRef]
  12. Han, G.; Qian, A.; Jiang, J.; Sun, N.; Liu, L. A grid-based joint routing and charging algorithm for industrial wireless rechargeable sensor networks. Comput. Netw. 2016, 101, 19–28. [Google Scholar] [CrossRef]
  13. Dong, M.; Ota, K.; Yang, L. T.; Liu, A.; Guo, M. LSCD: A Low-Storage Clone Detection Protocol for Cyber-Physical Systems. IEEE Trans. Comput.-Aided Des. Integr. Circuits Syst. 2016, 35, 712–723. [Google Scholar] [CrossRef]
  14. Zhu, C.; Zhang, H.; Han, G.; Shu, L.; Rodrigues, J.J. BTDGS: Binary-Tree based Data Gathering Scheme with Mobile Sink for Wireless Multimedia Sensor Networks. Mob. Netw. Appl. 2015, 20, 604–622. [Google Scholar] [CrossRef]
  15. Rao, J.; Biswas, S. Analyzing multi-hop routing feasibility for sensor data harvesting using mobile sinks. J. Parallel Distrib. Comput. 2012, 72, 764–777. [Google Scholar] [CrossRef]
  16. Salarian, H.; Chin, K.W.; Naghdy, F. An energy-efficient mobile-sink path selection strategy for wireless sensor networks. IEEE Trans. Veh. Technol. 2014, 63, 2407–2419. [Google Scholar] [CrossRef]
  17. Zhu, C.; Wu, S.; Han, G.; Shu, L.; Wu, H. A Tree-Cluster-Based Data-Gathering Algorithm for Industrial WSNs With a Mobile Sink. IEEE Access 2015, 3, 381–396. [Google Scholar] [CrossRef]
  18. Ma, M.; Yang, Y.; Zhao, M. Tour planning for mobile data-gathering mechanisms in wireless sensor networks. IEEE Trans. Veh. Technol. 2013, 62, 1472–1483. [Google Scholar] [CrossRef]
  19. Zhao, M.; Yang, Y.; Wang, C. Mobile data gathering with load balanced clustering and dual data uploading in wireless sensor networks. IEEE Trans. Mob. Comput. 2015, 14, 770–785. [Google Scholar] [CrossRef]
  20. Kinalis, A.; Nikoletseas, S.; Patroumpa, D.; Rolim, J. Biased sink mobility with adaptive stop times for low latency data collection in sensor networks. Inf. Fusion 2014, 15, 56–63. [Google Scholar] [CrossRef]
  21. Kumar, A.K.; Sivalingam, K.M.; Kumar, A. On reducing delay in mobile data collection based wireless sensor networks. Wirel. Netw. 2013, 19, 285–299. [Google Scholar] [CrossRef]
  22. Tashtarian, F.; Moghaddam, M.H.Y.; Sohraby, K.; Effati, S. On Maximizing the Lifetime of Wireless Sensor Networks in Event-Driven Applications With Mobile Sinks. IEEE Trans. Veh. Technol. 2015, 64, 3177–3189. [Google Scholar] [CrossRef]
  23. Tashtarian, F.; Moghaddam, M.Y.; Sohraby, K.; Effati, S. ODT: Optimal deadline-based trajectory for mobile sinks in WSN: A decision tree and dynamic programming approach. Comput. Netw. 2015, 77, 128–143. [Google Scholar] [CrossRef]
  24. Khan, A.W.; Abdullah, A.H.; Razzaque, M.A.; Bangash, J.I. VGDRA: A virtual grid-based dynamic routes adjustment scheme for mobile sink-based wireless sensor networks. IEEE Sens. J. 2015, 15, 526–534. [Google Scholar] [CrossRef]
  25. Shin, K.; Kim, S. Predictive routing for mobile sinks in wireless sensor networks: A milestone-based approach. J. Supercomput. 2012, 62, 1519–1536. [Google Scholar] [CrossRef]
  26. Luo, H.; Ye, F.; Cheng, J.; Lu, S.; Zhang, L. TTDD: Two-tier data dissemination in large-scale wireless sensor networks. Wirel. Netw. 2005, 11, 161–175. [Google Scholar] [CrossRef]
  27. Tunca, C.; Isik, S.; Donmez, M.Y.; Ersoy, C. Ring routing: An energy-efficient routing protocol for wireless sensor networks with a mobile sink. IEEE Trans. Mob. Comput. 2015, 14, 1947–1960. [Google Scholar] [CrossRef]
  28. Chen, X.; Xu, M. A geographical cellular-like architecture for wireless sensor networks. In Proceedings of the First International Conference on Mobile Ad-Hoc and Sensor Networks, Wuhan, China, 13–15 December 2005; pp. 249–258.
  29. Chen, T.S.; Tsai, H.W.; Chang, Y.H.; Chen, T.C. Geographic convergecast using mobile sink in wireless sensor networks. Comput. Commun. 2013, 36, 445–458. [Google Scholar] [CrossRef]
  30. Han, S.W.; Jeong, I.S.; Kang, S.H. Low latency and energy efficient routing tree for wireless sensor networks with multiple mobile sinks. J. Netw. Comput. Appl. 2013, 36, 156–166. [Google Scholar] [CrossRef]
  31. Ahmadi, M.; He, L.; Pan, J.; Xu, J. A partition-based data collection scheme for wireless sensor networks with a mobile sink. In Proceedings of the 2012 IEEE International Conference on Communications (ICC), Ottawa, ON, Canada, 10–15 June 2012; pp. 503–507.
  32. Liu, X.; Zhao, H.; Yang, X.; Li, X. SinkTrail: A proactive data reporting protocol for wireless sensor networks. IEEE Trans. Comput. 2013, 62, 151–162. [Google Scholar] [CrossRef]
  33. Mir, Z.H.; Ko, Y.B. A quadtree-based hierarchical data dissemination for mobile sensor networks. Telecommun. Syst. 2007, 36, 117–128. [Google Scholar] [CrossRef]
  34. Shi, L.; Yao, Z.; Zhang, B.; Li, C.; Ma, J. An efficient distributed routing protocol for wireless sensor networks with mobile sinks. Int. J. Commun. Syst. 2015, 28, 1789–1804. [Google Scholar] [CrossRef]
  35. Shin, J.H.; Park, D. A virtual infrastructure for large-scale wireless sensor networks. Comput. Commun. 2007, 30, 2853–2866. [Google Scholar] [CrossRef]
  36. Heinzelman, W.B.; Chandrakasan, A.P.; Balakrishnan, H. An application-specific protocol architecture for wireless microsensor networks. IEEE Trans. Wirel. Commun. 2002, 1, 660–670. [Google Scholar] [CrossRef]
  37. Manjeshwar, A.; Agrawal, D.P. TEEN: A Routing Protocol for Enhanced Efficiency in Wireless Sensor Networks. In Proceedings of the 15th International Parallel and Distributed Processing Symposium, San Francisco, CA, USA, 23–27 April 2000.
  38. Shi, G.; Zheng, J.; Yang, J.; Zhao, Z. Double-blind data discovery using double cross for large-scale wireless sensor networks with mobile sinks. IEEE Trans. Veh. Technol. 2012, 61, 2294–2304. [Google Scholar]
  39. Yuan, X.X.; Zhang, R.H. An energy-efficient mobile sink routing algorithm for wireless sensor networks. In Proceedings of the 7th International Conference on Wireless Communications, Networking and Mobile Computing (WiCOM), Wuhan, China, 23–25 September 2011; pp. 1–4.
  40. Zhao, H.; Guo, S.; Wang, X.; Wang, F. Energy-efficient topology control algorithm for maximizing network lifetime in wireless sensor networks with mobile sink. Appl. Soft. Comput. 2015, 34, 539–550. [Google Scholar] [CrossRef]
  41. Mottaghi, S.; Zahabi, M.R. Optimizing LEACH clustering algorithm with mobile sink and rendezvous nodes. AEU—Int. J. Electron. Commun. 2015, 69, 507–514. [Google Scholar] [CrossRef]
  42. Konstantopoulos, C.; Pantziou, G.; Gavalas, D.; Mpitziopoulos, A.; Mamalis, B. A rendezvous-based approach enabling energy-efficient sensory data collection with mobile sinks. IEEE Trans. Parallel Distrib. Syst. 2012, 23, 809–817. [Google Scholar] [CrossRef]
  43. He, L.; Pan, J.; Xu, J. A progressive approach to reducing data collection latency in wireless sensor networks with mobile elements. IEEE Trans. Mob. Comput. 2013, 12, 1308–1320. [Google Scholar] [CrossRef]
  44. Handcock, R.N.; Swain, D.L.; Bishop-Hurley, G.J.; Patison, K.P.; Wark, T.; Valencia, P. Monitoring animal behaviour and environmental interactions using wireless sensor networks, GPS collars and satellite remote sensing. Sensors 2009, 9, 3586–3603. [Google Scholar] [CrossRef] [PubMed]
  45. Akbas, M.I.; Brust, M.R.; Ribeiro, C.H.; Turgut, D. fAPEbook-animal social life monitoring with wireless sensor and actor networks. In Proceedings of the 2011 IEEE Global Telecommunications Conference, Houston, TX, USA, 5–9 December 2011; pp. 1–5.
  46. Mainwaring, A.; Culler, D.; Polastre, J.; Szewczyk, R.; Anderson, J. Wireless sensor networks for habitat monitoring. In Proceedings of the 1st ACM international workshop on Wireless sensor networks and applications, Atlanta, GA, USA, 28 September 2002; pp. 88–97.
  47. Younis, O.; Fahmy, S. HEED: A hybrid, energy-efficient, distributed clustering approach for ad hoc sensor networks. IEEE Trans. Mob. Comput. 2004, 3, 366–379. [Google Scholar] [CrossRef]

Article Metrics

Citations

Article Access Statistics

Multiple requests from the same IP address are counted as one view.