Open Access
This article is
 freely available
 reusable
Sensors 2018, 18(8), 2487; doi:10.3390/s18082487
Article
A Collaborative Data Collection Scheme Based on Optimal Clustering for Wireless Sensor Networks
^{1}
School of Computer Science and Engineering, Northeastern University, Shenyang 110819, China
^{2}
Laboratory of Language Engineering and Computing, Guangdong University of Foreign Studies, Guangzhou 510006, China
^{3}
School of Information Science and Technology, Guangdong University of Foreign Studies, Guangzhou 510006, China
^{4}
School of Software, University of Technology Sydney, Sydney 2007, Australia
^{5}
School of Information Science and Engineering, Central South University, Changsha 410083, China
^{6}
College of Information Science and Engineering, Jishou University, Jishou 416000, China
^{*}
Author to whom correspondence should be addressed.
Received: 25 June 2018 / Accepted: 26 July 2018 / Published: 1 August 2018
Abstract
:In recent years, energyefficient data collection has evolved into the core problem in the resourceconstrained Wireless Sensor Networks (WSNs). Different from existing data collection models in WSNs, we propose a collaborative data collection scheme based on optimal clustering to collect the sensed data in an energyefficient and loadbalanced manner. After dividing the data collection process into the intracluster data collection step and the intercluster data collection step, we model the optimal clustering problem as a separable convex optimization problem and solve it to obtain the analytical solutions of the optimal clustering size and the optimal data transmission radius. Then, we design a Cluster Heads (CHs)linking algorithm based on the pseudo Hilbert curve to build a CH chain with the goal of collecting the compressed sensed data among CHs in an accumulative way. Furthermore, we also design a distributed clusterconstructing algorithm to construct the clusters around the virtual CHs in a distributed manner. The experimental results show that the proposed method not only reduces the total energy consumption and prolongs the network lifetime, but also effectively balances the distribution of energy consumption among CHs. By comparing it o the existing compressionbased and noncompressionbased data collection schemes, the average reductions of energy consumption are 17.9% and 67.9%, respectively. Furthermore, the average network lifetime extends no less than 20times under the same comparison.
Keywords:
wireless sensor networks; data collection; compressed sensing; clustering; optimization1. Introduction
Nowadays, wireless sensor networks have emerged as a powerful and lowcost platform for connecting the physical world to the digital world [1]. By deploying numerous tiny and inexpensive sensor nodes in the surveillance area, we can monitor the surrounding environment through periodically collecting the sensed data, which are transmitted back to the sink in a wireless and multihop way. Currently, WSNs have found applications in various scenarios, such as environment monitoring, industrial automation, precision agriculture, smart homes, structural health monitoring, military surveillance, and so on [2].
However, the computational power and the energy resources of wireless sensor nodes are strictly constrained. Compared to the data sampling and the data computation, the data communication usually consumes a large portion of energy in wireless sensor nodes. For example, the statistics in [3] demonstrate that the energy consumption of the data communication occupies nearly 86% of the total energy consumption. Therefore, how to reduce the energy consumption of the data communication with constrained computational support is the key problem in wireless sensor networks. Although numerous data collection schemes have been proposed in the literature, the energy efficiency of the periodical data collection and the lifetime of the network still need to be improved [4].
The emerging theory of Compressed Sensing (CS) provides a new paradigm for data collection in WSNs [5]. The compressibility, computational asymmetry, robustness and stability of the compressed sensing theory have made it very suitable for resourceconstrained wireless sensor networks. However, it would not be of much benefit to simply apply the compressed sensing theory to data collection in WSNs without considering their features and network structure [6]. Although the treebased data collection structure can reduce the amount of data transmission, the fault tolerance and the balance of energy consumption are ruined at the same time. Since the clusterbased data collection structure presents many advantages over the flat structure and the treebased structure [7,8], the CSbased clustered data collection schemes were studied comprehensively in the literature [9,10,11,12,13,14]. By carefully examining these schemes, we identify the following technical challenges that are still posed in the data collection problem of WSNs.
Firstly, how does one determine the optimal cluster size? The number of cluster members has an important influence on the balance of intercluster and intracluster energy consumption. Whether the cluster head or the cluster member runs out of energy, the data collection in wireless sensor networks is terminated. Secondly, how does one collect the compressed sensed data among cluster heads in an energyefficient and loadbalanced manner? The existing backbone routing tree gives rise to the unbalanced energy consumption in cluster heads. The larger the number of child CHs, the more energy the cluster head consumes. Thirdly, how does one choose the cluster heads and construct the corresponding cluster structure in a distributed manner? In distributed autonomous WSNs, it is usually too costly or even impossible to acquire the specific localizations of all sensor nodes. Thus, only the distributed cluster heads’ election and cluster constructing method is feasible in the real scenario.
Aiming at answering the above questions, we propose a collaborative data collection scheme based on optimal clustering for periodical data collection in WSN. First of all, we divide the data collection process into the intracluster data collection step and the intercluster data collection step. After evaluating the intracluster energy consumption and the intercluster energy consumption individually, we model the optimal clustering problem as a separable convex optimization problem and solve it to obtain the analytical solutions of the optimal clustering size and the optimal data transmission radius. Then, we propose a cluster headslinking algorithm based on the pseudo Hilbert curve to collect the compressed sensed data among cluster heads in an accumulative way. In addition, a distributed clusterconstructing algorithm is also proposed to construct the clusters around virtual cluster heads in a distributed manner. In summary, the main contributions of this paper are listed as follows.
 (1)
 We model the optimal clustering problem as a separable convex optimization problem and solve it analytically to obtain the optimal clustering size and the optimal transmission radius.
 (2)
 We design a cluster headslinking algorithm based on the pseudo Hilbert curve to collect the compressed sensed data among cluster heads in a collaborative and accumulative manner.
 (3)
 We design a distributed clusterconstructing algorithm to construct the intercluster data collection structure around virtual cluster heads in a wireless sensor network.
The remainder of this paper is organized as follows. In Section 2, we summarize the existing CSbased data collection schemes in WSNs. In Section 3, we model the optimal clustering problem and solve it analytically to obtain the optimal network parameters. In Section 4, we present the cluster headslinking algorithm based on the pseudo Hilbert curve and the distributed clusterconstructing algorithm to construct the underlying data collection structure. In Section 5, we compare and analyse the experiment results of our proposed scheme with other stateoftheart data collection schemes. Finally, we conclude this paper, as well as discuss our future research in Section 6.
2. Related Work
In recent years, the application of compressed sensing theory to data collection in wireless sensor networks has been receiving increasing attention. Bajwa et al. were the first to apply the compressed sensing theory to data collection in wireless sensor networks [15]. They proposed a joint sourcechannel communication architecture and analysed the relationships among power consumption, distortion and latency for WSNs. However, only singlehop communication was considered, which is not suitable for largescale wireless sensor networks.
For multihop WSNs, Luo et al. proposed the Compressive Data Gathering (CDG) scheme, which was the first complete design to apply the compressed sensing theory to largescale sensor data collection [16]. After having received the weighted sums of the sensed data from its previous node in the data collection route, each sensor node multiplied its own sensed data with the corresponding coefficients of the measurement matrix and sent the newlycomputed weighted sums to its next neighbour node. Other researchers argued that it would not be of much benefit to apply the compressed sensing theory to the data collection problem in WSNs naively [6]. Hence, they proposed a data collection scheme based on hybrid compressed sensing, which treats sensor nodes differently according to their levels in the data collection tree. Furthermore, Xiang et al. formulated the CSbased data collection process as an integer programming problem and solved it with a greedy heuristic algorithm in [17]. Based on the theory of the expander graph and compressed sensing, Zheng et al. constructed a random matrix with a nonuniform distribution and proposed a corresponding random walk routing algorithm to collect the sensed data in WSNs [18].
The characteristics of the cluster structure, such as fault tolerance and traffic loads’ balance, make the CSbased clustered data collection schemes present competitive advantages over other type of data collection schemes. Aiming at minimizing the overall innetwork communication and resolving the energy hole problem, Singh et al. designed two data collection schemes based on the compressed sensing theory and the hierarchical clustered structure in [9,10]. In their schemes, sensor nodes carried out different kinds of compression operations according to their levels in the hierarchical structure. In a similar hierarchical data compression scenario, Lan et al. proposed a CompressibilityBased Clustering Algorithm (CBCA), which compresses the sensed data based on the spatial correlation of the readings in the cluster members [11]. In [12], Qiao et al. proposed a compressed data gathering method based on even projection to solve the problem of unstable and unbalanced projection nodes in WSNs with either uniformly or unevenlydistributed sensor nodes. Zhang et al. substituted the widelyadopted discrete cosine transform basis with the newlyproposed treelet transform basis to sparsify the sensed data and designed the Treeletbased Clustered Compressive Data Aggregation (TCCDA) scheme in [13]. Furthermore, the morphological watershed transform was also utilized by Hammoudeh et al. in [19] to gather the sensed data in a WSN with a specified level of accuracy in a timely and powerefficient approach.
Furthermore, the spatial and temporal correlation of sensed data can be utilized simultaneously to further improve the data compression ratio of CSbased data collection schemes. In [20], Quan et al. proposed a neighbouraided compressive datagathering scheme to collect the spatially and temporallycorrelated sensed data by integrating the structured random matrix with the Kronecker compressed sensing model. Based on the theory of TwoDimensional Compressive Sensing (2DCS), Wang et al. proposed a new data collection scheme, CS${}^{2}$ collector, to exploit both the spatial and temporal sparsity of the sensed data in WSNs at the same time [3]. Recently, Matrix Completion (MC) and Matrix Approximation (MA), which can be seen as a natural extension to the compressed sensing theory, were also developed to solve the compressive data collection problem in WSNs. By taking advantage of the lowrankness and the shortterm stability of the sensed data, Cheng et al. designed the SpatialTemporal Compressive Data Gathering (STCDG) scheme based on matrix completion in [21]. Based on a similar idea, a correlated spatiotemporal data collection method based on low rank matrix approximation and optimized node sampling was proposed by Piao et al. in [22]. The Gini index was utilized to measure both the spatial distribution of the active nodes and the evenness of the network energy status. In order to obtain the sensed data matrix without missing and corrupted entries, Xie et al. proposed a twophase data recovery scheme based on matrix completion by exploiting the inherent lowrankness features in the spatiotemporallycorrelated sensed data [23].
3. System Model and Clustering Analysis
3.1. Overview of the Compressed Sensing Theory
Compressed sensing theory (also known as compressive sampling theory or compressive sensing theory) provides a suite of new signal processing principals and techniques for efficiently acquiring and recovering a signal from a set of underdetermined linear systems. By effectively exploiting the sparsity of a signal, we can reconstruct it from far fewer samples than required by the classical Shannon–Nyquist sampling theorem [24,25].
According to the compressed sensing theory, the data sensing step and the data compression step can be combined in the following one step:
where $x\in {R}^{n}$ is the original data, $y\in {R}^{m}$ is the measurement data and $\Phi \in {R}^{m\times n}$ is the measurement matrix. Generally, the size of the original data is much larger than that of the measurement data, i.e., $n\gg m$. The usually chosen measurement matrix $\Phi $ is a Gaussian or Bernoulli random matrix [26]. It is well known that most natural sensed data are sparse under a certain transform basis [27]. Therefore, the original data x can be transformed to k sparse data $s\in {R}^{n}$ as follows:
where $\mathsf{\Psi}\in {R}^{n\times n}$ is the transform matrix and only k elements of s have nonzero values. By combining Equations (1) and (2) together, the complete compressed sensing process can be expressed as:
where $A=\Phi \mathsf{\Psi}\in {R}^{m\times n}$ is referred to as the sensing matrix.
$$y=\Phi x$$
$$x=\mathsf{\Psi}s$$
$$y=As$$
We can recover the original data x from the measurement data y by solving the following ${l}_{0}$ minimization problem:
and then transform s back to x through Equation (2). However, the minimization problem in Equation (4) is NPhard since all kdimensional subspaces of the n dimensions have to be traversed in order to finding the sparsest solution. Usually, we replace the object function in Equation (4) with the convex ${l}_{1}$ norm of the optimization variable and solve the following ${l}_{1}$ minimization problem:
$$\begin{array}{cc}\hfill \underset{s}{min}& {\u2225s\u2225}_{0}\hfill \\ \hfill s.t.\phantom{\rule{4pt}{0ex}}& y=As\hfill \end{array}$$
$$\begin{array}{cc}\hfill \underset{s}{min}& {\u2225s\u2225}_{1}\hfill \\ \hfill s.t.\phantom{\rule{4pt}{0ex}}& y=As\hfill \end{array}$$
It has been proven that the solutions of Equations (4) and (5) are identical with a very high probability as long as the number of measurements m satisfies $m\ge cklog\frac{n}{k}$ and the sensing matrix A satisfies the following Restricted Isometry Property (RIP):
where c is a constant and ${\delta}_{s}\in [0,1)$ is the RIP constant [28].
$$\left(1{\delta}_{s}\right){\u2225s\u2225}^{2}\le {\u2225As\u2225}^{2}\le \left(1+{\delta}_{s}\right){\u2225s\u2225}^{2}$$
A number of data reconstruction algorithms have been proposed in the literature to solve Equation (5). Essentially, these algorithms can be generally classified into two categories, i.e., the optimizationbased algorithms and the greedy pursuit algorithms. In the optimizationbased algorithms, different kinds of convex optimization methods are utilized to recover the original data. The representative optimizationbased algorithms include the interiorpointbased algorithm, the Gradient Projection for Sparse Reconstruction (GPSR) algorithm [29], the homotopybased algorithm [30], etc. In the greedy pursuit algorithms, different rules of basis selection are designed and applied iteratively to select the optimal support set of the original data. The representative greedy pursuit algorithms include the Orthogonal Matching Pursuit (OMP) algorithm, the Compressive Sampling Matching Pursuit (CoSaMP) algorithm, the Partial Hard Thresholding (PHT) algorithm [31], etc.
3.2. System Model
The system model of clustered wireless sensor networks is shown in Figure 1. For illustrative purposes, the data transmission process in one round of data collection is divided into the intracluster data transmission step and the intercluster data transmission step. In the intracluster data transmission step, the cluster members send the sensed data to their cluster heads without using the compressed sensing theory. Thus, there is no data compression in the intracluster data transmission step. In the intercluster data transmission step, each cluster head compresses the sensed data that are received from its cluster members based on the compressed sensing theory. After that, the compressed results are summed with the compressed data of its predecessor in the CH chain and then forwarded to the next cluster head. The above collaborative compression continues until the accumulative compressed results reach the sink.
In Figure 2, we show the aforementioned intercluster data collection step in detail. Assume each cluster head $C{H}_{j}$ is loaded with the projection vectors ${\Phi}_{{c}_{j}}$, which are the corresponding columns of the measurement matrix $\Phi $ for its cluster members. Those coefficients can also be generated in situ with a unified seed and the identifiers of its cluster members. After receiving the projection sums of the sensed data ${\sum}_{i=1}^{j1}{\Phi}_{{c}_{i}}{x}_{{c}_{i}}$ from its previous cluster head $C{H}_{j1}$, $C{H}_{j}$ computes the new projection sums ${\sum}_{i=1}^{j}{\Phi}_{{c}_{i}}{x}_{{c}_{i}}$ with the projection vectors ${\Phi}_{{c}_{j}}$ and the sensed data ${x}_{{c}_{j}}$ that are received from its cluster members. The above data compression process starts from the first cluster head $C{H}_{1}$ and continues until the last cluster head $C{H}_{l}$ sends the whole projection sums ${\sum}_{i=1}^{l}{\Phi}_{{c}_{i}}{x}_{{c}_{i}}$ to the sink, where l is the number of cluster heads in the CH chain.
It should be noted that the chainbased data collection structure is superior to the treebased data collection structure in terms of traffic loads’ balance and energy consumption. A treebased compressed data collection structure is shown in Figure 3. Obviously, the number of received messages of a nonleaf node is in proportion to the number of its child nodes. Thus, the energy consumption of cluster heads with more children is obviously higher than that of less children. Correspondingly, the cluster heads in a treebased data collection structure usually run out of energy earlier than in a chainbased structure. Therefore, we can avoid the energy hole problem and prolong the network lifetime by collecting the compressed sensed data along the CH chain. Finally, the sink runs the data reconstruction algorithm to recover the original sensed data of the whole network.
3.3. Clustering Analysis
We make the following assumptions in the proposed collaborative data collection scheme.
 (1)
 All sensor nodes are randomly distributed in the surveillance area with an independent and identical distribution, which can be modelled as a Poisson point process with parameter $\lambda $.
 (2)
 All sensor nodes are set to the same level of data transmission power and data transmission rate. Therefore, the data transmission range of all sensor nodes is identical.
 (3)
 Every sensor node is aware of its location. A number of sensor localization algorithms for WSNs can be used for this purpose [32].
Different from previous research [33], the cluster sizes are identical instead of increasing gradually from the sink towards the network periphery. The CSbased data collection process along the CH chain can balance the energy consumption of all cluster heads and avoid the energy hole problem. Moreover, it is also easy to implement and analyse.
To evaluate the total energy consumption in one round of data collection, we utilize the following energy consumption model, which was also adopted by other researchers [34,35]:
In Equation (7), ${E}_{t}(b,d)$ denotes the energy consumption for transmitting b bits of data to a receiver of d meters away. ${E}_{elec}$ and ${E}_{amp}$ denote the energy consumption of the transmission circuit and the power amplifying circuit to process one bit of data, respectively. According to the relationship between the distance d and the threshold ${d}_{0}$, the propagation loss can be modelled as either the freespace power loss model or the multiple path fading power loss model. In Equation (8), ${E}_{r}\left(b\right)$ denotes the energy consumption for receiving b bits of data from the transmitter.
$${E}_{t}(b,d)=\left\{\begin{array}{cc}\hfill b({E}_{elec}+{d}^{2}{E}_{amp})\phantom{\rule{4pt}{0ex}},& \hfill if\phantom{\rule{4pt}{0ex}}d\le {d}_{0}\\ \hfill b({E}_{elec}+{d}^{4}{E}_{amp})\phantom{\rule{4pt}{0ex}},& \hfill if\phantom{\rule{4pt}{0ex}}d>{d}_{0}\end{array}\right.$$
$${E}_{r}\left(b\right)=b{E}_{elec}$$
We show the clustering analysis model in Figure 4, where the whole surveillance area is partitioned into a number of small grids of size $a\times a$. The transmission radius of each sensor node should satisfy $r\ge \sqrt{2}a$. Thus, the sensor nodes that are deployed in the same grid are able to communicate with each other. We stack $D\times D$ grids into a square to form a cluster. In order to reduce the total intracluster energy consumption, the cluster head should be placed at the centre of the cluster. After compressing the sensed data in the current cluster, cluster heads forward the accumulative compressed projections to the sink along the CH chain.
Suppose there are n sensor nodes deployed in the surveillance area and the sensed data of each sensor node can be encoded into a data package of p bits. By evaluating the intracluster energy consumption and the intercluster energy consumption individually, we can formulate the optimal clustering strategy.
(1) The intracluster energy consumption:
Considering the sensor node that is in the same grid as the cluster head, only one hop is required to transmit the sensed data to the cluster head. Thus, the transmission energy consumption for one node that is in the centre grid of the cluster (also known as Layer 1) is $p\left({E}_{elec}+\delta (r\le {r}_{d}){r}^{2}{E}_{amp}+\delta (r>{r}_{d}){r}^{4}{E}_{amp}\right)$, where $\delta \left(\right)$ is the indicator function. For presentation purposes, we only describe the case where the transmission distance d is less than or equal to the threshold ${d}_{0}$. The following analysis can be easily modified to suit the opposite case. Then, the above transmission energy consumption can be simplified to $p\left({E}_{elec}+{r}^{2}{E}_{amp}\right)$. There are $\lambda {a}^{2}$ nodes deployed in each grid, where $\lambda $ is the density of the Poisson point distribution. Thus, the total transmission energy consumption for nodes in the first layer of one cluster is:
$${E}_{tI}^{L1}=p\lambda {a}^{2}\left({E}_{elec}+{r}^{2}{E}_{amp}\right)$$
Based on a similar analysis, the total receiving energy consumption for nodes in the first layer of one cluster is:
$${E}_{rI}^{L1}=p\lambda {a}^{2}{E}_{elec}$$
Next, we consider the nodes that are in the hth layer of one cluster, where $h\ge 2$. There are $(D+1)/2$ layers in each cluster and $8(h1)$ grids in the hth layer. Meanwhile, the number of hops required to transmit the sensed data collected in the hth layer to the cluster head is bounded by ;$[h1,h]$. For example, Node B in Layer 2 requires two hops to reach the cluster head CH in Figure 4. However, only one hop is required for Node C to reach CH because Node C is closer to CH than Node B, although they are in the same layer. Therefore, the lower bound and the upper bound of the total transmission energy consumption for nodes in one cluster except the first layer can be evaluated as:
and:
respectively. Similar to the above analysis, the lower bound and the upper bound of the total receiving energy consumption for nodes in the same area can be evaluated as:
and:
respectively. For the sake of convenience, we use the lower bounds ${\underline{E}}_{tI}^{L2+}$ and ${\underline{E}}_{rI}^{L2+}$ in the following evaluations.
$$\begin{array}{cc}\hfill {\underline{E}}_{tI}^{L2+}=& \sum _{h=2}^{\frac{D+1}{2}}8{(h1)}^{2}p\lambda {a}^{2}\left({E}_{elec}+{r}^{2}{E}_{amp}\right)\hfill \\ \hfill =& \frac{D\left({D}^{2}1\right)}{3}p\lambda {a}^{2}\left({E}_{elec}+{r}^{2}{E}_{amp}\right)\hfill \end{array}$$
$$\begin{array}{cc}\hfill {\overline{E}}_{tI}^{L2+}=& \sum _{h=2}^{\frac{D+1}{2}}8(h1)hp\lambda {a}^{2}\left({E}_{elec}+{r}^{2}{E}_{amp}\right)\hfill \\ \hfill =& \frac{(D+3)\left({D}^{2}1\right)}{3}p\lambda {a}^{2}\left({E}_{elec}+{r}^{2}{E}_{amp}\right)\hfill \end{array}$$
$$\begin{array}{cc}\hfill {\underline{E}}_{rI}^{L2+}=& \sum _{h=2}^{\frac{D+1}{2}}8{(h1)}^{2}p\lambda {a}^{2}{E}_{elec}\hfill \\ \hfill =& \frac{D\left({D}^{2}1\right)}{3}p\lambda {a}^{2}{E}_{elec}\hfill \end{array}$$
$$\begin{array}{cc}\hfill {\overline{E}}_{rI}^{L2+}=& \sum _{h=2}^{\frac{D+1}{2}}8(h1)hp\lambda {a}^{2}{E}_{elec}\hfill \\ \hfill =& \frac{(D+3)\left({D}^{2}1\right)}{3}p\lambda {a}^{2}{E}_{elec}\hfill \end{array}$$
The number of clusters in the surveillance area can be computed as $n/\lambda {D}^{2}{a}^{2}$, because there are $\lambda {D}^{2}{a}^{2}$ sensor nodes in each cluster. Therefore, the total intracluster energy consumption is:
$$\begin{array}{cc}\hfill {E}_{I}=& \left({E}_{tI}^{L1}+{E}_{rI}^{L1}+{\underline{E}}_{tI}^{L2+}+{\underline{E}}_{rI}^{L2+}\right)\frac{n}{\lambda {D}^{2}{a}^{2}}\hfill \\ \hfill =& \left(\frac{D}{3}\frac{1}{3D}+\frac{1}{{D}^{2}}\right)pn\left(2{E}_{elec}+{r}^{2}{E}_{amp}\right)\hfill \\ \hfill \approx & \left(\frac{D}{3}\frac{1}{3D}\right)pn\left(2{E}_{elec}+{r}^{2}{E}_{amp}\right)\hfill \end{array}$$
(2) The intercluster energy consumption:
As shown in the system model, each cluster head compresses the sensed data into a data package of $mp$ bits with the measurement matrix $\Phi $. Then, it sends the accumulative compressed projections to the sink along the CH chain. There are D hops between two adjacent cluster heads and $\sqrt{2}D/2$ hops between the sink and the last forwarding cluster head. Thus, the total energy consumption of the intercluster data transmission can be evaluated as:
$${E}_{tII}=\left(\frac{n}{\lambda {D}^{2}{a}^{2}}1\right)Dmp\left({E}_{elec}+{r}^{2}{E}_{amp}\right)+\frac{\sqrt{2}D}{2}mp\left({E}_{elec}+{r}^{2}{E}_{amp}\right)$$
Similarly, the total energy consumption of the intercluster data receiving can be evaluated as:
$${E}_{rII}=\left(\frac{n}{\lambda {D}^{2}{a}^{2}}1\right)Dmp{E}_{elec}+\frac{\sqrt{2}D}{2}mp{E}_{elec}$$
Therefore, the total intercluster energy consumption is:
$$\begin{array}{cc}\hfill {E}_{II}=& {E}_{tII}+{E}_{rII}\hfill \\ \hfill =& \left(\frac{n}{\lambda {D}^{2}{a}^{2}}1\right)Dmp\left(2{E}_{elec}+{r}^{2}{E}_{amp}\right)+\frac{\sqrt{2}D}{2}mp\left(2{E}_{elec}+{r}^{2}{E}_{amp}\right)\hfill \end{array}$$
(3) The optimal clustering strategy:
By adding the intracluster energy consumption Equation (15) and the intercluster energy consumption Equation (18) together, we can obtain the total energy consumption in one round of data collection as follows:
where ${c}_{1}=n/3(2\sqrt{2})m/2$, ${c}_{2}=nm/\lambda {a}^{2}n/3$, ${c}_{3}=2p{E}_{elec}$ and ${c}_{4}={E}_{amp}/2{E}_{elec}$.
$$\begin{array}{cc}\hfill E=& {E}_{I}+{E}_{II}\hfill \\ \hfill =& \left(\frac{Dn}{3}\frac{n}{3D}+\frac{nm}{\lambda D{a}^{2}}Dm\right)p\left(2{E}_{elec}+{r}^{2}{E}_{amp}\right)+\frac{\sqrt{2}D}{2}mp\left(2{E}_{elec}+{r}^{2}{E}_{amp}\right)\hfill \\ \hfill =& \left(\left(\frac{n}{3}\frac{(2\sqrt{2})m}{2}\right)D+\left(\frac{nm}{\lambda {a}^{2}}\frac{n}{3}\right)\frac{1}{D}\right)p\left(2{E}_{elec}+{r}^{2}{E}_{amp}\right)\hfill \\ \hfill =& \left({c}_{1}D+\frac{{c}_{2}}{D}\right){c}_{3}\left(1+{c}_{4}{r}^{2}\right)\hfill \end{array}$$
In order to keep all sensor nodes connected to each other, we should guarantee that at least one additional sensor node is located in the coverage area of each sensor node. Thus, the transmission radius should satisfy $r\ge \sqrt{2/\lambda \pi}$. Furthermore, the maximal value of D is obtained when all sensor nodes are in the same cluster, i.e., $D\le \sqrt{2n/\lambda {r}^{2}}$. Therefore, the optimal clustering problem can be cast as the following optimization problem:
$$\begin{array}{cc}\hfill \underset{D,r}{min}& \left({c}_{1}D+\frac{{c}_{2}}{D}\right){c}_{3}\left(1+{c}_{4}{r}^{2}\right)\hfill \\ \hfill s.t.\phantom{\rule{4pt}{0ex}}& 0<D\le \sqrt{\frac{2n}{\lambda {r}^{2}}}\hfill \\ & r\ge \sqrt{\frac{2}{\lambda \pi}}\hfill \end{array}$$
According to the compressed sensing theory, the size of the original data is much larger than that of the measurement data, i.e., $n\gg m$. Thus, ${c}_{1}>0$. Meanwhile, the number of compressed sensed data in a cluster is much larger than that of uncompressed sensed data in a small grid. Thus, ${c}_{2}>0$. It is obvious that ${c}_{3}>0$ and ${c}_{4}>0$. Let $f(D,r)=\left({c}_{1}D+{c}_{2}/D\right){c}_{3}\left(1+{c}_{4}{r}^{2}\right)$. Since $dom\left(f\right)$ is convex and ${\nabla}^{2}f(D,r)\u2ab00$, the optimization problem in Equation (20) is convex. Meanwhile, it is trivial to see that problem in Equation (20) is separable and can be solved by splitting it into the following two independent optimization subproblems:
and:
$$\begin{array}{cc}\hfill \underset{D}{min}& \phantom{\rule{4pt}{0ex}}{c}_{1}D+\frac{{c}_{2}}{D}\hfill \\ \hfill s.t.\phantom{\rule{4pt}{0ex}}& 0<D\le \sqrt{\frac{2n}{\lambda {r}^{2}}}\hfill \end{array}$$
$$\begin{array}{cc}\hfill \underset{r}{min}& \phantom{\rule{4pt}{0ex}}1+{c}_{4}{r}^{2}\hfill \\ \hfill s.t.\phantom{\rule{4pt}{0ex}}& r\ge \sqrt{\frac{2}{\lambda \pi}}\hfill \end{array}$$
The optimal solution of Equation (21) can be obtained by computing the stationary point of the object function and comparing it with the constraint criterion. Thus, the optimal clustering size can be determined as:
$${D}^{*}=\left\{\begin{array}{cc}\sqrt{\frac{{c}_{2}}{{c}_{1}}}& \left(6\frac{3\sqrt{2}}{2}\right)mn<\lambda {a}^{2}\\ \sqrt{\frac{2n}{\lambda {r}^{2}}}& otherwise\end{array}\right.$$
It is trivial to obtain the optimal transmission radius ${r}^{*}=\sqrt{2/\lambda \pi}$ because the objection function of Equation (22) is monotonically increasing.
4. The Collaborative Data Collection Scheme
In wireless sensor networks, the sink is aware of the surveillance area in which all sensor nodes are deployed. However, it does not need, nor is it even possible to know, the specific deployment localizations of all sensor nodes. Assume the surveillance area is a rectangle with length L and width W. After determining the optimal clustering size ${D}^{*}$ according to Equation (23), the sink generates $l=2LW/{\left({D}^{*}{r}^{*}\right)}^{2}$ virtual cluster heads and sets them to the centre of each cluster individually. Note that these virtual cluster heads are only virtual agents used to build the CH chain, instead of real sensor nodes.
Firstly, the sink runs the cluster headslinking algorithm based on the pseudo Hilbert curve to build the CH chain. Afterwards, the distributed clusterconstructing algorithm is executed with the goal of selecting the proper cluster heads and forming the underlying data collection structure. Finally, all sensor nodes send the sensed data to the sink by following the same principles that have been explained in detail in the system model. It should be noted that the above two algorithms are only required to be executed once. Therefore, the cost of constructing the underlying data collection structure is trivial in comparison to that of the periodical data collection.
4.1. The Cluster HeadsLinking Algorithm Based on the Pseudo Hilbert Curve
By designing a chainbased data collection structure among CHs, the energy consumption of all cluster heads can be well balanced. Essentially, the CH chain is a Hamilton path in the CH graph. It traverses all cluster heads and accesses each cluster head only once. The Hilbert curve is the most popular spacefilling curve, which presents the strong locality preserving property and can be used as a Hamilton path for any region of size ${2}^{q}\times {2}^{q}$, where $q\in N$. However, the surveillance areas of WSNs are usually rectangles of arbitrary sizes instead of squares of exponential sizes. Therefore, we generate a pseudo Hilbert curve of arbitrary dimension for general WSN applications.
A Hilbert curve can be decomposed into a set of primitive curves. In order to connect the end points of all primitive curves correctly, four primitive curves with different orientations are defined and shown in Figure 5. Meanwhile, an orientation transformation rule for different orientations is also defined in Table 1. After partitioning the parent Hilbert curve into four parts $P1$, $P2$, $P3$ and $P4$ in the order of lowerleft, lowerright, upperleft and upperright, the orientations of each parts ${O}_{1}$, ${O}_{2}$, ${O}_{3}$ and ${O}_{4}$ can be determined by looking up in the table according to the parent orientation of the curve.
A simple $4\times 4$ Hilbert curve is shown in Figure 6 to demonstrate the aforementioned orientation transformation rule, where the parent orientation is drawn in a bold arrow. Since the parent orientation of the Hilbert curve is I, the lowerleft, lowerright, upperleft and upperright parts of the Hilbert curve are arranged as the primitive Hilbert curves II, IV, I and I, respectively. After connecting the end points of these four parts according to the orientation of the Hilbert curve I, a $4\times 4$ Hilbert curve can be built.
We adopt the primitive Hilbert areas and the primitive Hilbert curves designed by Wu et al. in [36] to build the CH chain. In addition to the original primitive Hilbert curves I to IV, extra eight primitive pseudo Hilbert curves are designed and shown in Figure 7. Since the horizontal and vertical lengths of all primitive Hilbert curves are at most two units, we first partition the whole surveillance area into a number of primitive Hilbert areas, the length of each side of which is less than three units. Then, we fill each subarea with a primitive (pseudo) Hilbert curve based on its size, orientation and the orientation transformation rule. In order to build the whole spacefilling curve, we connect all primitive (pseudo) Hilbert curves headtotail in sequence. Finally, we traverse the resultant pseudo Hilbert curve to record the indexes of all cluster heads with the goal of forwarding the compressed data along the CH chain in the intercluster data transmission step. The pseudo code of the cluster headslinking algorithm based on the pseudo Hilbert curve is shown in Algorithm 1. Please refer to [36] for the pseudo Hilbert curve generation details.
Algorithm 1 Cluster headslinking algorithm based on the pseudo Hilbert curve. 

In Figure 8, we show a CH chain that is generated by Algorithm 1 in a $5\times 7$ surveillance area. The number in each cluster is the chain index, which is assigned in the pseudo Hilbert curve traversing step. By sending the compressed sensed data along the CH chain in the reverse order of cluster indexes, we can collect the compressed sensed data of the whole sensor network in an accumulative manner.
4.2. The Distributed Cluster Constructing Algorithm
After generating the CH chain, the sink broadcasts the locations and the indexes of all virtual cluster heads to the network. Since there are usually no sensor nodes deployed in the exact locations of virtual cluster heads, the same number of sensor nodes that are closest to the virtual cluster heads should be selected as CHs on behalf of their clusters. In Section 3, we have restricted that the optimal transmission radius ${r}^{*}$ of each sensor node should satisfy ${r}^{*}\ge \sqrt{2/\lambda \pi}$. Hence, at least one sensor node can be guaranteed to reside in the circle with radius ${r}^{*}$ and centred at the virtual cluster head.
In the distributed clusterconstructing algorithm, the sensor nodes that are in the range of ${r}^{*}$ from any virtual cluster head will challenge to be CHs. These candidate sensor nodes encapsulate their identifiers and locations into the CH election messages and then broadcast the messages to all sensor nodes within two hops. After a timeout, the sensor nodes that are nearest to the virtual cluster heads will be elected as CHs. Then, these new elected cluster heads broadcast the cluster invitation messages to all sensor nodes within $({D}^{*}+1)/2$ hops to construct the intracluster data collection structure. After recording or updating the shortest route to a cluster head, each sensor node joins a cluster and points to a parent node in the intracluster data collection route. The pseudo code of the distributed clusterconstructing algorithm is summarized in Algorithm 2.
Algorithm 2 Distributed clusterconstructing algorithm. 

Note that a new set of cluster heads can be reelected by following the same rules presented in Algorithm 2 when the structure of the network has changed or new sensor nodes have been added to the network. The residual energy of each sensor node can also be considered as an additional factor to balance the intercluster energy consumption in Algorithm 2. Moreover, there is no need to regenerate the CH chain when updating the cluster heads, since the locations and the indexes of the virtual cluster heads are fixed.
5. Performance Evaluations
In this section, we evaluate the performance of our proposed collaborative data collection scheme and compare it to the other four data collection schemes through experiments. In the cluster with CS scheme, the whole sensor network is partitioned into a number of clusters based on the number of data transmission, and then, the compressed sensed data are collected along a backbone tree among CHs [37]. In the cluster without CS scheme, the same cluster structure is used, but without compressing the sensed data. In the Shortest Path Tree (SPT) scheme, a shortest path tree is built to collect the plain sensed data in the network. In the SPT with CS scheme, the sensed data are compressed using CS and then transmitted back to the sink along the shortest path tree. We use the total energy consumption, the network lifetime and the distribution of energy consumption as three metrics to compare their performance. Note that the network lifetime is defined as the maximum number of cycles before the first node runs out of energy.
In the experiment, a number of sensor nodes are deployed uniformly and independently in a rectangle surveillance area. We set a sink at the lowerleft corner of the surveillance area to collect the sensed data. The number of sensor nodes n varies from 200 to 1000. The surveillance area S varies from 200 to 600 square meters. The data compression ratio $\rho =n/m$ varies from five to 20. We assume each sensor node is equipped with one 1.5V 1000mAh nonrechargeable battery. Therefore, the theoretical initial energy of each sensor node is 5400 J. The energy consumptions of the transmission circuit ${E}_{elec}$ and the power amplifying circuit ${E}_{amp}$ to process one bit of data are 50 nJ/bit and 10 pJ/bit/m${}^{2}$, respectively. Furthermore, the TOS_MSG structure in TinyOS is utilized to encapsulate the sensed data transmitted in the network. Therefore, an extra seven bytes of data should be transmitted besides the sensed data.
5.1. Performance Analysis
In the proposed collaborative data collection scheme, the energy consumption in one round of data collection with different number of nodes n and data compression ratio $\rho $ is shown in Figure 9. We can see that the total consumed energy increases with the number of nodes n and decreases with the compression ratio $\rho $. This is obvious since a smaller number of sensor nodes and a higher data compression ratio would reduce the number of transmitted sensed data in the network. Furthermore, we can also see that the effect of increasing the compression ratio $\rho $ on the total energy consumption degrades gradually. It should be noted that a higher data compression ratio would also degrade the data reconstruction accuracy according to the compressed sensing theory. By comparing Figure 9a with Figure 9b, we can also conclude that more energy is consumed in a larger surveillance area under the same number of nodes and the compression ratio.
In Figure 10, we show the energy consumption of our proposed collaborative data collection scheme in one round of data collection with different surveillance area size S and compression ratio $\rho $. Obviously, the expansion of the surveillance area would lead to the increasing of distances between any pair of sensor nodes. Then, the transmitting energy consumption ${E}_{t}$ of each sensor node in Equation (7) would also increase correspondingly. Furthermore, the total energy consumption decreases with the increase of the data compression ratio $\rho $, and the effect degrades gradually, which is in accordance with the conclusions drawn from Figure 9. By comparing Figure 10a to Figure 10b, we can also conclude that more energy is consumed when the number of sensor nodes n increases. This is in accordance with the conclusions drawn from Figure 9.
5.2. Performance Comparison
We evaluate the total energy consumption in one round of data collection of our proposed collaborative data collection scheme and the other four schemes with different numbers of sensor nodes. The consumed energy of the aforementioned five data collection schemes is shown in Table 2. It is obvious that the proposed collaborative data collection scheme is superior to the other four schemes in terms of energy consumption. Moreover, we can also conclude that the larger the number of sensor nodes, the more energy consumed in all data collection schemes. This is due to the fact that more sensed data will be compressed (for our proposed scheme, the cluster with CS scheme and the SPT with CS scheme) and then transmitted back to the sink.
To further demonstrate the energy efficiency of our proposed collaborative data collection scheme, we take the cluster without CS scheme as the benchmark and evaluate the ratio of the energy consumption of all other schemes compared to that of the benchmark. Note that the cluster without CS scheme is the most energyintensive scheme, which can be seen clearly from Table 2. We show the comparison of energy efficiency for five schemes in Figure 11, where the compression ratio $\rho $ equals 10 and 15, respectively. We can see that our proposed collaborative data collection scheme presents the highest energy efficiency, followed by the cluster with CS scheme. By comparing the SPT with CS scheme to the SPT scheme and the cluster with CS scheme to the cluster without CS scheme, we can conclude that the compressed sensing theory saves a great deal of energy in the data collection of wireless sensor networks. It is obvious that the top three energyefficient data collection schemes in our experiment are all CSbased data collection schemes.
However, higher energy efficiency does not mean longer network lifetime directly. Therefore, we record the number of data collection cycles before the first sensor node runs out of energy for all five data collection schemes and report the result in Figure 12. We can see that our proposed collaborative data collection scheme achieves the longest network lifetime among all five data collection schemes. Moreover, the network lifetime of the cluster without CS scheme is longer than that of the SPT scheme, although the latter consumes less energy in one round of data collection. This is due to the unbalanced energy consumption in sensor nodes. Obviously, the sensor nodes that are located closer to the sink need to consume extra energy to forward the sensed data of their children in the data collection tree.
In order to investigate the distribution of energy consumption in WSNs, we record the consumed energy of all sensor nodes in five data collection schemes and show the result in Figure 13. Firstly, we can see that the distribution of energy consumption in the cluster heads of our proposed scheme is well balanced in comparison to other clusterbased data collection schemes (i.e., the cluster with CS scheme and the cluster without CS scheme). This is due to the fact that the CH chain can provide more balanced energy consumption among cluster heads than the treebased data collection structure. If we update the cluster heads in all clusters periodically, we can further average the energy consumption of all sensor nodes. Secondly, we can see that the energy consumptions of our proposed scheme, and that of the cluster with CS scheme is obviously lower than those of the other schemes. Therefore, the optimal clustering analysis can provide energyefficient data collection for WSNs. Finally, the total energy consumptions of nonclusterbased data collection schemes are higher than those of clusterbased schemes, which verifies the advantages of the clusterbased data collection structure.
6. Conclusions and Future Research
In this paper, we presented a collaborative data collection scheme based on optimal clustering for wireless sensor networks. By evaluating the intracluster energy consumption and the intercluster energy consumption individually, we abstracted the optimal clustering problem as a separable convex optimization problem and then solved it to obtain the optimal network parameters. A cluster headslinking algorithm based on the pseudo Hilbert curve was designed to realize the accumulative intercluster data collection. Furthermore, a distributed clusterconstructing algorithm was also designed to construct the intracluster data collection structure. Experimental results showed that the proposed scheme outperformed some existing data collection schemes in terms of energy consumption, network lifetime and load balance. In the future, we will plan to extend the proposed scheme with historical data to improve the overall performance. In addition, we will also plan to modify our scheme to monitor the irregular surveillance area.
Author Contributions
G.L. designed the algorithms, performed the theoretical analysis and wrote the entire manuscript. H.C. carried out all the simulations. S.P. revised the manuscript. X.L. provided some suggestions on the proposed algorithms. C.W. and S.Y. contributed to polishing the revised manuscript and gave some suggestions on the simulation results. P.Y. checked the revised manuscript and provided some constructive suggestions.
Funding
This work was supported in part by the National Natural Science Foundation of China through Grants 61402094 and 61702089, the Natural Science Foundation of Hebei Province through Grants F2016501076 and F2016501079, the Natural Science Foundation of Liaoning Province through Grant 201602254, the Fundamental Research Funds for the Central Universities through Grant N172304022, the Science and Technology Plan Project of Guangzhou through Grant 201804010433, and the Bidding Project of Laboratory of Language Engineering and Computing through Grant LEC2017ZBKT001.
Conflicts of Interest
The authors declare no conflict of interest.
References
 Rawat, P.; Singh, K.D.; Chaouchi, H.; Bonnin, J.M. Wireless sensor networks: A survey on recent developments and potential synergies. J. Supercomput. 2014, 68, 1–48. [Google Scholar] [CrossRef]
 Borges, L.M.; Velez, F.J.; Lebres, A.S. Survey on the characterization and classification of wireless sensor network applications. IEEE Commun. Surv. Tutor. 2014, 16, 1860–1890. [Google Scholar] [CrossRef]
 Yong, W.; Yang, Z.; Zhang, J.; Feng, L.; Wen, H.; Shen, Y. CS2Collector: A new approach for data collection in wireless sensor networks based on twodimensional compressive sensing. Sensors 2016, 16, 1318. [Google Scholar]
 AbdulSalaam, G.; Abdullah, A.H.; Anisi, M.H.; Gani, A.; Alelaiwi, A. A comparative analysis of energy conservation approaches in hybrid wireless sensor networks data collection protocols. Telecommun. Syst. 2016, 61, 159–179. [Google Scholar] [CrossRef]
 Middya, R.; Chakravarty, N.; Naskar, M.K. Compressive sensing in wireless sensor networks—A survey. IETE Tech. Rev. 2016, 33, 1–13. [Google Scholar] [CrossRef]
 Luo, J.; Xiang, L.; Rosenberg, C. Does compressed sensing improve the throughput of wireless sensor networks? In Proceedings of the IEEE International Conference on Communications, Cape Town, South Africa, 23–27 May 2010; pp. 1–6. [Google Scholar]
 Sucasas, V.; Radwan, A.; Marques, H.; Rodriguez, J.; Vahid, S.; Tafazolli, R. A survey on clustering techniques for cooperative wireless networks. Ad Hoc Netw. 2016, 47, 53–81. [Google Scholar] [CrossRef]
 Jan, B.; Farman, H.; Javed, H.; Montrucchio, B.; Khan, M.; Ali, S. Energy efficient hierarchical clustering approaches in wireless sensor networks: A survey. Wirel. Commun. Mobile Comput. 2017, 2017, 1–14. [Google Scholar] [CrossRef]
 Singh, V.K.; Kumar, M. A compressed sensing approach to resolve the energy hole problem in large scale WSNs. Wirel. Pers. Commun. 2018, 99, 185–201. [Google Scholar] [CrossRef]
 Singh, V.K.; Kumar, M. Innetwork data processing in wireless sensor networks using compressed sensing. Int. J. Sens. Netw. 2018, 26, 174–189. [Google Scholar] [CrossRef]
 Lan, K.C.; Wei, M.Z. A compressibilitybased clustering algorithm for hierarchical compressive data gathering. IEEE Sens. J. 2017, 17, 2550–2562. [Google Scholar] [CrossRef]
 Qiao, J.; Zhang, X. Compressive data gathering based on even clustering for wireless sensor networks. IEEE Access 2018, 6, 24391–24410. [Google Scholar] [CrossRef]
 Zhao, C.; Zhang, W.; Yang, Y.; Yao, S. Treeletbased clustered compressive data aggregation for wireless sensor networks. IEEE Trans. Veh. Technol. 2015, 64, 4257–4267. [Google Scholar] [CrossRef]
 Li, X.; Tao, X.; Mao, G. Unbalanced expander based compressive data gathering in clustered wireless sensor networks. IEEE Access 2017, 5, 7553–7566. [Google Scholar] [CrossRef]
 Bajwa, W.; Haupt, J.; Sayeed, A.; Nowak, R. Joint sourcechannel communication for distributed estimation in sensor networks. IEEE Trans. Inf. Theory 2007, 53, 3629–3653. [Google Scholar] [CrossRef]
 Luo, C.; Wu, F.; Sun, J.; Chen, C.W. Compressive data gathering for largescale wireless sensor networks. In Proceedings of the International Conference on Mobile Computing and Networking, Beijing, China, 20–25 September 2009; pp. 145–156. [Google Scholar]
 Xiang, L.; Luo, J.; Rosenberg, C. Compressed data aggregation: energyefficient and highfidelity data collection. IEEE/ACM Trans. Netw. 2013, 21, 1722–1735. [Google Scholar] [CrossRef]
 Zheng, H.; Yang, F.; Tian, X.; Gan, X.; Wang, X.; Xiao, S. Data gathering with compressive sensing in wireless sensor networks: A random walk based approach. IEEE Trans. Parallel Distrib. Syst. 2015, 26, 35–44. [Google Scholar] [CrossRef]
 Hammoudeh, M.; Newman, R. Information extraction from sensor networks using the Watershed transform algorithm. Inf. Fusion 2015, 22, 39–49. [Google Scholar] [CrossRef]
 Quan, L.; Xiao, S.; Xue, X.; Lu, C. Neighboraided spatialtemporal compressive data gathering in wireless sensor networks. IEEE Commun. Lett. 2016, 20, 578–581. [Google Scholar] [CrossRef]
 Cheng, J.; Ye, Q.; Jiang, H.; Wang, D. STCDG: An efficient data gathering algorithm based on matrix completion for wireless sensor networks. IEEE Trans. Wirel. Commun. 2013, 12, 850–861. [Google Scholar] [CrossRef]
 Piao, X.; Hu, Y.; Sun, Y.; Yin, B.; Gao, J. Correlated spatiotemporal data collection in wireless sensor networks based on low rank matrix approximation and optimized node sampling. Sensors 2014, 14, 23137–23158. [Google Scholar] [CrossRef] [PubMed]
 Xie, K.; Ning, X.; Wang, X.; Xie, D.; Cao, J.; Xie, G.; Wen, J. Recover corrupted data in sensor networks: A matrix completion solution. IEEE Trans. Mobile Comput. 2017, 16, 1434–1448. [Google Scholar] [CrossRef]
 Candes, E.J.; Wakin, M.B. An introduction to compressive sampling. IEEE Signal Process. Mag. 2008, 25, 21–30. [Google Scholar] [CrossRef]
 Donoho, D.L. Compressed sensing. IEEE Trans. Inf. Theory 2006, 52, 1289–1306. [Google Scholar] [CrossRef]
 Qaisar, S.; Bilal, R.M.; Iqbal, W.; Naureen, M. Compressive sensing: From theory to applications, a survey. J. Commun. Netw. 2013, 15, 443–456. [Google Scholar] [CrossRef]
 Campobello, G.; Segreto, A.; Serrano, S. Data gathering techniques for wireless sensor networks: A comparison. Int. J. Distrib. Sens. Netw. 2016, 12, 1–17. [Google Scholar] [CrossRef]
 Candes, E.; Romberg, J.; Tao, T. Robust uncertainty principles: Exact signal reconstruction from highly incomplete frequency information. IEEE Trans. Inf. Theory 2006, 52, 489–509. [Google Scholar] [CrossRef]
 Figueiredo, M.A.T.; Nowak, R.D.; Wright, S.J. Gradient projection for sparse reconstruction: Application to compressed sensing and other inverse problems. IEEE J. Sel. Top. Signal Process. 2008, 1, 586–597. [Google Scholar] [CrossRef]
 Soussen, C.; Idier, J.; Duan, J.; Brie, D. Homotopy based algorithms for l0regularized leastsquares. IEEE Trans. Signal Process. 2014, 63, 3301–3316. [Google Scholar] [CrossRef]
 Jain, P.; Tewari, A.; Dhillon, I.S. Partial hard thresholding. IEEE Trans. Inf. Theory 2017, 63, 3029–3038. [Google Scholar] [CrossRef]
 Han, G.; Xu, H.; Duong, T.Q.; Jiang, J.; Hara, T. Localization algorithms of wireless sensor networks: A survey. Telecommun. Syst. 2013, 52, 2419–2436. [Google Scholar] [CrossRef]
 Xu, G.; Zhu, M.; Luo, X.; Wu, M.; Ren, F. An unequal clustering algorithm based on energy balance for wireless sensor networks. IEEJ Trans. Electr. Electron. Eng. 2012, 7, 402–407. [Google Scholar] [CrossRef]
 Li, H.; Liu, Y.; Chen, W.; Jia, W.; Li, B.; Xiong, J. COCA: Constructing optimal clustering architecture to maximize sensor network lifetime. Comput. Commun. 2013, 36, 256–268. [Google Scholar] [CrossRef]
 Zhang, J.; Feng, X.; Liu, Z. A gridbased clustering algorithm via load analysis for industrial Internet of things. IEEE Access 2018, 6, 13117–13128. [Google Scholar] [CrossRef]
 Wu, C.; Chang, Y. Approximately even partition algorithm for coding the Hilbert curve of arbitrarysized image. IET Image Process. 2012, 6, 746–755. [Google Scholar] [CrossRef]
 Xie, R.; Jia, X. Transmissionefficient clustering method for wireless sensor networks using compressive sensing. IEEE Trans. Parallel Distrib. Syst. 2014, 25, 806–815. [Google Scholar]
Parent Orientation  ${\mathit{O}}_{1}$  ${\mathit{O}}_{2}$  ${\mathit{O}}_{3}$  ${\mathit{O}}_{4}$ 

I  II  IV  I  I 
II  I  II  III  II 
III  III  III  II  IV 
IV  IV  I  IV  III 
Table 2.
The energy consumption of five schemes in one round of data collection (J). SPT, Shortest Path Tree.
Scheme  Number of Nodes  

200  400  600  800  1000  
Proposed  0.1012  0.3536  0.7777  1.43234  2.0637 
Cluster with CS  0.1065  0.3557  0.8078  1.4724  2.1138 
Cluster w/oCS  0.3663  1.2067  2.5087  4.4584  6.5277 
SPT with CS  0.1441  0.4922  1.2334  2.0445  3.4418 
SPT  0.3571  1.0301  2.3671  3.8858  5.6929 
© 2018 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (http://creativecommons.org/licenses/by/4.0/).