Sub-Sampling Framework Comparison for Low-Power Data Gathering: A Comparative Analysis

Milosevic, Bojan; Caione, Carlo; Farella, Elisabetta; Brunelli, Davide; Benini, Luca

doi:10.3390/s150305058

Open AccessArticle

Sub-Sampling Framework Comparison for Low-Power Data Gathering: A Comparative Analysis

¹

DEI, University of Bologna, 40123 Bologna, Italy

²

DA, Fondazione Bruno Kessler, I-38123 Trento, Italy

³

DII, University of Trento, I-38123 Trento, Italy

⁴

IIS, ETH, 8092 Zurich, Switzerland

^*

Author to whom correspondence should be addressed.

Sensors 2015, 15(3), 5058-5080; https://doi.org/10.3390/s150305058

Submission received: 28 November 2014 / Revised: 5 February 2015 / Accepted: 17 February 2015 / Published: 2 March 2015

(This article belongs to the Special Issue Wireless Sensor Networks and the Internet of Things)

Download

Browse Figures

Versions Notes

Abstract

: A key design challenge for successful wireless sensor network (WSN) deployment is a good balance between the collected data resolution and the overall energy consumption. In this paper, we present a WSN solution developed to efficiently satisfy the requirements for long-term monitoring of a historical building. The hardware of the sensor nodes and the network deployment are described and used to collect the data. To improve the network's energy efficiency, we developed and compared two approaches, sharing similar sub-sampling strategies and data reconstruction assumptions: one is based on compressive sensing (CS) and the second is a custom data-driven latent variable-based statistical model (LV). Both approaches take advantage of the multivariate nature of the data collected by a heterogeneous sensor network and reduce the sampling frequency at sub-Nyquist levels. Our comparative analysis highlights the advantages and limitations: signal reconstruction performance is assessed jointly with network-level energy reduction. The performed experiments include detailed performance and energy measurements on the deployed network and explore how the different parameters can affect the overall data accuracy and the energy consumption. The results show how the CS approach achieves better reconstruction accuracy and overall efficiency, with the exception of cases with really aggressive sub-sampling policies.

Keywords:

wireless sensor networks; low-power design; data reconstruction; compressive sensing; latent variables

1. Introduction

The recent technological evolution of sensing devices for wireless sensor networks (WSNs) has triggered research activities in the field of data gathering and compression. The goal is to minimize sampling, storage and communication costs to extend as much as possible the lifetime of the network, while maintaining the desired data accuracy [1].

In a typical WSN scenario, the communication pattern consists of a very large number of small and low-cost devices that periodically sample the sensors' data and transmit them to a central collecting sink [2]. For large scenarios, where the device transmission range is smaller than the area to cover, multi-hop communication is used to efficiently gather the desired data. In this case, the network topology organization and the data transmission and routing approaches should be optimized to ensure the correct network coverage and connectivity and to balance data acquisition quality and power consumption [3].

Common approaches to extend the uptime of a sensor node try to enhance the battery life directly by harvesting energy from the environment and employing low-power hardware architectures [4] or using improved wireless protocols and distributed data processing [5]. More recently, researchers have been optimizing the battery life indirectly by reducing the overall amount of data sensed and transmitted through the network [6,7]. When the considered signals present spatio-temporal correlations, those can be used to reconstruct the desired data from only a limited portion of collected samples [8]. The ability to reconstruct missing data enables the adoption of aggressive duty cycling policies on individual sensor nodes, sampling only a limited part of the data, thereby reducing the energy consumption [9].

Among the different proposed approaches to increase the energy efficiency of a sensor network, we focus on strategies that leverage correlations to reconstruct sub-sampled signals. Two of the most promising techniques capable of successfully recovering the signal from highly incomplete versions are compressive sensing (CS) [10,11] and a data-driven statistical model based on latent variables (LV) [12,13]. These techniques are both able to recover the original signals from a smaller sub-sampled version obtained by skipping samples during the acquisition phase.

CS theory claims that if a signal can be compressed using classical transform coding techniques and its representation is sparse on some basis, then a small number of projections on random vectors contains enough information for approximate reconstruction [10]. Natural signals have usually a relatively low information content as measured by the sparsity of their spectrum [14]; therefore, the theory of CS suggests that randomized low-rate sampling may provide an efficient alternative to high-rate uniform sampling. This peculiar form of CS is a novel strategy to sample and process sparse signals at a sub-Nyquist rate [15].

The second considered framework is an energy-efficient and data-driven technique to estimate missing data within a heterogeneous sensor network, based on latent variables [12]. This approach extends the standard latent variable factorization model [16], which typically considers only dyadic interactions in data, to multivariate spatio-temporal data, by applying tensor decomposition techniques [17]. The key advantage of using a latent variable model is that it provides a compact representation of the gathered data that can be used to recover the missing samples. To perform well under extreme sub-sampling conditions, the standard technique is extended to explicitly incorporate the spatial, temporal and inter-sensor correlations present in the considered data [13].

In this work, we focus on the comparison between these two approaches and their impact on the energy consumption of the network nodes. We describe a real deployment of a WSN for the environmental monitoring of a heritage building, which is used as a common platform and testbed for the evaluation. The two techniques share the same approach, but address the reconstruction problem from different theoretical bases; hence, a straightforward question is which one is most well suited as an energy-minimization technique for WSNs. To the best of our knowledge, this is the first work in which the two techniques are directly compared, using the same dataset and a detailed power consumption model to evaluate their reconstruction abilities and the energy efficiency.

To compare the performance of the two techniques, evaluating the best trade-off between signal recovery and energy savings, we explore the impact of features, such as the use of network-wide correlations to improve the reconstruction accuracy or the length of the data block to be reconstructed. For both the hardware characteristics and the gathered data, we use a real deployed sensor network. The power consumption measurements are used to model the node's power consumptions to allow an easy evaluation of the energy expenditure for the different algorithms and sampling strategies used.

The rest of the paper is organized as follows: In Section 2, the related works are introduced. In Sections 3 and 4, the mathematical background for CS and LV is presented, while in Section 5 the models for the hardware, network and the power consumption are described. The simulation results are shown in Section 6, and the conclusions are in Section 7.

2. Related Work

The problem of data gathering, compression and signal reconstruction in WSNs is well explored in the literature. Even though the majority of the works deal with reconstruction algorithms and mathematical aspects, practical aspects and low power implementation problems have lately been gaining interest.

2.1. Theoretical Approaches

As seen in the previous section, CS builds on several works, like [10,11], which show that if a signal can be compressed using classical transform coding techniques and its representation is sparse on some basis, then a small number of projections on random vectors contain enough information for approximate reconstruction.

For WSN data gathering applications, CS can be used to improve the overall efficiency of the network. One approach is to use CS to aggregate the data as it is collected by a multi-hop network, avoiding a progressive increase of data to be transmitted as we travel from the peripheral nodes to the sink. The work in [18] introduces an approach to minimize the network energy consumption through joint routing and compressed aggregation. The approach has an optimal solution, which is NP-complete, so the authors present a greedy heuristic that delivers near-optimal solutions for large-scale WSNs. Another data acquisition protocol for WSNs is presented in [19], where the results from studies in vehicle routing are applied to data collection in WSN and integrated with CS. Furthermore, this approach is NP-hard, and the authors propose both a centralized and a distributed heuristic to reduce computational times and to improve the scalability of the approach. These works analyze the routing and aggregation problem in WSNs, while in our work, we focus on the spatio-temporal correlation in the observed phenomena, which can be used to improve the data reconstruction accuracy Our approach allows the sensor nodes to duty-cycle and skip a significant amount of samples, thus saving energy Our analysis focuses on joint data reconstruction and on power consumption impacts in a real deployment scenario.

When CS is used in a scenario where several sensors acquire data from the same environment, we can think that the sensed data have a certain kind of shared information that can be exploited to perform a better reconstruction. The best known technique used for exploiting the inter- and intra-correlation among several nodes in a WSN is the distributed CS (DCS) introduced in [20]. In these works, the authors analyze three different Joint Sparsity Models (JSMs): JSM-1, JSM-2 and JSM-3, to describe most of the signal ensemble; and for each sparsity model, they present a different reconstruction algorithm. Among other works dealing with joint sparse recovery, we can cite [21] or Kronecker CS, introduced in [22]. Different from these works, in this paper, we want to focus on a special technique for the reconstruction of jointly sparse solutions, known as multiple measurement vector (MMV), based on ℓ_2,1-regularization, which is introduced in [23].

Latent variables and decomposition-based techniques have also been proposed in the literature, with their most notable applications in collaborative filtering [24] and recommender systems [25]. A standard way to learn a set of latent variables for a two-dimensional dataset is to apply a matrix factorization technique. A recent review of matrix factorization techniques with applications to recommender systems can be found in [16]. To achieve a better latent variable model and to allow the prediction of the missing entries in the original data matrix, standard matrix factorization techniques have been extended to incorporate temporal dynamics and, thus, better capture the temporal evolution of the data [12,26]. When dealing with multivariate datasets, which can be represented by N-dimensional data arrays, tensor factorization techniques can be exploited to further improve the performance of the models [27]. In our work, we applied a tensor factorization approach to WSNs, taking advantage of the three-dimensional nature of the data gathered from a heterogeneous sensor network. We further extended this approach, incorporating for each data dimension a correlation model learned from the data itself, to enhance the capabilities to reconstruct the missing data.

2.2. WSN-Related Practical Implementations

The general problem of using CS in WSNs is investigated in several works. In [28], the measurement matrix is created jointly with data routing policies, trying to achieve an efficient reconstruction quality. Furthermore, in [29], the authors try to improve the reconstruction by reordering the input data to achieve a better compressibility. The main focus of these works is to investigate the signal reconstruction problem, but they are missing the consideration about how the usage of CS impacts the power consumption. While there is no doubt that CS is a powerful technique for data size reduction and compression, its usage and impact on network lifetime, when real hardware and COTSnodes are used, is still marginally addressed in literature.

The work in [30] is one of the first papers trying to address the issue of energy consumption for data compression, dealing with the problem of generating a good measurement matrix using as low energy as possible. In this work, the research is focused on wireless body sensor networks (WBSNs) for real-time energy-efficient ECG compression. This is quite a different application when compared to WSNs, where the presence of several nodes, sensing the same environment, permits one to exploit the distributed nature of the signals to improve the quality of the data recovery.

Other works deal with the use of CS when data are gathered from different joint sources, using DCS to improve the recovery quality. In [31], DCS and principal component analysis are used to reconstruct spatially- and temporally-correlated signals in a sensor network, but once again, the contribution of the power consumption for compression in the network lifetime is neglected.

In this paper, we deal with CS when the signals are sampled at a sub-Nyquist frequency, resembling a technique that is referred to analog CS in the literature. The name derives from the fact that the sub-sampling is performed by dropping samples during the acquisition and analog-to-digital conversion (ADC) stage [32]. One notable example of this technique is in [33], where the effects of circuit imperfections in the analog compressive sensing architectures are discussed. In the framework proposed in this work, samples are not discarded by analogue circuits, but are not sampled at all, saving the energy for waking up the node.

In the literature, other works investigate the problem when the samples are discarded by the device performing the sensing rather than the ADC. For example, in [34,35], the analysis of energy consumption is totally neglected, and the recovery is strictly related to the specific applications described. Different from environmental signals, used in our work, the signals coming from the oximeter present a much higher temporal correlation, with small variations in their temporal evolution, facilitating their reconstruction.

Matrix factorization learning of latent variables has been used for recovering missing data in sensor networks in [12], where temporal correlations found in the dataset are used to infer the missing variables. Tensor decomposition techniques have been applied on WSNs in [36], where the learned models are used to find the damage in a structural health monitoring application. The previously presented algorithms only consider homogeneous sensor streams, dealing with one sensor at a time, and do not consider the energy costs across the network. Instead, our approach focuses on the multivariate nature of the collected data, and it expands the tensor factorization techniques by employing spatio-temporal and intra-sensor correlations for more robust and better results than the existing methods.

The energy consumption during sub-sampling operations is instead considered in [37], where the authors use a sparse generated matrix adjusting the sampling rate to maintain an acceptable reconstruction performance, while minimizing the energy consumption. Different from our work, they do not consider inter- and intra-correlation among signals and do not use any group sparsity-enhancing algorithm to perform a better recovery. In addition, our work presents a real network deployment and takes advantage of the gathered data and power consumption measurements to accurately evaluate the two proposed frameworks in a common testbed.

3. Compressing Sensing and Group Sparsity

3.1. Mathematical Background

Considered a continuous signal x(t) of duration T, and x(n) 1 ≤ n ≤ N is its discrete version. The Nyquist theorem states that in order to perfectly capture the information of the signal x(t), having a bandwidth of B_nyq/2 Hz, we must sample the signal at its Nyquist rate of B_nyq samples per second.

In formula:

x (n) = {x (t) |}_{t = n T_{s}}

(1)

where T_s ≤ 1/B_nyq and NT_S < T. The sampled signal x(n) is represented by an N-dimensional vector of real numbers x.

If the vector x is sparse, then CS is able to recover this finite-dimensional vector x ∈ ℝ^N from a very limited number of measurements of the original signal x(t). The sparsity of a signal α is usually indicated as the ℓ₀-norm of the signal, where the ℓ_p-norm ‖ · ‖_p is defined as:

{‖ α ‖}_{p} = {(\sum_{i = 0}^{N - 1} {| α_{i} |}^{p})}^{1 / p}

(2)

with α ∈ ℝ^N. If the signal x is sparse, then there exists some N × N basis or dictionary Ψ ∈ ℝ^N^×^N, such that there is an N-dimensional vector α implying x = Ψα and ‖α‖₀ ≤ K with K ≪ N. CS theory demonstrates that it is possible to compress this kind of sparse signal using a second different measurement matrix Φ ∈ ℝ^M^×^N with M ≪ N [11]. The compression can be written as y = Φx, where y is the M-dimensional measurements vector. While Ψ is usually defined by the signals characteristics, Φ has to be designed so that M is much smaller than N.

Having the measurements vector y, the recovery of the original signal x can be obtained by the inverse of the measurement problem:

y = Θ α = Φ Ψ α

(3)

Even though the inversion is not an easy task, since the matrix Θ ∈ ℝ^M^×^N is rectangular with M ≪ N, the fact that x is sparse can relax the problem by opening the way to the use of optimization-based reconstruction or iterative support-guessing reconstruction.

The most common optimization-based method, here reported for the sake of clarity, is the basis pursuit (BP) [38] method that searches for the most sparse solution for which the ‖α‖₁ is minimum:

\hat{α} = arg min {‖ α ‖}_{1} subject to y = Θ α = Φ Ψ α

(4)

CS theory proves that if the two matrices Φ and Ψ are incoherent (elements of the matrix Φ are not sparsely represented in the basis Ψ) and the original signal x is compressible or sparse, we can recover α with high probability.

To further enhance the recoverability, recent studies propose taking into account additional information about the underlying structure of the solutions [39]. When the signals to compress and recover are obtained from sensors deployed close to each other in the environment, we can expect that the ensemble of these signals presents an underlying joint structure. This characteristic can be exploited to further compress the data, without a loss in reconstruction accuracy In practice, this class of solutions is known to have a certain group sparsity structure. This means that the solution has a natural grouping of its components, and the components within a group are likely to be either all zeros or all non-zeros. Encoding the group sparsity structure can reduce the degrees of freedom in the solution, thereby leading to better recovery performance.

Having an ensemble of J signals, we can denote each signal with x_j ∈ ℝ^N with j ∈ {1,2,…, J}. For each signal x_j in the ensemble, we have a sparsifying basis Ψ ∈ ℝ^N^×^N and a measurement matrix Φ_j ∈ ℝ^M^×^N, such that, as before, y_j = Φ_jx_j with M_j ≪ N and x_j = Ψα_j. The reconstruction of jointly sparse solutions, also known as the multiple measurement vector (MMV) problem, has its origin in sensor array signal processing and recently has received much interest as an extension of the single sparse solution recovery in compressive sensing. The recovery problem can be formulated as:

\begin{array}{c} min_{\tilde{α}} & {‖ \tilde{α} ‖}_{w, 2, 1} : = ∑_{i = 1}^{n} w_{i} {‖ {\tilde{α}}_{i} ‖}_{2} \\ subject to & \tilde{Θ} \tilde{α} = \tilde{Y} \end{array}

(5)

where

\tilde{Y} = {[y_{1}^{T} y_{2}^{T} \dots y_{J}^{T}]}^{T}

,

\tilde{α} = {[α_{1}^{T} α_{2}^{T} \dots α_{J}^{T}]}^{T}

, w_i is the weight and Θ̃ ∈ ℝ^JM^×^JN is a matrix having on the diagonal matrices Θ_j = Φ_jΨ for j ∈ {1,2,…, J}.

3.2. CS and Sub-Nyquist Sampling

As discussed in the previous section, to successfully recover the original signal from its sampled version, the samples are taken regularly on a time axis at a given rate that is not less than the Nyquist one. With respect to CS, this requirement means that the measurement matrix Φ is a dense matrix (usually an independent and identically distributed Gaussian matrix). A particular form of CS, usually referred to as analog CS, relies on random sampling and aims to produce a number of measurements fewer than with the Nyquist sampling, still enabling the reconstruction of the original signal. While analog CS is usually performed by means of specialized hardware encoders, this is also a suitable technique to be performed on WSNs nodes, opportunely skipping samples during the acquisition phase.

From a mathematical point of view, the problem is still the same as the problem in Equation (3); what is different is the form of the measurement matrix Φ, which is not a dense matrix, but is a sparse measurement matrix. More precisely, if B is an M-dimensional vector where each element is a unique random entry between one and N, then the matrix Φ in the analog CS is a sparse M × N measurement matrix, which is composed by an all-zero vector on each row and a “1” at the location given by the i-th element of B. This is a very simple measurement matrix, energetically inexpensive to generate and store and permits a huge reduction in the duty-cycling of the nodes.

4. Latent Variables and Tensor Factorization

Latent variable-based factorization is a simple, yet powerful framework for modeling data and has been successfully applied in several application domains [16]. The main idea behind this framework is to model the large number of observed variables (the observed data) in terms of a much smaller number of unobserved variables (the latent variables). The latent variables are learned from the observed data and are used to estimate the missing samples, modeling complex interactions between the observed variables through simple interactions between the latent variables.

More specifically, given some multivariate data that are collected by a heterogeneous WSN in a large field over time, we can naturally organize these into a three-dimensional data array (or a three-tensor). Each of the three dimensions corresponds to a different variate of a particular measurement (e.g., the time, the location and the sensor type associated with each reading). Once the data are organized in this way, we can then associate a low-dimensional latent variable with each unique location, time slice and sensor type. We can thus model a particular observation (at a given location, time and type) as a noisy combination of the associated latent variables. In many scenarios, a multiplicative combination of these latent variables is able to capture intricate dependencies in the data [17,40]. The goal then is to learn a good set of latent variables (that is, find a factorization) that can efficiently represent our observed data.

4.1. Modeling Details

We now formally define our model. For each unique time instance t, sensor type s and node location n, we associate a unique K-dimensional vector, a_t, b_s and c_n, respectively. These unobserved vectors are called the latent factors or variables and are assumed to control the location-, time- and sensor-specific interactions present in the observed data.

Then, given a [T × S × N] tensor χ of sensor readings from S different sensor types collected at N different nodes and T different time instances, with possible missing entries, we model χ as follows (see Figure 1). We assume that each reading x_tsn (the reading at time t, for sensor type s, at node location n) is a noisy realization of the underlying true reading that is obtained by the interaction of the time-specific latent variable a_t, with the sensor-specific latent variable b_s and with the location-specific variable c_n.

That is,

x_{tsn} = \sum_{k = 1}^{K} a_{t k} b_{s k} c_{n k} + ε

(6)

where ε is modeled as independent zero-mean Gaussian noise (ε ∼ Sensors 15 05058i1

(0, σ²)). Observe that under this model, once all of the latent variables are known, one can recover the true readings of all sensors at all locations and times. Thus, the goal is to find the most predictive set of vectors a_t, b_s and c_n for all t = 1,…, T, s = 1,…, S, n = 1,…, N. Such a representation models the entire data of size [T · N · S] by just [K · (T + N + S)] modeling parameters. The choice of the free parameter K provides a key trade-off: a large K increases the number of modeling parameters and, thus, can help model the observed data exactly. However, this lacks the capability to predict unobserved/missing data due to overfitting. A small K, on the other hand, escapes the overfitting problem, but the corresponding model lacks sufficient richness to capture salient data trends. The exact choice of a good K is typically application dependent and is derived empirically.

4.1.1. Learning the Latent Variables

Finding the optimal set of K-dimensional latent variables given the observations is equivalent to factorizing the given tensor into three matrices, each of rank at most K [17]. Thus, assuming that all of the data is known (that is, every entry in the tensor is observed), we can find the latent factors by employing the Canonical decomposition/Parallel factors (CP) tensor factorization [17]. This is simply a higher-order generalization of the matrix singular value decomposition (SVD) and decomposes a generic third order tensor in three matrix factors A, B and C. Restricting the ranks of each of the matrix factors to at most K yields the best rank K approximation. Algorithmically, the matrix factors are found by an alternating least squares approach (ALS) [40], which starts from a random initialization and iteratively optimizes one matrix factor at a time, while keeping the other two fixed.

This technique can be generalized to work with tensors that have missing entries. Since sensor nodes can periodically go offline due to duty-cycling or running out of energy (preventing all sensors on a node from collecting any data for an extended period of time), we need to extend our basic model to deal with data missing from multiple sensors or nodes, resulting in entries and rows of missing data in the collected tensor. In order to do well in this regime, we extend the basic tensor factorization model to explicitly incorporate spatially-, temporally- and sensor-specific information from neighboring observations by explicitly learning and enforcing the corresponding correlations.

4.2. Incorporating Correlations

To successfully interpolate the sensor interactions to contiguously missing blocks of data, we need to explicitly model spatially-, temporally- and sensor-specific trends within each of our latent variables a_t, b_s and c_n. Such trends ensure that the latent variables a_t and a_t′ (respectively, b_s and b_s′ and c_n and c_n_′) take similar values when times t and t′ are “similar” (respectively, sensors types s and s′ and locations n and n′). Note that similarity can mean anything based on the context. For locations, it can mean that variables associated with two locations that are close in distance should have similar characteristics, while for time, it can mean that variables associated with times that are the same hour of the day or the same day of the week should have similar characteristics. Here, we take a data-driven approach to infer the best notion of similarity using correlations directly computed from the data. We use a limited sample of data collected from the network to learn the correlations, which are then applied to the subsequent slot of data to reconstruct the unsampled values.

The similarity constraints are modeled in the same way for all three sets of latent variables, and here, we illustrate the case for the a_t's. Since each a_t is a K-dimensional variable, let a_tk denote its k – th coordinate. We model a_tk (independently for each coordinate k) as:

\begin{matrix} a_{: k} & = μ_{a k} + α_{: k} \\ α_{: k} & \sim N (0, Σ_{a}) \end{matrix}

(7)

Here, a_:_k represents the collection of all a_t's (across t = 1,…,T) in the k–th coordinate and μ_a represents their mean value. The distributional constraint over α_:_k (as Sensors 15 05058i1

(0,Σ_a)) enforces the similarity constraints via the T × T covariance matrix Σ_a. By changing the t, t′ entry of Σ_a, we can encourage/discourage the corresponding a_t and a_t′ to take similar values: a high positive value at Σ_a(t, t′) encourages a positive correlation; a high negative value encourages a negative correlation; while a value close to zero does not encourage any correlation. We enforce similar constraints on b_s and c_n variables, as well.

To get the right similarity constraints Σ_a, Σ_b and Σ_c (for latent variables a_t, b_t and c_n), we compute them from the data that we are considering. For spatial similarity constrains, we computed the averaged pairwise Pearson correlation coefficients between data from different pairs of locations and averaged over different times and sensors. We do the same to approximate inter-sensor and temporal similarities. We can have only one global correlation constraint along each of the dimensions.

4.3. Parameter Learning

We can learn the underlying latent variables in a probabilistic framework using a maximum a posteriori (MAP) estimate. In particular, let θ denote all of the model parameters (i.e., θ = {{a_t}, {b_s}, {c_n}, σ}), then the optimum choice of parameters θ_MAP given the data χ is obtained by:

\begin{matrix} θ_{MAP} (χ) : = \underset{θ}{arg max} \underset{likelihood}{\underset{︸}{p (χ | θ)}} \underset{prior}{\underset{︸}{p (θ)}} & = \underset{θ}{arg max} ∑_{t, s, n \in observed} log p (x_{tsn} | a_{t}, b_{s}, c_{n}, σ) + \\ + ∑_{k = 1}^{K} log p (a_{: k}) + ∑_{k = 1}^{K} log p (b_{: k}) + ∑_{k = 1}^{K} log p (c_{: k}) . \end{matrix}

The first term (the likelihood) takes the form of Equation (6), and the other terms represent the priors for each latent variable; each one of them takes the form of Equation (7). We take a uniform prior over σ, the standard deviation of the residuals in Equation (6), so it does not explicitly show in the equation.

This optimization does not have a closed form solution. Standard gradient-based techniques can be used to get a locally optimal solution. Here, we can follow an alternating hill-climb approach by optimizing the value of one variable while keeping all others fixed to get a good solution. As for the standard factorization technique, this iterative approach uses a random initialization of the latent variables.

5. Hardware, Network and Power Models

In this section, we will describe the hardware and software characteristics of the WSN deployed as a testbed for this work [41]. In particular, we report the experience gained during the EU Project 3ENCULT (Efficient energy for EU cultural heritage, http://www.3encult.eu) regarding the design and implementation of a WSN for environmental monitoring of heritage buildings. We present the network solution developed to efficiently satisfy the requirements for long-term monitoring of a historical building, called Palazzina della Viola, located in the center of Bologna, Italy.

5.1. Hardware

The sensor nodes used in the network are the W24TH nodes, shown in Figure 2a. The hardware core of each node is based on the NXP JN5148 module, which is an ultra-low-power, high-performance wireless microcontroller targeted at ZigBee PRO networking applications. The device features an enhanced 32-bit RISC processor, a 2.4-GHz IEEE 802.15.4 compliant transceiver, 128 kB of ROM, 128 kB of RAM and several analogue and digital peripherals. Its power consumption is 15 mA for transmission and 18 mA for reception, which gives 35% of power saving when compared to other similar platforms (e.g., TelosB).

The board is equipped with several sensors such as a temperature, humidity, ambient light and a metal-oxide gas sensor. Each sensor is equipped with a specific conditioning circuit, which is useful for various ambient monitoring applications. A MicroSD card reader is also available, intended for local back-up, data logging and firmware updates. Moreover, each node is equipped with a USB battery charger, a 32-kHz quartz oscillator and an expansion connector. The power management subsystem provides the ability to switch off the whole system by software, thus enabling a 10 μA sleep mode current consumption. In addition, all sensors can be individually disconnected for the efficient management of the hardware resources, reducing the node's power consumption.

For our installation, a network of 23 low-power sensor nodes is deployed across the three floors of the historic building, as shown in Figure 2b. The topology is a tree network with a central collecting sink that is in charge of gathering the data from the nodes and forwarding them to a remote PC, used to log the data and reconstruct the missing samples.

5.2. Network

Generally, in low-power sensor nodes, the communication subsystem has a much higher energy consumption than the computation and sensing subsystems; hence, the radio management and duty cycling are crucial aspects for energy efficiency. Ideally, we could turn on the radio as soon as a new data packet becomes available for sending and switch it off again when the data are sent. The problem with this technique is that in collaborative networks, such as WSNs, coordination among nodes is required to enable data exchanging and forwarding from the source node to the sink. Thus, a sleep/wakeup scheduling algorithm is required to permit the correct operation of the network.

To solve this problem, we developed and implemented a new power management and scheduling protocol, named conservative power scheduling (CPS). This protocol is built on top of the IEEE 802.15.4 MAC protocol and adapts a slotted approach derived from time division multiple access (TDMA) schemes for the staggered wakeup pattern [42].

CPS is based on a centralized approach and provides a time synchronization by using a non-beacon-enabled mode and the dissemination of a service-packet. Moreover, the algorithm avoids collisions during the data packet transmission through a TDMA technique, where each node in the network transmits its data in a given time slot. The service-packet contains several pieces of information, including data for time synchronization, as well as the sequence of active sensors for all receivers. The transmission of the service-packet across the network is performed by using a novel approach based on a constructive interference of IEEE 802.15.4 symbols for fast network flooding and implicit time synchronization [43].

In this protocol, each communication period, identified by a service-packet, is divided into two parts: an active interval (AI), during which the node must keep its radio on to exchange packets with other nodes, and a sleep interval (SI), during which the node is sleeping. To permit the correct data exchange through the network, the talk interval between a node and its parent/child must overlap for at least a time slot, needed to exchange the desired data.

The real challenge of the CPS protocol is to correctly assign the time slots to each node, so as to minimize the power consumption. To achieve this goal, each node has to be in the active interval for the shortest time possible, waking up just in time for sending sampled data to the parent node and going back to sleep as soon as it has forwarded all of the data coming from its child nodes. Different from other approaches found in the literature [44], CPS does not calculate at run-time the slot scheduling, but it uses a static scheduler computed off-line. The algorithm for slot assignment takes as input the tree-structure of the network and gives as output the slot times assigned to each node.

The algorithm is articulated in two different steps: (1) ordering and (2) scheduling. In the ordering phase, each node in the network is assigned a unique ID that defines the priority of the node, following a depth-first approach. After each node is opportunely tagged with an ID, the scheduling step of the algorithm is in charge of computing the start and duration of the active time slot for each node. It ensures that each node waits until data sent by its own parent have reached the sink before sending out its own data, avoiding collisions in the network. Afterwards, the node can switch to sleep mode only after having forwarded data coming from all of its child nodes. An example of the protocol's scheduling is illustrated in Figure 3.

In a practical implementation of CPS, the synchronization using broadcast packets from the coordinator can be problematic due to the delay in the packet relay through the network. For this reason, the margin for synchronization is relaxed, accounting for clock drift and delays in the transmission of synchronization packets.

5.3. Power Model

We introduce an architecture-level power model to evaluate the energy consumption of the node when the sub-sampling parameters are changed. Using this power model, fitted and validated with data from real hardware and measurements, we can easily evaluate how changing the parameters influences the energy consumption and the lifetime of the network.

Starting from the assumption reported in the previous section, the average energy consumption in each period of duration T_k, for a sub-sampling factor ρ applied to a block of N sampling intervals, is:

E_{k} = ρ (E_{setup} + E_{sampl} + E_{store}) + E_{sleep} + N^{- 1} (E_{nv} + E_{send})

(8)

where E_sleep is the energy spent in sleep mode, E_setup is the energy used for waking up and setting up the device, E_Sample is the energy for sampling each sensors, E_send is the energy used to send the acquired data, E_store is the energy to store the acquired sample in non-volatile memory and E_nv is the energy spent to read the data from non-volatile memory.

Expanding each term, we have:

E_{k} = ρ (T_{setup} \cdot (P_{mcu} + P_{soff} + P_{toff}) + T_{sample} \cdot (P_{sample} + P_{sactive} + P_{toff}) + T_{store} \cdot (P_{soff} + P_{toff} + P_{store})) + T_{sleep} \cdot (P_{sleep} + P_{soff} + P_{toff}) + N^{- 1} (T_{nv} \cdot (P_{store} + P_{soff} + P_{toff}) + T_{send} \cdot (P_{comm} + P_{soff} + P_{send}))

(9)

where T_sleep, T_setup, T_sample, T_send, T_store, T_nv are the duration of each respective phase, P_sleep is the power consumed in sleep mode, P_soff is the power absorbed from sensors when sleeping, P_toff is the power consumption of the transceiver when the node is in sleep mode, P_mcu is the power consumed by the MCU, P_sample is the power spent for data acquisition, P_sactive is the power consumed by sensors, P_comm is the power consumption for filling the transceiver output buffer and, finally, P_send is the power for sending data. All of the values for the power consumption and timings are obtained by the values reported in Table 1 measured on real hardware or extracted from the datasheets.

6. Results

In this section, we compare the reconstruction performance of CS and the latent variable-based statistical model against the 3ENCULT deployment, considering data coming from the temperature, humidity and light sensors. We want to investigate whether a better reconstruction technique exists among those proposed here and how the sub-sampling parameters affect such recovery quality

We can consider that each node samples the signals for a period of time T, called the acquisition period, ideally gathering N = T/f_s samples at a f_s sampling frequency The compression phase is the same for both frameworks: each node samples the signals of interest gathering a sub-set M of the needed samples (M = ρN), with an under-sampling ratio 0 < ρ < 1. The under-sampling pattern is locally generated by each node using its own ID and the time-stamp as the seed for randomization. In the random sampling pattern, the inter-measurements intervals are always multiples of the minimum sampling period T_k = T/N.

After the acquisition period T = NT_k, the gathered data are sent to the collecting sink through the network. The sampling time T_k in the following simulations is set to 600 s and the results are averaged over 100 trials. Each trial is characterized by a different sampling pattern and a different considered portion of the signal. The reconstruction phase is fairly different and determines the recovery quality of the original signal. For CS, the DCTmatrix is used as the sparsifying matrix, which has already been demonstrated to be a good sparsifying matrix for natural signals [45,46].

In the first simulation, we reconstruct the original signals from a sub-sampled version without exploiting any inter- or intra-correlation among them, just averaging the reconstruction quality over all of the signals, with a signal length of N = 512. For the latent variables approach, here, we used the standard CP tensor factorization technique, without the contribution of any correlations in the data. The comparison is carried evaluating the signal-to-noise ratio (SNR), defined as:

{SNR}_{dB} = 20 \cdot {log}_{10} \frac{{‖ x ‖}_{2}}{{‖ x - \hat{x} ‖}_{2}}

(10)

where x is the original signal and x̂ is its recovered version. We show the average SNR across all of the network nodes.

From the plot, shown in Figure 4a, we can infer how different the performance is for the two techniques: while the reconstruction performance for LV is pretty stable when varying the sub-sampling factor ρ, CS is much more affected by the compression factor. Recovery with CS achieves a better reconstruction almost for every sub-sampling factor with respect to the latent variable-based technique.

Both of the frameworks seem to be greatly affected by the nature of the signal to reconstruct. In particular, from the plot, we can infer how difficult the recovery of light signals is for the two proposed techniques. This is due to the nature of the light signal that is recorded inside the building. While for temperature and humidity, the gathered signals are continuous signals and smoothly affected by human presence, the light signal is highly irregular and highly influenced by the artificial lighting in single rooms. Moreover, some of the nodes are placed in the basement, where the light level is under the noise threshold of the light sensors, providing extremely noisy data.

To evaluate whether it is possible to exploit the correlations existing among sensors and nodes to improve the reconstruction, we performed the recovery against the same dataset using for CS the group sparse optimization (GS-CS), exploiting the joint sparsity of the signals, and for LV the maximum a posteriori optimization (LV-MAP) introduced in Equation (8). The results are reported in Figure 4b. While the performance for CS remains almost the same, the LV-MAP method guarantees a significant improvement in reconstruction, resulting in better performance than CS for small values of ρ, especially in relation to humidity and temperature signals.

The behavior of CS could be explained by looking at Figure 5. Here, we show how the union over all signals of the K best DCT basis vectors per signal has a size definitely greater than K. Practically, this means that GS-CS is able to exploit the inter-node correlation only to a small extent, since the shared information among different nodes is limited, and the recovery algorithm is not able to exploit this information to improve the recovery quality.

According to the model in Section 5.2, the simulations are performed with a sampling frequency f_s = 1/600 Hz, and since the length of the data is N = 512, this brings a delay in the data delivery towards the data collector of 3.5 days. Thus, the size of the recovered signal spans across 3.5 days. Having high values of N means that we have to wait a longer time to proceed with data recovery. Therefore, we want now to investigate how the length of the block of data gathered by the sensors affects the two frameworks and whether a correlation between recovery performance and the N parameter exists for GS-CS and LV-MAP.

In Figure 6a, the results for GS-CS when N is changed are presented. From the plot, we can infer how the length of the signal N does not greatly affect the reconstruction quality for all of the signals taken into consideration. Rather, we can see how the influence of the parameter N (and then, of the delay in the data collection) is only visible for small values of the sub-sampling factor ρ. Different from temperature and humidity, the light signal presents a peculiar behavior showing an increased reconstruction quality with the increase in the number of acquired samples.

The same results for the LV-MAP approach are presented in Figure 6b. The difference in the reconstruction error for the various values of N is more evident than in the GS-CS case. With small values of N, we registered difficulties in reconstructing the desired signals. The best recovery performance is achieved when considering 256–512 samples at a time, identifying the optimal trade-off between delay and reconstruction accuracy, since larger blocks of data present again a loss of accuracy. For a delay smaller than N = 128, the reconstruction of the light signal is not feasible in both cases, since the majority of the samples gathered are zeros, due to the lack of light at night.

Having evaluated the recovery performance and the influence of the gathering delay on reconstruction, it is interesting to investigate the power consumption involved with compression according to the power model in Section 5.3. In Figure 7, we report the reconstruction quality against the energy consumption for one acquisition cycle. The plot clearly shows how a trade-off between energy consumption and transmission delay does exist in the GS-CS case (Figure 7a). Higher values of N, thus higher delays in transmission, are able to guarantee a better reconstruction quality with definitely less energy than the N = 16 case (all of the other cases are not considered in the plot, since they are between these two boundaries). The light signal is a special case, but we can draw the same conclusions as before. The LV-MAP case (Figure 7b) presents a similar behavior, but with a less emphasized increase in energy efficiency corresponding to the increase in data size N. When comparing the two graphs, we can observe how the GS-CS case exhibits a slightly higher energy efficiency, allowing a higher reconstruction quality when considering the same energy consumptions as the LV-MAP case. Only for extremely sub-sampled signals are the LV-MAP approach's results better, having a major benefit from the explicit correlation models incorporated in the data reconstruction. In both cases, we are able to obtain a better accuracy (or the same reconstruction quality with less energy) if we are willing to wait for a higher number of gathered samples before proceeding with the reconstruction.

The same conclusions are shown in Figure 8a, where the ratio between the reconstruction quality and the consumed energy is plotted against the sub-sampling factor ρ. Here, we can see a direct comparison of the two techniques for the case with the best reconstruction performance (N = 512). Again, we can see how the GS-CS case has a higher ratio when compared to the LV-MAP case, for almost all of the sub-sampling policies. Only when dealing with a really small amount of sampled data (20%) does the LV-MAP case show a better performance. This result can be a guide for WSN developers, suggesting the adoption of the LV method only when a really aggressive power saving technique is needed. The plots of the two techniques are combined in Figure 8b, where the best ratio between the reconstruction accuracy and the consumed energy is plotted against the sub-sampling factor. Here, we have the combination of the two approaches and show the best achievable results for every sub-sampling factor.

There is no detailed analysis in the literature on conjunct sub-sampled data reconstruction and accurate power consumption estimation for heterogeneous sensor networks. In [28], an SNR of 40 to 68 dB is reported for the reconstruction of temperature signals from compressed data gathered at sub-sampling factors ranging from 0.1 to 0.8, but no power analysis is performed. The hybrid CS aggregation scheme of [18] achieves 20% to 50% energy savings for different aggregation factors, when compared to non-aggregation techniques on a simulated 1255-node grid network. The paper, however, does not report any detail on the power consumption models used, nor on the impacts of the proposed CS aggregation algorithm on the reconstructed data quality. In this work, we presented two energy-efficient approaches, with detailed data reconstruction and power consumption analysis on a heterogeneous WSN, introducing the use of spatio-temporal correlations to improve the performance. Considering the temperature signal, we achieve 40 to 70 dB of SNR for increasing sub-sampling factors, which corresponds to average power consumptions ranging from 63 to 128 μJ, with maximum energy savings of 58%. Our results are aligned with the presented literature and introduce detailed multi-signal reconstruction and power analysis based on a real-life WSN deployment, which provide insights into the application of sub-sampling in heterogeneous WSNs and can serve as guidelines and future design specifications.

7. Conclusions

In this paper, we presented a wireless sensor network deployment and used it to compare two promising techniques for energy-efficient data gathering and reconstruction. One is based on the well-known compressive sensing framework, while the second is a latent variable-based statistical model. Both approaches try to successfully recover the desired signal from a highly incomplete sub-sampled version, obtained opportunely, skipping samples during the acquisition phase. The two techniques exploit redundancies and correlations present in the gathered data to achieve a better reconstruction accuracy with a smaller number of collected samples and, thus, with a lower energy consumption. We described the hardware and software implementation of the deployed network, introduced an energy model for the sensor nodes and analyzed the gathered data, comparing the two reconstruction techniques. The results showed how the use of the data from the whole network and their correlations improved the recovery performance in both cases, when compared to the standard approaches, where individual nodes and signals are considered. The CS approach usually achieves better reconstruction accuracy, with the exception of cases when really aggressive sub-sampling policies are used. This leads also to the better energy efficiency of the CS method.

Acknowledgments

The research leading to these results has received funding from EU Projects Genesi (Grant Agreement No. 257916) and 3ENCULT (Grant Agreement No. 260162), funded by the EU 7th Framework Programme. The authors would like to thank Tajana Simunic Rosing and Piero Zappi for helpful discussion, valuable comments and support.

Author Contributions

All of the authors have contributed to the development and writing of this article. Bojan Milosevic and Carlo Caione carried out the hardware deployment and measurements, developed the MATLAB code for data collection and for the testing of the reconstruction algorithms and worked on the draft of the paper. Elisabetta Farella and Davide Brunelli supervised the technical work and contributed to the writing of the paper. Luca Benini supervised the whole work, corrected the draft of the paper and contributed to the writing of the final revision of the paper.

Conflicts of Interest

The authors declare no conflicts of interest.

References

Li, X.Y.; Wang, Y.; Wang, Y. Complexity of Data Collection, Aggregation, and Selection for Wireless Sensor Networks. IEEE Trans. Comput. 2011, 60, 386–399. [Google Scholar]
Ammari, H.M. Investigating the Energy Sink-Hole Problem in Connected k-Covered Wireless Sensor Networks. IEEE Trans. Comput. 2013, 63, 2729–2742. [Google Scholar]
Li, M.; Li, Z.; Vasilakos, A.V. A survey on topology control in wireless sensor networks: Taxonomy, comparative study, and open issues. IEEE Proc. 2013, 101, 2538–2557. [Google Scholar]
Moser, C.; Thiele, L.; Brunelli, D.; Benini, L. Adaptive Power Management for Environmentally Powered Systems. IEEE Trans. Comput. 2010, 59, 478–491. [Google Scholar]
Zappi, P.; Lombriser, C.; Stiefmeier, T.; Farella, E.; Roggen, D.; Benini, L.; TrÃűster, G. Activity Recognition from On-Body Sensors: Accuracy-Power Trade-Off by Dynamic Sensor Selection. Proceedings of the European Conference on Wireless Sensor Networks, Bologna, Italy, 30 January–1 February 2008; pp. 17–33.
Veeravalli, V.V.; Varshney, P.K. Distributed inference in wireless sensor networks. Philos. Trans. R. Soc. A Math. Phys. Eng. Sci. 2012, 370, 100–117. [Google Scholar]
Zou, Z.; Hu, C.; Zhang, F.; Zhao, H.; Shen, S. WSNs data acquisition by combining hierarchical routing method and compressive sensing. Sensors 2014, 14, 16766–16784. [Google Scholar]
Aquino, A.L.; Junior, O.S.; Frery, A.C.; de Albuquerque, E.L.; Mini, R.A. MuSA: Multivariate sampling algorithm for wireless sensor networks. IEEE Trans. Comput. 2014, 63, 968–978. [Google Scholar]
AbdelSalam, H.S.; Olariu, S. Toward adaptive sleep schedules for balancing energy consumption in wireless sensor networks. IEEE Trans. Comput. 2012, 61, 1443–1458. [Google Scholar]
Candes, E.; Wakin, M. An introduction to compressive sampling. IEEE Signal Process. Mag. 2008, 25, 21–30. [Google Scholar]
Duarte, M.; Eldar, Y. Structured compressed sensing: From theory to applications. IEEE Trans. Signal Process. 2011, 59, 4053–4085. [Google Scholar]
Verma, N.; Zappi, P.; Rosing, T. Latent variables based data estimation for sensing applications. Proceedings of the IEEE International Conference on Intelligent Sensors, Sensor Networks and Information Processing, Adelaide, Australia, 6–9 December 2011; pp. 335–340.
Milosevic, B.; Yang, J.; Vrema, N.; Tilak, S.; Zappi, P.; Farella, E.; Rosing, T.S.; Benini, L. Efficient energy management and data recovery in sensor networks using latent variables based tensor factorization. Proceedings of the ACM/IEEE International Conference on Modeling, Analysis and Simulation of Wireless and Mobile Systems (MsWiM), Barcelona, Spain, 3–8 November 2013; pp. 247–254.
Caione, C.; Brunelli, D.; Benini, L. Distributed Compressive Sampling for Lifetime Optimization in Dense Wireless Sensor Networks. IEEE Trans. Ind. Inform. 2012, 8, 30–40. [Google Scholar]
Mamaghanian, H.; Khaled, N.; Atienza, D.; Vandergheynst, P. Design and Exploration of Low-Power Analog to Information Conversion Based on Compressed Sensing. IEEE J. Emerg. Sel. Top. Circuits Syst. 2012, 2, 493–501. [Google Scholar]
Koren, Y.; Bell, R.; Volinsky, C. Matrix Factorization Techniques for Recommender Systems. Computer 2009, 42, 30–37. [Google Scholar]
Kolda, T.G.; Bader, B.W. Tensor Decompositions and Applications. SIAM Rev. 2009, 51, 455–500. [Google Scholar]
Xiang, L.; Luo, J.; Vasilakos, A. Compressed data aggregation for energy efficient wireless sensor networks. Proceedings of the Sensor, Mesh and Ad Hoc Communications and Networks (SECON), Salt Lake City, UT, USA, 27–30 June 2011; pp. 46–54.
Yao, Y.; Cao, Q.; Vasilakos, A. EDAL: An Energy-Efficient, Delay-Aware, and Lifetime-Balancing Data Collection Protocol for Wireless Sensor Networks. Proceedings of the 2013 IEEE 10th International Conference on Mobile Ad-Hoc and Sensor Systems (MASS), Hangzhou, China, 14–16 October 2013; pp. 182–190.
Baron, D.; Duarte, M.F.; Wakin, M.B.; Sarvotham, S.; Baraniuk, R.G. Distributed Compressive Sensing. Available online: arxiv.org/pdf/0901.3403 (accessed on 28 February 2015).
Lee, K.; Bresler, Y.; Junge, M. Subspace Methods for Joint Sparse Recovery. IEEE Trans. Inf. Theory 2012, 58, 3613–3641. [Google Scholar]
Duarte, M.; Baraniuk, R. Kronecker Compressive Sensing. IEEE Trans. Image Process. 2012, 21, 494–504. [Google Scholar]
Deng, W.; Yin, W.; Zhang, Y. Group Sparse Optimization by Alternating Direction Method. Rice CAAM Report TR11-06. Available online: http://www.caam.rice.edu/optimization/L1/GroupSparsity/group110419.pdf (accessed on 28 February 2015).
Su, X.; Khoshgoftaar, T.M. A survey of collaborative filtering techniques. Adv. Artif. Intell. 2009. [Google Scholar] [CrossRef]
Yoshii, K.; Goto, M.; Komatani, K.; Ogata, T.; Okuno, H. An Efficient Hybrid Music Recommender System Using an Incrementally Trainable Probabilistic Generative Model. IEEE Trans. Audio Speech Lang. Process. 2008, 16, 435–447. [Google Scholar]
Koren, Y. Collaborative filtering with temporal dynamics. Commun. ACM 2010, 53, 89–97. [Google Scholar]
Karatzoglou, A.; Amatriain, X.; Baltrunas, L.; Oliver, N. Multiverse recommendation: N-dimensional tensor factorization for context-aware collaborative filtering. Proceedings of the Fourth ACM Conference on Recommender Systems, Barcelona, Spain, 26–30 September 2010; pp. 79–86.
Luo, C.; Wu, F.; Sun, J.; Chen, C.W. Efficient Measurement Generation and Pervasive Sparsity for Compressive Data Gathering. IEEE Trans. Wirel. Commun. 2010, 9, 3728–3738. [Google Scholar]
Mahmudimanesh, M.; Khelil, A.; Suri, N. Reordering for Better Compressibility: Efficient Spatial Sampling in Wireless Sensor Networks. Proceedings of the 2010 IEEE International Conference on Sensor Networks, Ubiquitous, and Trustworthy Computing, Newport Beach, CA, USA, 7–9 June 2010; pp. 50–57.
Mamaghanian, H.; Khaled, N.; Atienza, D.; Vandergheynst, P. Compressed Sensing for Real-Time Energy-Efficient ECG Compression on Wireless Body Sensor Nodes. IEEE Trans. Biomed. Eng. 2011, 58, 2456–2466. [Google Scholar]
Makhzani, A.; Valaee, S. Reconstruction of a Generalized Joint Sparsity Model Using Principal Component Analysis. Proceedings of the 2011 4th IEEE International Workshop on Computational Advances in Multi-Sensor Adaptive Processing (CAMSAP), San Juan, Puerto Rico, 13–16 December 2011; pp. 269–272.
Murray, T.S.; Pouliquen, P.O.; Andreou, A.G. Design of a Parallel Sampling Encoder for Analog to Information (A2I) Converters: Theory, Architecture and CMOS Implementation. Electronics 2013, 2, 57–79. [Google Scholar]
Yu, Z.; Hoyos, S. Digitally Assisted Analog Compressive Sensing. Proceedings of the 2009 IEEE Dallas Circuits and Systems Workshop, Richardson, TX, USA, 4–5 October 2009; pp. 1–4.
Baheti, P.; Garudadri, H. An Ultra Low Power Pulse Oximeter Sensor Based on Compressed Sensing. Proceedings of the Sixth International Workshop on Wearable and Implantable Body Sensor Networks, Berkeley, CA, USA, 3–5 June 2009; pp. 144–148.
Garudadri, H.; Baheti, P.; Majumdar, S.; Lauer, C.; Massé, F.; van de Molengraft, J.; Penders, J. Artifacts mitigation in ambulatory ECG telemetry. Proceedings of the 2010 12th IEEE International Conference on e-Health Networking Applications and Services (Healthcom), Lyon, France, 1–3 July 2010; pp. 338–344.
Prada, M.A.; Toivola, J.; Kullaa, J.; Hollmen, J. Three-way analysis of structural health monitoring data. Neurocomputing 2012, 80, 119–128. [Google Scholar]
Chen, W.; Wassell, I. Energy efficient signal acquisition via compressive sensing in wireless sensor networks. Proceedings of the 2011 6th International Symposium on Wireless and Pervasive Computing (ISWPC), Hong Kong, China, 23–25 February 2011; pp. 1–6.
Needell, D.; Vershynin, R. Uniform Uncertainty Principle and Signal Recovery via Regularized Orthogonal Matching Pursuit. Found. Comput. Math. 2009, 9, 317–334. [Google Scholar]
Chen, W.; Rodrigues, M.; Wassell, I. Distributed compressive sensing reconstruction via common support discovery. Proceedings of the 2011 IEEE International Conference on Communications (ICC), Kyoto, Japan, 5–9 June 2011; pp. 1–5.
Acar, E.; Yener, B. Unsupervised Multiway Data Analysis: A Literature Survey. IEEE Trans. Knowl. Data Eng. 2009, 21, 6–20. [Google Scholar]
Aderohunmu, F.A.; Balsamo, D.; Paci, G.; Brunelli, D. Long Term WSN Monitoring for Energy Efficiency in EU Cultural Heritage Buildings. Lect. Notes Electr. Eng. 2014, 281, 253–261. [Google Scholar]
Keshavarzian, A.; Lee, H.; Venkatraman, L. Wakeup scheduling in wireless sensor networks. Proceedings of the 7th ACM International Symposium on Mobile Ad Hoc Networking and Computing, Florence, Italy, 22–25 May 2006; pp. 322–333.
Ferrari, F.; Zimmerling, M.; Thiele, L.; Saukh, O. Efficient network flooding and time synchronization with Glossy. Proceedings of the 2011 10th International Conference on Information Processing in Sensor Networks (IPSN); pp. 73–84.
Hohlt, B.; Doherty, L.; Brewer, E. Flexible power scheduling for sensor networks. Proceedings of the 3rd International Symposium on Information Processing in Sensor Networks, Berkeley, CA, USA, 26–27 April 2004; pp. 205–214.
Caione, C.; Brunelli, D.; Benini, L. Compressive Sensing Optimization for Signal Ensembles in WSNs. IEEE Trans. Ind. Inform. 2014, 10, 382–392. [Google Scholar]
Caione, C.; Brunelli, D.; Benini, L. Compressive sensing optimization over ZigBee networks. Proceedings of the 2010 International Symposium on Industrial Embedded Systems (SIES), Trento, Italy, 7–9 July 2010; pp. 36–44.

Figure 1. Tensor data representation and its matrix factorization.

Figure 2. (a) The W24TH sensor node and (b) the deployment map.

Figure 3. Example of scheduling using the conservative power scheduling (CPS) protocol.

Figure 4. Recovery comparison when reconstructing the original signals from sub-sampled versions (N = 512): (a) compressive sensing (CS) and latent variable (LV) tensor factorization averaging the reconstruction quality over all of the nodes; (b) GC-CS and maximum a posteriori (MAP), exploiting the correlations among sensors and nodes.

Figure 5. Number of DCTcoefficients necessary to include the K largest coefficients for each signal (N = 512).

Figure 6. Reconstruction quality varying the sub-sampling factor ρ and using the signal length N as the parameter: (a) group sparse (GS)-CS and (b) LV-MAR From top to bottom: temperature, humidity, light.

Figure 7. Reconstruction quality vs. averaged per cycle energy consumption varying the parameter N: (a) CS and (b) LV.

Figure 8. (a) Ratio between the recovery quality and energy spent in compression varying the sub-sampling factor ρ for the two approaches; (b) the combination of the two approaches.

Table 1. Characteristics of the device taken as a reference in the power model.

**Table 1.** Characteristics of the device taken as a reference in the power model.
T_sample	600 (s)	Sampling Period
T_setup	5e–4 (s)	Setup time
T_store(16 bit)	5.25e–5 (s)	Time to store data in NVM
V_batt	3.3 (V)	Battery voltage
A_sleep	1.3e–6 (A)	Sleep current
A_mcu	7.5e–3 (A)	Idle current
A_sample	l.le–3 (A)	ADC current
A_store	7.5e–3 (A)	Current when saving in NVM
A_comm	1e–6 (A)	Current for filling the transceiver buffer
F	24 (MHz)	Microcontroller frequency
A_send	3.1e–2(A)	Transmission current
A_toff	10e–6 (A)	Sleep current of the transceiver
S_tx	150 (kbps)	Transmission throughput
T_{setup_radio}	5e–3 (s)	Transceiver setup time
B_pkt	127 (byte)	Packet size
B_header	10 (byte)	Header size
A_sactive,TH	3e–4 (A)	Temp and Hum.(TH) current consumption
A_soff,TH	1.5e–7 (A)	TH sleep current
T_sampl,TH	2e–5 (s)	TH sampling time
A_sactive,AL	150e–6 (A)	Ambient light (AL) current consumption
A_soff,AL	0.01e–6 (A)	AL sleep current
T_sampl,AL	1e–5 (s)	AL sampling time

© 2015 by the authors; licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution license ( http://creativecommons.org/licenses/by/4.0/).

Share and Cite

MDPI and ACS Style

Milosevic, B.; Caione, C.; Farella, E.; Brunelli, D.; Benini, L. Sub-Sampling Framework Comparison for Low-Power Data Gathering: A Comparative Analysis. Sensors 2015, 15, 5058-5080. https://doi.org/10.3390/s150305058

AMA Style

Milosevic B, Caione C, Farella E, Brunelli D, Benini L. Sub-Sampling Framework Comparison for Low-Power Data Gathering: A Comparative Analysis. Sensors. 2015; 15(3):5058-5080. https://doi.org/10.3390/s150305058

Chicago/Turabian Style

Milosevic, Bojan, Carlo Caione, Elisabetta Farella, Davide Brunelli, and Luca Benini. 2015. "Sub-Sampling Framework Comparison for Low-Power Data Gathering: A Comparative Analysis" Sensors 15, no. 3: 5058-5080. https://doi.org/10.3390/s150305058

Article Menu

Sub-Sampling Framework Comparison for Low-Power Data Gathering: A Comparative Analysis

Abstract

1. Introduction

2. Related Work

2.1. Theoretical Approaches

2.2. WSN-Related Practical Implementations

3. Compressing Sensing and Group Sparsity

3.1. Mathematical Background

3.2. CS and Sub-Nyquist Sampling

4. Latent Variables and Tensor Factorization

4.1. Modeling Details

4.1.1. Learning the Latent Variables

4.2. Incorporating Correlations

4.3. Parameter Learning

5. Hardware, Network and Power Models

5.1. Hardware

5.2. Network

5.3. Power Model

6. Results

7. Conclusions

Acknowledgments

Author Contributions

Conflicts of Interest

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI