1. Introduction
Living with continuous exposure to high levels of traffic noise has been proved to be harmful for health, affecting the quality of life of people living in urban and suburban areas [
1]. Several actions have been conducted to address this problem, taking account of the European Noise Directive 2002/49/EC (END) [
2] and the consequent strategic noise mapping assessment CNOSSOS-EU [
3], which are the main instruments at the European level devoted to elaborate specific action plans to mitigate noise pollution.
Noise measurements in cities have been historically conducted by professionals, recording and analyzing the data in specific locations and time periods by using certified sound level meters. The noise maps are then generated from these noise level measurements by means of the application of complex acoustic models after data post-processing; maps that should be updated and published every five years to fulfill the END requirements [
2]. However, this approach becomes difficult to scale when more measurements and/or locations are needed. The Internet of Things and the Smart City frameworks have allowed a change of paradigm for the city noise management and policy generation by means of the deployment of Wireless Acoustic Sensor Networks (WASN) [
4]. The core idea of the development of these networks is designing and deploying low-cost reliable and scalable automatic sensing system. Several projects based on WASN have been developed in Europe, such as the RUMEUR network [
5] in France, which actually is an hybrid network: it includes both high accuracy equipment for critical places, like airports, where the focus is to obtain detailed acoustic information due to the intense noise environment, together with less precise measuring equipment in other locations where the goal is only the updating of the noise map. Other projects are the IDEA project [
6] in Belgium, or the SoNYC project [
7] in New York City, which provides a description of the acoustic environment of the city.
Recently, the DYNAMAP project aims at developing a dynamic noise mapping system to represent the acoustic impact of the road traffic noise in real time in two Italian pilot areas, Rome and Milan [
8]. The WASN envisioned for the DYNAMAP project is hybrid, composed of both low-cost high-capacity sensors and low-cost low-computation capacity sensors, as in RUMEUR network [
5]. The DYNAMAP project takes into account an acoustic measurement challenge not faced by the previous project as it considers the inclusion of an Anomalous Noise Event Detection algorithm [
9] to provide a reliable picture of actual road traffic noise (RTN). Moreover, anomalous noise events (e.g., aircrafts, trains, bells) should be automatically identified and removed in order to cope with the noise source distinction required by the END [
2]. In order to remove this kind of events from the RTN measurement of the WASN, the ANED algorithm has been designed as a two-class classifier (ANE vs. RTN) [
9], whose outputs are obtained following a two-level decision process; the frame-level binary decisions are integrated through a high-level decision based on a majority voting scheme computed in a certain interval. The algorithm has been developed and tested in the two pilot areas of the DYNAMAP project—the reader can find more information about the algorithm details and performance evaluation in [
9].
The ANED algorithm has been initially designed and developed to run on the high-capacity sensors taking advantage of their computing power. In this paper, the preliminary analysis of the viability to adapt the ANED to the low-capacity sensors (denoted as ANED LowCap) is presented. The core idea of the ANED LowCap proposal is to reduce its computational complexity by means of proposing a simplified classification scheme based on a signal threshold obtained directly from the A-weighted band energy filtering process of the raw acoustic signal. In this small piece of research we present the first analysis conducted over two recorded and labelled datasets in Milan and in Rome [
10], to test the viability of the band energy study as a basic binary classifier using a threshold.
This paper is structured as follows. In
Section 2, the proposal to adapt the ANED algorithm to run on the low-cost low-capacity acoustic sensors is presented. In
Section 3, we describe the analysis of viability of the approach over raw acoustic data collected from both suburban and urban scenarios of operation. In
Section 4, we discuss the open questions foreseen from the results obtained in the previous section, which should be tackled in future works in order to implement the ANED LowCap.
2. ANED LowCap Proposal
The ANED LowCap algorithm proposal is mainly focused on the reduction of the computational complexity of the ANED algorithm designed to run on the high-capacity sensors [
9] while maintaining its capability to identify and discard ANE from the RTN noise level computation. In order to implement the algorithm in an affordable low capacity computational platform with a microcontroller as a core, a change of paradigm is mandatory, both in terms of feature extraction and classification, so as to satisfy the severe decrease of the computational cost.
In
Figure 1, the block diagram of the ANED version of adapted to run on the low computation capacity sensors is depicted. The signal level computation block obtains a variable that is directly related to the signal acoustic level computed per several spectral bands, by including a band-pass filtering step. Then, the classification process consists of a simple comparison between the signal level measure and the optimized threshold
. With the aim of obtaining reliable output estimation of the RTN and ANE labels, the threshold
should be calculated in an optimization process based on audio databases analysis.
3. Analysis
The analysis of the viability of the aforementioned approach is based on the computation of the spectral envelopes of the recorded audio signals, windowed at 30 ms, and its subsequent clustering. The purpose of this cluster analysis is to evaluate to what extent there are significant spectral differences between RTN and ANE for each database (Rome and Milan) at certain spectral bands, from real-life collected acoustic data.
Two types of analysis have been conducted after the parametrization of the computed spectral envelopes of each database using Gammatone Cepstral Coefficients (GTCC) [
11]:
3.1. RTN Spectral Analysis
Figure 2 shows the clustering analysis of the RTN class in Rome for 1, 2, 4 and 6 clusters. The idea of testing with several clusters in the RTN class comes from the fact that there are several types of traffic noise, depending on the sound source. The ANE class has labelled each different type of sound depending on the source (e.g., people, siren, thunder), but in the RNT this labelling process would be diffuse, because sometimes several types of traffic occur together. Analyzing with different types of clusters for RTN will give information about the diversification of the traffic noise.
In the following figures, the mean frequencies are represented with solid lines while the mean plus and minus the standard deviation are depicted using dashed lines.
Figure 3, depicts the same information for the city of Milan.
It is worth noticing that RTN exhibits different patterns for the city of Rome (suburban environment) and the city of Milan (urban environment). Moreover, both
Figure 2 and
Figure 3 show that an increase in the number of GMM components makes the results to better reflect the different intensity profiles present in the RTN measurements. In this sense, in the suburban environment of the Rome ring, more homogeneous profiles are observed than in the urban context of the Milan city, as they contain RTN coming from more regular traffic conditions as the sensors are placed on the portals of the Rome’s A90 motorway.
3.2. ANE Spectral Analysis
In
Figure 4 and
Figure 5, the cluster analysis of the ANE class is depicted for both Rome and Milan cities. The depicted ANE are different in the suburban and urban scenarios. Again, the mean frequency is drawn using solid lines and the mean plus and minus the standard deviation is plotted using dashed lines.
From these results, on the one hand, we can observe that ANE and RTN exhibit similar patterns for each city database. On the other hand, subtle but not negligible differences can be observed when comparing statistics of the clustered spectral envelopes of the RTN and ANE classes across the 48 frequency bands.
Therefore, it can be concluded that the differences found between RTN and ANE for each scenario can be of use for the implementation of the ANED LowCap. Nevertheless, the specific study of which frequency subbands should be considered in the signal level module of
Figure 1 capable of providing the best discrimination capabilities is a matter of research left for future work.
4. Discussion
The results of the spectral analysis conducted over real-life acoustic data collected from both suburban and urban environments show significant differences between the RTN and the ANE spectral distributions. Thus, it seems viable to take into account these differences in order to implement the ANED LowCap version for low-cost low-capacity acoustic sensors. However, it is difficult to determine which are the GTCC bands where spectral differences between RTN and ANE occur, and hence, should be selected to derive a simple yet effective threshold-based binary classifier (i.e., a low-cost solution with higher reliability than using only LAeq computed from the wide-band audio signal spectrum). For this reason, an in-depth quantitative analysis based on the identification of the frequency bands yielding lower error probability rates is proposed to be the next step to design the ANED LowCap algorithm version.
The future analysis of the ANED LowCap is devised following the next steps: (i) first, a preliminary approach based on a simple thresholding considering the LAeq measures over the full-band signal can be firstly explored as a means of obtaining a baseline system with which to compare; (ii) second, the classification threshold should be adjusted to differentiate between ANE and RTN categories optimally for each frequency band of interest to define the decision rules of the low-capacity ANED; (iii) and finally, a validation of the algorithm is foreseen to assess the performance of the design in suburban and urban scenarios in comparison with the ANED version already developed for the low-cost high-capacity acoustic sensors composing the hybrid WASN of the DYNAMAP project.