1. Introduction
Networks of automatic measuring stations are used to define and monitor air quality on a local level. They are equipped with reference instruments for measuring concentrations of pollutants defined by the Air Quality Directive 2008/50/EC [
1] such as sulphur dioxide (SO
2), carbon monoxide (CO), nitrogen dioxide (NO
2), ozone (O
3) and particulate matter (PM
10 and PM
2.5). The gaseous pollutant measurements are obtained using high performance reference gas analyzers that are selective and precise towards targeted gases. The use of these analyzers is required for regulatory purposes; however, because of their high price point, complicated maintenance, calibration and use, performing measurements with them is challenging. In the last several years, we have seen the development of a complementary method for monitoring air quality in the use of sensors for many non-regulatory applications such as understanding local air quality trends, identifying hot spots and supplemental monitoring [
2,
3,
4,
5]. Sensors are small-scale devices that considerably reduce installation and maintenance cost and allow users larger spatial coverage [
6,
7], but the data quality from these technologies is highly variable [
8], and it can be influenced by meteorological conditions [
9].
The European Union (EU) and the United States have funded several projects in the recent years to evaluate sensor monitoring technologies and establish networks for testing periods [
10,
11,
12,
13]. There is consensus in the scientific community that the sensor equipment should be carefully characterized to meet the expectations of their specific applications as equipment to monitor outdoor or indoor air [
14]. Since the publication [
15], which recognized the role of sensors in monitoring air quality in the future, there has been a number of studies on the development and applications of sensors and their networks. Considering the fact that there are many versions and manufacturers of the sensors for each pollutant, they show different results and present issues when compared to each other and to reference methods. Some sensors have shown decent performance but only in certain conditions. Previous research papers either focused on characterizations and descriptions of one group of sensors, such as for monitoring particulate matter (PM) [
2,
16,
17,
18,
19,
20] and gaseous pollutants [
21] and for evaluating crowdsourced monitors or offering a general overview of the state of the art and relevant applications [
22,
23]. Other papers and studies on sensors are oriented to the better calibration of sensors [
24,
25,
26,
27,
28], but there are few studies in which sensor measurements are compared with reference methods [
29,
30].
Scientific papers and studies oriented toward the comparison and evaluation of particulate matter sensors usually compare results with non-reference methods for particulate matter analysis or with reference methods for specific purposes or for a shorter time frame [
31,
32,
33,
34,
35]. Automatic analyzers for particulate matter are not reference analyzers and as such show large discrepancies in results from the reference method. Because of that, it is necessary to perform equivalence testing for every non-reference method according to the Guide to the demonstration of equivalence of ambient air monitoring methods [
36].
Regarding the general air quality in Zagreb, the problem of PM air pollution is characteristic in the winter months, while the problem of ground-level ozone pollution is characteristic in the summer months. PM air pollution in Zagreb is primarily associated with the combustion of low-quality fuels, wood and coal in domestic heating sources. In addition to adverse meteorological conditions during the winter, gas and particle emissions generally increase. Long-term trends show that the PM levels are decreasing over the years [
37].
The aim of this paper, therefore, was to compare results from several sensors with validated reference data for the period of one year. With that comparison, we hoped to obtain a better picture on how sensors behave in different conditions in outdoor air. According to the results, we can plan future research on sensors and determine how to improve their results in real conditions. The aim of the study was also to provide insights into the behavior of the sensor using only factory calibration and to see how many deviations there are compared to reference methods if no calibrations are carried out at the beginning of the measurement campaign.
2. Materials and Methods
Starting with 2018, the project “Eco Map of Zagreb” (granted by the City of Zagreb) began, in which part of it included installing 8 sensor sets (type AQMeshPod) for air quality monitoring in the city. Zagreb is the capital of Croatia, situated in the northeastern part of the country, at the foot of Mt. Medvednica to the north and by the Sava River to the south with a continental climate. Although measurements of air quality have been performed continuously since 1999, until 2018 there were no measurements of air quality with air sensors.
From January to December of 2018, a set of sensors was set up at an automatic measuring station at the Institute for Medical Research and Occupational Health (IMROH, Zagreb, Croatia) for comparison with reference methods for air quality measurement. This automatic station is a city background station within the Zagreb network for air quality monitoring, where measurements of SO
2, CO, NO
2, O
3, PM
10 and PM
2.5 are performed using standardized methods accredited to EN ISO/IEC 17025 in accordance with the Air Protection Act [
38,
39,
40,
41] and the Regulation on Air Quality Monitoring [
42]
The air quality monitoring station where the experiment was conducted is located in the northern part of Zagreb in a predominantly residential area of the city. During the experiment, the sensor was placed on the inlet pipe of the gas analyzer at the automatic station, at a height of 4 m from the ground. The reference PM samplers were placed at the same location with inlets at a height of 1.5 m from the ground. At the time of the experiment, the station did not have meteorological parameters, so they could not be used in the comparison.
The sensor set used in the comparison is AQMeshPod, using electrochemical sensors for gas pollutants and an optical particle counter for particulate matter (
Table 1).
IMROH holds under its wing an accredited laboratory according to EN ISO/IEC 17025 and is also a reference laboratory for PM measurements for Croatia. The responsibility of the Institute as a reference laboratory is to perform equivalence testing for all non-reference PM methods in Croatia.
After one year of parallel measurements, the values were statistically compared. Hourly averages were compared for SO2, CO, NO2 and O3 gases, and concentrations of 24 h were averaged for particulate matter. The raw data of the sensor and validated data of the reference devices were used. The periods when the sensors did not work, as well as the periods during the calibration and maintenance of the reference devices, have been excluded from the statistical analysis. Data processing was performed using Excel 2016 and Statistica 14 software. Negative data values from the sensor were replaced by the minimum detection limits of the reference methods. Data were compared using Student’s t-test dependent samples and a regression analysis. A Grubb test was performed to remove the outliers of data pairs from the database. The Grubb test is not applicable for gaseous pollutants due to a high dispersion of results. For PM10 and PM2.5, the Grubb test was satisfactorily applied. In the data analysis, only the data of the continuous seasons during 2018 were taken to avoid the aging of the sensor and the decrease in the sensitivity of the sensor. The data for the continuous seasons was taken as follows: winter (January–March), spring (March–June), summer (June–September) and autumn (September–December). The reason for this was to avoid potential declines in sensor sensitivity.
In order to eliminate large discrepancies between the results for the sensors and the reference methods for gaseous pollution and to reduce the scatter of the results, data filtering was performed. In the filtering process, pairs of results were eliminated in which the deviation between the results determined by the sensors and the results determined by the reference methods was more than 90%. The newly acquired data set was again subjected to the Grubb test. The Grubb test was run until the critical factor of 5% confidence level.
3. Results
Table 2 shows the annual statistical parameters for the reference method and sensor during 2018. The data presented in
Table 2 are the number of pairs of results (N), data coverage (DC), lowest concentration in the indicated period (C
min), highest concentration in the indicated period (C
max) and median (Median). The lowest data coverage in the comparative was 77.7 for SO
2, while the highest was 98.9 for PM
2.5.
Table 3 shows the annual statistical parameters after the filtration process. It can be seen that the data coverage is the lowest for SO
2 at 33.9 and the highest for CO at 88.4. The filtering process was not performed for the particulate matter.
Table 4 shows the annual statistical significance for the reference method and sensor during 2018. The data presented in
Table 4 are the mean difference between pairs (Δ), standard deviation (Std (Δ)), standard error (SE (Δ)),
t-ratio in the
t test (
t), correlation coefficient (r) and t-ratio at the given confidence range (
tcrit (
p = 0.95)).
Table 5 shows the annual statistical significance after the filtration process. In the filtering process, pairs of results were eliminated in which the deviation between the results determined by the sensors and the results determined by the reference methods was more than 90%. The filtration process was not applied to the particulate matter.
A comparison of the results obtained by sensor measurements and the results obtained by reference methods indicated that the scattering was too large for all the gaseous pollutants determined (SO
2, NO
2, O
3 and CO) (
Table 2,
Figure 1,
Figure 2,
Figure 3 and
Figure 4). For this reason, data filtration was performed by rejection from further analysis of pairs of results in which the deviation between the results determined by the sensors and the results determined by the reference methods was more than 90%.
A comparison of the results determined by the sensors and the results determined by the reference methods for the PM
10 and PM
2.5 particulate matter fractions indicates satisfactory low scattering (
Figure 5). For this reason, no data filtering was necessary, and the requirement for data coverage under the Air Protection Act [
38,
39,
40,
41] and the Air Quality Monitoring Regulation [
42] was also satisfied.
In the winter part of the year, the concentration of some pollutants in the environment is higher, such as NO
2 and PM particles, while the concentration of the pollutant O
3 is lower. At low concentrations and small oscillations in temperature, the sensor for O
3 shows a better correlation with the reference method. At higher concentrations of PM particles, a good correlation with reference methods can be seen (
Table 6,
Figure 6).
With the beginning of the warmer parts of the year, SO
2 and O
3 concentrations increase, while PM particle concentrations decrease due to the reduction in pollution sources (house heating). With the increase in concentrations, the sensor for SO
2 shows a better correlation with the reference method, while for the sensor for O
3, the correlation worsens due to the greater difference in temperatures between morning and afternoon temperatures, which has a bad effect on the sensor’s operation. The decrease in the concentration of PM particles drastically worsens the correlation (
Table 7,
Figure 7).
In the summer, when temperatures are highest in Zagreb, there are differences between the morning temperature when the humidity is higher and the afternoon temperature when the humidity is low but the temperature is high, and this results in the sensors showing a bad correlation. By moving into cooler parts of the year, the correlations improve for SO
2, O
3, CO and PM (
Table 8 and
Table 9,
Figure 8).
The results of the regression analysis showed a strong seasonal dependence for all pollutants. The regression analysis for SO
2, CO, PM
10 and PM
2.5 shows a better correlation for seasons when concentrations were increased, while for seasons when concentrations were lower the correlation between sensor and reference data is very poor. For the O
3 sensor, the correlation is better for the seasons when the concentrations are lower compared to the seasons when the concentrations are higher. For the NO
2 sensor, there is no correlation by season, nor is it related to the concentration level. Observing unfiltered data (
Figure 6,
Figure 7,
Figure 8 and
Figure 9), the highest correlation between sensors and referent methods was obtained for CO and O
3.
For sulphur dioxide (SO2), the total data coverage for 2018 before filtering was 77.7% and, after filtering, 33.9%, which did not satisfy the criterion of data coverage. The statistics show that there was a significant statistical difference between the reference method and the sensors both for the whole year and in all seasons.
For nitrogen dioxide (NO2), the total data coverage for the year 2018 before filtering was 77.7% and after filtering the data amounted to 38.8%, thus not meeting the criterion of data coverage in accordance with the regulations. The statistics show that there was a significant statistical difference between the reference method and the sensors both for the whole year and in all seasons.
For ozone (O3), the total data coverage for 2018 was 81.1% and, after filtering the data, 77.8%, thus not meeting the data coverage criteria in accordance with the regulations. The statistics show that there was a significant statistical difference between the reference method and the sensors both for the whole year and in all seasons.
For carbon monoxide (CO), the total data coverage for 2018 before filtering was satisfactory (90.3%), but after filtering data it was 88.4% and thus did not meet the minimum data coverage criterion. The statistics show that there was a significant statistical difference between the reference method and the sensor, except in the fall.
For PM
10, the total coverage of data for 2018 was 94.5%, thus satisfying the criterion of data coverage according to the Air Protection Act and the Regulation on Air Quality Monitoring [
38,
39,
40,
41,
42]. The statistics show that there was a significant statistical difference between the reference method and the sensor.
For PM
2.5, the total data coverage for 2018 was 98.9%, thus satisfying the data coverage criterion according to the Air Protection Act and the Regulation on Air Quality Monitoring [
38,
39,
40,
41,
42]. The statistics show that there was no significant statistical difference between the reference method and the sensors for the whole year and summer, while in the other seasons (winter, spring and autumn) there was a statistically significant difference.
The main problem in this project is the calibration of sensors, and many papers have suggested that better results are obtained with better calibration [
25,
26]. Also, as said before, the first step before stating the measuring campaign is to perform field calibration, and as that calibration was not performed, the results were not so promising. But, according to many papers which oriented on the calibration of sensors, it is clear that not only field calibration before measurements is needed but also calibrations before every season of the measurements [
13,
30].
4. Conclusions
This study has shown that the results obtained from sensory measurement techniques are highly seasonally dependent, suggesting the sensitivity of meteorological conditions to the measurement technique. Seasonal dependence, as well as poor data coverage due to failures (especially in the determination of SO2, NO2 and O3) and wastefulness of the results, call into question their application in the continuous monitoring of air quality. The best agreement with the reference method was shown by CO sensors and the PM2.5 particulate matter fractions, while the PM10 and PM2.5 sensors showed the least scatter and satisfactory data coverage. The results also indicate the need to make calibrations at the measurement site before starting the measurement campaign itself. The calibration was not performed in the field, but a factory calibration was used, which was found to be insufficient.
This study indicates that the use of factory calibration is not sufficient to obtain high-quality measurements by sensors and that seasonal calibration is needed. Factory calibrations show good calibration at higher concentrations for most pollutants except for O
3, but such calibration turned out to be very poor at low concentrations of most pollutant parameters except for O
3. The experiment shows the need for field calibration at least for warmer and colder parts of the year, although field calibration for each season would be better. Future research will focus on improving seasonal calibrations using calibration chambers for gaseous sensors and field calibrations for PM sensors. It is also necessary to examine the transfer of calibrations to different types of stations, which was attempted in some experimental measurements [
13,
30]. Today’s use of artificial intelligence (machine learning and neural networks) can greatly help in better calibration of sensors for specific climate conditions.
Today’s policy of the European Union, influenced by AQUIL-a and reference laboratories of the European Union, is to get better control of the use and calibration of sensors. Within a few years, the plan is to pass legislation related to sensor calibration. This is why it is essential to research better calibration methods using newer tools in the field of measurements and computer science.