Assessment of the Variability of Air Pollutant Concentrations at Industrial, Trafﬁc and Urban Background Stations in Krakow (Poland) Using Statistical Methods

: In cities with an extensive air quality monitoring (AQM) system, the results of pollutant concentration measurements obtained in this system can be used not only for current assessments of air pollution, but also for analyzes aimed at better identiﬁcation of factors inﬂuencing the air quality and for tracking trends in changes taking place in this regard. This can be achieved with the use of statistical methods that allow for the assessment of the variability of measurement data observed at stations of various types and for the determination of possible interdependencies between these data. In this article, an analysis of this type was carried out for trafﬁc, urban background and industrial AQM stations in Krakow (Southern Poland) operating in the years 2017–2018 with the use of, i.a., cluster analyzes, as well as dependent samples t -test and Wilcoxon signed-rank test, taking into account the concentrations of air pollutants such as ﬁne particulate matter (PM 10 ), nitrogen dioxide (NO 2 ), benzene (C 6 H 6 ) and sulfur dioxide (SO 2 ). On the basis of the conducted analyzes, similarities and differences were shown between the data observed at individual types of stations, and the possibilities of using them to identify the causes of the observed changes and the effects of remedial actions to improve air quality undertaken recently and planned in the future were indicated. It was found that the air concentrations of some substances measured at these stations can be used to assess the emission abatement effects in road transport (NO 2 , PM 10 or C 6 H 6 ), residential heating (PM 10 or SO 2 ), and selective industrial plants (SO 2 , NO 2 or C 6 H 6 ).


Introduction
Air quality in urban-industrial areas is determined primarily by the emission of pollutants from, residential, traffic and industrial sources as well as meteorological and topographic factors influencing transport, dispersion and conversions of pollutants in the atmospheric air. The assessment of the contribution of particular types of sources to the level of pollutant concentrations in the air is usually very labor intensive. It requires a detailed inventory of all significant emission sources influencing the air quality in a given region (along with the determination of the amount of pollutants emitted from individual sources and its temporal variability), as well as determination of the concentrations caused by individual emission sources, which is most commonly carried out using deterministic models that allow to simulate the transport and dispersion of pollutants [1][2][3]. It is also possible to apply receptor modeling allowing, among others, for the assessment of the contribution of natural and other emission sources located at greater distances, not usually included in modeling carried out with the use of deterministic models, as this would require balancing the emission over a larger area and the application of regional and mesoscale meteorological and dispersion models [4][5][6]. Receptor models are most often based on the results of measurements of meteorological parameters, concentrations of various air Sustainability 2021, 13, 5623 3 of 23 in Krakow (including the prohibition of using solid fuels with effect from September 2019) or lockdowns resulting from the COVID-19 pandemic.
Statistical analyzes and tests performed in this work (including cluster analysis, dependent samples t-test and Wilcoxon signed-rank test ) have not yet been applied to such an extent in relation to data from the public air quality monitoring system in Krakow. The results of air quality measurements in Krakow from the public monitoring system, possibly supplemented with additional measurement data and analyzes, were previously used mainly for the assessment of temporal and spatial variability of air pollution, the identification of emission sources and other factors influencing the level of concentrations of pollutants or their mutual correlation, the estimation of the impact of air pollutants on the health of residents, as well as the assessment or development of deterministic and statistical (stochastic) air quality models.
In works [56,57] the correlation analyses and the dependent samples t-test was used, with respect to PM 10 concentration data, in suitability analysis of established in 2016 and 2017 air quality monitoring stations in Krakow (after a significant expansion made in 2015-2017 in the field of PM 10 concentration measurements). Using statistical methods, the impact of meteorological factors on air quality in Krakow was also assessed [58][59][60][61][62][63][64] and source apportionment analyzes were performed based on the principal component analysis (PCA) [24,[65][66][67] or the positive matrix factorization (PMF) receptor model [68][69][70][71]. Previously, a comprehensive source apportionment analysis for measurements from January-February 2005 was performed by the JRC Institute for Environment and Sustainability using seven multivariate receptor models [72,73].
As it results from the previous publications and analyzes of available monitoring data, the air pollution in Krakow in the last 30 years fluctuated under the influence of various meteorological conditions and the activity of emission sources but is generally characterized by a downward trend [74,82]. A dozen or so years ago, the combustion of solid fuels (mainly coal) in small furnaces and boilers (related to residential heating) and industrial energy production had the dominant contribution to the high levels of PM 10 in the air in the winter months [72,73]. Thanks to the large-scale elimination of stoves and boilers fired with solid fuels carried out in recent years and replacing them with more ecological energy sources and reducing the emission of pollutants into the air from local industrial plants, emissions from road transport have started to play an increasingly important role for the air pollution in Krakow with regard to fine dust (including engine emission of soot), as well as dust drifted by wind from the road surface and during construction works in dry periods [63,84,[94][95][96].
Despite the relatively large number of studies focused on the identification of factors determining air quality in Krakow and the changes taking place in this area, a significant part of them was based on receptor modeling and the results of analyzes limited to single measurement points or a simple assessment of the trends taking place. There is no broader approach to the analyzes performed, which would enable the exploration of data from the air quality monitoring system for all types of stations in order to better recognize their variability and possible interdependencies. The following sections provide materials and methods used in this paper (Section 2), the results obtained (Section 3), discussion (Section 4) and conclusions (Section 5).  The location of these stations was presented on Figure 1.

Data
Measurement results derived from eight air quality monitoring (AQM) stations in Krakow, spanning the period 2017-2018, were applied in this study. The stations are operated by the Inspectorate for Environmental Protection (IEP) [102,103].
Three following types of measuring stations, according to the classification defined in the State Environmental Monitoring, were included in the analyses: The location of these stations was presented on Figure 1. The measurements with 1-h resolution of following substances concentrations were analyzed: particulate matter PM10; nitrogen dioxide (NO2), sulfur dioxide (SO2) and benzene (C6H6). The concentrations were measured automatically (continuous measurements) with resolution of 1 h using following methods (equipment): • PM10-β-ray attenuation (Bam 1020 fine dust monitor/Met One Instruments, Inc.) or light scattering (Grimm EDM 180 or Fidas ® 200 E fine dust monitor); • NO2-chemiluminescence (the model T200 NO/NO2/NOx analyzer/Teledyne API); • C6H6-gas chromatograph with a built-in Tenax GR pre-concentration system and PID detector (Syntech Spectras GC955 series 600 BTEX analyzer); • SO2-UV fluorescence (the model T100 SO2 analyzer/Teledyne API).
All the measurement methods used ensure appropriate data quality in accordance with [12] and the measuring ranges of the equipment used are adequate to the pollutant concentrations occurring in the air. The type of pollutants measured continuously in the analyzed period at individual stations is presented in Table 1. The measurements with 1-h resolution of following substances concentrations were analyzed: particulate matter PM 10 ; nitrogen dioxide (NO 2 ), sulfur dioxide (SO 2 ) and benzene (C 6 H 6 ). The concentrations were measured automatically (continuous measurements) with resolution of 1 h using following methods (equipment): • PM 10 -β-ray attenuation (Bam 1020 fine dust monitor/Met One Instruments, Inc.) or light scattering (Grimm EDM 180 or Fidas ® 200 E fine dust monitor); • NO 2 -chemiluminescence (the model T200 NO/NO 2 /NO x analyzer/Teledyne API); • C 6 H 6 -gas chromatograph with a built-in Tenax GR pre-concentration system and PID detector (Syntech Spectras GC955 series 600 BTEX analyzer); • SO 2 -UV fluorescence (the model T100 SO 2 analyzer/Teledyne API).
All the measurement methods used ensure appropriate data quality in accordance with [12] and the measuring ranges of the equipment used are adequate to the pollutant concentrations occurring in the air. The type of pollutants measured continuously in the analyzed period at individual stations is presented in Table 1.  1 Substances not included in this work. 2 The station operated at this location until May 2018.

Principles of the Methodology
In order to assess the variability of data series originating from specific type of stations, a range of statistical tools enabling to compare data series and analyze the degree of similarity between them was applied. There was an assumption that a particular type of station is characterized by time series of concentrations of each substance, averaged for all stations representing this type.
First of all, the mean values for each type of stations were calculated. This means that for every 1-h record there was calculated mean value from all stations belonging to particular type of AQM station. The obtained time series of "station type averages" (STA) for every substance were further analyzed. The calculated ratios between STA were considered as the basic measure of similarity. These data were then grouped by different time intervals (hours, months, seasons) and presented on plots.
In the purpose of better understanding the similarities and differences between the data, cluster analysis and statistical significance tests were also applied.
In summary, the assumptions of created methodology are as follows: 1. Time series of averaged concentrations for each type of AQM station (traffic, urban background, industrial) were taken into account in the analyses and called "station type averages" (STA). In the case when the concentrations of specific pollutant from only one station were available, these data were adopted and considered as the STA; 2.
Urban background stations are treated as reference stations, i.e., when analyzed records from traffic or industrial stations, they were related to urban background stations.

Statistical Techniques and Visualisations
Data series of PM 10 , NO 2 , C 6 H 6 and SO 2 concentrations in the air, averaged according to the above-mentioned principles, were presented on scatter plots generated via R programming environment [104][105][106][107]. The special mode of these plots (hexagonal binning), aggregating dense groups of data points into polygons, was applied. This allowed us to not only examine relationships between variables but also to analyze the density of data points in particular part of the graph.
In order to investigate the variability of station types the concentration ratios of analyzed substances were calculated. These ratios were expressed as STA of specific pollutant on traffic/industrial stations divided by STA of the same substance on urban background stations (according to the methodology). The time series of raw data and the time series of the calculated ratios were visualized on timeVariation plots, produced in R programming environment [105] (package openair [106]). Other graphs were prepared in ggplot2 package [107], also in R.
The agglomerative hierarchical clustering was conducted in this research. As a distance measure the Euclidean distance was applied and Ward linkage was selected for the purpose of determining the proximity between clusters. Ward linkage is considered the most effective method [41]. The station types for each substance were clustered in our Sustainability 2021, 13, 5623 6 of 23 research and the interpretation of possibly created clusters of different substances (e.g., PM 10 and C 6 H 6 ) was of less interest.
In this study, two different tests were performed and compared: the dependent samples t-test and the Wilcoxon signed-rank test. Before performing tests, data distribution was examined for normality. As it was proved that the distribution of the input data is not normal, the parametric tests properly should not be used. However, the data set consists of several thousands of observations. In such case, the combination of methods was proposed: t-test, Wilcoxon test and t-test after Box-Cox transformation, which ensured the normality of distribution. The results obtained from each of the methods were compared. Significance tests and Box-Cox transformation were carried out in Statistica 13.

Statistical Distribution of 1-Hour Concentrations
The statistical distribution of 1-h concentrations of the analyzed pollutants in the air for individual types of air quality monitoring stations in Krakow in 2017-2018 is presented in Figures 2 and 3 in the form of boxplot charts. A horizontal dash inside the box represents the median, and the border dashes of the box represent the 25% and 75% quantiles, respectively. Outliers are also shown in these figures (values greater than 150% of the interquartile range).
The agglomerative hierarchical clustering was conducted in this research. As a distance measure the Euclidean distance was applied and Ward linkage was selected for the purpose of determining the proximity between clusters. Ward linkage is considered the most effective method [41]. The station types for each substance were clustered in our research and the interpretation of possibly created clusters of different substances (e.g., PM10 and C6H6) was of less interest.
In this study, two different tests were performed and compared: the dependent samples t-test and the Wilcoxon signed-rank test. Before performing tests, data distribution was examined for normality. As it was proved that the distribution of the input data is not normal, the parametric tests properly should not be used. However, the data set consists of several thousands of observations. In such case, the combination of methods was proposed: t-test, Wilcoxon test and t-test after Box-Cox transformation, which ensured the normality of distribution. The results obtained from each of the methods were compared. Significance tests and Box-Cox transformation were carried out in Statistica 13.

Statistical Distribution of 1-Hour Concentrations
The statistical distribution of 1-h concentrations of the analyzed pollutants in the air for individual types of air quality monitoring stations in Krakow in 2017-2018 is presented in Figures 2 and 3 in the form of boxplot charts. A horizontal dash inside the box represents the median, and the border dashes of the box represent the 25% and 75% quantiles, respectively. Outliers are also shown in these figures (values greater than 150% of the interquartile range). Traffic stations (TS) are characterized by the highest median values of NO2 and PM10 concentrations, compared to other station types, which is reflected in the boxplots ( Figure  2). In the case of C6H6 and SO2 the median concentrations for individual types of monitoring stations are comparable to each other-there is no noticeable difference  Figure 3). It can be noted that there occur outlying points, which can be several times higher than the respective median value.

Interdependencies of 1-Hour Concentrations between AQM Station Types
There is an observed clear linearity between a certain type of station (traffic and industrial, respectively) in relation to the values measured at urban background stations (UBS) in case of PM10 concentrations ( Figure 4). The highest density of observations on the scatterplots falls on the values of 100 μg/m 3 or lower. In contrast to the PM10, the relationship between NO2 concentration for traffic and urban background stations is likely to be non-linear ( Figure 5). There is observed a similar pattern regarding C6H6 and SO2 concentrations (Figures 6 and 7). In order to better visualize the relationships between the 1-h concentrations observed at different types of AQM stations, the Pearson correlation coefficient (r) and Spearman's rank correlation coefficient (rs) are also presented in Traffic stations (TS) are characterized by the highest median values of NO 2 and PM 10 concentrations, compared to other station types, which is reflected in the boxplots (Figure 2). In the case of C 6 H 6 and SO 2 the median concentrations for individual types of monitoring stations are comparable to each other-there is no noticeable difference ( Figure 3). It can be noted that there occur outlying points, which can be several times higher than the respective median value.

Interdependencies of 1-Hour Concentrations between AQM Station Types
There is an observed clear linearity between a certain type of station (traffic and industrial, respectively) in relation to the values measured at urban background stations (UBS) in case of PM 10 concentrations ( Figure 4). The highest density of observations on the scatterplots falls on the values of 100 µg/m 3 or lower. In contrast to the PM 10 , the relationship between NO 2 concentration for traffic and urban background stations is likely to be non-linear ( Figure 5). There is observed a similar pattern regarding C 6 H 6 and SO 2 concentrations (Figures 6 and 7). In order to better visualize the relationships between the 1-h concentrations observed at different types of AQM stations, the Pearson correlation coefficient (r) and Spearman's rank correlation coefficient (r s ) are also presented in  Sustainability 2021, 13, x FOR PEER REVIEW 7 of 25 ( Figure 3). It can be noted that there occur outlying points, which can be several times higher than the respective median value.

Interdependencies of 1-Hour Concentrations between AQM Station Types
There is an observed clear linearity between a certain type of station (traffic and industrial, respectively) in relation to the values measured at urban background stations (UBS) in case of PM10 concentrations ( Figure 4). The highest density of observations on the scatterplots falls on the values of 100 μg/m 3 or lower. In contrast to the PM10, the relationship between NO2 concentration for traffic and urban background stations is likely to be non-linear ( Figure 5). There is observed a similar pattern regarding C6H6 and SO2 concentrations (Figures 6 and 7). In order to better visualize the relationships between the 1-h concentrations observed at different types of AQM stations, the Pearson correlation coefficient (r) and Spearman's rank correlation coefficient (rs) are also presented in Figures 4-7.
(a)      The results of performed cluster analysis are presented in Figure 8 and Tables 2 and  3. Two variants of analysis were conducted, with and without outliers, in order to assess the effect of outliers exclusion. In both cases Euclidean distance was applied as a measure of similarity. Despite the slight differences for PM10 and C6H6, the main tendency of clustering IS and UBS together and TS separately was still present. For all substances,  The results of performed cluster analysis are presented in Figure 8 and Tables 2 and  3. Two variants of analysis were conducted, with and without outliers, in order to assess the effect of outliers exclusion. In both cases Euclidean distance was applied as a measure of similarity. Despite the slight differences for PM10 and C6H6, the main tendency of clustering IS and UBS together and TS separately was still present. For all substances, The results of performed cluster analysis are presented in Figure 8 and Tables 2 and 3. Two variants of analysis were conducted, with and without outliers, in order to assess the effect of outliers exclusion. In both cases Euclidean distance was applied as a measure of similarity. Despite the slight differences for PM 10 and C 6 H 6 , the main tendency of clustering IS and UBS together and TS separately was still present. For all substances, except benzene, this tendency is present both in the example with and without outliers. In the case of C 6 H 6 including outliers, the TS and UBS types are clustered together and IS type is separated from them.
Sustainability 2021, 13, x FOR PEER REVIEW 10 of 25 except benzene, this tendency is present both in the example with and without outliers.
In the case of C6H6 including outliers, the TS and UBS types are clustered together and IS type is separated from them.      (Tables 4 and 5), all analyzed differences were recognized as statistically significant (p < 0.05). This means that all types of stations differ from each other in terms of the concentration of each of the pollutants under consideration.  Figures A1 and A2 (Appendix A) and Figure A3 (Appendix B). As it results from the data presented in Figure A1 the majority of points representing single 1-h concentrations of PM 10 , which exceed 100 µg/m 3 (the information threshold for PM 10 obligatory in Poland) was recorded in winter months. This suggests that maximal concentrations are more likely to occur in the middle of the heating season. The situation is similar for the other analyzed substances (Figures A1-A3); however, a certain exception in this case is NO 2 , for which the seasonal increase in air concentrations in winter months is relatively small.

Temporal variability of 1-h air concentrations of analyzed pollutants is shown in
The black squares in Figures A1 and A2 are representing mean value grouped by hour of a day. There is observed a clear tendency of maximal and minimal values depending on the hour of a day. The maximum values among the averages calculated for a specific hour occurred in the hours 7:00-9:00 and 19:00-23:00 for the industrial station and urban background station. The discussed dependence is slightly different in the case of traffic stations. In the morning hours there is no clear maximum, while in the second half of the day the maximum falls in the time period 18:00-20:00.
The concentrations of NO 2 and PM 10 at traffic stations in each season of the year are clearly higher than the concentrations of these substances at urban and industrial background stations in the respective periods ( Figure 9). For example, in the summer, 1-h NO 2 concentrations in the afternoon at traffic stations are even five times higher than the correspondingly determined concentrations of this substance at the urban background station, while in summer this exceedance is almost twice as high. In the non-heating season (late spring and summer months) clearly increased C 6 H 6 concentrations are also observed at the traffic and industrial stations compared to the urban background station (Figure 9a).

Discussion
The period of time analyzed in this study (2017-2018) reflects the air quality in Krakow in the final period of the gradual decommissioning of low-power boilers and stoves, fired with solid fuels, but before the introduction of a complete ban on their use, introduced from 1 September 2019 [48]. Previously, there were also significant reductions

Discussion
The period of time analyzed in this study (2017)(2018) reflects the air quality in Krakow in the final period of the gradual decommissioning of low-power boilers and stoves, fired with solid fuels, but before the introduction of a complete ban on their use, introduced from 1 September 2019 [48]. Previously, there were also significant reductions in air pollutant emissions (mainly SO 2 ) from local industrial plants.
Statistical distributions of 1-h concentrations of analyzed pollutants in the air visualized using boxplots (Figures 2 and 3) show that at least half of the 1-h measurement data is concentrated near the median (narrow interquartile ranges determined by the height of the boxes). Thus, the high variation in the values of 1-h concentrations during the year results mainly from the outliers that appear periodically during particularly unfavorable weather conditions with the increased impact of local and influx emission sources. Among the analyzed substances, the highest values of their concentrations in the air (also in relation to the relevant permissible levels) are observed for PM 10 and NO 2 , and the lowest for C 6 H 6 and SO 2 . Indirectly, this indicates a relatively small impact on the air quality in Krakow of local SO 2 emission sources (decommissioning of low-power coal-fired furnaces and boilers, commissioning of the flue gas desulphurization installation in the municipal heat and power plant). The generally low air concentrations of SO 2 previously observed at the traffic station (MpKrakAlKras) [102,103], were also the basis for resigning from the measurements of SO 2 at this station a few years ago.
High correlation coefficients between PM 10 concentrations for analyzed types of AQM stations (Figure 4) result from the impact of similar emission sources (PM 10 might originate from same sources in the analyzed period). On the other hand, the lack of a strong correlation between the series of data observed at individual types of monitoring stations observed in the case of other analyzed substances (NO 2 , C 6 H 6 and SO 2 ) (Figures 5-7) confirms their significant differentiation in terms of the dominant emission sources. Thus, it indicates their proper location enabling the implementation of the main measurement objectives assigned to a given type of station. There is a weaker correlation in the case of pair TS/UBS than IS/UBS, which confirms the impact of traffic emission on the dissimilarity of these two data series. It may suggest that traffic emission from busier and more congested roads has a more pronounced impact on the measurement results than a combination of other pollution sources, including industrial emission sources.
Slightly more detailed dependencies in this respect result from the cluster analysis. Cluster analysis is a technique allowing to create groups (clusters) of data (objects, statistical observations), when there is not known any "natural" set of classes for these data. The objects belonging to particular cluster are as similar as possible to each other in this cluster and as different as possible according to objects from another clusters. Therefore, it is possible to discover some unobvious information in input data, recognize patterns hidden for "classical" statistical methods, define the level of similarity between the data features and organize data into groups, which are easy to explain and interpret. The obtained results of the cluster analysis show much smaller similarities between traffic stations and other types of stations. It can be clearly explained by higher concentrations of examined substances (especially NO 2 and PM 10 ) recorded at traffic stations, stemming from the stronger impact of traffic emission and poorer ventilation conditions in street canyons compared to the other sites. The analysis of the distance matrix (Tables 2 and 3) shows that traffic and industrial stations are the least similar to each other, and the industrial and urban background stations are the most similar. The latter regularity indicates the possibility of overlapping, under certain conditions, the impact of the urban background on the results of measurements at industrial stations, which was demonstrated for Krakow in [24]. From the main tendency of clustering IS and UBS together and TS separately ( Figure 8) it can be concluded that the traffic stations located in the center of Krakow within the second ring road fulfill their role. They record the impact on the air quality of local and transit road transport related to routes with high car traffic intensity with periodically forming traffic jams. On the other hand, industrial sources have a minimal effect on the level of pollutant concentrations measured at these stations.
Statistical significance tests are intended, among others, for verification if observed differences between statistics calculated from compared samples (e.g., mean, median) are coincidental (insignificant) or they result from a certain effect and therefore they are not coincidental (significant). Some of the significance tests are applied in situations when there are compared two samples representing various conditions of measurements. This means that all types of stations differ from each other in terms of the concentration of each of the pollutants under consideration. The conducted statistical tests (Tables 4 and 5) show that air concentrations of analyzed pollutants at individual types of AQM stations differ significantly from each other. This may indicate the impact of different groups of pollution sources on considered stations. This conclusion is confirmed by the results of the cluster analysis, especially in relation to traffic stations that were in a separate cluster from the urban background and industrial stations (Figure 8).
In the case of PM 10 , NO 2 and C 6 H 6 the highest concentration values are observed at traffic stations and the lowest at urban and industrial background stations, while the concentrations of PM 10 and C 6 H 6 are definitely higher in the heating season than in the non-heating season, and NO 2 concentrations are usually not subject to large seasonal fluctuations, as they are mainly shaped by road transport (Figures 9 and A1-A3). Increased concentration values at traffic stations are most evident in the case of NO 2 . On this basis, it can be concluded that this type of stations differs most from the others in terms of NO 2 concentration, which, among the pollutants analyzed in the study, seems to be the most typical pollution for road transport. No significant differences between the winter and summer months or the transition period for NO 2 concentrations in the air at traffic stations is confirmed by the results obtained in previous studies [22,61,75]. Strong seasonal variations of the air concentrations observed for the rest of analyzed substances can be explained, on the one hand, by higher emissions (municipal and household sector), and on the other hand, it results from weaker ventilation and a lower mixing-layer height, which was already demonstrated for previous years, e.g., in the works [58,60,64]. There is also a clear seasonal trend in the case of SO 2 concentrations, but this is reflected in the periodic impact of industrial plants (combined heat and power plant and steel mill) on the measurement results, recorded especially at an industrial station located within the range of the impact of these emission sources [24,75].
Poor air quality in Krakow, often observed during the heating season, is usually associated with unfavorable synoptic-scale (anti-cyclonic circulation), low temperature, low mixing-layer height and low wind speed, which hinders the dispersion of pollutants in the air [58,60,61,64,65], and in the case of fine solid particles with increased formation of secondary inorganic (SIA) or organic (SOA) aerosols [62,[68][69][70][71] and a large inflow background of dust pollution from adjacent areas [22,24,65,67]. Despite a significant reduction of SIA precursor emissions from large combustion plants (SO 2 and NO x ) and residential heating (SO 2 ), the contribution of SIA formation in fine particulate matter is now much greater [68][69][70] than it has been reported for several years ago [72,73]; this indicates an increase in the role of NO x (derived from combustion processes, including to a large extent from the combustion of engine fuels) in the formation of SIA, and thus their greater impact on the levels of fine particulate matter in the air [91,92,97].
Increased C 6 H 6 concentrations in the non-heating season at the traffic station (Mp-KrakAlKras) compared to the urban background station (MpKrakBujaka) (Figure 9a) indicate that the main source of C 6 H 6 in this season is emission from road transport, which is also confirmed, among others, by in the results of earlier measurements carried out in Krakow in the summer at several points in the immediate vicinity of a busy street and petrol station [108] and also in other cities in various locations [109][110][111]. Benzene concentrations measured at the industrial station (MpKrakBulwar) in the non-heating season are also significantly higher than those measured at the urban background station (Figure 9b), which results from the relatively close location of the MpKrakBulwar station to the fuel station (approximately 60 m) and a large junction (approximately 100 m), as well as within the range of the periodic impact of a coking plant (operating as part of a steelworks) located approximately 3-4 km from this monitoring station [24]. This corresponds to the research results presented by Notario et al. [111]. A clear increase in C 6 H 6 concentrations in the vicinity of petrol stations was also demonstrated, among others, by Bogacki et al. [108] and Karakitsios et al. [112]. As it results from the work [24] at the MpKrakBulwar industrial station, depending on the wind direction, the impact of emissions not only from the nearby steel plant, but also from other nearby industrial plants (including heat and power plant and municipal waste incineration plant) is also recorded, which is reflected, i.a., in a clear periodic increase in SO 2 and NO 2 concentrations in relation to the urban background station located far from these plants (Figure 9b).
In the case of PM 10 concentrations at all AQM stations, including traffic stations, very similar daily trends are observed ( Figure A3). This indicates that the level of these concentrations in the whole of Krakow is greatly influenced by meteorological factors characterized by daily variability (including, in particular, the wind speed and the mixing-layer height [60,64]) and the dominant influence of emission sources and phenomena occurring to a large extent across the city (urban, suburban and regional background and local formation of secondary aerosols from gaseous pollutants [67,68,71,73,75]). On the other hand, the observed daily variability of the NO 2 and C 6 H 6 concentration ratios at TS and UBS monitoring stations (Figure 9b) in the summer and spring months reflects the mechanisms of NO 2 formation occurring during the day (oxidation of NO from engine emissions, e.g., with ozone and hydroxyl or hydrocarbon radicals), as well as the atmospheric destruction of benzene or its photo-oxidation towards the formation of SOA [113][114][115].
The diurnal variation of air pollutants is strongly controlled not only by temporal variation of emission levels and photochemical reactions but also by vertical turbulent mixing related to the vertical temperature gradient and the mixing-layer height. The sudden increase of vertical mixing in the morning leads to the drop of pollutant concentrations in the air. The strong mixing during the daytime also keeps low concentrations. In the late afternoon mixing becomes weak and concentrations start to increase. This so-called rectifier effect [116,117] can be seen in Figure A3, except for NO 2 at TS stations (due to strong temporal variation of emission sources and photochemical reactions). The rectifier effect can be employed to explain that the drop of concentration occurs later in the morning in winter than in summer as shown in Figure A3a,b (for UBS and IS) and Figure A3c. The different magnitudes of PM 10 , C 6 H 6 and SO 2 between summer and winter are due to different vertical mixing in the two seasons [60,64].

Conclusions
The automatic air quality monitoring system in Krakow focuses mainly on the observation of the spatial and temporal variability of the concentrations of particulate matter PM 10 at four urban background stations (UBS) and two industrial stations (IS) and the registration of road transport impact in this regard at two traffic stations (TS) including one station located in a deep street canyon with very heavy traffic. Additional substances measured at some stations (i.a. NO 2 , SO 2 and C 6 H 6 ) allow for the observation of additional impacts of emission sources characteristic for these substances (emissions from engine vehicles, domestic stoves, power boilers, petrol stations or a coking plant).
As shown by the analysis of measurement data available for 2017-2018, in Krakow, there is a pronounced impact of emissions from road transport on NO 2 concentrations at traffic stations located in the city center within streets with high traffic and from industrial plants on SO 2 concentrations recorded at the industrial station located in the north-eastern part of the city within the influence of the heat and power plant and steelworks. The influence of benzene on the results recorded at the traffic station is also noticeable, especially in the spring and summer seasons, when other emission sources of this substance functioning only during the heating season do not interfere with traffic emissions.
In the cold months associated with the heating season, the variability of the concentration ratios recorded at IS/UBS and TS/UBS stations for all analyzed pollutants is flattened, which results from the overlapping of the impact of many different emission sources in this period, and above all from the growing importance of the urban, suburban and regional background of this pollution.
Traffic stations differ from urban background and industrial stations. The last two types are more similar to each other. This was demonstrated by cluster analysis. This trend is confirmed by hourly averaged concentrations of PM 10 and NO 2 .
In the case of various types of AQM stations, for which no very strong linear correlation is observed for the measurement results recorded on them, one can conclude about different emission sources affecting individual types of stations and areas for which they are representative. When a strong linear correlation occurs, it can be suspected that the concentrations at the stations analyzed are influenced by the same emission sources at the same time. This issue would require further research using receptor models or at least analysis of the wind speed field and direction.
The highest concentration values observed during the year usually occur in the winter months. The clear diurnal and seasonal variation in PM 10 and NO 2 concentrations may be due to both the variable activity of emission sources and the natural mechanisms occurring in the atmosphere, the actions of which result, for example, in the variability of vertical turbulent mixing (the height of the mixing-layer) during the daytime and depending on the season and type of atmospheric circulation. In addition, the levels of PM 10 in the air are increasingly influenced by the formation of secondary inorganic and organic aerosols.
The ratio of NO 2 concentrations observed at TS/UBS stations is several times higher than the one in spring and summer (when there is no significant impact of NO x emission sources other than engine emissions), which indicates that the measurement of NO 2 concentration at traffic stations in Krakow is a good proxy parameter describing the impact of road transport on air quality. This substance can also be used as an indicator in the analysis of the effects of the changes introduced in the field of car traffic management and the effects of reducing vehicle traffic in connection with the COVID-19 pandemic or other large-scale incidents.
The concentrations of the remaining analyzed substances (PM 10 , C 6 H 6 and SO 2 ) are strongly dependent on the season (they significantly increase in the heating season), therefore their usefulness in air quality management should be focused mainly on the analysis of changes in emissions from the municipal and household sector (residential heating) and the industrial plants in the case of significant modernization or production stop. In the non-heating season, it is possible to identify individual impacts of other specific emission sources (road dust resuspension, fugitive particulate matter emissions from road repairs or construction works, benzene emissions from petrol stations or cooking plant, SO 2 emissions from the large industrial plants, etc.). The reduction of NO 2 concentrations in the air due to the commissioning of new flue gas denitrification (DeNOx) installations should also be observed within the range of impact of emissions from combined heat and power plants.
Further, more detailed analysis of monitoring data on concentrations of gaseous and particulate substances with the use of statistical tools for receptor modeling (e.g., PCA, PMF) is recommended in order to identify the main groups of sources affecting AQM stations belonging to particular types in order to better identify them and apply in activities in the field of the air quality improvement and the health risk reduction. There is a need for further research in this field, including works related to the better identification of factors influencing the air pollution in Krakow and the interdependencies between the measurement data observed at monitoring stations of various types, as well as the assessment of the effects of corrective actions taken so far and their further development. In more detailed analyses of trends in air quality changes, it is recommended to take into account the available meteorological data or to perform meteorological normalization of pollutant concentrations. Funding: The paper has been prepared as part of the research connected with the subsidy of Ministry of Education and Science in Poland to maintain scientific potential (agreement no. 16.16.150.545).

Acknowledgments:
The authors would like to express their sincere gratitude to the Chief Inspectorate for Environmental Protection and the Voivodeship Inspectorate of Environmental Protection in Krakow for sharing air quality monitoring data and description of the air quality monitoring system.

Conflicts of Interest:
The authors declare no conflict of interest. The funders had no role in the design of the study; in the collection, analyses, or interpretation of data; in the writing of the manuscript, or in the decision to publish the results.