Aerosol Optical Properties and Contribution to Differentiate Haze and Haze-Free Weather in Wuhan City

: Haze is an atmospheric phenomenon in which different types of particulates obscure the sky, and hence affect almost all human activities. Over a couple of recent decades, China has witnessed increasingly worse air quality as well as atmospheric haziness in its cities. There are various haze contributing factors including the rapid industrialization, excessive biomass burning, and an increase in the number of vehicles. This study proposes a methodology based on the aerosols scattering and absorption properties, to predict the likelihood of an episode of hazy days. This case study employs the aerosol optical properties data from integrated nephelometer and aethalometer sensors from December 2009 to September 2014 over Wuhan. The role and contribution of each aerosol optical parameter (e.g., aerosol scattering and absorption coefficients, single scattering albedo, scattering, and absorption Ångström exponents, backscatter ratio, and asymmetry factor) in distinguishing haze and haze-free conditions has been quantitatively determined based on a machine learning approach. Each aerosol optical parameter was classified independently by the support vector machine (SVM) algorithm, and the aerosol scattering (85.37%) and absorption (74.53%) coefficients were found to be primary potential indicators. Through the Kolmogorov-Smirnov test and traditional statistical analysis, the aerosol scattering and absorption coefficients were then verified as important indicators in distinguishing haze and haze-free days. Finally, through a probability density diagram and frequency histogram, we propose a simple quantitative standard to distinguish between haze and haze-free conditions based on the aerosol scattering coefficient and absorption coefficient in Wuhan City. The accuracy of the standard was determined to be 81.49% after testing, which indicates an accurate result. An error in aerosol optical properties may lead to an error in the calculation of aerosol radiative forcing, the earth’s energy budget, and climate prediction. Therefore, understanding of the aerosol properties during haze-free and haze-days will help policymakers to make new policies to control urban pollution and their effects on human health.


Introduction
Aerosols are microscopic solid particles or droplets in the atmosphere [1][2][3][4][5][6], and the World Health Organization has defined that particles with diameters smaller than 2.5 microns (PM2.5) can enter human blood circulation systems, causing vascular inflammation and atherosclerosis [7,8]. The Intergovernmental Panel on Climate Change (IPCC) stated that aerosols are one of the leading uncertain factors that induce the climate change, as aerosols can directly affect the earth's radiation balance by scattering and absorbing solar short-wave radiation and thermal infrared long-wave radiation, and the changes in cloud nuclei aerosols can control cloud's physical properties and life cycle, which can indirectly affect the earth's climate system [9][10][11].
In China, which has experienced rapid economic development in recent years, increased industrial and urbanization activities have led to an increased number of aerosol particles, and, in turn, enhanced air pollution events [12]. At present, the definition of haze standards is not consistent around the world. For example, the Chinese Meteorological Administration (CMA) decided that if visibility value is less than 10 km and relative humidity (RH) is less than 95%, it is haze weather. However, in some studies, researchers reported that if visibility values are less than 5 km and RH values are less than 80%, it defines haze weather [13,14]. In recent past, impactful regional scale haze events have occurred in the Pearl River Delta (PRD) [15], Yangtze River Delta (YRD) [16], the Beijing-Tianjin-Hebei urban agglomeration (BTH) [17,18], and semi-arid northwestern areas of the country [19]. Different types of haze (dust, brown smoke, black carbon) have variable impacts on the regional and global climate [20]. Haze from soot particles can be an absorbent and act to heat the atmosphere, cooling the earth's surface [21]. Studies of haze have also attracted the attention of many researchers in recent years. These studies aimed to explore the causes, optical and physical properties, and chemical compositions of the haze-causing aerosols. Similar studies have also found that haze aerosol particles have positive heating effects on atmospheric fluxes [20,22], and serious haze events have been closely related to certain meteorological conditions and anthropogenic pollutant emissions [23,24].
Various researches have reported that aerosol optical parameters can quantitatively reflect aerosol radiation forcing, especially on haze days [10,11,25]. These optical parameters have been widely investigated on haze and haze-free days, but their ability to distinguish between weather types is still unknown and has not been studied previously [13]. Also, the classification of different aerosol categories has been the first step in retrieving aerosol optical properties in some projects, such as the Cloud-Aerosol LiDAR and Infrared Pathfinder Satellite Observation (CALIPSO) and Lidar In-Space Technology Experiment (LITE) programs [26,27]. However, no explicit criteria exist to distinguish between haze and haze-free aerosols yet. The differences in the aerosol optical properties between these two weather types (i.e., hazy and clear sky) can lead to a more detailed understanding of the haze radiative effect. For example, we can identify which parameter changes significantly from haze-free to hazy days, while providing a convenient and simple criterion to classify haze and haze-free aerosols and assist in the retrieval of aerosol optical properties. At present, visibility is the basic index to define haze-causing aerosols [14]; therefore, a comprehensive defining criterion based on optical parameters for haze is necessary.
The aim of this study, first, is to investigate the optical and physical properties of aerosols in Wuhan. Second, it further explores the contribution of each optical and physical parameter of haze and haze-free aerosols. Finally, according to the optical parameters with large contributions (weights), a convenient and simple optical criterion is proposed to distinguish haze from haze-free aerosols, to help us better understand the radiation forcing effect of haze aerosols. To our knowledge, there is no such study available before. The following section of this paper presents the experimental methodology, including data sources and study area. The third section details the results and discussion. Finally, the fourth part summarizes the conclusion.

Study Area, Equipment and Data
Our experimental site is in the Wuhan University (29.97 degrees north latitude, 113.91 degrees east longitude) ( Figure 1). Wuhan city is the capital of Hubei province and it is a mega inland city in central China, which is at the confluence of the Yangtze and Han rivers. The climate type of Wuhan is subtropical monsoon climate: winter is cold, summer is hot, humidity is high in summer and the annual precipitation is high. The aerosols mainly originate from traffic emission, industrial pollutants emission, fossil fuel combustion, and straw combustion, etc., [2,3]. Our research was conducted at the State Key Laboratory of Surveying, Mapping, and Remote Sensing Information Engineering (LISMARS), Wuhan University. The aerosol optical characteristics were principally obtained from nephelometer and aethalometer. The observation equipment was settled on the top of the laboratory [3]. The laboratory is close to a bustling highway, and the surrounding environment has industrial, commercial, and residential areas.
We collected datasets of aerosol optical parameters for 370 haze-free days and 662 haze days from December 2009 to September 2014. The working bands of the nephelometer were 450 nm, 550 nm, and 700 nm, and it can directly provide aerosol scattering coefficients and backscattering coefficients. The instrument operated automatically every day, and we use standard CO2 to collaborate it every three months [28,29]. The constituents of the outside environment and atmosphere were pumped in the equipment main cavity through a 3-meter stainless steel pipe. The flow rate of the nephelometer was 30 L·min −1 and the sampling frequency was 1 min. The aerosol scattering properties could be measured at 7°-170°angle [30]. At the same time, the truncation error increases when there are large aerosol particles (such as dust) and we also conducted the truncation error correction before further processing of the dataset. The detection limit values of the scattering coefficient at their band were 0.44, 0.17, and 0.26 Mm −1 , respectively. The detection limit values of backscattering coefficient were 0.29, 0.11, and 0.21 mm −1 , respectively [30]. Data values beyond the limit were excluded from the calculation. The aethalometer worked on seven wavelengths: 370, 470, 520, 590, 660, 880, and 950 nm. The 880 nm was the pure black carbon (BC) band. The principle of aethalometer was to measure the optical attenuation of BC and then retrieve the BC mass concentrations. The outside BC particles were inhaled into the primary chamber of aethalometer through stainless steel. Then the BC particles adhered to the quartz membrane. The absorption attenuation of BC, then, can be obtained according to light exposure [31]. Based on the attenuation of light values, the mass concentration of BC, then, can be calculated continuously [30,32]. The flow rate of the equipment is 5 L·min −1 and the sampling frequency is 2 min. We maintained the stability of the light source every day to ensure the accuracy of the data and conduct calibration every three months.

Methodology
The aerosol scattering coefficient σ and the backscattering coefficient β are the important aerosol optical parameters and manifest the particle scattering capacity. The σ and β can be obtained directly from the nephelometer. In this study, we used the 550 nm of σ and β [2,3]. At the same time, the scattering Ångström index α can be calculated based on the σ : α reveals the aerosol sizes characteristic according to the Mie theory. The lower the values of α , the higher the aerosol size values [23]. The change in α is governed by the aerosol size distribution. α has little relationship with the aerosol concentration. In this study, α can be calculated based on σ at 450 nm and 700 nm as Equation (1).
At the same time, the study of g and b is helpful to improve the understanding of aerosol properties during haze. Also, aerosol asymmetry factors b and g can usually be calculated using Equations (2) and (3) [33,34]: The aerosol asymmetry parameter (g) and the backscatter ratio (b) are also aerosol scattering optical parameters. g reflects the forward scattering capability of particles; b reflects the backscattering capability of aerosols. Investigating g and b can help improve the understanding of aerosol properties during haze periods. g and b can also be calculated based on σ and β [33,34]: In this equation, g denotes the aerosol asymmetry factor, and α is the total scattering coefficient obtained by the integrated nephelometer; β is the backscattering coefficient obtained by the integrated nephelometer. In theory, the range of g is between −1 and 1 [2]. On the other hand, the aerosol absorption coefficient σ is also the basic optical parameter and it shows the absorption capacity of the aerosol. σ can be calculated indirectly using the BC concentration [29,30,34]: Here [BC] is the black carbon mass concentration at 880 nm, which represents the pure black carbon concentrations and can be obtained directly from the aethalometer. α is the absorption efficiency factor, which can be calculated by 14625/λ [31,32]. λ represents the wavelength. In the study, we utilized the σ at 532 nm. Meanwhile, the aerosol single scattering albedo (SSA) is also an important optical parameter of aerosols. SSA can reflect the particles scattering proportion in the whole extinction effect of aerosols. The SSA can be calculated as the following formula: The aerosol's direct radiative effects can be sensitively revealed in the change of SSA, and SSA can also convey the relative contribution of scattering and absorption [3]. The SSA of pure BC is only 0.2, which indicates that the scattering capacity of BC is low and the absorption capacity of BC is high. SSA values are mainly related to aerosol ingredients and sizes [35]. In this study, we utilized SSA values at 532 nm. So the σ should be corrected to 532 nm based on α : σ (532 nm) = σ (550 nm) × 520 nm 550 nm (6) In all, the aerosol scattering (σ ) and absorption (σ ) coefficients, SSA (ω ), scattering Ångström exponents (αs), backscatter ratio (b), and asymmetry factor (g) are the defining parameters that characterize the primary aerosol optical properties, which can help to quantitatively gauge the radiation forcing of aerosols, especially in haze weather. The contribution of each optical parameter during haze and haze-free conditions was of our pertinent interest, which required long-term accumulation experiments. The different characteristics of each aerosol optical parameter in haze and haze-free periods can enable us to have a clearer and more detailed understanding of the aerosol radiation effect on haze (e.g., we can determine which optical parameters change dramatically). On the other hand, according to the varied contributions of aerosol optical properties, discrimination methods of haze and haze-free aerosols can also be developed.

Support Vector Machine Algorithm
To further explore the contribution of each optical characteristic parameter of haze and haze-free aerosol particles, we selected data from December 2009 to September 2014 (370 days of non-haze and 662 days of haze conditions). In this study, we also utilized the weather datasets provided by the Wuhan Meteorological Bureau [36]. At present, the definition of haze standards is not consistent around the world. For example, the Chinese Meteorological Administration (CMA) decided that if visibility values are less than 10 km and relative humidity (RH) values are less than 95%, it is haze weather. In some studies, researchers reported that if visibility values are less than 5 km and RH values are less than 80%, it defines haze weather [13,14]. Also, it is challenging to link haze-days with aerosol optical properties. Therefore, in this study, we utilized the weather datasets provided by Wuhan Meteorological Bureau, which utilized the criterion that visibility values are less than 5 km and RH values are less than 80% [36]. A support vector machine (SVM) algorithm was employed to calculate the classification accuracy of each optical parameter ( Figure 2). The Kolmogorov-Smirnov statistical method was used for testing, and statistical methods were also applied to verify the result. Then, the contribution of each optical parameter under haze and haze-free conditions was obtained. Finally, according to the optical parameters with large contributions (weights), a convenient and simple optical criterion has been defined to distinguish haze and haze-free aerosols, to help us better understand the radiation forcing effect of haze aerosols.
The SVM is a generalized linear classifier that carries out binary classification on data according to supervised learning [37]. The decision boundary of the SVM is the maximum margin hyperplane, which is solved for the learning samples to create a classification hyperplane as a decision surface to maximize the edge of the isolation between the different classes [38]. The SVM used the hinge loss function to calculate empirical risks and added a regularization term into the solving system to optimize the structural risk, which was a classifier with sparsity and robustness. The SVM can conduct non-linear classification through the kernel method, which is one of the common kernel learning methods. As shown in Figure 2, we first chose the training database to find the support vectors in the hyperplane and then used the trained model for classification. Finally, the classification accuracy of each training parameter was obtained. We can utilize the classification accuracy to decide which parameter has the most significant effect on the classification.   Table 1. The skewness measures the asymmetry degree of the optical parameters. When the skewness is > 0, the frequency distribution is skewed to the right, and vice versa. Normal distributions are symmetrical with skewness of 0. All parameters demonstrate a skewness higher than 1 except SSA (ω , −2.59), scattering Ångström exponents (αs, 0.11) and asymmetry factor (g, −0.93). Negative skewness indicates that values lower than the mean are more probable than values higher than the mean. Similar SSA skewness values have been reported by Pandolfi et al. at the MSY RB site as well as the MSC site [39]. Negative skewness for g is a consequence of the high positive skewness observed for b. As shown in Table 1, both σ and σ show skewness higher than 1, so we can see that the mean σ is 25% larger than its median and σ is 38% greater than its median.  [3]. During this haze event, the mean σ and σ were approximately 5.3, and 3.4 times higher than that of normal weather, respectively, which explains the observed values of the hourly mean maximum of σ and σ on 11 June 2012. Compared to other long-period measurements around the world, σ and σ values are similar to those measured in anthropogenically influenced stations in North China [40]. The mean SSA for the entire period was 0.89 (0.08), which was about 0.08 higher than the value reported by Bergin et al. (2001) [40] for Beijing urban area, and also higher than the value (0.85) in climate modelling simulations for China and India performed by Menon et al. (2002) [41].   [40,42]. The σ of Wuhan is slightly larger than that in Shanghai (293 Mm −1 ) while the σ is lower than that in Shanghai (65.81 Mm −1 ) [15]. Wuhan, an intensely industrialized megacity, hosts diverse sources of aerosols. Thus, σ and σ were reasonably larger than some rural zones, such as SDZ site (174.6 Mm −1 /17.5 Mm −1 ) [43]; coastal sites in western countries, such as BND site (57.0 Mm −1 / 4.62 Mm −1 ) in North America [44] and some high altitude sites in the top of the mountain, such as Melpitz site (53.37 Mm −1 / 5.64 Mm −1 ) in Germany [45]. On average, the absolute values of extensive optical properties and their amplitudes are higher at the LIESMARS site compared with the above-mentioned sites. The urban site is close to anthropogenic pollutant emissions compared to rural and mountain sites and make more effective the pollution potential of thermally driven atmospheric processes such as the PBL oscillations [46]. As a consequence, the Wuhan station showed marked optical properties due to the effectiveness of the emission of highly fine particles of anthropogenic origin in Wuhan. The mean SSA in LIESMARS station (0.89) is about 0.01 larger than the values in the SDZ site (0.88) and it is slightly lower than that in the BND site (0.91). The SSA in Wuhan was remarkably larger than 0.83 in Guangzhou, 0.81 in summer in Beijing, and 0.81 in Shanghai [40,42]. Dubovik et al. [47] calculated the range of SSA (0.85-0.95) for the northern hemisphere using the Aerosol Robotic Network (AERONET) inversion. The results of this study are close to the results of AERONET-based study by Dubovik et al. The mean values of b at 550 nm, scattering Ångström exponents (αs, 450-700 nm), and g at 550 nm measured at LIESMARS site were 0.12 ± 0.02, 1.52 ± 0.35, 0.61 ± 0.05, respectively. These values were very close to the values reported for the Jungfraujoch site and MSC site [28], indicating similarities in the mean aerosol characteristics observed at these stations. The changes in aerosol αs, b, and g are related to aerosol composition and age [35]. Figure 4 portrays the correlations between the frequency distribution of aerosol optical and physical properties. Similar relationships were also discussed in the researches of [28,40,44,48]. This kind of relationship can be a benefit to enhance the precision of algorithms for retrieving aerosol properties [44]. As depicted in Figure 4a, aerosol scattering coefficients (σ ) has an obvious linear positive correlation with aerosol backscattering coefficients (β ). This phenomenon can easily be understood. When aerosol concentration increases, the total scattering effect gets enhanced, and, hence, the increase in backscattering coefficients. The same results were also observed at MSC site [28], MSY RB site [40] and a similar phenomenon has also been reported at AND2011 [48]. Aerosol scattering coefficients ( σ ) have also a certain positive correlation with aerosol absorption coefficients (σ ) (Figure 4b), which can be due to the higher concentrations of aerosols, in turn, leading to increased total scattering as well as absorption effects. This observation can also be found in the research of MSC site [28], MSY RB site [40], and AND2011 [48]. Figure 4c reveals the slight rise in aerosol SSA with the increase in aerosol scattering coefficients (σ ) for Wuhan city. This indicates that the aerosol scattering effect proportion would slightly increase when aerosol concentrations get enhanced in some haze days in Wuhan [2,35]. In haze days, the aerosol contents were mostly sulfate, nitrate, and organic aerosols, which can also accentuate the proportions of aerosol scattering effects [2, 35,40,48,49]. Figure 4d shows that scattering Ångström exponents (αs) has a certain negative correlation with aerosol scattering coefficients. This indicates that when a haze episode occurs, the aerosol scattering coefficients become higher, and the aerosol sizes also get enhanced, simultaneously [50,51]. Because during haze episodes, aerosol contents are mostly sulfate, nitrate, organic, and secondary aerosol aerosols from a photochemical reaction, which can lead to bigger aerosol sizes [2, 3,35,48,49,52,53]. Aerosol asymmetry factor (g) has also an obvious positive correlation with aerosol scattering coefficients (σ ) (Figure 4e). The reason being, when aerosol concentration is increased, the total scattering effect gets enhanced, and so does the forescattering effects. The same results were observed at the MSC site [28], MSY RB site [40], and a similar phenomenon was reported by AND2011 [48]. According to formula 5, backscatter ratio, hence, has an obvious negative correlation with aerosol scattering coefficients (σ ) (Figure 4f). This indicates that the aerosol backscattering effects proportion can decrease when aerosol concentrations get enhanced in some haze-episode days in Wuhan. This observation is also in agreement with the MSC site [28].

Seasonal Variation of Aerosol Scattering and Absorption Properties
The average annual cycles of aerosol optical properties at the LISMARS site are shown in Figure  5. Table 2 lists the average aerosol optical properties in each month. It can be seen in Figure 5  August (18.73 Mm −1 ), respectively. Large variations in the monthly aerosol scattering and absorption levels existed in Wuhan city. The mean monthly values of σ and σ are always higher than the median values, especially in winter, indicating that heavier pollution events rather occur in winter than in summer. These findings were the same in the research of Wang et al. [35] and Zhang et al.
[2]. A similar phenomenon was also observed at the regional GAW observation site Melpitz in East Germany. The annual cycles of σ and σ are mainly determined by the annual variation of the boundary layer height [2]. In winter, the local emission is higher due to a larger amount of fossil fuel combustion. The wintertime shallow mixing layer favors the accumulation of aerosol pollutants, thus causing an increase in the level of σ and σ . In summer, the boundary layer is more unstable because of the surface heating and causes turbulence and vertical mixing, which results in a dilution effect for a pollutant. Therefore, σ and σ in summer are at a relatively low level and with small variations. The SSA only shows slight annual variations, with high values in summer. This annual pattern might be caused by the variation of regional emission and secondary aerosol formation. In summer, air pollution episodes (such as haze) occur frequently in Wuhan [3]. In these events, secondary aerosol productions via photochemistry processes are efficient and result in a large fraction of non-light-absorbing components such as organic matter and sulfate in a particulate matter [37], hence yielding a relatively higher level of SSA. The evident annual cycle can be found in the backscatter ratio (b), with higher values in winter. This annual pattern can be attributed to the variation of both the number size distribution and the mixing state of particles. A significant nucleation mode can be usually found in the daytime because of the formation of secondary aerosols during hazy weather. According to Collaud Coen et al. (2007) [54], such a decrease in aerosol mean size may increase the b. Also, it was found that the morphology of a black carbon core surrounded by a non-light-absorbing shell would result in a much higher b than the external mixture of the same amount of both components. In winter, because of the active photochemical aging processes, black carbon is close to the morphology of core-shell, thus causes a higher level of b. Scattering Ångström exponent (αs) is mainly determined by the shape of the aerosol number size distribution. αs smaller than 1 indicates that the size distribution is dominated by coarse mode, whereas αs larger than 2 indicates that fine mode particles dominate the size distribution [55,56]. The annual pattern of αs can be explained by the variation of aerosol number size distributions due to secondary particle formation [57,58]. In summer Wuhan, haze episodes often happened owing to straw burning incidents and the atmospheric temperature was also high (Table 2). During such periods, secondary particles can form due to photochemical processes. High concentrations of small particles cause a high level of αs, thus results in higher average αs in summer compared to other seasons.

SVM Results
SVM has been applied to independently classify the optical parameters of haze and haze-free aerosols, to determine the classification accuracy of each parameter. High precision indicates high importance, and low precision indicates low importance. To verify the results rendered by SVM, each parameter has been statistically tested by using the Kolmogorov-Smirnov test. This enabled to identify the important optical parameters to be used as haze aerosol indicators. Further, based on the probability density distribution of the indicator parameters, the classification standards of haze aerosol particles were defined.
We had a total of 370 samples of mean daily values in the haze-free period and 662 samples of mean daily values in haze aerosol conditions; 150 haze-free samples and 300 haze samples were randomly selected as the training datasets. However, the remaining 220 haze-free samples and 362 haze samples were selected as the testing dataset. The classification accuracy of each aerosol optical parameter is presented in Table 3. It can be seen that the aerosol scattering coefficient (σs) and absorption coefficient (σab) have the highest accuracy (85.37% and 74.53%, respectively), which attest that these are the most important indicators in distinguishing haze and haze-free weather.
To verify the preliminary results of the SVM, we first used statistical analysis. The mean value and standard deviation of each aerosol optical parameter were calculated for both the haze and haze-free days. As can be seen in Table 4, σs is 2.89 times higher for haze days (540.21 Mm −1 ) than for haze-free days (186.96 Mm −1 ); while the σab is 2.03 times higher for haze days (153.27 Mm −1 ) than for haze-free days (75.34 Mm −1 ). Other aerosol optical parameters (b, αs, αab, g, and ωo) show no significant differences between haze-free and haze days. Further, the Kolmogorov-Smirnov test was used to examine the distribution differences of each optical parameter in haze and haze-free conditions ( Table 5). Compared with other optical parameters, the aerosol scattering coefficient σs and absorption coefficient σab have very small p values with much lower magnitudes than do the other parameters, indicating significant differences in these values between haze and haze-free days. These results are aligned with the previous SVM results. Therefore, σs and σab are verified as the main factors to quantitatively distinguish between haze and haze-free conditions. Table 3. The SVM classification accuracies of every aerosol optical parameter. σ is the aerosol scattering coefficient at 550 nm (Mm −1 ); b is the aerosol backscattering ratio at 550 nm; σ is the aerosol absorption coefficient at 520 nm (Mm −1 ); αs is the aerosol scattering Ångström exponent at 450-700 nm; αab is the aerosol absorption Ångström exponent at 370-950 nm; g is the aerosol asymmetry factor at 550 nm; and ω is the aerosol single scattering albedo at 550 nm. Given these results, the next step was to quantitatively determine the discrimination criteria based on the two indicator factors in Wuhan City. Figure 6 depicts the relationship between σs and σab for haze-free ( Figure 6a) and haze (Figure 6b) aerosols. For haze-free aerosols, the main range of σs is 50-300 Mm −1 , while for haze days, there is a wider range of 250-1700 Mm −1 . The σs values have almost no overlapping ranges for haze and haze-free days, which indicated that σs has an obvious distinction between haze and haze-free aerosols, indicating that the σsSVM classification accuracy is the highest (Table 3). At the same time, the main range of σab is 30-100 Mm −1 for haze-free days and 50-150 Mm −1 for haze days. σab demonstrates some overlap between haze and haze-free days; therefore, the σabSVM classification accuracy is lower than that of σs (Table 3). This might be because the local absorbing aerosol concentrations (mainly black carbon aerosols) did not change much between haze and non-haze periods, while the scattering aerosol concentrations (mainly sulfate and nitrate aerosols) changed greatly. The σs and σab distributions of haze and haze-free aerosols exhibit a marked difference that can be used to distinguish between them. According to the distribution differences of σs and σab between haze and haze-free aerosols, a quantitative criterion can be established to distinguish haze and haze aerosols. To devise such quantitative criteria, the frequency histograms of the aerosol scattering (σs) and absorption coefficients (σab) are displayed in Figure 7. As shown, the peak of σs was approximately 150 and 450 Mm −1 for the haze-free and haze days, respectively, and the intersection of the σs frequency distribution of haze-free and haze was 250 Mm −1 (Figure 7a). Thus, a threshold of 250 Mm −1 for σs could be used to distinguish between haze and haze-free conditions. Similarly, the peak σab values are approximately 75 and 150 Mm −1 for haze-free and haze days, respectively, and the intersection of the σs frequency distribution of haze-free and haze conditions is 75 Mm −1 ( Figure  7b). Thus, a threshold of 75 Mm −1 for σab could be used to further distinguish between the two weather conditions.

Parameters
In conclusion, the σs and σab thresholds can be combined to identify haze aerosols.
(4) For σs < 250 Mm −1 and σab < 75 Mm −1 , the aerosols are identified as haze-free aerosols. We used this new standard to identify haze aerosols from our database, and the detection accuracy was 81.49%, indicating that the standard was suitable to distinguish between haze and haze-free aerosols in Wuhan City.

Conclusions
Based on the experimental datasets for four years, the weighted contributions of various aerosol optical parameters of haze and haze-free conditions have been quantitatively studied. The SVM algorithm was used to classify each optical parameter independently, and the indicator factor with the greatest influence was statistically determined by its classification precision. The results reveal that the aerosol scattering coefficient σs (85.37%) and absorption coefficient σab (74.53%) have the highest classification accuracy levels. To verify the SVM results, a Kolmogorov-Smirnov statistical test and traditional statistical analysis were utilized. Results demonstrate that both σs and σab have small p values, indicating that their frequency distributions differed greatly between haze and non-haze aerosols. Meanwhile, the σs values during haze are 2.89 times that during haze-free weather, and the σab values during haze are 2.03 times that during haze-free weather. Based on the differences in the σs and σab probability distributions between haze and haze-free aerosols, a quantitative criterion has been proposed to distinguish between haze and haze-free aerosols. After testing, the accuracy rate of this standard is 81.49%. This detailed study on the optical properties of haze aerosols can provide a scientific basis for local environmental departments to formulate atmospheric governance policies.
Overall, this study provides a better understanding of air pollution in the YERB region. In the future study, aerosol types and their microphysical properties over the region will be considered. Also, the vertical feature mask (VFM) products of CALIPSO will be considered to investigate the vertical structure and layers stratification characteristic of different aerosol species.