1. Introduction
With global warming, there has been an intensified production of airborne pollens, extension of the pollen pollination period, and changes in airborne allergenic components [
1]. Studies in North America showed a significant 20% increase in airborne pollen concentrations in 2018 compared to 1990, and a 20-day extension of pollen seasons [
2]. In Switzerland, the rising temperature has led to an earlier start for spring pollen species, with some species experiencing up to a 29-day advancement [
3]. In the United Kingdom, it was reported that Betula pollen seasons were becoming more severe due to warmer temperatures, while Quercus pollen seasons started earlier despite not becoming more severe [
4]. The impact of climate change on pollens and pollen seasons varies, depending on regions and the specific pollen species.
Airborne pollen allergens are the common cause of respiratory allergic diseases, in particular for allergic rhinitis (AR) or hay fever [
5]. The extension of the pollination period and increased allergenic pollens aggravate allergic respiratory diseases, and they further affect patients’ quality of life and respiratory health [
6]. A retrospective analysis in Japan found an increase in pollen counts was significantly associated with a rise in seasonal AR outpatients [
7]. The month of birth was correlated with sensitization to aeroallergens too. Children born in certain months showed a higher prevalence of specific IgE towards certain allergens and allergic diseases [
8]. In Sweden, birch pollen was the dominant cause of pollen sensitization in 4-year-old children, and those who were sensitized to birch also had a higher proportion of sensitization to other inhalant allergens [
9]. After thunderstorms, pollens emerged as the predominant cause of allergic rhinitis (AR) and asthma, resulting in a significant surge in demand for medication in both clinical emergency and outpatient settings [
10].
To accurately reflect variations in pollen counts in a timely and convenient manner, it is essential to implement automatic real-time monitoring for pollen concentrations. Such prompt and effective monitoring of pollen concentrations is necessary to provide data for precise forecasting and alerts. Currently, the most widely accepted method of pollen monitoring internationally is the volumetric pollen counting method, primarily using Hirst-type traps [
11]. It is recommended by the European committee as the standard pollen monitoring method. In the past few years, automated pollen monitoring devices have also been invented [
12], making real-time monitoring of airborne pollens feasible and more convenient. A series of studies have been carried out in Switzerland comparing the automatic pollen counting method with the manual Hirst-type trap sampling method [
13].
In Japan, real-time pollen counting methods using the Yamatronics KH3000 instrument (Yamatronics, Kanagawa, Japan) and the traditional Hirst-type trap sampling method showed a favorable correlation with each other [
14]. To our knowledge, no verified real-time pollen monitoring approach has been reported in China so far. Gravity sedimentation remains the primary technique, including the painstaking enumeration of pollen grains under microscopes. In China, most pollen sampling employs a canopy-type gravity sedimentation sampler equipped with microscope slides covered in glue. These samplers are positioned on the rooftops of buildings between two and six floors high, where they gather pollen that descends to the sampler. The slides are exchanged every 24 h, and the accumulated pollen on the slides is dyed, preserved, and subsequently categorized and enumerated under an optical microscope. Daily observations are conducted to track the fluctuations of various kinds of pollen over distinct dates or seasons [
15]. The pollen concentrations obtained from the gravity sedimentation method represent concentrations per area, rendering them incompatible with the internationally recognized volume concentration values.
In this study, we aimed to set up a validated, real-time, and convenient pollen monitoring method. By comparing the real-time monitoring of pollen concentrations and standard manual pollen counting using the volumetric Hirst samples [
16], we aimed to validate a model for real-time daily airborne pollen volumetric concentration against the standard method. The influencing meteorological factors and airborne particulate matter were considered and adjusted appropriately. This trial is the first validation study in China for real-time pollen monitoring. Such monitoring will help spread information on pollen in a timely manner for clinical doctors and community populations and allow for alerts to be provided in advance so that susceptible patients or populations can take preventive measures. It will also help in optimizing medical resources in advance to deal with potentially urgent demands for medication in susceptible subjects.
2. Materials and Methods
2.1. Pollen Monitoring and Sampling Settings
Automatic real-time pollen monitoring and manual sampling by the standard method were completed at the Meteorological Bureau of Pudong New Area, Shanghai, China (31°13′33″ N, 121°33′32″ E). The place was adjacent to Shanghai’s Century Park which had a rich variety of flowers and trees representing local plants and flowers. Specifically, automatic real-time pollen monitoring was performed by the Yamatronics KH3000 instrument on the roof of the 3rd building’s floor, 12 m above the ground within the height range recommended by Rojo [
17]. Pollen sampling by the standard Hirst-type trap method was performed on the same roof 3 m apart to reduce mutual turbulence interference.
The pollen monitoring and sampling were performed over a total of 534 days, from 4 April to 31 December 2023, and from 1 April to 30 November 2024. By removing the days with unavailable or ineffective pollen data, the effective days for pollen concentrations were 483 days (90.4%) and 482 days (90.3%). Out of these days, 437 days (81.8% and 90.6%) were matched between the two methods on the same days, consisting of 239 days in 2023 and 198 days in 2024, respectively.
2.2. Pollen Monitoring and Sampling Methods
2.2.1. Real-Time Monitoring by Yamatronics KH3000
The Yamatronics KH-3000 is a fully automated pollen monitoring device that counts pollen number per m
3 of air (grains/m
3) per minute. The daily concentration per volume of air (grains/m
3) was calculated by summing up all pollen grains in a whole day divided by the total volume pumped in (m
3). It utilized scattered laser beams from semiconductor materials to detect and calculate airborne pollens. Upon passing through the inlet, high-density particles were eliminated by the dust filtration apparatus. The air, comprising residual particles primarily consisting of pollen grains, was injected into the optical system and irradiated by a semiconductor laser beam. Two detecting elements (PIN-PD) captured the dispersed light, one for forward scattering and the other for side scattering. The disparity between the side and forward scattering intensity of the scattered light was mostly associated with the particle morphology and surface texture. The scatter plot of lateral scattering intensity versus forward scattering intensity defined a rectangular area called the “extract window”. Through this extraction window, particles could be distinguished, allowing for efficient and reliable identification of pollen and other particles (e.g., fiber dust). Details on the instrument’s system can be found in the previous article [
18].
The standard inlet flow rate of the Yamatronics KH3000 was 4.1 L/min. During the sampling period, the flow rate was checked regularly at least once every two weeks. The KH3000 pollen monitoring instrument was equipped with an airflow meter. When the center of the small ball in the flow meter was aligned with the marked scale, it indicated that the instrument’s intake airflow was normal. The Yamatronics KH3000 has been widely applied in Japan for its high reliability and low cost of monitoring and maintenance. It is used in the Hanako-san system managed by the Japanese Ministry of the Environment.
2.2.2. Standard Pollen Sampling by Hirst-Type Trap
The volumetric Hirst sampler [
16] was operated to collect pollen samples according to the European standard [
19]. This sampler contained a rotating drum that pumped air at a flow rate of 10 L/min. The air inlet flow rate was regularly checked by a non-resistance flowmeter at least once every two weeks. The pollen sample traps were collected once a week and carefully cut into 7 splits corresponding to 7 continuous days. The whole instrument was cleaned and maintained at least once a month to ensure it was in normal working condition.
2.3. Manual Counting of Pollens Under Microscopes
According to the European standards [
19], pollens in the Hirst-type trap samples were counted in four longitudinal lines with 400× magnification, accounting for about 15.8% of the total surface of the slide, meeting the requirement of at least 10% by the standard. Two lines were in the upper part and the other two lines were in the lower part, and they were symmetrical to the middle line. The daily pollen numbers were obtained by multiplying to the whole day’s sampling surface area.
All pollen samples collected by the Hirst-type trap method were stained with toluidine blue before counting under the fluorescence microscope.
2.4. Meteorological Data and Ambient Airborne Particles
Hourly averages of meteorological factors (air temperature (Temp, °C), relative humidity (RH, %), precipitation (Precip. mm/h), wind speed (WS, m/s), air pressure (AP, Pa), and ambient particles PM2.5 and PM10 (aerodynamic diameter equal to or less than 2.5 μm and 10 μm, μg/m3) were collected from the Shanghai Pudong Monitoring Station. Daily averages were calculated based on the hourly data across 2 years and in each year in 2023 and 2024, respectively. This station was 2.6 km away from the sampling site.
2.5. Data Quality and Statistical Analyses
Daily monitoring data with at least 16 h a day (the monitoring time length exceeding two-thirds of the entire day) were considered as effective and were then included in the analysis. The manual counting of pollen numbers under the microscope was carried out by professionally trained personnel independently. They were blind to the real-time data provided by the KH3000 instrument to ensure the reliability of validation.
Daily averages and the standard deviation (SD) of pollen concentrations (grains/m3) were calculated, as well as the median, minimum, and maximum values. Data on meteorological factors and PM were also described. Pearson correlation and Spearman’s rank correlation were applied to analyze the correlations between the two methods.
Both the simple and multiple linear regression models were constructed for the real-time pollution monitoring, in which the multiple linear regression model was optimized for the best set of variables with the highest goodness of fit. The coefficient of determination (R2) was calculated for each model. R2 represented the proportion of change in the dependent variable (the Hirst-type data) predicted from the independent variables (the real-time monitoring data) and/or other variables. The slope was the parameter that represented the rate of concentration change in the Hirst-type pollens relative to the rate of concentration change by the real-time monitoring instrument (KH3000). The p-value < 0.05 was considered statistically significant, and two-sided significance tests were applied. Meteorological factors (Temp, RH, and AP and so on), PM2.5, and PM10 were selected and optimized in the multiple linear regression model to find the best model with the highest goodness of fit.
To evaluate the performance of simple and multiple linear regression models, we employed K-fold cross-validation. The dataset was randomly split into K equal-sized subsets, known as folds. In each iteration, one of these subsets acted as the validation set, while the remaining K-1 subsets were applied to train the models. This cycle was repeated K times, guaranteeing that each subset was used for validation at least once. We selected K as 5, which aligned with standard practices in the literature and was a suitable size, given the size of the whole dataset. After completing the five iterations, we averaged the performance metrics, coefficient of determination (R2), root mean squared error (RMSE), and mean absolute error (MAE) to obtain a reliable assessment of the models and identify the best performance.
4. Discussion
In this study, the automatic real-time monitoring method for daily pollen concentration (KH3000 instrument) was validated in a typical Yangtze River Basin city in China. Through almost 2 years of continuous parallel sampling and comparisons against the international standard Hirst-type manual method, a simple linear regression model for real-time pollen monitoring was proven to be acceptable, reliable, and feasible, with the coefficient of determination R2 reaching 0.76 (p < 0.01). Specifically, the model performed even better in days with ≥10 grains/m3 and in the spring season. A multiple linear regression model was also constructed by optimizing the adjusted variables, despite no obvious improvement being observed compared with the simple linear regression model. Our study provided the first reference model for automatic real-time monitoring of daily pollens (grains/m3) in mainland China. We believed it would help promote the monitoring of daily pollens in a more convenient and efficient way.
Traditional gravity sampling combined with manual counting under a microscope is still the primary method in China for pollen collection and measurement [
5]. While it is widely used at a lower cost, it has significant limitations. First, manual counting is time-consuming and labor-intensive, requiring a high level of skill from the operator. This makes it challenging to meet the demands for high-throughput and long-term continuous monitoring. Second, the traditional method of pollen monitoring can only be calculated as area concentration data (grains/mm
2), which does not align with the volume concentration data (grains/m
3), the latter of which is more accepted internationally [
20]. In the context of a warming climate and increased prevalence of allergic respiratory diseases, including AR or hay fever [
21], a convenient and reliable method for pollen monitoring is in great demand. Furthermore, the high temporal resolution in real-time pollen monitoring is a necessity for timely forecasting and for providing alerts related to pollen exposure, needed by both clinical doctors and susceptible community populations.
The Hirst-type trap sampling of pollens is now recognized internationally as a reference method for pollen monitoring, which provides volumetric concentration data. This active collection method can accurately reflect the airborne pollen concentrations [
22].
However, the Hirst-type trap sampling still requires manual counting, making it difficult to realize the automation of data processing at a high temporal resolution.
To address the need for more convenient and higher temporal resolution pollen monitoring, several types of automated pollen monitoring instruments have been developed. These include optical counting methods [
23], image recognition techniques [
24], and laser particle size measurement techniques [
24]. The significant advantages of these automated monitoring approaches include a reduction in manual operation, the improvement of monitoring efficiency, and real-time data collection. However, existing automatic pollen monitoring instruments also face challenges, such as the accuracy of detection and classification, being affected by algorithms and environmental factors [
25].
The KH3000 automatic pollen monitor utilized in this study had high temporal resolution and detection accuracy [
26], underscoring its superiority in data processing and real-time monitoring of pollen concentration changes. It provided a vital reference for pollen monitoring efforts. However, no systematic comparative study on automatic monitoring instruments has been conducted in China against traditional manual methods, since the local plantation and ecological systems can vary across regions and countries. As far as we know, this study represented the first effort to verify the effectiveness and applicability of automated pollen monitoring by the KH3000 automatic monitor in China.
In our analysis, the coefficient of determination was similar to the pollen concentration results obtained from two instruments previously conducted in Japan (R
2 = 0.83) [
14]. It was also noteworthy that automatic measurements, particularly those obtained using the KH3000 instrument, tended to underestimate total pollen concentrations in comparison to the standard Hirst method. This discrepancy might be attributed to the fixed “extract window” setting of the KH3000, which might result in the omission of pollen particles outside of the designated size range. In contrast, the Hirst method allowed for the identification of all pollen particles that adhered to the adhesive tape after undergoing color staining for microscopic observation. The rare periods with a higher pollen level observed by the KH3000 instrument occurred during the peak pollen days in 2023. We considered that this was likely due to the underestimation of pollen numbers by the Hirst-type method, since on these days with an extremely high number of pollens, pollens on the tape overlapped with each other. As a result, the number counted under the microscope might be lower than the actual levels. Despite these limitations, it is important to note that we found a significant correlation between the measurements from both methods, indicating that they were fairly aligned with each other.
With an acceptable goodness of fit in the simple linear model across 2 years of parallel sampling, we noticed that in the subset of days with higher daily pollen concentrations ≥10 grains/m
3, the real-time monitoring was more in agreement with manual observations than in days with daily pollen concentrations <10 grains/m
3. Similarly, better performance was also observed in the spring season (average daily pollen ≥ 10grains/m
3) compared to non-spring seasons (average daily pollen < 10 grains/m
3). This discrepancy might be due to several reasons. First, at higher concentrations, the signal-to-noise ratio improved, allowing the optical sensors to more effectively distinguish pollen from abiotic particles [
25]. The second reason is related to pollen species and morphology. In spring, Shanghai has a dominant pollen species, primarily tree pollen, with a narrower range of particle sizes and similar morphological characteristics. This reduced the possibility of misidentification by the KH3000. In contrast, in other seasons, there is a mixture of pollen from trees, grasses, and weeds, which vary significantly in particle size and morphology. This variation led to more diverse optical scattering patterns, reducing the specificity of detection. A third reason is the potential for error in the Hirst-type method. Although the Hirst-type trap is widely used as a reference pollen monitoring instrument, it is not considered as the absolute “gold standard”. Additionally, manual counting involved counting the four long axes of the pollen sample bands, and this could lead to errors in the data obtained by the Hirst-type trap, particularly when pollen concentrations were low [
27]. A fourth reason is related to possible frequent rainfall and relative humidity in non-spring seasons. In non-spring seasons, increased precipitation and higher relative humidity can cause pollen to fragment, making it difficult for the KH3000 to accurately identify the particles [
25]. In several previous studies, the performance of KH3000 was only evaluated when the daily pollen was >10 grains/m
3 or in pollen peak seasons [
28].
Our study has several strengths. The validation was robust and supported by almost 2 years of parallel sampling and monitoring data, which covered both the seasonal and inter-annual variations. Second, it adopted a double-blind method to ensure the objectivity of comparison between manual counting and automatic real-time monitoring. Third, the manual counting under the microscope was completed by fixed professional personnel to reduce objective bias among different subjects. Fourth, both simple and multiple linear regression models were tested and compared to select the most effective and feasible model.
There were also some limitations. The monitoring site in this study was in Shanghai, a city at a closer latitude, and with a more similar climate, to Japanese cities than most other cities in China. Our results provided evidence of feasibility, which might be more suitable to apply in cities with similar latitudes or climate. However, it is uncertain whether results can be extrapolated to other cities with different vegetation and climate characteristics. Second, due to our inability to access the backend data of the KH3000, particularly the specific “pollen window” data, the further classification and counting of pollen types was not be possible for validating the pollen type-specific counting by the KH3000. Thus, there is still a need to verify the accuracy and reliability of specific pollen species concentrations between KH3000 and Hirst-type data. From the perspective of pollen prevention, there are significant differences in the allergenicity of different pollen species. So, future research should incorporate artificial intelligence technology to develop the automatic real-time identification of pollen species, not only the total pollen numbers, to enhance the precise monitoring and forecasting of pollen species.