1. Introduction
The
Ambrosia artemisiifolia L. or common ragweed is an invasive weed, recognised for its atopic properties. For both event analysis and operational forecast, numerous efforts have been devoted to model its emission and transport. A recent overview of ragweed studies [
1] showed that ragweed modelling is conducted using local statistical models (using observed meteorological data, for example), trajectories models, multi-parameter regression analyses, as well as regional models. While the physics of birch and ragweed pollen emissions is completely different, some models use the same physical basis to estimate these two different release terms, as in [
2], for example.
Assessing and predicting ragweed pollen emissions is challenging because they are sensitive to many different biological and meteorological factors. First, even though major efforts are currently made (see, e.g., refs. [
3,
4]), the location of ragweed plants is difficult to establish, as presented in ref. [
5] for France, in ref. [
6] for Serbia, refs. [
7,
8] in Western Europe. Second, for an area with clearly identified ragweed, seasonal weather conditions determine the phenology of the plant and its pollen production. Depending on these seasonal conditions, the yearly amount of pollen may be very different from one year to another [
9]. Finally, if the plants are present and climate conditions are favourable to the plant growth and flowering, hourly meteorological variability strongly influences the pollen release. A wealth of observational data were analysed to better understand the physical mechanisms underlying pollen emissions. One of the first foundational works is [
10] studying pollen release in the common ragweed. A few years later, Holmes et al. [
11] made measurements of hourly ragweed pollen concentrations during the summer of 1961 in Ottawa. For the first time, meteorological variables (relative humidity and air temperature) were measured together with ragweed pollen concentrations. It was shown that concentrations exhibited a diurnal peak during the morning when relative humidity suddenly decreased and temperature increased, except for the rainy days. No significant relationship with wind speed was found. The diurnal cycle of ragweed pollen was also studied by ref. [
12].
What are the main drivers for ragweed pollen emissions? To answer this question, several studies were conducted in the last few decades. Laaidi et al. [
13] analysed the ragweed pollen concentration data in Lyon (France) during the long-term period of 1987 to 1999. Using a statistical approach to relate meteorology and concentrations using a multi-parameter regression, they were able to predict the pollen season start with an error of 3 days at the maximum, and the duration of the pollen season with an error of 7 days at the maximum. The temperature range seems to be an important control factor, indicating the dehydration of pollens during the day, losing mass and thus more likely to be emitted under specific wind conditions. Makra et al. [
14] also analysed observational data and studied a possible relationship between ragweed pollen and meteorology for the period of 1997 to 2001 and in the city of Szeged (Hungary). They used 11 meteorological variables and proposed a complex relation to fit their data. A similar approach is presented in ref. [
15] over Rzeszow (Poland), quantifying the impact of temperature and wind speed on ragweed emissions. The strength of such studies is that they are close to the processes and are able to evaluate emissions taking into account the spatial representativeness of the measured concentrations. More recently, Matyasovszky et al. [
7] conducted a statistical study to identify the main drivers leading to ragweed pollen emissions, more precisely, the season start and end in Europe. They found a clear relationship between the date of the last frost day and the maximum amount of emitted pollen grains.
Regional modelling was initiated to better understand, quantify and predict the individual processes driving concentration variability (emission fluxes, long-range transport, deposition). It has motivated the development of pollen emission parameterizations. Over recent years, a few schemes have been proposed to estimate the pollen emission fluxes. Helbig et al. [
16] proposed a scheme for pollen emission and re-suspension with the KAMM/DRAIS model. In their model, pollen emissions are computed using a “characteristic” concentration (the sum of grains measured over one season), the leaf area index of the corresponding model grid cell and the friction velocity u
acting as a limiting factor. Pollen emissions are considered as a threshold process, similar to the saltation of mineral dust over arid areas. That is why resistances based on relative humidity and wind speed are included. Zink et al. [
17] used the Consortium for Small-scale Modelling (COSMO-ART) model to analyse a pollen episode observed over northern Germany in September 2006. They compared the local contribution and the import from Hungary (one of the European countries most exposed to ragweed spread) and found that at least 20% of the pollen counts could be attributed to transboundary inflow. Makra et al. [
18] found that the annual ragweed pollen amount transported by the atmospheric circulation is 27.8% of the annual total pollen at Szeged (Hungary). From this quantity, 7.5% is added to (due to transport), while 20.3% is subtracted from (e.g., because of wash-out by frontal rainfalls going towards Szeged) local sources. Sofiev et al. [
19] presented the first birch pollen forecast model, integrated into the SILAM model (System for Integrated Modelling of Atmospheric Composition) and applied it over the whole of Western Europe. The emissions were statistically prescribed. A few years later, Sofiev et al. [
20] presented a deterministic emission module, with the flowering season driven by the heat sum, including a probabilistic term. The module was limited to birch pollen emissions. Efstathiou et al. [
21] recently implemented a modified [
16] scheme in CMAQ and modelled the pollen period of 2002 over Newark (United States, NJ). Zink et al. [
2] presented a tunable scheme for different kinds of pollen (birch, ragweed). This scheme corresponds to the best fit between modelled emissions and recorded concentrations over several sites in Europe. However, correlations between observations and simulations were found to be insignificant. Prank et al. [
22] proposed a scheme for ragweed emissions dedicated to pollen forecasts. Chapman et al. [
4] modelled the ragweed spatial distribution and its evolution in the United States and in Europe. They showed that the phenology is able to correctly reproduce the propagation of this invasive plant. Liu et al. [
23] also implemented a modified version of the ref. [
16] scheme in the RegCM model with some surface characteristics determined using the CLM4.5 model. Regional models are now running together for daily forecast (the CAMS project) and for birch pollen, as explained in ref. [
24]: the models are CHIMERE, EMEP, EURAD-IM (EURopean Air Pollution Dispersion), LOTOS-EUROS, MATCH, MOCAGE (Modèle de Chimie Atmosphérique de Grande Echelle) and SILAM. Multi-model ensembles are also performed with numerous models as in ref. [
25] for olive pollen.
The main goal of this study is to quantify the impact of meteorological variables on ragweed pollen emission, more specifically the daily release. First, several meteorological variables are compared to local observations to identify possible correlations. Since collocated pollen-weather observations, including a consistent set of weather variables, are generally not available, we use here outputs from a mesoscale meteorological model. For representativeness and accuracy reasons, the low-resolution meteorological model can deviate from the actual meteorological context of the observation site. However, it remains the best available method to assess the capability of such regional models to calculate pollen concentrations over large areas, for past, present, and future studies (such as climate scenario studies). The observations from 2005 to 2011 and the model used are described in
Section 2. The meteorological fields used are described in
Section 3.1. Correlations between measured pollen counts and several modelled meteorological variables are presented in
Section 3.3. The main rationale for pollen emission models is presented in
Section 4. Among all processes for ragweed emissions, we focus on the daily release. In order to test an alternative formulation for this daily release, we implemented several schemes in the CHIMERE chemistry-transport model: (i) the SILAM model scheme [
20], (ii) the [
21] scheme and (iii) the new release term proposed in this study. The whole year of 2010 is modelled over Western Europe, and the modelled results are compared to measurements in
Section 5. Finally, conclusions and perspectives are proposed in
Section 6.
3. The Link between Pollen Concentration and Meteorology
In this section, we try to establish a link between the observed surface pollen concentration and the meteorology modelled for the corresponding location of the measurement. This link is searched by comparing daily data for those variables: it means that we want to quantify a possible relation between concentrations and meteorology mainly by studying the day-to-day variability.
3.1. The Modelled Meteorological Fields
3.1.1. The CORDEX Meteorological Simulations
In order to compare these daily pollen counts with meteorological variables, we could have used locally observed meteorological data or local meteorological model outputs (such as Large Eddy Simulation models), or meteorological fields after data assimilation. The choice was made to use regional meteorological fields modelled using the WRF model in the framework of the CORDEX experiment. Evaluation of this WRF configuration and of the simulations used was made in ref. [
41].
This choice to use modelled meteorological variables was driven by several constraints: first, some of the weather parameters tested are not available from observations at sites close to pollen monitoring sites. Second, our aim is to build a scheme able to model ragweed emission fluxes with a three-dimensional eulerian chemistry-transport model and over domains such as Europe (thus with a horizontal resolution of a few tens of kilometres, integrating various environments). However, to quantify the impact of this choice, we compared in the next section the observed and modelled the 2 m temperature and showed the differences being negligible in the context of this study.
3.1.2. The WRF Model Configuration
The meteorological variables were obtained from a simulation using the WRF regional model in its version 3.3.1. The model was used in its non-hydrostatic configuration, with a horizontal resolution of 0.44
× 0.44
and 32 vertical levels from the surface to 50 hPa, a similar configuration as that used in the EURO-CORDEX project [
42,
43]. The simulation uses ERA-Interim boundary conditions, as for the EURO-CORDEX simulations, but uses here a spectral nudging technique for the upper-air winds. This choice was made in order to allow the model to follow the large-scale circulation while leaving the model physics unconstrained. For the microphysics, the WRF Single Moment-5 class scheme was used, allowing for mixed-phase processes and super-cooled water [
44]. The radiation scheme is the RRTMG scheme with the MCICA method of random cloud overlap [
45]. The surface layer is based on the Monin–Obukhov scheme with Carlson–Boland viscous sub-layer. The surface physics was calculated using the Noah Land Surface Model scheme with four soil temperature and moisture layers [
46]. The planetary boundary layer physics was processed using the Yonsei University scheme [
47] and the cumulus parameterization uses the ensemble scheme of ref. [
48]. This model configuration is the same as that of ref. [
43]. The meteorological variables used in this study were extracted from the grid cell corresponding to the station location at a temporal frequency of three hours. These variables are listed in
Table 3.
3.2. Comparison between Observed and Modelled 2 m Temperature
Even when we have to use modelled meteorological variables to access all required parameters for release estimates, it is important to quantify if the use of the model will induce a bias in our results or not. One of the most important parameters for ragweed pollen daily release is the 2 m temperature. Fortunately, this variable is available over Europe with observations in the framework of the E-OBS database [
49,
50]. These only use raw station land data collected from about 3688 meteorological stations throughout Europe and the Mediterranean processed through a series of quality tests to remove errors and unrealistic values. A map of the network is presented in ref. [
41]. The three-step interpolation methodology is adapted to ensure that the daily data across the region is homogeneous. In the first step, the monthly mean is interpolated using three-dimensional thin-plate splines to define the spatial trend in the data. The next step is the kriging (minimizing the variance) of the monthly mean anomalies and, finally, applying the anomaly to the interpolated monthly mean to the daily estimate for the final result. The dataset provides the best estimate of the grid box average rather than point value. We, thus, compare time series for this variable and for the locations of the pollen counts measurements. The Pearson’s product moment correlation coefficient,
R, is calculated as:
This Pearson correlation coefficient is the ratio of the covariance between two data sets x and y and the product of their two standard deviations. A value of 1 is a complete positive correlation. Similarly, a value of −1 represents a complete negative correlation. An important additional piece of information is the significance of this correlation. It is estimated following the Student’s law probability . The closer is to zero, the more significant the correlation.
Comparisons between measured and modelled 2 m temperature (in °C) are presented in
Table 4. The comparison period is the whole year 2010, and the results show a very high correlation with values always higher than 0.97. In addition, the mean bias is lower than 1 °C. This clearly shows that the use of modelled meteorological variables is completely adapted to the calculation of daily pollen release variability, as presented in this study.
In order to better understand the differences between observations and model, and to discuss the daily variability of the 2 m temperature,
Figure 1 presents two time series, corresponding to the sites of BJELOVAR and ROUSSILLON. To better see the temperature values and daily variability, the time series are zoomed over the studied pollen period, from the 1st of June to the 30th of November. This example is for the year 2010. As the two sites are far from each other, the time series show different daily values and variability. However, in the two cases, the two modelled time series are very close to the observations. Clearly, the modelling of this variable is correct and, thus, is not the main reason if the pollen emissions are not realistic enough.
3.3. Statistics between Ragweed Pollen Concentrations and Meteorological Variables at Daily Time Scale
In order to calculate the correlation between the meteorological datasets, the parameterised release term and the observed pollen counts, the Pearson’s product moment correlation coefficient, R, is used. An important additional piece of information is the significance of this correlation. It is estimated following the Student’s law probability . The closer is to zero, the more significant the correlation.
Following previous studies, some meteorological variables are of particular importance for pollen emissions. For birch pollen, ref. [
16] showed that the most important parameter to take into account is the friction velocity. The surface temperature and relative humidity, as well as the wind speed, act as resistances to this emission flux. Sofiev et al. [
20] used the temperature heat-sum function to estimate the starting season, and the flux is moderated by meteorological factors, such as the wind speed, the relative humidity, and the precipitation rate.
Since the ragweed emission process is different from that of birch, we increase the number of potentially correlated parameters. In order to take into account the specific plant phenology, its height above the ground, its sensitivity to temperature and humidity, the examined meteorological variables are listed in
Table 3. For each parameter, the temporal averaging is also reported. Note that the ragweed plant being close to the ground, the 10 m wind speed is not taken into account, the near-surface dynamical processes being better represented by the friction velocity,
. The wind direction is not taken into account: even if this variable is important at a very local scale [
29,
51], it is not the case with our low horizontal resolution: we consider here that the emission is a column process and the wind direction is used in the model only for the horizontal transport. Results are presented in
Table 5 for the selected sites and for the year 2010. The largest two correlations are boldfaced. The results are dispersed and there is no meteorological parameter with a systematically high correlation value.
The highest correlations are for the 2 m temperature, with values ranging between 0.3 and 0.66. While experimental studies have shown that ragweed pollen emissions are more intense during the morning and thus depend on the morning temperature gradient, this does not clearly appear in our results. Recent studies have also shown that
is an important factor for ragweed pollen emissions. This was diagnosed under the form of the day length, and thus, the magnitude of daily received
, as presented in refs. [
1,
5], for example. This can be seen from our correlations, with values up to 0.38 for the daily mean (
) or the daily maximum (
). Furthermore, fair correlation values are found for
. This shows that the emissions are sensitive to the near-surface turbulent heat fluxes. For the four sites, the correlations range from 0.04 to 0.43, with mostly significant values.
Surprisingly, the correlations between concentrations and and are not systematically negative. For example, for ROUSSILLON and HUDEBR, the correlation is positive for . Here we used the specific humidity in place of the relative humidity, and this could be the reason. The specific humidity is selected here to represent the water vapour amount but also to be an independent variable from temperature. Finally, the highest correlations are found for thermal and radiative parameters, and , rather than dynamical parameters. The choice of a "time window" (e.g., daily mean or daily max) is not significant, except the morning for which the correlations are very low and not significant.
4. Modelling Ragweed Pollen Emission
In order to test the impact of a new emissions release term, we implemented three schemes into the CHIMERE chemistry-transport model. First, we implemented existing schemes already used in other models: (1)
P2013: the scheme implemented in the SILAM model as it follows the work of refs. [
20,
22], (2)
E2011: the scheme developed by [
21], a modified version of ref. [
16].
TS2021 (TS for this study): The third scheme corresponds to the [
21] scheme but with an alternative release function. For all emissions schemes, the flux (grains m
s
) may be expressed as:
where
is the ragweed density distribution in a number of individual plants per square meter.
is the annual production of grains per individual plant.
is the phenology factor in s
, considering its yearly integrated value is unity. This factor represents the knowledge of the start and end date of the pollen season as well as the shape of these potential emissions.
is the daily or sub-daily weather-dependent release of pollen grains in the atmosphere, which depends on the hourly (or daily) meteorological variables.
is unitless. These different terms correspond to two different temporal scales:
,
and
represent “annual” information, where
represents the “short-term” information for which we want to evaluate the correlation with the meteorological variables. In this work, we focus on the calculation of the
release factor.
4.1. The Ragweed Plant Fraction in Europe
For the three studied schemes, the terms
and
are the same. They are combined and available through the work of the CAMS pollen forecast group [
24]. The distribution of ragweed plants in Europe for the year 2010 is mapped. It is difficult to validate this map having no direct satellite information and the map itself is built using in-situ observations. However, since the goal of this study is to inter-compare emission schemes, and since the same map is used for the three schemes, its uncertainty will not impact the results. Note that new methods are under development and use, mixing in-situ measurements and satellite information on crops, as in ref. [
6]. The map used for this study is displayed in
Figure 2, where the value indicates the surface fraction occupied by the ragweed plant in a grid cell of resolution 1 × 1 km.
4.2. The Emissions Scheme
The emissions scheme of ref. [
22] is used in the paper, referred to as P2013 hereafter. It is the scheme of the SILAM model [
20]. Originally developed for birch pollen, it was adapted to ragweed by ref. [
22]. This SILAM version is directly implemented in CHIMERE and used without any changes. The goal of this paragraph is not to present this scheme again, which has been extensively described in these two publications. Only the main characteristics and differences with the other schemes are presented. Following P2013, the ragweed pollen emission flux is calculated as:
where
is the total number of pollen grains released from 1 m
during the whole season,
is the plant fraction in grid cell
,
is a relative emission intensity as a linear function of temperature.
H is the mean height of the plant. For ragweed,
H = 1 m.
is the emission flux fraction depending on the relative heat sum.
is the emission flux fraction depending on pollen quantity emitted since the start of the flowering season.
is the wind-dependent correction.
are the coefficients modulating emissions by humidity and precipitation (between 0 and 1), as:
For the relative humidity,
and
. For the precipitation,
,
= 0.5 mm h
. Wind speed
U and convective turbulence
impacts are described as:
with
the 10 m wind speed (m s
) and
w, the convective velocity scale (m s
).
= 0.5,
= 1 and
= 5 m s
. The gradual start of the flowering of the plants in a grid cell is described by the relative heat sum
, with the corresponding term
given by:
where
= 0.2 is a relative uncertainty of
. The parameter controlling the gradual flowering season termination is the number of grains remaining in catkins with respect to the initial total number of grains
:
. In the current model version,
is prescribed with
grains per year. An additional map containing a factor is delivered each year in the framework of the CAMS project to recalibrate
. The corresponding term
is given by:
where
= 0.2 is a relative uncertainty of
.
The heat sum, HS, concept follows the concept of ref. [
4], based on biological days. Some input information is necessary to update HS at each model time-step: the current 2 m temperature, the current day, its length in hours and the time step. The HS is updated only if the current day is after the first day, in our case, it is always equal to 79 (i.e., the 20 March). Two ramp functions are estimated, one depending on the 2 m temperature, the second one on the day length. The HS is then equal to:
with:
the
variable is equal to 1 except if the day length (
) is greater than the photo period parameter, here 14.5 and for the following values:
where the threshold temperature values are fixed. Note that this HS may be reset to zero depending on specific meteorological conditions as:
T TempThr (here TempThr = 273.15 K).
daily mean T DayTempThr (here DayTempThr = 280.65 K). Note that in this model version, the daily mean 2 m temperature is the running average for the last 24 h.
HS is lower than StartHSThr. This value is fixed here to StartHSThr = 25.0.
With this scheme, there are no emissions during the night. The calculation of sunrise and sunset is necessary.
4.3. The Emissions Scheme
The approach of ref. [
21], herefater called E2011, is a mix between the schemes of [
16,
19], adapting these formulations to ragweed. In the following, the specific notations of the publications are used to have a reference. The terminology is different from P2013 and [
16], but the principle is the same. The pollen emission flux,
E (grain m
s
), is calculated as:
The flux depending on the surface grid cell and time,
. The terms of the equation are:
a plant specific factor,
is the grain production factor. The release factor
is represented by (
).
is a time-varying factor depending on weather and
is the friction velocity. The plant specific factor,
, is calculated as:
where
is approximated to 10
.
S is the pollen season duration in days, with
S = 60.
d is the Julian day, varying during the simulation. In practice, the starting and ending dates of the ragweed period are fixed here to 210 and 270 Julian days, respectively. Compared to the values for 2010 in
Table 2, this period is slightly larger to enable very low emissions at the start and end of the period. The grain production factor
is calculated as:
where
is the total production of grains per year and is 10
grains m
. It corresponds to the maximum number of emitted grains. Day after day,
is reduced by the amount of the already emitted grains the day before. LAI is the leaf area index. LAI is a map in E2011, but in this study, we are using the value of LAI = 3.
= 1 m is the canopy height. The
variable is estimated as:
with the three meteorological limiting factors:
for humidity,
for 10 m wind speed and
for precipitation. For relative humidity, the limiting factor is expressed as Equation (
4) with
= 50% and
= 80%. For precipitation, we use the fit in E2011 as:
with
p the precipitation rate in mm h
. For the wind speed,
is based on P2013 wind correction previously described with Equation (
5).
4.4. This Study
The emissions scheme of this study, herefater called TS2021, has the same formulation as the E2011 scheme, except that the release term is reformulated. This release term formulation is inspired by the previously calculated statistical scores but has no direct link to the correlations values. The emissions flux is expressed as:
where
and
are the same functions as in the E2011 scheme.
(with TS for “this study”) is the new instantaneous release factor. Based on the correlation results of
Section 3.3, it appears that the main driving factors are those related to thermodynamical processes, namely the 2 m temperature,
, vertical velocity scale
w and shortwave radiation
. The pollen emissions may be moderated by precipitation rates
and 2 m specific humidity
.
The differences between birch and ragweed emissions could be explained by the plant typology itself: birch is a tree, with the pollen source up to 10 m above the ground. At this level, the wind may be considered as a dominant process for the emission of grains. Ragweed rarely exceeds 1 to 2 m above the ground, where the wind speed is moderate. In this case, the dominant factor could be the temperature, considering the grains are emitted under the highest temperature when they are sufficiently dry [
11]. The precipitation rate is a limiting factor but not the most important one: even if it rains during the night, the grains can dry out and can be pulled off the plant in the morning.
is thus estimated as:
where the values of
,
w and
correspond to the mean daily value. These values are normalised in order to keep the release term nondimensional. The normalization factors are
= 10 °C,
= 1 m s
and
= 200 W m
.
In order to moderate these fluxes when meteorological conditions are not favourable, resistance terms are added. These resistances are mainly due to the 2 m specific humidity
and the precipitation rate
. Each resistance is expressed as a sigmoid function ranging between 0 and 1, depending on minimal and maximal value of the
x parameter. The resistance has to reflect the fact that these parameters inhibit ragweed pollen emissions.
with
being a constant chosen here as
= 10, which determines the curve of the sigmoid function.
and
represent the range of the sigmoid and are here chosen as
= 0 and
= 1 in order to use a normalised function for each resistance. The critical issue here is to choose the minimum and maximum value for each
x meteorological parameter. These boundaries have to reflect the best possible range of variations of meteorological variables, for all locations over Europe and for the whole year. The maximum values must be moderate enough in order to provide a realistic resistance: a too low maximum value would give a resistance of 1 too often, while a too high maximum value would give too low resistances. Based on all meteorological values used in this study, the boundaries for the 2 m specific humidity are
= 0 and
= 5 × 10
g g
and for the precipitation rate are
= 0 and
= 1.5 mm h
.