1. Introduction
Air pollution in Europe has become a serious issue in recent years with the recorded concentrations of PM
(particulate matter of aerodynamical diameter smaller than 2.5
m) [
1,
2]. High concentrations of PM
can cause detrimental human health problems. Daily PM
concentrations are very useful for health studies and evaluations of regulations. Therefore, their accurate prediction and modeling is of paramount importance.
Chemistry-Transport Models (CTMs) are numerical tools to predict PM
concentrations on different scales. However, these models are uncertain. For example, Van Loon et al. [
3] confronted the simulated aerosol concentrations over Europe against surface observations and found that the Root Mean Square Error (RMSE) between both of them is about 10
g·m
and that the correlation rarely exceeds 50%. Additionally, Majdi et al. [
4] examined the uncertainties on air quality modeling, fire emissions parameters and PM
concentrations threshold exceedances over the Euro-Mediterranean region during two severe fire events in summer 2007 and revealed that the statistical dispersion for PM
concentrations can be as high as 75% depending on the chemical mechanisms, the injection heights of fire emissions and the model’s vertical resolution. He et al. [
5] used GEOS-Chem model to simulate Black Carbon (BC) over the Tibetan plateau and revealed that the model succeeded in capturing the seasonal variability of surface BC at rural sites, but the observed wintertime peaks were not reproduced. Moreover, Prank et al. [
6] evaluated four regions using the measurements of PM chemical composition by the European Monitoring and Evaluation Program (EMEP) network. This study showed that the four models underestimate PM concentrations by 10–60% depending on the model and the simulation period and stressed the necessity of improving models performances. These discrepancies are due to the uncertainties related not only to meteorology [
7,
8,
9,
10], chemistry [
4,
11,
12] and emissions [
4,
7,
13] but also initial conditions [
14,
15,
16,
17]. Different studies investigated the improvement of CTMs performance through the improvement of emissions [
4,
7], meteorological conditions [
11] and chemical mechanisms [
4,
7], but other studies focused on improving the accuracy of the initial conditions for forecast applications [
18,
19,
20,
21].
Data assimilation (DA) is a powerful tool that exploits the available observations including airborne, ground-based or remote sensing observations in order to reduce model uncertainties on initial conditions. Several data assimilation approaches are available and range from statistical methods, such as optimal interpolation [
22], to variational [
18,
20,
21] and sequential such as Kalman filters and Ensemble Kalman Filters (EnKF) [
23]. For example, Liang et al. [
20] showed by comparing the control experiment involving no DA and an experiment involving DA of lidar Aerosol Extinction Coefficient (AEC) data that the 3-DVAR DA system was effective at assimilating lidar AEC data. While there were only five lidars within the simulation region, assimilating AEC data alone was still found to effectively improve the accuracy of the initial field, hence improving the forecast performance for PM
for more than 24 h. The lidar AEC DA can reduce the RMSE of the surface PM
mass concentration in the initial field of the model by 17.6%. Although variational and sequential data assimilation techniques are advanced and complex, the Objective Analysis (OA), known also as optimal interpolation is very straightforward and computationally very portable [
24,
25,
26]. Agudelo et al. [
27] used both the objective analysis and EnKF techniques to improve PM
estimates of AURORA model with ground-based measurements provided by IRCEL (the Belgian Interregional Environment Agency) over Belgium, Luxembourg, Germany and the Netherlands. They found that the model performances were more improved using the Optimal Interpolation (OI) than EnKF, as the mean bias was reduced from
to 0.04
g·m
using OI and to 0.75
g·m
using EnKF. In addition, Kumar et al. [
26] also used the AURORA model to assimilate ground-level ozone O
and nitrogen dioxide NO
concentrations over Belgium. The evaluation over 70 airbase stations showed that the correlation improved from 40 to 80% for O
and from 30 to 60% for NO
, and the RMSE was reduced from 27.9 to 12.6
g·m
for O
and from 17.4 to 11.0
g·m
for NO
during the month of June. During December, both the spatial correlation and the index of agreement of monthly means of both species’ concentrations improved considerably.
In meteorology, OA has been surpassed by 4D-Var or the EnKF [
28], but it is still a commonly used DA method in CTMs, as OA is simple to implement and is computationally cheaper than other DA methods [
16]. By contrast, 4D-Var assimilates observations over a time window, which could yield better results [
29] when the model is reliable. However, it is more complex to implement because the adjoining of the model is required in the 4D-Var method [
30,
31]. Refs. [
23,
32,
33] compared two different DA methods, the OA and the EnKF for aerosol forecasts. They reported that the EnKF delivers slightly better results than the OA, but the cost of implementation of the EnKF is higher than that of the OA due to the high number of required model simulations. The OA is then employed in this paper to sequentially assimilate observations.
Therefore, by combining observations and model results, the OA can be used to improve initial conditions of CTMs and hence improve PM predictions either directly using surface concentrations or indirectly by comparing the Aerosol Optical Depth (AOD) with the modeled one and make the correspondent adjustments [
34,
35]. In fact, Kaufman et al. [
36] showed, based on 7 years comparison that AOD from MODIS is highly correlated with daily average AOD measured from Aerosol Robotic Network (AERONET) over more than 50 sites worldwide and even better correlated with hourly PM
measurements.
However, the indirect OI using observations from satellites is more relevant and useful because surface and airborne observations are not only sparse but also spatially and temporally limited. For example, Tombette et al. [
25] assimilated particulate matter with a diameter lower than 10
m (PM
) data with the BDQA (Base de Donnees sur la Qualité de lÁir: the French national data base for air quality that covers France) using optimal interpolation implemented in the chemistry-transport model Polair3D of the air quality modeling system Polyphemus and found that the statistics are not improved because the effects of DA are overshadowed and the concentrations quickly become close to the concentrations without DA.
Satellite data have proven to be effective in improving model-derived AOD over the United States, Asia and the Indian ocean using the OA technique [
34,
37,
38]. For example, Tang et al. [
34] assimilated the AOD in Community Multi-scale Air Quality (CMAQ) modeling system and found that surface PM
concentrations biases over the Contiguous United States were reduced from
to 0.77
g·m
. In addition, Tang et al. [
35] adjusted the initial conditions of the CMAQ model to assimilate MODIS AOD using OI, and the model showed a great improvement, as the bias of surface PM
and ozone was reduced from 7.14 to −0.11
g·m
and from 2.54 ppbV to 1.06 ppbC, respectively.
The aim of this paper is to show how satellite retrievals can be assimilated in the Chimère model using the OA technique. The model is then evaluated in terms of both modeled PM concentrations and AOD improvements in order to quantify the spatio-temporal impact of assimilating MODIS AOD data on model performance.
The paper is structured as follow:
Section 2 describes the setups of the model and the DA system.
Section 3 discusses the results of different numerical experiments and the findings of different sensitivity studies.
3. Simulations Experiments Evaluation
Two simulations are performed with the same input data and parameterizations during the month of March 2009, which correspond to record pollution concentrations over Europe: the first one is the reference simulation without DA (called “Simulation without D.A.”), and the second one is run with the developed DA (called “Simulation with D.A.”).
Figure 5 shows maps of PM
concentrations and AOD field over Europe averaged over March 2009 and the relative difference between the two simulations in order to quantify the impact of OA on the model predictions. The white areas in
Figure 5 correspond to areas of relative difference equal to zero.
The effect of OA during the one-month period is significant. The main differences between the modeled concentrations of PM and AOD with and without OA are located over France (∼50–60%) and the southern part of Europe (∼50–60%): the northern part of Italy, Balkans, south of Greece and over the Mediterranean.
The modeled PM
concentrations (resp. AOD) are compared against hourly surface observations from AirBase (resp. AERONET) network in
Table 3 (resp.
Table 4).
Using the default simulation without the DA, the modeled surface concentrations of PM
are significantly underestimated by a factor of about 47% (the measured and modeled means are 38.01 and 20.01
g·m
, respectively). The goal criteria are not respected as the mean fractional bias and error are
and 51%, respectively. This underestimation is mainly due to the fact that the model highly underestimates PM
concentrations over the urban background stations but modeled PM
concentrations agree better with the observed concentrations over the rural background. The reasons of this underestimation were explained by Terrenoire et al. [
59] who showed Chimère has more difficulty in reproducing the PM concentrations during winter, especially at urban background stations because the model is not able to correctly simulate the stable meteorological conditions that lead to high PM episodes [
60].
During this winter month, PM
are mainly composed of primary organic compounds mainly emitted by industrial, traffic and biomass-burning anthropogenic sectors. The highest concentrations are located over North Eastern Europe, which is consistent with results found by Terrenoire et al. [
59].
Because of the OA, PM concentrations increased by 59% as the simulated means moved from 20.89 to 33.21 g·m and became closer to the observed mean (38.01 g·m). The RMSE also decreased from 30.05 to 15.13 g·m. The bias and error, moreover, were reduced from to % and from 51 to 39%, respectively.
In addition, the model performance improved as the modeled PM
concentrations using OA respect the goal criterion in a time when the modeled concentrations without OA respects only the performance criterion. Tombette et al. [
25] evaluated the OA effect using Polyphemus model on both PM
and PM
over 156 European stations and found that PM
concentrations increased by 7% and the bias was reduced from 55% to 49%, and over 8 European stations, PM
concentrations increased by a factor of 7%. Tang et al. [
35] also improved hourly PM
and ozone concentrations using OA over AIRNOW measurements as the mean bias improved from
to
g·m
and from 2.54 to 1.06 ppbV.
Similarly, the modeled AOD field over AERONET stations increased by a factor of 43% from 0.21 to 0.30. The correlation also slightly improved from 73 to 78%. Both the bias and error decreased from −45 to −27% and from 55 to 42%. The error of 42% can be explained by the fact that AERONET can make a much wider range of measurements throughout the day while MODIS onboard Terra and Aqua pass at 10:30 and 1:30 local time, respectively.
Figure 6 shows the scatter plots of hourly PM
concentrations and AOD modeled with and without the developed DA over AirBase and AERONET stations, respectively.
The number of compared data in the AERONET network is lower than the number of compared AirBase data because numerous missing data were found in the AERONET data. The correlation moved from 40 to 58%, and the slope of the scatter plot increased from 0.23 to 0.52. The same result is found when it comes to the modeled AOD (as the correlation increased from 73 to 90% and the slope of the scatter plot moved from 0.34 to 0.59). The analysis fields show an overestimation of the PM
concentrations. This overestimation (false alarms) can be explained by the fact that concentrations of PM with an aerodynamic diameter larger than 2.5
m are also sensitive to AOD
and therefore are accounted for in Equation (
1). A better distribution of AOD by particles size should be added to avoid false alarms in the forecasting system in future studies. In addition, the model was evaluated against the terrestrial part of the simulated domain as shown in
Figure 5. Because of the diurnal change in the boundary layer, as well as humidity and other factors that are very different over the sea surface than the terrestrial surface, this evaluation of the model and the DA impact needs to be further validated over the part of the Atlantic Ocean of the domain (dashed area in
Figure 5). Over the Mediterranean sea, the AirBase and AERONET stations over islands such as Corsica, Malta, the Balearic Islands, etc., can be considered as a validation over the Mediterranean sea.
Temporal Impact of the OA
In order to evaluate the effectiveness of the MODIS AOD assimilation, the time scale for which the OA affects the concentrations is worth exploring because the OA operates on the initial conditions.
Figure 7 shows the hourly evolution of PM
concentrations averaged over all grid points where the relative difference is over 20% to better estimate the influence of the DA.
Figure 7 shows that the effect of OA lasts nearly 13 h, which is comparable to the temporal impact found by Tombette et al. [
25] using OA along with Polyphemus model over Europe. After 13 h, the impact of the MODIS data on the modeled concentrations is low because the impact of the regional transport and local emissions start overshadowing the assimilation impact.
4. Sensitivity to the Creesman Influence Radius
The performance of the OA is tested using two different influence radii. A first (resp. second) simulation using an influence radius R (R) equal to (is twice) the grid cell spacing, which corresponds to 0.5 (1).
Table 5 shows the temporal and spatial sum of relative differences between the simulations with OA computed with R
and R
, respectively, over the cells where the effect of OI exceeds 20%. The comparison shows that the model performance is slightly sensitive to the influence radius value: the relative difference between the two simulations do not exceed 10.5 (12.3)% for PM
concentrations (AOD). This may be due to the low satellite observation density during winter because of dense cloud coverage.
Figure 8 shows the cloud coverage (MODIS Terra-corrected reflectance with a spatial resolution of 250 m) over Europe on four different days in March as examples of where the majority of the European land is masked by clouds. The results are expected to be more sensitive to the influence radius if the satellite data were of better quality or obtained during summertime.
The comparison of the two simulations against AirBase and AERONET data in
Table 6 and
Table 7 show that simulation with a radius of influence equal to the grid cell predicts PM
and AOD that have a better agreement with observations. The simulated mean of PM
concentrations (AOD) 33.21
g·m
(0.30) are closer to the observed means 38.01
g·m
(0.37). In addition, the goal criteria are respected with the analysis of the PM
concentration field simulated using R
.
5. Conclusions
In this study, we tested the assimilation method of objective analysis to adjust Chimère’s initial conditions for the simulation of PM and AOD over Europe during March 2009. The base model underestimated surface PM concentrations and AOD field because of the low resolution and uncertainties on meteorology and emissions. The objective analysis assimilation was able to correct the aerosol concentrations and AOD fields. However, the impact of adjusting initial conditions can be overshadowed by the local emissions and local winds, as they lasted for 13 h. The proposed model can be used in operational air quality forecasting, data validation and the verification of regulations.
Assimilating data from elevated levels or airborne measurements, although occasionally available, can be used to sharpen the adjustment of initial conditions and would improve PM
concentration and AOD field as it represents the loading of the aerosol column. However, precise knowledge on the vertical distribution of PM
using lidars [
61,
62] is required for at least two reasons: (1) it is better for quantifying air quality and its variability since, for example, the different vertical distribution of PM
near the Earth surface has very different impact on public health; (2) it is likely to significantly enhance the PM
estimation and provide data for model evaluation, improvement and development for the daily air quality forecast.
Future works could focus on investigating the OA impact on finer-resolution simulations, describing the background and observation covariance matrices better, using more complex data assimilation techniques such as 4D-var and ensemble Kalman filter, describing the emissions using a top-down approach (inverse modeling) better, combining the use of satellite data with ground based or airborne observations of the aerosol chemical composition (organics, sulfate, nitrate, sea-salts, etc.). The use of lidar observations could furthermore be very beneficial to improve the vertical distribution of aerosols.