Next Article in Journal
Application of LiDAR Remote Sensing for Aerosol Monitoring: Case Studies in Cyprus and Greece
Previous Article in Journal
Analyzing the Relationship Between Vegetation and Temperature Changes in the Sylhet Region
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Proceeding Paper

Effect of the Form of the Error Correlation Functions on Uncertainty in the Estimation of Atmospheric Aerosol Distribution When Using Spatial-Temporal Optimal Interpolation †

Institute of Physics, National Academy of Sciences of Belarus, 220072 Minsk, Belarus
*
Author to whom correspondence should be addressed.
Presented at the 7th International Electronic Conference on Atmospheric Sciences (ECAS-7), 4–6 June 2025; Available online: https://sciforum.net/event/ECAS2025.
Environ. Earth Sci. Proc. 2025, 34(1), 11; https://doi.org/10.3390/eesp2025034011
Published: 22 September 2025

Abstract

Spatial-temporal optimal interpolation (STOI) is a data assimilation method based on minimizing the error in an estimate. Error correlations can be modeled with analytical functions. We investigated the effect of the form of the error correlation functions on the uncertainty in an estimate. We applied STOI to the estimation of aerosol distribution over Europe using the results of GEOS-Chem chemical transport model simulations and observations from a ground-based radiometric Aerosol Robotic Network. We used exponential functions to model correlation curves. The results show that a ±15–25% change in the argument of the constructed exponential functions has little effect on the mean square error in the estimate in regions where observations are dense. STOI estimates are sensitive to the form of the correlation curves in regions where observations are sparse.

1. Introduction

A common approach to estimate the spatial-temporal distribution of atmospheric species properties is data assimilation [1,2,3]. This comprises methods to combine information from different sources for obtaining the best estimate of a system state. Data assimilation is based on minimizing the error in the estimate (optimal interpolation and Kalman filtering methods [4,5,6]) or on minimizing the cost function (variational methods [7,8,9]). Under certain conditions, variational methods turn out to be equivalent to optimal interpolation or Kalman filtering. Data assimilation techniques require knowing data error statistics for the observations and background (appropriate prior information).
When performing assimilation in practice, some approximations have to be carried out. The observation error covariance and background error covariance are never exactly known. Numerous works are dedicated to the estimation of the observation and background error covariance matrices using differences between the observations and the background [3,10,11,12]. The common assumption is that the background error field is stationary in time and uniform in space. In this case, correlation curves can be constructed that describe the dependence of the correlation coefficient (or covariance) of the errors at different points on the distance between the points. The observation error covariance matrix is often assumed to be diagonal, and can be estimated empirically. Even under these simplifying assumptions, the estimation of error covariance is challenging [3]. Correlation curves are commonly approximated by analytical functions with tuned parameters [1,13,14,15,16]. The background error covariance matrix should be positive definite, so an exponential function is often chosen to model background error correlations.
The performance of data assimilation schemes can be sensitive to the specification of the observation and background errors [2,3,14]. The error covariances are subject to uncertainties. In [3], a simplified synthetic example showed the importance of the specification and tuning of the error covariance matrices. However, in real data assimilation problems the impact of error covariance matrices is hard to estimate [3]. The goal of the present work was to evaluate if slight differences in error covariance matrices can significantly affect the results of real atmospheric data assimilation.
In the present work, we investigated the effect of the form of the background error correlation functions on uncertainty in the estimate of aerosol optical depth (AOD) when using the spatial-temporal optimal interpolation (STOI) technique. The estimation of error covariance matrices is crucial in various data assimilation methods in application to the estimation of a broad range of atmospheric characteristics. We chose STOI because it is a relatively simple and computationally cheap technique. We chose AOD as it is one of the most important quantities characterizing columnar atmospheric aerosol.

2. Materials and Methods

STOI [17,18] is one of several data assimilation methods. It was developed as an extended optimal interpolation method that uses both spatial and temporal background error covariance. STOI is based on estimation theory equations. A general form of the data assimilation equation is:
x a = x b + K [ y H ( x b ) ]
K = B H T ( H B H T + R ) 1
xa is a vector containing estimates at a regular grid, xb is a vector containing background values at a regular grid, y is a vector containing values of observations, K is a matrix containing weight coefficients, H is an observation operator providing the link between observation space and background space, B is a covariance matrix of background errors, and R is a covariance matrix of observation errors. Weight coefficients are chosen so as to minimize the mean square error in the estimate.
We applied STOI to the estimation of AOD distribution over Europe. To perform STOI, we used an output from the GEOS-Chem chemical transport model [19,20] simulations as a background, and observations from a ground-based radiometric network, AERONET [21,22,23].
AERONET is a global network of ground-based sun–sky–lunar radiometers. We used version 3 AERONET Level 2.0 cloud-screened and quality-assured daily averaged total AOD data at wavelengths of 440, 675, and 870 nm at 86 AERONET sites in Europe.
We obtained the background field using version v12.1.1 of the GEOS-Chem chemical transport model in the classical configuration. We calculated daily mean AOD at wavelengths 440, 675, and 870 nm over the nested European domain (32.75° N–61.25° N, 15° W–40° E) at 0.25° latitude × 0.3125° longitude spatial resolution for 2015–2016.
AERONET provides AOD data with low uncertainty that is much smaller than the uncertainty of AOD data simulated by GEOS-Chem [24,25]. This allows treating the AERONET AOD observations as a “true” field. Under this assumption, we undertook a preliminary bias correction for the AOD background field. The observation error was taken to be zero. We derived background error statistics empirically from background-minus-observation AOD values. The obtained statistics showed a large range in values of background error correlation coefficient. To specify an error correlation curve, we selected areas with the highest graphic density of values of background error correlation coefficient on the scatter plots of spatial and temporal dependencies. Within the selected areas, we modeled spatial and temporal correlation coefficients by exponential, gaussian and polynomial functions using the least-squares method. We chose the functions such that the correlation coefficient is equal to one when distance (for spatial correlation curve) or time interval (for temporal correlation curve) is equal to zero, and the correlation coefficient tends to zero at the correlation length. The exponential function provided the best fit. Therefore, for further research we used the exponential function.
Exponential spatial and temporal correlation functions obtained using the least-squares method are shown in Figure 1. The shape of the spatial correlation curves is almost similar for all three wavelengths; temporal correlation curves lie a little higher with a decrease in wavelength.
Then, we changed the arguments of the exponential functions to make the correlation curves change their shape. The change in the arguments was chosen to be about ±15–25% so that the correlation curves would lie at the upper and lower boundaries of the selected areas on the scatter plots described above. We considered three cases: (1) exponential functions obtained using the least-squares method; (2) exponential functions with artificially decreased arguments (estimates are closer to the observations); (3) exponential functions with artificially increased arguments (estimates are closer to the model output). Correlation curves for these three cases are shown in Figure 2 and Figure 3.
We calculated root mean square errors (RMSE) in the AOD values obtained with the GEOS-Chem simulation and the AOD estimates obtained using STOI for the three cases described above. RMSE were calculated for AERONET sites Granada, Lille, and Minsk. We calculated the reduction in RMSE of the estimate after STOI in comparison with the GEOS-Chem output. We compared the reduction in RMSE for the three cases of different shapes of correlation curves.

3. Results

The results of the comparison of the reduction in RMSE for three cases of different shapes of correlation curves are presented in Table 1, Table 2 and Table 3. Table 1 refers to case 1, correlation curves modeled by exponential functions obtained using the least-squares method. Table 2 refers to case 2, correlation curves modeled by exponential functions with artificially decreased arguments. Table 3 refers to case 3, correlation curves modeled by exponential functions with artificially increased arguments. In the tables, d is distance in kilometers in the arguments of the constructed exponential spatial correlation functions; t is time interval in days in the arguments.

4. Discussion and Conclusions

Granada, Lille, and Minsk regions differ in the density of observational AERONET sites. Minsk is located in the eastern European region where observations are sparse. Observations are rarer in Lille than in Granada because Lille is located in the northern part of Europe with a smaller number of sunny days than in the southern part of Europe where Granada is located.
Comparison of the results presented in Table 1, Table 2 and Table 3 shows that a ±15–25% change in the argument of the constructed exponential correlation functions did not affect RMSE at the estimation points in regions where observations are dense (Granada and Lille). STOI estimations were sensitive to the shape of the correlation curve at points in the region where observations were sparse (Minsk). This can be because in regions where AERONET site density is high, the weight of observations is large enough for all three cases of correlation curve shapes. By contrast, in regions where observations are sparse, the increased weight given to observations is of significant importance. Therefore, the accuracy of STOI is slightly better in regions with sparse observations when estimates are closer to the observations than to the model output.
The practical importance of the results is that there is no need for fine-tuning of the correlation curves when using AOD spatial-temporal optimal interpolation. We assume that this conclusion may be extended to other data assimilation techniques and other atmospheric characteristics, but this requires further research.

Author Contributions

Conceptualization, N.M. and A.C.; methodology, N.M.; software, N.M. and A.B.; validation, N.M. and A.B.; investigation, N.M.; data curation, N.M., A.B. and A.C., writing—original draft preparation, N.M.; writing—review and editing, A.B. and A.C.; visualization, N.M. All authors have read and agreed to the published version of the manuscript.

Funding

This research received no external funding.

Institutional Review Board Statement

Not applicable.

Informed Consent Statement

Not applicable.

Data Availability Statement

AERONET data are freely available from https://aeronet.gsfc.nasa.gov (accessed on 11 July 2025). GEOS-Chem simulation and the STOI data generated in this study are freely available from http://scat.bas-net.by/~assimilation/ (accessed on 11 July 2025).

Conflicts of Interest

The authors declare no conflict of interest.

References

  1. Kalnay, E. Atmospheric Modeling, Data Assimilation and Predictability; Cambridge University Press: Cambridge, UK, 2003. [Google Scholar]
  2. Bocquet, M.; Elbern, H.; Eskes, H.; Hirtl, M.; Žabkar, R.; Carmichael, G.R.; Flemming, J.; Inness, A.; Pagowski, M.; Pérez Camaño, J.L.; et al. Data Assimilation in Atmospheric Chemistry Models: Current Status and Future Prospects for Coupled Chemistry Meteorology Models. Atmos. Chem. Phys. 2015, 15, 5325–5358. [Google Scholar] [CrossRef]
  3. Tandeo, P.; Ailliot, P.; Bocquet, P.; Carrassi, A.; Miyoshi, T.; Pulido, M.; Zhen, Y. A Review of Innovation-Based Methods to Jointly Estimate Model and Observation Error Covariance Matrices in Ensemble Data Assimilation. Mon. Weather Rev. 2020, 148, 3973–3994. [Google Scholar] [CrossRef]
  4. Candiani, G.; Carnevale, C.; Finzi, G.; Pisoni, E.; Volta, M. A Comparison of Reanalysis Techniques: Applying Optimal Interpolation and Ensemble Kalman Filtering to Improve Air Quality Monitoring at Mesoscale. Sci. Total Environ. 2013, 458–460, 7–14. [Google Scholar] [CrossRef] [PubMed]
  5. Perrot, A.; Pannekoucke, O.; Guidard, V. Toward a Multivariate Formulation of the Parametric Kalman Filter Assimilation: Application to a Simplified Chemical Transport Model. Nonlinear Process. Geophys. 2023, 30, 139–166. [Google Scholar] [CrossRef]
  6. Li, K.; Bai, K.; Li, Z.; Guo, J.; Chang, N.-B. Synergistic Data Fusion of Multimodal AOD and Air Quality Data for Near Real-Time Full Coverage Air Pollution Assessment. J. Environ. Manag. 2022, 302, 114121. [Google Scholar] [CrossRef] [PubMed]
  7. Zang, Z.; You, W.; Ye, H.; Liang, Y.; Li, Y.; Wang, D.; Hu, Y.; Yan, P. 3DVAR Aerosol Data Assimilation and Evaluation Using Surface PM2.5, Himawari-8 AOD and CALIPSO Profile Observations in the North China. Remote Sens. 2022, 14, 4009. [Google Scholar] [CrossRef]
  8. Tian, X.; Zhang, H.; Feng, X.; Li, X. i4DVar: An Integral Correcting Four-Dimensional Variational Data Assimilation Method. Earth Space Sci. 2021, 8, e2021EA001767. [Google Scholar] [CrossRef]
  9. Huang, B.; Pagowski, M.; Trahan, S.; Martin, C.R.; Tangborn, A.; Kondragunta, S.; Kleist, D.T. JEDI-based Three-Dimensional Ensemble-Variational Data Assimilation System for Global Aerosol Forecasting at NCEP. J. Adv. Model. Earth Syst. 2023, 15, e2022MS003232. [Google Scholar] [CrossRef]
  10. Desroziers, G.; Berre, L.; Chapnik, B.; Poli, P. Diagnosis of Observation, Background and Analysis-Error Statistics in Observation Space. Q. J. R. Meteorol. Soc. 2005, 131, 3385–3396. [Google Scholar] [CrossRef]
  11. Bocquet, M.; Raanes, P.N.; Hannart, A. Expanding the Validity of the Ensemble Kalman Filter without the Intrinsic Need for Inflation. Nonlinear Process. Geophys. 2015, 22, 645–662. [Google Scholar] [CrossRef]
  12. Kotsuki, S.; Ota, Y.; Miyoshi, T. Adaptive Covariance Relaxation Methods for Ensemble Data Assimilation: Experiments in the Real Atmosphere. Q. J. R. Meteorol. Soc. 2017, 143, 2001–2015. [Google Scholar] [CrossRef]
  13. Collins, W.D.; Rasch, P.J.; Eaton, B.E.; Khattatov, B.V.; Lamarque, J.-F.; Zender, C.S. Simulating Aerosols Using a Chemical Transport Model with Assimilation of Satellite Aerosol Retrievals: Methodology for INDOEX. J. Geophys. Res. 2001, 106, 7313–7336. [Google Scholar] [CrossRef]
  14. Barth, A.; Azcárate, A.A.; Joassin, P.; Jean-Marie, B.; Troupin, C. Introduction to Optimal Interpolation and Variational Analysis. In Proceedings of the SESAME Summer School, Varna, Bulgaria, 26 November 2008. [Google Scholar]
  15. Xia, X.; Zhang, T.; Wang, L.; Gong, W.; Zhu, Z.; Wang, W.; Gu, Y.; Lin, Y.; Zhou, X.; Dong, J.; et al. Spatial–Temporal Fusion of 10-Min Aerosol Optical Depth Products with the GEO–LEO Satellite Joint Observations. Remote Sens. 2023, 15, 2038. [Google Scholar] [CrossRef]
  16. Yu, H.; Dickinson, R.E.; Chin, M.; Kaufman, Y.J.; Holben, B.N.; Geogdzhayev, I.V.; Mishchenko, M.I. Annual Cycle of Global Distributions of Aerosol Optical Depth from Integration of MODIS Retrievals and GOCART Model Simulations. J. Geophys. Res. 2003, 108, 4128. [Google Scholar] [CrossRef]
  17. Miatselskaya, N.; Milinevsky, G.; Bril, A.; Chaikovsky, A.; Miskevich, A.; Yukhymchuk, Y. Application of Optimal Interpolation to Spatially and Temporally Sparse Observations of Aerosol Optical Depth. Atmosphere 2023, 14, 32. [Google Scholar] [CrossRef]
  18. Miatselskaya, N.; Bril, A.; Chaikovsky, A. Using Chemical Transport Model and Climatology Data as Backgrounds for Aerosol Optical Depth Spatial–Temporal Optimal Interpolation. Atmosphere 2025, 16, 623. [Google Scholar] [CrossRef]
  19. GEOS-Chem. Available online: https://geoschem.github.io (accessed on 9 July 2025).
  20. Bey, I.; Jacob, D.J.; Yantosca, R.M.; Logan, J.A.; Field, B.D.; Fiore, A.M.; Li, Q.; Liu, H.Y.; Mickley, L.J.; Schultz, M.G. Global Modeling of Tropospheric Chemistry with Assimilated Meteorology: Model Description and Evaluation. J. Geophys. Res. 2001, 106, 23073–23096. [Google Scholar] [CrossRef]
  21. NASA. Goddard Spase Flight Center; AERONET.; Aerosol Robotic Network. Available online: https://aeronet.gsfc.nasa.gov/ (accessed on 9 July 2025).
  22. Holben, B.N.; Tanre, D.; Smirnov, A.; Eck, T.F.; Slutsker, I.; Abuhassan, N.; Newcomb, W.W.; Schafer, J.; Chatenet, B.; Lavenue, F.; et al. An Emerging Ground-Based Aerosol Climatology: Aerosol Optical Depth from AERONET. J. Geophys. Res. 2001, 106, 12067–12097. [Google Scholar] [CrossRef]
  23. Dubovik, O.; King, M.D. A Flexible Inversion Algorithm for Retrieval of Aerosol Optical Properties from Sun and Sky Radiance Measurements. J. Geophys. Res. 2000, 105, 20673–20696. [Google Scholar] [CrossRef]
  24. Li, S.; Garay, M.J.; Chen, L.; Rees, E.; Liu, Y. Comparison of GEOS-Chem Aerosol Optical Depth with AERONET and MISR Data over the Contiguous United States. J. Geophys. Res. 2013, 118, 11228–11241. [Google Scholar] [CrossRef]
  25. Curci, G.; Hogrefe, C.; Bianconi, R.; Im, U.; Balzarini, A.; Baró, R.; Brunner, D.; Forkel, R.; Giordano, L.; Hirtl, M.; et al. Uncertainties of Simulated Aerosol Optical Properties Induced by Assumptions on Aerosol Physical and Chemical Properties: An AQMEII-2 Perspective. Atmos. Environ. 2015, 115, 541–552. [Google Scholar] [CrossRef]
Figure 1. (a) Spatial and (b) temporal correlation coefficient at 440 (violet lines), 675 (green), and 870 (red) nm.
Figure 1. (a) Spatial and (b) temporal correlation coefficient at 440 (violet lines), 675 (green), and 870 (red) nm.
Eesp 34 00011 g001
Figure 2. Spatial correlation coefficient modeled by an exponential function obtained using the least-squares method (case 1); modeled by an exponential function with artificially decreased argument (case 2); modeled by an exponential function with artificially increased argument (case 3).
Figure 2. Spatial correlation coefficient modeled by an exponential function obtained using the least-squares method (case 1); modeled by an exponential function with artificially decreased argument (case 2); modeled by an exponential function with artificially increased argument (case 3).
Eesp 34 00011 g002
Figure 3. Spatial correlation coefficient at (a) 440 nm; (b) 675 nm; (c) 870 nm: modeled by an exponential function obtained using the least-squares method (case 1); modeled by an exponential function with artificially decreased argument (case 2); modeled by an exponential function with artificially increased argument (case 3).
Figure 3. Spatial correlation coefficient at (a) 440 nm; (b) 675 nm; (c) 870 nm: modeled by an exponential function obtained using the least-squares method (case 1); modeled by an exponential function with artificially decreased argument (case 2); modeled by an exponential function with artificially increased argument (case 3).
Eesp 34 00011 g003
Table 1. Constructed correlation functions and reduction in root mean square errors (RMSE) for case 1.
Table 1. Constructed correlation functions and reduction in root mean square errors (RMSE) for case 1.
440 nm675 nm870 nmAveraged
Spatialexp (−0.00207d)exp (−0.00215d)exp (−0.00204d)
Temporalexp (−0.298t)exp (−0.384t)exp (−0.424t)
Reduction in RMSE
Granada54%61%62%59%
Lille18%13%14%15%
Minsk24%17%16%19%
Table 2. Constructed correlation functions and reduction in RMSE for case 2.
Table 2. Constructed correlation functions and reduction in RMSE for case 2.
440 nm675 nm870 nmAveraged
Spatialexp (−0.0015d)exp (−0.0015d)exp (−0.0015d)
Temporalexp (−0.25t)exp (−0.335t)exp (−0.375t)
Reduction in RMSE
Granada53%61%62%59%
Lille18%13%14%15%
Minsk24%19%19% 21%
Table 3. Constructed correlation functions and reduction in RMSE for case 3.
Table 3. Constructed correlation functions and reduction in RMSE for case 3.
440 nm675 nm870 nmAveraged
Spatialexp (−0.0025d)exp (−0.0025d)exp (−0.0025d)
Temporalexp (−0.35t)exp (−0.435t)exp (−0.475t)
Reduction in RMSE
Granada54%61%63%59%
Lille18%13%14%15%
Minsk23%15%12%17%
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

Share and Cite

MDPI and ACS Style

Miatselskaya, N.; Bril, A.; Chaikovsky, A. Effect of the Form of the Error Correlation Functions on Uncertainty in the Estimation of Atmospheric Aerosol Distribution When Using Spatial-Temporal Optimal Interpolation. Environ. Earth Sci. Proc. 2025, 34, 11. https://doi.org/10.3390/eesp2025034011

AMA Style

Miatselskaya N, Bril A, Chaikovsky A. Effect of the Form of the Error Correlation Functions on Uncertainty in the Estimation of Atmospheric Aerosol Distribution When Using Spatial-Temporal Optimal Interpolation. Environmental and Earth Sciences Proceedings. 2025; 34(1):11. https://doi.org/10.3390/eesp2025034011

Chicago/Turabian Style

Miatselskaya, Natallia, Andrey Bril, and Anatoly Chaikovsky. 2025. "Effect of the Form of the Error Correlation Functions on Uncertainty in the Estimation of Atmospheric Aerosol Distribution When Using Spatial-Temporal Optimal Interpolation" Environmental and Earth Sciences Proceedings 34, no. 1: 11. https://doi.org/10.3390/eesp2025034011

APA Style

Miatselskaya, N., Bril, A., & Chaikovsky, A. (2025). Effect of the Form of the Error Correlation Functions on Uncertainty in the Estimation of Atmospheric Aerosol Distribution When Using Spatial-Temporal Optimal Interpolation. Environmental and Earth Sciences Proceedings, 34(1), 11. https://doi.org/10.3390/eesp2025034011

Article Metrics

Back to TopTop