Next Article in Journal
Three-Dimensional Calibration for Routine Analyses of Bromide and Nitrate Ions as Indicators of Groundwater Quality in Coastal Territories
Next Article in Special Issue
Reliability of Low-Cost, Sensor-Based Fine Dust Measurement Devices for Monitoring Atmospheric Particulate Matter Concentrations
Previous Article in Journal
Dietary Lead Exposure and Associated Health Risks in Guangzhou, China
Previous Article in Special Issue
Influence of Straw Burning on Urban Air Pollutant Concentrations in Northeast China
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

Application of the Orthogonal Polynomial Fitting Method in Estimating PM2.5 Concentrations in Central and Southern Regions of China

1
College of Ocean Science and Engineering, Shandong University of Science and Technology, Qingdao 266590, China
2
Physical Oceanography Laboratory/CIMST, Ocean University of China and Qingdao National Laboratory for Marine Science and Technology, Qingdao 266200, China
3
First Institute of Oceanography, Ministry of Natural Resources and Laboratory for Regional Oceanography and Numerical Modeling, Qingdao National Laboratory for Marine Science and Technology, Qingdao 266061, China
*
Author to whom correspondence should be addressed.
Int. J. Environ. Res. Public Health 2019, 16(8), 1418; https://doi.org/10.3390/ijerph16081418
Submission received: 5 March 2019 / Revised: 13 April 2019 / Accepted: 14 April 2019 / Published: 19 April 2019
(This article belongs to the Special Issue Air Quality Monitoring and Assessment)

Abstract

:
Sufficient and accurate air pollutant data are essential to analyze and control air contamination problems. An orthogonal polynomial fitting (OPF) method using Chebyshev basis functions is introduced to produce spatial distributions of fine particle (PM2.5) concentrations in central and southern regions of China. Idealized twin experiments (IE1 and IE2) are designed to validate the feasibility of the OPF method. IE1 is designed in accordance with the most common distribution of PM2.5 concentrations in China, whereas IE2 represents a common distribution in spring and autumn. In both idealized experiments, prescribed distributions are successfully estimated by the OPF method with smaller errors than kriging or Cressman interpolations. In practical experiments, cross-validation is employed to assess the interpolation results. Distributions of PM2.5 concentrations are well improved when OPF is applied. This suggests that errors decrease when the fitting order increases and arrives at the minimum when both orders reach 6. Results calculated by the OPF method are more accurate than kriging and Cressman interpolations if appropriate fitting orders are selected in practical experiments.

1. Introduction

Fine particles (PM2.5) are particulate matters suspended in the air with diameters within 2.5 μm. PM2.5 is usually reported as a principal pollutant of air contamination. Exposure to highly concentrated PM2.5 affects the respiratory and cardiovascular systems and might cause heart and lung disease [1,2]. Moreover, PM2.5 is thought to be involved in some processes of neurodegeneration [3]. It was estimated that the global premature mortality rate associated with PM2.5 was at 3.15 million per year in 2010, with China being the leading country at about 1.33 million [4]. PM2.5 in 2015 contributed as much as 40.3% to total stroke deaths, 26.8% to ischemic heart disease deaths and 23.9% to lung cancer deaths [5]. The threat of PM2.5 pollution on public health has become increasingly remarkable. Therefore, there has been continued interest in computing and predicting the concentrations of PM2.5 and its effects on human health [6,7]. Spatial variation of pollutant concentration fields should be taken into account when computing air pollutant concentrations [8,9]. Estimations of accurate spatial variations in PM2.5 concentrations will provide valuable information for health risks and air quality control programs. Sufficient and accurate pollutant data are required to analyze and control air contamination problems. However, the observations of PM2.5 concentration are limited. Observational data can only be obtained from monitoring stations which are distributed sparsely in space [10,11]. In fact, comparatively accurate and sufficient PM2.5 data can be estimated with the help of interpolation for use in air pollution investigations and its effects on health.
Spatial interpolation techniques are essential for estimating PM2.5 variations. Many interpolation methods, such as kriging and Cressman interpolations, have been widely used in atmospheric subjects [8,9,12,13,14]. Lee et al. [12] developed a space-time geostatistical kriging model to predict PM2.5 fields over continental United States and found the kriging estimate was more accurate for locations near monitoring stations. Sampson et al. [13] introduced a regionalized national universal kriging model for estimating annual averaged PM2.5 concentrations across the U.S. Their universal kriging model provides the basis for prediction at arbitrary spatial locations. Physick et al. [8] estimated exposure to ambient concentrations based on Cressman interpolation.
With increasing application of spatial interpolation methods, there is a growing concern about accuracy and precision [15,16,17]. Kriging and Cressman interpolations are two kinds of interpolation methods widely used in various fields because of their high accuracy—but there are still defects in these methods. The kriging interpolation provides a description of data spatial structure and variance estimation, which depends on the expression of spatial variation of the property in terms of the variogram [10]. However, it is time-consuming and cumbersome because of the need to calculate inverse-matrices [18]. Cressman interpolation, in which the spatial influence of an observation is weighted as a function of distance, is flexible and easy to implement [19]. When Cressman interpolation is applied to datasets with large distances between grid points, it produces relatively large errors [15,17]. So, Cressman interpolation needs to be modified to ensure that the interpolated result is close to the observation [20,21]. Some interpolation methods, such as spline interpolation, radial basis function interpolation and orthogonal polynomial fitting (OPF), are widely used in the fields of artificial intelligence, image processing and oceanic subjects [16,17,22,23]. Those methods are more accurate and efficient and need to be introduced to applications with atmospheric subjects. Orthogonal polynomial fitting is one of the methods combining accuracy with efficiency. The application of orthogonal polynomial fitting has been studied for decades [24,25,26,27,28,29,30]. In our study, an orthogonal polynomial fitting method based on Chebyshev basis functions is introduced to estimate PM2.5 concentrations in central and southern regions of China. The details of this method are shown in part of Appendix A. The paper is organized as follows. In Section 2 the monitoring data of PM2.5 concentrations are introduced. In Section 3, twin idealized experiments in central and southern China are performed. Practical experiments are also carried out in this section. Section 4 presents the conclusions.

2. Monitoring Data

The monitoring data are from the China National Environmental Monitoring Center. The purpose of observation is to investigate the temporal/spatial distribution and variation of PM2.5, PM10 and various air pollution factors in China, to assess and evaluate the current situation and the variation trend of the atmospheric environment and to diagnose the air pollution problems. Daily averaged urban air quality observation data are provided by the China National Environmental Monitoring Center. There are more than 1400 stations monitoring the air quality of urban areas. Monitoring stations are well distributed in the districts of 337 cities. The PM2.5 concentration of a city is obtained by averaging the data from monitoring stations within that city. The number of cities located in the study zone is 225. Data of those 225 cities are used in this study. The observation period is a year (from January to December 2017). Data from 27 December 2017, 17 November 2017 and 3 October 2017 are analyzed in practical experiments. The distribution of the monitored cities is shown in Figure 1.

3. Results and Discussion

3.1. Idealized Twin Experiments

Two types of PM2.5 distributions are employed to evaluate the effectiveness of the OPF method. In idealized experiment 1 (IE1), PM2.5 concentrations are prescribed as one heavy-contamination center located in the northwest region and one slight-contamination center in the southeast region, which is the most common air pollution distribution in China according to observations. In idealized experiment 2 (IE2) the concentrations are prescribed according to a common distribution in spring and autumn. In IE2, the heavy-contamination center is located in the southeast region and the slight-contamination center is in the southwest region. The distributions of IE1 and IE2 are shown in Figure 2a,b, respectively.
For the Cressman interpolation, the influence radius is initially set to 4°. If there are at least five monitored cities within the influence radius of a grid, the PM2.5 concentration of that grid will be computed. Otherwise the influence radius is increased by 0.1° until there are enough cities within the influence area. For the kriging interpolation, the semivariogram model in our study is chosen as a spherical semivariogram model. The range and sill were set to 10° and 1°, respectively. For the OPF method, the results theoretically become more accurate with an increase in polynomial order, if there are enough observational data. In fact, the amount of data available for analysis is limited and therefore polynomial orders should be selected in a reasonable range according to observations. The range of orders in this paper is limited to 10. In the idealized twin experiments, the orders of x and y directions are set to 5. The relationship between the selection of polynomial orders and the accuracy of estimation is discussed in the practical experiments section.
In IE1, the interpolation results estimated by OPF, kriging and Cressman methods are shown in Figure 3. Spatial distributions of absolute errors of PM2.5 in IE1 are shown in Figure 4. For the OPF method, large deviations (>15 μg·m−3) only appear in the northwest region and there is no region with errors larger than 20 μg·m−3. For kriging interpolation, large deviations are mainly concentrated in Shandong peninsula, which is located in the northeast of the study zone. For Cressman interpolation, errors in Shandong peninsula as well as the southwest region of the study zone are relatively large. The mean absolute error (MAE) between estimated results and prescribed values at sampling points is often used as a precision indicator for interpolation methods [27]. The spatially averaged MAEs of OPF, kriging and Cressman interpolation results are 4.89, 5.91 and 7.87 μg·m−3, respectively. This suggests that the OPF method produces a better result than the kriging and Cressman interpolations, which estimates the PM2.5 concentrations in IE1 to be closest to the prescribed field.
In IE2, the estimated results calculated by those methods are shown in Figure 5. Distributions of absolute errors are displayed in Figure 6. In IE2, the spatially averaged MAEs of OPF, kriging and Cressman estimations are 5.32, 6.56 and 7.84 μg·m−3, respectively. This suggests that the result estimated by the OPF method, in which the MAE is the smallest among those methods, is the closest to the prescribed field. Figure 7b shows components of absolute errors in IE2 and the corresponding results in IE1 are shown in Figure 7a. With the OPF method, more than 50% of the errors are smaller than 5 μg·m−3 in IE2 and small errors (≤5 μg·m−3) explain nearly 70% of the errors in IE1. The amount of large deviations with the OPF method is apparently less than those with kriging or Cressman interpolation in both IE1 and IE2.
The root mean square error (RMSE) between the estimated results and prescribed values is another indicator evaluating the accuracy of interpolation results [31]. The MAE represents the average distance between estimations and observations. Using MAE as an indicator can avoid canceling out errors with opposite signs, making it relatively accurate in reflecting the magnitude of errors. However, MAE is not sensitive to large errors. RMSE, which is more sensitive to large errors, is based on the sum of squared errors. Small amounts of very large errors will result in an obvious increase in the RMSE. The MAE tends to be smaller than the RMSE because the RMSE penalizes large errors while the MAE gives the same weight to all errors, whereas the sensitivity of the RMSE to outliers may induce ambiguities [32]. On the other hand, MAE is a natural measure of average error magnitude and that means it is an unambiguous measure. By employing both MAE and RMSE as indicators, the accuracy of the estimated results can be validated comprehensively.
The RMSEs of the idealized twin experiments are shown in Figure 8. With the OPF method, the RMSE in IE1 is slightly larger than that of IE2, owing to the existence of larger errors (>15 μg·m−3 in MAE). With kriging or Cressman interpolation, the RMSEs in both idealized experiments are larger than those with the OPF method, indicating the accuracy of OPF, which agrees well with the result of the MAE analysis. The results of the idealized twin experiments reveal that the OPF method reconstructs PM2.5 distributions closer to the prescribed field than kriging or Cressman interpolation. This suggests that the estimation will apparently be more accurate if the OPF method is adopted.

3.2. Practical Experiments

As described in Section 3.1, OPF is an effective method to recover the prescribed PM2.5 concentrations, which encourages us to further apply this method to practical cases. Cross-validation is employed to assess the interpolation results [33]. In this study, the PM2.5 monitoring dataset is randomly split into eight subsets. In each subset, the data of 15 observed cities, which are well distributed in the study zone, are selected as set A, while the data of the other cities are referred to as set B. The data in set B are used to interpolate, meanwhile the accuracy of estimation is evaluated with the data in set A. Cross-validation is repeated eight times, represented by CV1 to CV8 in each practical experiment.
Practical experiment 1 (PE1) represents the experiment estimating distribution of PM2.5 concentrations on 2017 December 27. With the OPF method, PM2.5 distributions are calculated 100 times (polynomial orders of x and y directions each increase from 1 to 10) in the processes of cross-validation. Results and MAEs of the validation cities are averaged. The errors in PE1 are obtained by averaging the MAEs from all the cross-validations. Figure 9 demonstrates the MAE variation with orders. The MAEs decrease with the polynomial orders increasing and arrive at the minimum when s and k (polynomial orders of x and y directions) are both set to 6. The results suggest that errors are large when fitting orders are low and errors become extremely large when high polynomial orders are employed. Amplifying polynomial orders involves calculating much more numbers of polynomial coefficients, requiring a large amount of monitoring data. However, the amount of data is insufficient to support the accurate calculation of all the polynomial coefficients. The deviation of polynomial coefficients induces large errors in the results. It can be speculated that the estimations will become much closer to the observation when high-order OPF is applied, if enough monitoring data are available. In circumstances with limited numbers of observations, the polynomial orders should be selected within a reasonable range.
PE1 is continued with fitting orders (s and k) both set to be 6 with the aim of producing relatively accurate results. The averaged MAE is 9.48 μg·m−3 (close to that in CV5) of the OPF method. Figure 10a demonstrates the OPF estimations of PM2.5 concentrations in CV5. The averaged MAE of kriging interpolation is 11.60 μg·m−3, which is close to that in CV6. For Cressman interpolation, the averaged MAE is 13.02 μg·m−3, close to that in CV4. Figure 10b,c represent interpolation results computed by kriging and Cressman interpolations, respectively. The estimation with the OPF method presents lower errors than those with kriging and Cressman interpolations in PE1. Averaged MAEs of all the cross-validations estimated by those three methods are shown in Figure 10d. The results estimated by the OPF method are more accurate than those estimated by kriging interpolation in most of the cross-validation processes except for CV2, in which the difference between those two methods is relatively small. The MAEs of Cressman interpolation are always larger than those of OPF in all the eight cross-validation processes. The results of those methods show similar spatial patterns, yet differences also exist. PM2.5 concentrations calculated by Cressman interpolation lose the capability of estimating detail of distributions. The kriging interpolation presents an unsmooth pattern with a larger MAE than the OPF method.
The OPF method is applied to practical experiments PE2 and PE3 (using the results of 17 November 2017 and 3 October 2017). PE2 represents cases with heavy pollution, whereas PE3 represents cases with slight pollution. In both PE2 and PE3, fitting orders are the same as those in PE1 (s = 6, k = 6). Figure 11 demonstrates the results of PE2 and estimations of PE3 are shown in Figure 12. In PE2, spatially averaged MAEs of OPF, kriging and Cressman interpolations are 9.66 11.76 and 12.39 μg·m−3, respectively. In PE3, the corresponding MAEs are 2.90, 3.49 and 3.69 μg·m−3, respectively. In PE2, the errors of the OPF method are lower than those of kriging interpolation in most of the cross-validation processes except for CV4 and CV5. The errors of Cressman interpolation are larger than those of the OPF method in all of the cross-validation processes. In PE3, the averaged MAEs of the OPF method are also smaller than those of kriging and Cressman interpolations in most of the cross-validation processes, which is consistent with results of PE1 and PE2.
Figure 13 shows the comparison of estimated and observed PM2.5 concentrations for PE1, PE2 and PE3. The dotted, solid and dashed lines indicate the ratio of observed concentrations to estimated values equal to 2, 1, and ½, respectively. The results of the OPF method suggest that the estimated concentrations of PM2.5 agree well with the observations, with values concentrating near the solid line (the line indicating observations equal to estimations). Meanwhile, the interpolated results of kriging and Cressman methods are dispersedly distributed around that solid line, which involves relatively large deviations between the observed and estimated values.
Figure 14 demonstrates the absolute errors of cities in data set A for practical experiments. In PE1, errors from most of the cities are less than 20 μg·m−3 with the OPF method, whereas errors of kriging and Cressman interpolations are relatively larger, exceeding 30 μg·m−3 in a small number of cities. Similar to PE1, cities with errors larger than 30 μg·m−3 appear when kriging or Cressman interpolation is applied in PE2. In PE3, the large majority of errors with the OPF method are within 5 μg·m−3. Estimation by the OPF method is more accurate for the number of cities with errors less than 5 μg·m−3, which are apparently more than those with kriging or Cressman interpolation.
The RMSEs of practical experiments are shown in Figure 15. In PE1, the RMSEs of the OPF method are smaller than those of kriging and Cressman interpolations in all the cross-validation processes. In PE2, the results calculated by the OPF method are also more accurate in most of the cross-validation processes except for CV4 and CV7. The variation of RMSEs in PE3 is consistent with those in PE1 and PE2, indicating that the distribution of PM2.5 calculated by the OPF method is closer to the observation than those by calculated by kriging or Cressman interpolation.

4. Conclusions

In this paper, an orthogonal polynomial fitting method using Chebyshev basis functions is presented to estimate distributions of PM2.5 concentrations in the central and southern regions of China. Applying the OPF method requires three major steps. First, polynomial coefficients are computed based on monitoring data. Secondly, appropriate polynomial orders need to be selected. Finally, spatial distributions of PM2.5 fields are calculated. In the idealized twin experiments, IE1 is designed in accordance with the most common distribution of PM2.5 concentrations in China, whereas IE2 represents a common air contamination pattern in spring and autumn. Distributions of PE1 and PE2 are successfully recovered by the OPF method with accuracy higher than kriging and Cressman interpolations. Therefore, it is validated that the OPF method is reliable in estimating the spatial distribution of PM2.5 concentrations. In the practical experiments, PE1 suggests that errors are the smallest when fitting orders of x and y directions are both set to 6 for the OPF method. The amount of data is insufficient to support high-order OPF. The results of the practical experiments also indicate that the OPF method can reconstruct the PM2.5 fields accurately. The estimated PM2.5 concentrations computed by the OPF method are closer to the observations. The spatially averaged MAEs of the OPF method are lower than those of kriging or Cressman interpolations in all the practical experiments. The RMSEs of the OPF method in the practical experiments are the smallest in most of the cross-validation processes investigated among those methods.

Author Contributions

The idea for the study was provided by X.L. The design of the experiment method and the writing of the paper were accomplished by B.L. The analysis of the results was conducted by Y.L. The review and editing of the paper were carried out by X.W. The data were collected by Q.F.

Funding

This work is funded by the National Key Research and Development Plan of China (2016YFC1402304 and 2016YFC1402705), the Scientific Research Foundation of Shandong University of Science and Technology for Recruited Talents (2017RCJJ052) and the Open Fund of the Key Laboratory of Ocean Circulation and Waves, Chinese Academy of Sciences (KLOCW1807).

Conflicts of Interest

The authors declare no conflict of interest.

Appendix A. OPF Method

In our study, the OPF method is based on Chebyshev basis functions. Let x i be the points on the x axis ( i = 1 , 2 , , n ). The Chebyshev polynomials are:
Φ 0 ( x i ) = 1 , Φ 1 ( x i ) = x i P 1 , 0 Φ 0 ( x i ) , Φ 2 ( x i ) = x i 2 P 2 , 1 Φ 1 ( x i ) P 2 , 0 Φ 0 ( x i ) , Φ k ( x i ) = x i k P k , k 1 Φ k 1 ( x i ) P k , k 2 Φ k 2 ( x i ) P k , 0 Φ 0 ( x i ) ,
where k is the order of polynomials in the x direction, ϕ 0 x i ,   ϕ 1 x i , ,   ϕ k x i are Chebyshev polynomials with orders from zero to k and P is the coefficient of Chebyshev polynomials, calculated as:
P k , l = i = 1 n x i k Φ l ( x i ) i = 1 n Φ l 2 ( x i ) ,
where P k , l is the l-th coefficient of the k-order polynomial. The distribution of PM2.5 concentrations Z x i , y j can be fitted as:
Z ˜ ( x i , y i ) = k = 0 K 0 s = 0 S 0 A k , s Φ k ( x i ) ς s ( y j ) ,
where i = 1,2,…,N, j = 1,2,…,M, k and s are the orders of polynomials for the x and y directions respectively, K 0 and S 0 are the corresponding cutoff orders. A k , s are expansion coefficients. ϕ k x i is the k-order Chebyshev orthogonal polynomial in the x direction, and ς s ( y j ) is the s-order Chebyshev orthogonal polynomial in the y direction. Z ˜ x i , y j is the fitted value for function Z . The sum of squares of the error ε is:
ε ( A 0 , 0 , A 1 , 0 , , A k 0 s 0 ) = i = 1 N j = 1 M [ Z k = 0 K 0 s = 0 S 0 A k , s Φ k ( x i ) ς s ( y j ) ] 2 .
According to the Rolle intermediate value theorem, we obtain:
δ ε δ A m , n = 2 i = 1 N j = 1 M [ Z i k = 0 K 0 s = 0 S 0 A k , s Φ k ( x i ) ς s ( y j ) ] Φ m ( x i ) ς n ( y j ) = 0 .
Owing to the orthogonality of Chebyshev polynomials:
i = 1 N Φ k ( x i ) Φ m ( x i ) = 0 ,   k m , j = 1 M ς s ( y j ) ς n ( y j ) = 0 , s n ,
where m and n are the orders of polynomials for the x and y directions, respectively. Combining Equation (A5) with Equation (A6), the expansion coefficients can be solved as:
A k , s = i = 1 N j = 1 M Z ( x i , y j ) Φ k ( x i ) ς s ( y j ) i = 1 N Φ k 2 ( x i ) j = 1 M ς s 2 ( y j ) .

References

  1. Mansfield, T.J.; Rodriguez, D.A.; Huegy, J.; Gibson, J.M. The Effects of Urban Form on Ambient Air Pollution and Public Health Risk: A Case Study in Raleigh, North Carolina. Risk Anal. 2015, 35, 901–918. [Google Scholar] [CrossRef] [PubMed]
  2. Yu, T.Y. Characterization of ambient PM2.5 concentrations. Atmos. Environ. 2010, 44, 2902–2912. [Google Scholar] [CrossRef]
  3. Kioumourtzoglou, M.A.; Schwartz, J.D.; Weisskopf, M.G.; Melly, S.J.; Wang, Y.; Dominici, F.; Zanobetti, A. Long-term PM2.5 exposure and neurological hospital admissions in the Northeastern United States. Environ. Health Perspect. 2016, 124, 23–29. [Google Scholar] [CrossRef] [PubMed]
  4. Giannadaki, D.; Lelieveld, J.; Pozzer, A. Implementing the U.S. air quality standard for PM2.5 worldwide can prevent millions of premature deaths per year. Environ. Health 2016, 15, 88–98. [Google Scholar] [CrossRef]
  5. Song, C.; He, J.; Wu, L.; Jin, T.; Chen, X.; Li, R.; Ren, P.; Zhang, L.; Mao, H. Heath burden attributable to ambient PM2.5 in China. Environ. Pollut. 2017, 223, 575–586. [Google Scholar] [CrossRef] [PubMed]
  6. Puett, R.C.; Hart, J.E.; Yanosky, J.D.; Paciorek, C.; Schwartz, J.; Suh, H.; Speizer, F.E.; Laden, F. Chronic Fine and Coarse Particulate Exposure, Mortality, and Coronary Heart Disease in the Nurses’ Health Study. Environ. Health Perspect. 2009, 117, 1697–1701. [Google Scholar] [CrossRef]
  7. Stafoggia, M.; Cesaroni, G.; Peters, A.; Andersen, Z.J.; Badaloni, C.; Beelen, R.; Caracciolo, B.; Cyrys, J.; de Faire, U.; de Hoogh, K.; et al. Long-term exposure to ambient air pollution and incidence of cerebrovascular events: Results from 11 European cohorts within the ESCAPE project. Environ. Health Perspect. 2014, 122, 919–925. [Google Scholar] [CrossRef]
  8. Physick, W.L.; Cope, M.E.; Lee, S.; Hurley, P.J. An approach for estimating exposure to ambient concentrations. J. Expo. Sci. Environ. Epidemiol. 2007, 17, 76–83. [Google Scholar] [CrossRef]
  9. Wang, D.S.; Li, N.; Shen, Y.L.; Lv, X.Q. The parameters estimation for a PM2.5 transport model with the adjoint method. Adv. Meteorol. 2016, 2016, 9873815. [Google Scholar] [CrossRef]
  10. Oliver, M.A.; Webster, R. Kriging: A method of interpolation for geographical information systems. Int. J. Geogr. Inform. Syst. 1990, 4, 313–332. [Google Scholar] [CrossRef]
  11. Hu, J.; Wang, Y.; Ying, Q.; Zhang, H. Spatial and temporal variability of PM2.5 and PM10 over the North China Plain and the Yangtze River Delta, China. Atmos. Environ. 2014, 95, 598–609. [Google Scholar] [CrossRef]
  12. Lee, S.J.; Serre, M.L.; Donkelaar, A.V.; Martin, R.V.; Burnett, R.T.; Jerrett, M. Comparison of geostatistical interpolation and remote sensing techniques for estimating long-term exposure to ambient PM2.5 concentrations across the continental United States. Environ. Health Perspect. 2012, 120, 1727–1732. [Google Scholar] [CrossRef]
  13. Sampson, P.D.; Richards, M.; Szpiro, A.A.; Bergen, S.; Sheppard, L.; Larson, T.V.; Kaufman, J.D.J.A.E. A regionalized national universal kriging model using Partial Least Squares regression for estimating annual PM2.5 concentrations in epidemiology. Atmos. Environ. 2013, 75, 383–392. [Google Scholar] [CrossRef] [PubMed]
  14. Xu, M.J.; Fu, K.; Lv, X.Q. Application of adjoint data assimilation method to atmospheric aerosol transport problems. Adv. Math. Phys. 2017, 2017, 1–14. [Google Scholar] [CrossRef]
  15. Pan, H.D.; Guo, Z.; Lv, X.Q. Inversion of Tidal Open Boundary Conditions of the M2 Constituent in the Bohai and Yellow Seas. J. Atmos. Ocean. Technol. 2017, 34, 1661–1672. [Google Scholar] [CrossRef]
  16. Guo, Z.; Pan, H.D.; Fan, W.; Lv, X.Q. Application of surface spline interpolation in inversion of bottom friction coefficients. J. Atmos. Ocean. Technol. 2017, 34, 2021–2028. [Google Scholar] [CrossRef]
  17. Jiang, D.; Chen, H.B.; Jin, G.Z.; Lv, X.Q. Estimating smoothly varying open boundary conditions for a 3D internal tidal model with an improved independent point scheme. J. Atmos. Ocean. Technol. 2018, 35, 1299–1311. [Google Scholar] [CrossRef]
  18. Ginsbourger, D. Fast Parallel Kriging-Based Stepwise Uncertainty Reduction with Application to the Identification of an Excursion Set. Technometrics 2014, 56, 455–465. [Google Scholar]
  19. Xie, Y.; Chen, T.B.; Lei, M.; Yang, J.; Guo, Q.; Song, B.; Zhou, X.Y. Spatial distribution of soil heavy metal pollution estimated by different interpolation methods: Accuracy and uncertainty analysis. Chemosphere 2011, 82, 468–476. [Google Scholar] [CrossRef]
  20. Huang, C.C.; Zheng, X.G.; Tait, A.; Dai, Y.J.; Chen, Z.Q.; Li, T.; Wang, Z.L. On using smoothing spline and residual correction to fuse rain gauge observations and remote sensing data. J. Hydrol. 2014, 508, 410–417. [Google Scholar] [CrossRef]
  21. Liu, Y.Z.; Yu, J.; Shen, Y.L.; Lv, X.Q. A modified interpolation method for surface total nitrogen in the Bohai Sea. J. Atmos. Ocean. Technol. 2016, 33, 1509–1517. [Google Scholar] [CrossRef]
  22. Gan, X.S.; Tang, X.Q.; Gao, H.L. Research on Nonlinear Approximation Model of Radial Basis Function Neural Network Trained Using Artificial Fish Swarm Algorithm with Adaptive Adjustment. Appl. Mech. Mater. 2015, 713, 1855–1858. [Google Scholar] [CrossRef]
  23. Mukundan, R.; Raveendran, P.; Jassim, W.A. New orthogonal polynomials for speech signal and image processing. IET Signal Process. 2012, 6, 713–723. [Google Scholar]
  24. Dumitriu, I.; Pal, S. Sparse regular random graphs: Spectral density and eigenvectors. Ann. Probab. 2012, 40, 2197–2235. [Google Scholar] [CrossRef]
  25. Ismail, M.E.H.; Kasraoui, A.; Zeng, J. Separation of variables and combinatorics of linearization coefficients of orthogonal polynomials. J. Comb. Theory 2013, 120, 561–599. [Google Scholar] [CrossRef]
  26. Junkins, J.L.; Younes, A.B.; Bai, X. Orthogonal polynomial approximation in higher dimensions: Applications in astrodynamics. Adv. Astronaut. Sci. 2013, 147, 531–594. [Google Scholar]
  27. Nolin, A.W. Recent advances in remote sensing of seasonal snow. J. Glaciol. 2017, 56, 1141–1150. [Google Scholar] [CrossRef]
  28. Kong, C.; Fang, Y.H.; Lan, T.G.; Xiong, W.; Dong, D.M.; Li, D.C. A boundary prolongation method based on orthogonal polynomial fitting in wavelet transform. Chin. J. Quantum Electron. 2008, 25, 25–28. [Google Scholar]
  29. Delvaux, S. Average characteristic polynomials for multiple orthogonal polynomial ensembles. J. Approx. Theory 2010, 162, 1033–1067. [Google Scholar] [CrossRef]
  30. Webster, M.A.; Rigor, I.G.; Nghiem, S.V.; Kurtz, N.T.; Farrell, S.L.; Perovich, D.K.; Sturm, M. Interdecadal changes in snow depth on Arctic sea ice. J. Geophys. Res. Ocean 2014, 119, 5395–5406. [Google Scholar] [CrossRef]
  31. Chai, T.; Draxler, R.R. Root mean square error (RMSE) or mean absolute error (MAE)?—Arguments against avoiding RMSE in the literature. Geosci. Model Dev. 2014, 7, 1247–1250. [Google Scholar] [CrossRef]
  32. Willmott, C.J.; Matsuura, K.; Robeson, S.M. Ambiguities inherent in sums-of-squares-based error statistics. Atmos. Environ. 2009, 43, 749–752. [Google Scholar] [CrossRef]
  33. Gramatica, P. Principles of QSAR models validation: Internal and external. QSAR Comb. Sci. 2010, 26, 694–701. [Google Scholar] [CrossRef]
Figure 1. Distribution of monitoring cities.
Figure 1. Distribution of monitoring cities.
Ijerph 16 01418 g001
Figure 2. Prescribed distributions of PM2.5 concentrations for (a) idealized experiment 1 (IE1) and (b) idealized experiment 2 (IE2).
Figure 2. Prescribed distributions of PM2.5 concentrations for (a) idealized experiment 1 (IE1) and (b) idealized experiment 2 (IE2).
Ijerph 16 01418 g002
Figure 3. Interpolation results estimate by (a) orthogonal polynomial fitting (OPF), (b) kriging and (c) Cressman methods in IE1.
Figure 3. Interpolation results estimate by (a) orthogonal polynomial fitting (OPF), (b) kriging and (c) Cressman methods in IE1.
Ijerph 16 01418 g003
Figure 4. Spatial distributions of absolute errors of PM2.5 in IE1 with (a) OPF, (b) kriging and (c) Cressman methods.
Figure 4. Spatial distributions of absolute errors of PM2.5 in IE1 with (a) OPF, (b) kriging and (c) Cressman methods.
Ijerph 16 01418 g004
Figure 5. Interpolation results estimated by (a) OPF, (b) kriging and (c) Cressman methods in IE2.
Figure 5. Interpolation results estimated by (a) OPF, (b) kriging and (c) Cressman methods in IE2.
Ijerph 16 01418 g005
Figure 6. Spatial distributions of absolute errors of PM2.5 in IE2 with (a) OPF, (b) kriging and (c) Cressman methods.
Figure 6. Spatial distributions of absolute errors of PM2.5 in IE2 with (a) OPF, (b) kriging and (c) Cressman methods.
Ijerph 16 01418 g006
Figure 7. Components of absolute errors in (a) IE1 and (b) IE2. Blue: the OPF method, black: kriging interpolation and red: Cressman interpolation.
Figure 7. Components of absolute errors in (a) IE1 and (b) IE2. Blue: the OPF method, black: kriging interpolation and red: Cressman interpolation.
Ijerph 16 01418 g007
Figure 8. The root mean square error (RMSE) in idealized twin experiments. Blue: the OPF method, black: kriging interpolation and red: Cressman interpolation.
Figure 8. The root mean square error (RMSE) in idealized twin experiments. Blue: the OPF method, black: kriging interpolation and red: Cressman interpolation.
Ijerph 16 01418 g008
Figure 9. The mean absolute error (MAE) variation (μg·m−3) with orders in practical experiment 1 (PE1) (the marker represents the minimum MAE when s and k are both 6).
Figure 9. The mean absolute error (MAE) variation (μg·m−3) with orders in practical experiment 1 (PE1) (the marker represents the minimum MAE when s and k are both 6).
Ijerph 16 01418 g009
Figure 10. PM2.5 concentrations of PE1 calculated by (a) OPF, (b) kriging, (c) Cressman methods and (d) averaged MAEs for the corresponding methods (OPF method: blue dashed line, kriging interpolation: black solid line and Cressman interpolation: red dash-dot line).
Figure 10. PM2.5 concentrations of PE1 calculated by (a) OPF, (b) kriging, (c) Cressman methods and (d) averaged MAEs for the corresponding methods (OPF method: blue dashed line, kriging interpolation: black solid line and Cressman interpolation: red dash-dot line).
Ijerph 16 01418 g010
Figure 11. PM2.5 concentrations of practical experiment 2 (PE2) calculated by (a) OPF, (b) kriging and (c) Cressman methods and (d) averaged MAEs for the corresponding methods (OPF method: blue dashed line, kriging interpolation: black solid line and Cressman interpolation: red dash-dot line).
Figure 11. PM2.5 concentrations of practical experiment 2 (PE2) calculated by (a) OPF, (b) kriging and (c) Cressman methods and (d) averaged MAEs for the corresponding methods (OPF method: blue dashed line, kriging interpolation: black solid line and Cressman interpolation: red dash-dot line).
Ijerph 16 01418 g011
Figure 12. PM2.5 concentrations of practical experiment 3 (PE3) calculated by (a) OPF, (b) kriging and (c) Cressman methods and (d) averaged MAEs for the corresponding methods (OPF method: blue dashed line, kriging interpolation: black solid line and Cressman interpolation: red dash-dot line).
Figure 12. PM2.5 concentrations of practical experiment 3 (PE3) calculated by (a) OPF, (b) kriging and (c) Cressman methods and (d) averaged MAEs for the corresponding methods (OPF method: blue dashed line, kriging interpolation: black solid line and Cressman interpolation: red dash-dot line).
Ijerph 16 01418 g012
Figure 13. Comparison of estimated and observed PM2.5 concentrations in practical experiments (PE1: (a) to (c), PE2: (d) to (f), PE3: (g) to (i). OPF method: blue dots, kriging interpolation: black dots and Cressman interpolation: red dots. The 1:2, 1:1 and 2:1 lines are shown for reference.).
Figure 13. Comparison of estimated and observed PM2.5 concentrations in practical experiments (PE1: (a) to (c), PE2: (d) to (f), PE3: (g) to (i). OPF method: blue dots, kriging interpolation: black dots and Cressman interpolation: red dots. The 1:2, 1:1 and 2:1 lines are shown for reference.).
Ijerph 16 01418 g013
Figure 14. Absolute errors of cities in data set A for (a) PE1, (b) PE2 and (c) PE3 (OPF method: blue dots, kriging interpolation: black dots and Cressman interpolation: red dots).
Figure 14. Absolute errors of cities in data set A for (a) PE1, (b) PE2 and (c) PE3 (OPF method: blue dots, kriging interpolation: black dots and Cressman interpolation: red dots).
Ijerph 16 01418 g014
Figure 15. RMSEs in (a) PE1, (b) PE2 and (c) PE3 (OPF method: blue dashed line, kriging interpolation: black solid line and Cressman interpolation: red dash-dot line).
Figure 15. RMSEs in (a) PE1, (b) PE2 and (c) PE3 (OPF method: blue dashed line, kriging interpolation: black solid line and Cressman interpolation: red dash-dot line).
Ijerph 16 01418 g015

Share and Cite

MDPI and ACS Style

Li, B.; Liu, Y.; Wang, X.; Fu, Q.; Lv, X. Application of the Orthogonal Polynomial Fitting Method in Estimating PM2.5 Concentrations in Central and Southern Regions of China. Int. J. Environ. Res. Public Health 2019, 16, 1418. https://doi.org/10.3390/ijerph16081418

AMA Style

Li B, Liu Y, Wang X, Fu Q, Lv X. Application of the Orthogonal Polynomial Fitting Method in Estimating PM2.5 Concentrations in Central and Southern Regions of China. International Journal of Environmental Research and Public Health. 2019; 16(8):1418. https://doi.org/10.3390/ijerph16081418

Chicago/Turabian Style

Li, Bingtian, Yongzhi Liu, Xinyi Wang, Qingjun Fu, and Xianqing Lv. 2019. "Application of the Orthogonal Polynomial Fitting Method in Estimating PM2.5 Concentrations in Central and Southern Regions of China" International Journal of Environmental Research and Public Health 16, no. 8: 1418. https://doi.org/10.3390/ijerph16081418

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop