Abstract
Geostatistical interpolation methods, sometimes referred to as kriging, have been proven effective and efficient for the estimation of target quantity at ungauged sites. The merit of the kriging approach relies heavily on the semivariograms in which the parametric functions are prevalently used. In this work, we explore the semiparametric semivariogram where no close-form semivariogram is required. By additionally enforcing the monotonicity condition in order to suppress the presence of spurious oscillation, a scaling of the nodes of the semiparametric kriging is proposed. To this end, the solar radiation estimates across extensive but unmeasured regions in Thailand using three different semivariogram models are undertaken. A cross validation analysis is carried out in order to justify the performance of each approach. The best results are achieved by the semiparametric model with an improvement of around 7–13% compared to those obtained from the parametric semivariograms.
1. Introduction
1.1. Motivation and Related Work
Spatial interpolation methods are essential tools for manipulating available but inadequate data to infill missing information. A wide spectrum of spatial interpolation approaches applied to several disciplines can be found in the literature [1,2,3]. Although most techniques are somewhat similar in the sense that estimations are calculated as the weighted average of known locations, they can be classified into two categories according to their mathematical underlying concepts; deterministic and stochastic methods. Prominent examples of deterministic approaches include spline functions [4], inverse distant weight [5], multiple linear regression [6]. Whilst those of stochastic-based methods which are usually regarded as kriging are ordinary kriging, universal kriging and regression kriging. Several studies attempted to compare between these two interpolation classes [7,8]. In general, it appears that if we take accuracy as a prime concern, kriging interpolation is a preferable option for reliable estimation. This stochastic-based process model has been applied to numerous types of climate variables including temperature [9], rainfall [10], soil properties [11] and solar radiation [12,13].
Kriging is a geostatistical method where an attribute value from each location is interpreted as a single realization of some random field [14,15]. Under this assumption, statistical properties such as unbiased estimator and minimum variance can be used for the estimation of the attribute at the non-observed locations. An estimate at an unknown location is a combination of trend and residual components. There have been various types of kriging approaches depending upon their treatment of both components. The trend, also called the drift term, presents a deterministic structure of the target variable which can be expressed in several ways; either simply an unknown constant or a function of explanatory variables. On the contrary, the residual term involves the weight determination which can be quantified via the semivariogram from input point data. It, therefore, becomes important to choose an appropriate model of semivariogram that correctly presents a spatial dependence structure of the data. The customary procedure is by initially constructing an empirical semivariogram and later imposing a prior assumption regarding semivariogram shape: for instance, exponential, Gaussian, and spherical models. The inferred kriging coefficients are subsequently found by means of any curve-fitting techniques where weighted least-squares method [16] and maximum likelihood estimation [17] are two commonly used. The effort has also been made in order to find other techniques to facilitate the identification of semivariogram parameters. This includes the utilization of heuristic optimization tools, particularly the genetic algorithm (GA). For example, Yeh et al. [18] applied factorial kriging along with GA for the groundwater monitoring network design. Zhang et al. [19] couples GA with ordinary kriging with the aim to minimize the objective function so that the optimal pole structure of switched reluctance motor can be gained.
Although the prescribed semivariogram is relatively convenient, one tends to employ a dissimilar parametric class of functions concerning semivariogram structures for different time and regions. Another complication of such a model is that if the empirical semivariogram cannot be described by any classes of parametric functions, this can ultimately lead to a severely biased prediction. The nonparametric semivariogram is an alternative approach introduced by Shapiro and Botha [20]. Based on the well known Bochner’s theorem [21,22], the spectral representation of any positive definite function can be adopted for covariance function leading to a nonparametric form of semivariogram. This method allows us to estimate the semivariogram flexibly because a fixed functional form of semivariogram needs not to be acquired. However, it is acknowledged that a choice of nodes should be carefully selected otherwise this can give rise to spurious oscillations. Gorsich and Genton [23,24] suggested that the nodes should be roots of the Bessel functions. The series expansion of the nonparametric covariogram estimator accordingly has a similar form to Fourier-Bessel expansion which ultimately guarantees covariogram estimator being positive thoroughly. Later, based on the nonparametric structure and some certain type of parametric models, Carmack et al. [25] developed a semiparametric estimator which is an extension of nonparametric semivariograms. This is principally done by introducing a tuning parameter in the scaling of lag distance. Nevertheless, some theoretical properties, for example, monotonicity or concavity, were not under consideration in which can potentially cause a bias in the estimate.
1.2. Contribution
The objectives of this paper are outlined as follows:
- The node selection criterion in the semiparametric semivariogram is proposed in accordance with the enforced monotonicity constraint so that the estimator fulfills the additional theoretical properties as well as eliminate spurious fluctuation due to node misspecification.
- A performance of semiparametric semivariogram is conducted and compared with parametric models in the case of the estimation of the spatial distribution of the monthly average daily solar radiation in Thailand.
- The GA is employed as a means to automate the search for optimal parameter values via the objective function.
1.3. Article Structure
The rest of the paper is organized as follows. Section 2 provides a brief description of seimvariogram estimators, both parametric and nonparametric models, and corresponding ordinary kriging method. A mathematical framework for the semiparametric semivariogram and our proposed node selection criterion are presented in Section 3. In Section 4, a case study of Thailand’s solar radiation is adopted to illustrate and compare the performance of three distinct semivariograms. The conclusion and discussion are given in Section 5.
2. Theoretical Background
2.1. Ordinary Kriging
Due to insufficient information about the variable of interest, kriging exploits the probabilistic concept where the uncertainty should be introduced in the description of the target quantity. Suppose that is a spatial random process where D is the spatial domain and . The random variable can be writte as
where is the trend component of , and is the residual component with zero mean and stationary covariance.
Let be a collection of samples at locations . The kriging estimate, , at any location can be expressed as a linear combination of available n observations
where indicates the kriging weight assigned to . Particularly, the ordinary kriging assumes the random process to be intrinsically stationary. This leads to the expected difference of the samples being zero and the variance of the estimation can be formulated in terms of the semivariogram
To evaluate the minimum variance of estimation (3) subject to the unbiasedness constraint [26], we apply the Lagrange multiplier method to this optimization problem which gives
where is the Lagrange multiplier. Our goal is to determine the kriging weights in the Equation (2) which can be accomplished by using the semivariogram model described in the following subsection.
2.2. Semivariogram
The semivariogram quantifies the covariance structures of measured sample points with distance. The widely used semivariogram estimate, , was put forth by [14]
where denotes the number of distinct pairs at a separation lag vector . Henceforward, we assume the isotropy thereby leading to the semivariogram estimator, being replaced by where h is the Euclidean norm of . Based on this resulting empirical semivariogram, the continuous spatial variability across the domain can be assumedly represented by smooth functions which is known as parametric semivariograms. Various prescribed functions including exponential, spherical and Gaussian functions have been exploited [27,28]. In this work, we employ two parametric krigings for the two dimensional space. One is the exponential model which is explicitly represented by
and the spherical model
where . The parameter represents the nugget value induced by the spatial error when the distance smaller than the shortest imposed lag distance. Whereas the variance of the process is denoted by and a combination between and is referred to as sill. The parameter is the range in which it indicates a distance from the origin to the point of sill achieved. A choice of these parameters undoubtedly has a considerable impact upon the accuracy of kriging estimators.
As opposed to the parametric semivariogram, a specification of the nonparametric semivariogram model is not required, this therefore provides a flexibility to compute estimate at any location. Under the isotropic and second-order stationary assumptions, the nonparametric semivariogram estimate can be written as a series from Yaglom’s representation of Bochner’s theorem [21,22]. In two-dimensional space, the approximated semivariogram can be expressed as
where is a vector of nonnegative coefficients, the scalars , for , are the jump points or nodes and is the Bessel function of the first kind of order zero. A goal is to find a vector so that it can minimize the weighted sum squared errors (WSSE) [29]
where L is the number of discrete lags. The weight is defined as
in which this weight manifestation suggests that the short-lag spatial correlations induces strong weights. The Equation (9) is subject the following constraint
where the nonnegative value b corresponds to the nugget effect.
2.3. Genetic Algorithm
Genetic algorithm primarily introduced by John Holland [30] is a class of stochastic optimization technique wherein its underlying principles are inspired by the biological evolution of living organisms. Besides its main contribution to the optimization and search problems, the method is often used as a tool for parameter identification. The process is carried out by a population of chromosomes which represents a collection of solutions to a problem. The chromosomes at each generation are assigned fitness values which indicate how likely they can survive. To produce a population of chromosomes for the subsequent generation with a higher probability of achieving optimal solutions, there are three genetic operations involved; selection, crossover and mutation. The process of chromosome evolution based on such operations is repeated until acceptable criteria are satisfied. The generic procedure of GA can be illustrated in Figure 1.
Figure 1.
Flowchart of genetic algorithm.
3. Semiparametric Semivariogram and Node Selection Criterion
On the other hand, by considering the semivariogram structures of both parametric and nonparametric models, Carmack et al. [25] purposed the semiparametric semivariogram where h in the nonparametric model is replaced by , for This gives rise to the semivariogram estimate
An imperative part of the nonparametric semivariogram is to choose the right choice of node selection, in the Equation (8). Shapiro and Botha [20] and Cherry [31] utilized improptu means by using regularly spaced nodes so that for where is the positive number which can be chosen. However, it appears that equispaced nodes can lead to spurious oscillations due to the fact that estimators are not guaranteed to be positive definite in continuum [24]. Another ubiquitous suggestion for the node selection was made by Gorsich and Genton ([23,24]) who proposed that the nodes should be the roots of Bessel functions. This node representation results in the coefficients in Equation (8) being nonnegative as well as the convergence being ensured. Whilst Carmack et al. adopted the concept of the first root of the Bessel function of the first kind of order zero, to define the nodes in the semiparametric kriging so that , for .
Despite the fact that, in practice, semivariogram structure can possess the hole effect whereby fitted semivariogram exhibits sinusoidal-wave form [32], we here restrict our study to a generally intuitive perception in which objects that are physically close to each other, should have a stronger correlation than those farther away. Given this assumption, the fitted semiparametric semivariogram might not be necessarily satisfied in some certain situations as illustrated in Figure 2. An alternative way for the node scaling is thus proposed by considering the monotonicity requirement for the fitted semivariogram. This precondition implies the first derivative of the estimated semivariogram (12) should be nonnegative, that is
for , where is the Bessel function of the first kind of order one. By assuming the node scaling in an adhoc manner
for , where is the first root of , and are the first and final separation lag distances respectively. Since and , it consequently gives
This leads to
which ultimately results in
for and . By defining as in Equation (14), the Equation (13) can be written in terms of a series expansion:
Figure 2.
An illustration when the monotonicity constraint is not imposed in the semivariogram.
By this node restriction, the monotonicity condition is proven to be satisfied and this can ultimately prevent noisy behavior in the estimation.
4. A Case Study: An Estimation of Solar Radiation Distribution in Thailand
Renewable energy deployment has grown rapidly in the power sector in the last decade. In an attempt to maintain energy security and environmental sustainability over the coming years, the Thai government sets up a blueprint for 20-year Thailand’s energy plan which is known as the Alternative Energy Development Plan-AEDP (2015–2036). By 2036, renewable energies are planned to account for 30% of net power generation capacity. In particular, giving its location in the equatorial regions, solar energy power has been an undoubtedly favorable renewable source in Thailand due to a large amount of incoming solar radiation all over the country throughout the year. Solar energy production is targeted to reach around 10 gigawatts or nearly 50% of the total alternative energy power capacity by 2036.
To attain the optimal benefits of such resource, one of the key factors for efficient planning and management is to examine the availability of spatially continuous data over the region of interest. Nonetheless, in practice, direct measurements are typically scarce and derived from point sources that are irregularly distributed. For these reasons, spatial interpolation techniques capable of quantitatively capturing the distribution of solar radiation are of great importance.
4.1. Data Description
The kriging technique with different types of semivariograms previously described has been applied to the estimation of solar radiation in Thailand. The study region is carried out across the whole country which extends over the area around 518,000 km. The latitude ranges from N to N, whereas longitude E to E. In this study, a variation of solar radiation characteristics in Thailand is classified according to seasonal periods, namely the summer season (March–June), rainy season (July–October) and winter season (November–February).
The source of solar radiation is obtained from the website of the department of alternative energy development and efficiency, www.dede.go.th. Instead of using a collection of hourly solar radiation data to justify our models, we use monthly averages of daily solar radiation which took place during the period from January 2015 to December 2015, hereby covering changes in annual radiation. In this dataset, the radiation intensity is within the spectral range 14 to 24 MJ/mday in summer. Whilst those in the rainy and winter seasons are around 8–20 MJ/mday and 11–23 MJ/mday respectively. The number of solar radiation monitoring stations is 37 sites that are sparsely situated across Thailand as shown in Figure 3 and details of station positions are displayed in Appendix A.
Figure 3.
A map of solar radiation monitoring stations in Thailand.
4.2. Accuracy Assessment
Quantitative evaluation of the estimation ability of each approach is performed by using the classical k-fold cross-validation method [33]. The dataset is chronologically divided into 9 folds. On each iteration, about 10% of the data is chosen for the validation, while the remaining data are used for the training. The process is repeated until all folds are picked as a validation set. This implies every individual data is utilized for both training and validating steps.
A combination of these two conventional statistics, the root mean squared error (RMSE) and the mean absolute percentage error (MAPE), are utilised for the evaluation of the quality of forecasting models. They are defined as follows:
where M is the number of validation sites and and are the actual and estimated (kriged) solar radiation values at site respectively. Despite its popularity for the assessment of model performance in climate and environmental studies, having to compute the RMSE value through the square of the discrepancies between observed and interpolated values can lead to significant weights being placed on outliers. On the contrary, the MAPE is a scale-independent measure resulting in an ease of interpretability. Some problems associated with the MAPE can be arisen due to a division by close-to-zero value. It is accordingly required to apply at least two metrics in order to guarantee consistency in evaluation.
In our numerical experiment, three different semivariogram models are compared; exponential model, spherical model and semiparametric model. Following the recommendation by [20], the number m in the Equation (8) is chosen to be equal to . For a fair comparison, all these distinct semivariograms are fitted based on the weighted sum squared errors in Equation (9) and GA is applied to identify the optimal set of parameters. The GA configuration is as follows: a population size of 30, a crossover probability of 0.8, a mutation probability of 0.2 and 5000 maximum generations. A section operator is chosen to be a stochastic uniform with a scattered crossover and Gaussian mutation mechanisms. The computational simulations of all approaches are performed in the MATLAB. Furthermore, when an optimal set of parameters of parametric models are found using GA, the nugget value, , always approaches zero. To simplify the time-intensive computation, we thus set the value of the nugget being zero for all semivariograms.
To evaluate the overall performances of different kriging approaches for each month, the MAPE and RMSE values obtained from nine folds are averaged and their corresponding standard deviation is also computed. Table 1 shows the mean and standard deviation of MAPE derived from three different semivariograms. According to the MAPE index, all approaches provide excellent performances as the mean values of MAPE in almost every month are less than 10% with standard deviation ranging from 1.7067% to 4.9115 %, except in July. Especially, the semiparametric semivariogram consistently outperforms both parametric models with a relative improvement in MAPE ranging from around 2% to 28 %. The maximum improvement can be marked in March with MAPE values of 6.0779% for the semiparametric model and 7.6585% and 8.4843% for exponential and spherical models respectively.
Table 1.
The mean and standard deviation of mean absolute percentage error (MAPE) of three semivariograms.
As presented in Table 2, similar patterns can be observed for the RMSE values where the semiparametric model exhibits lower RMSE values than other methods for almost every month. This can also be elucidated by the average values of the means in which both parametric semivariograms present higher average values of every month being above 2 MJ/mday whereas that of the semiparametric model is 1.8700 MJ/mday.
Table 2.
The mean and standard deviation of root mean squared error (RMSE) of three semivariograms.
Under different seasonal scenarios, relatively low values of the mean and standard deviation of both RMSE and MAPE for all models are produced throughout the summer months. In particular, the lowest mean values of MAPE ranging from 3.9667% to 4.2075% are acquired in April with corresponding standard deviation varying from 1.7067 % to 2.0254 %. This might due to the fact that considerable solar radiation together with high frequencies of the apparent sky are prominent during the dry season, giving rise to low variation in solar radiation across the country. In contrast with the wet season, especially in July when the maximum values of mean error and standard deviation are obtained, the variability between the wet and dry areas throughout the country becomes distinct due to the southwest monsoon together with Inter-Tropical Convergence Zone (ITCZ). This results in significant uncertainty in measurement as well as high fluctuation of solar radiation.
Figure 4 displays a visual comparison of the estimations of solar radiation distribution in March, July and November in Fold 4. The reason for selecting these three months is because each of them represents an individual season in Thailand. Overall, these three semivariograms produce fairly similar distribution patterns of solar radiation with no significant difference among them. A plausible explanation for this might be due to a small local variation of solar radiation, even with a collection of measured data itself. Nonetheless, in Figure 4d–f, little differences can be observed in central Thailand where a relatively higher intensity of solar radiation is predicted by both parametric semivariograms. This similar tendency is also exhibited in November in which lower values of the monthly average of daily solar radiation estimated by exponential and spherical models are found in northern Thailand.
Figure 4.
The maps of the predicted montly averages of daily solar radiation for March, July and November using exponential model, spherical model and semiparametric model.
Figure 5 illustrates a comparison regarding the fitted curves generated by each method. Broadly speaking, both exponential and semiparametric curves exhibit somewhat similar trends of behavior for these three selected months. Whereas the fitted curves derived from the spherical model tend to have a steep slope. The above-mentioned observation can be clearly seen in November where the estimated values of sill and range of the exponential semivariogram that are produced by GA are around 0.0260 and 10, while those of the spherical semivariogram are 0.0299 and 9.8739 respectively. On the other hand, the optimal value of the semiparametric semivariogram appears to be 0.3431.
Figure 5.
A comparison of three different semivariograms for March, July and November in Fold 4.
It can nevertheless be inconclusive just by examining merely the diagrams attained from each model. Additional investigation is required to assess the model quality. The weighted sum squared errors between empirical and estimated semivariograms are carried out and presented in Table 3. In March and July, the lowest WSSE values are achieved by the semiparametric model, followed by the exponential model while the highest WSSE is obtained from the spherical model.
Table 3.
The weighted sum squared errors of three different semivariograms for March, July and November in Fold 4.
To investigate the effects of judicious parameter on kriging estimates, we have varied the values of in the range of 0.1–0.9 with a spacing of 0.1 (not shown here). It is found that the semiparametric estimates with being greater than 0.5 result in performance deterioration. This agrees well with the best-tuned obtained from GA in which it tends to be smaller than 0.5 thoroughly. Figure 6 depicts the results of the empirical semivariogram and the associated semiparametric fits for being 0.1, 0.5, 0.8 and optimally tuned derived from the GA.
Figure 6.
Analysis of the effects of value in the semiparametric estimators for March, July and November in Fold 4.
In July, the curves obtained from and the best-tuned which is equal to 0.2227 almost coincide with the values of WSSE as shown in Table 4 being 20.0320 and 16.9381, respectively. On the contrary, in November, a suitable choice for is around 0.3–0.5. The values of WSSE obtained from is 25.9583 and that of is 27.4183.
Table 4.
The weighted sum squared error of semiparametric semivariogram with different for March, July and November in Fold 4.
5. Conclusions and Discussion
As having been stated earlier, the crucial factor of the traditional nonparametric kriging is to correctly specify the nodes otherwise the method can suffer from spurious oscillation due to the nature of the basis function. The primary objective of this study is to adopt the semiparametric semivariogram which is a variant of the nonparametric model. Regardless of the fact that non-monotonous semivariograms (hole effect) are applicable in some certain cases, the supplementary monotonicity constraint is imposed so that the node selection criterion is established with the aim to meet the theoretical characteristics of semivariograms as well as to prevent the spurious oscillation. This results in the appropriate node space being formulated in terms of the first root of the Bessel function of the first kind of order one, as shown in Equation (14).
The comparison of exponential, spherical and semiparametric models is demonstrated in the case of solar radiation estimation in Thailand. The GA technique is also coupled to all semivariograms for the purpose of identifying optimal parameters. The estimates derived from the semiparametric model are more accurate than those from the other two parametric semivariograms, with a relative improvement of around 7–13% according to the averages values of MAPE and RMSE. Regarding seasonal distribution, high errors are achieved during the monsoon months. This is associated with big changes in cloudiness in some particular areas which causes large fluctuations in solar radiation and ultimately becomes difficult to estimate. Although all three semivariograms produce almost identical maps of solar radiation for the selected months but, from the accuracy measures point of view, it is shown that the best performance is achieved by the semiparametric model. This might due to the fact that irregular patterns of the data arise under different seasonal climatic regimes, so this viable method allows the flexibility to modeling the semivariogram, rather than the predetermined function. Furthermore, following this study, the emphasis is also put on the effect of the tuning parameter to the semiparametric model. Based on our dataset, the range of fixed values of that results in sufficiently good estimates is restrained to relatively low values (≤0.5) which agree well with the optimally-tuned obtained from the GA.
Author Contributions
All authors designed the research and implemented the numerical experiments and analyzed the results. All authors also wrote the manuscript. All authors have read and agreed to the published version of the manuscript.
Funding
This research received no external funding.
Acknowledgments
This research was supported by Chiang Mai University and the Centre of Excellence in Mathematics, the Commission on Higher Education, Thailand.
Conflicts of Interest
The authors declare no conflict of interest.
Abbreviations
The following abbreviations are used in this manuscript:
| AEDP | Alternative Energy Development Plan |
| GA | Genetic Algorithm |
| WSSE | Weighted Sum Squared Errors |
| RMSE | Root Mean Squared Error |
| MAPE | Mean Absolute Percentage Error |
| ITCZ | Inter Tropical Convergence Zone |
Appendix A
Table A1.
The station coordinates.
Table A1.
The station coordinates.
| Symbol | Station Name | Latitude ( N) | Longitude ( E) |
|---|---|---|---|
| S1 | Bangkok | 13.75 | 100.50 |
| S2 | Kanchanaburi (Meteorological Station) | 14.02 | 99.53 |
| S3 | Kanchanaburi (Thongphaphum) | 14.73 | 98.63 |
| S4 | Lopburi | 14.83 | 100.62 |
| S5 | Nakhon Sawan | 15.67 | 100.12 |
| S6 | Phetchabun | 16.43 | 101.15 |
| S7 | Phitsanulok | 16.78 | 100.27 |
| S8 | Tak | 16.80 | 98.90 |
| S9 | Phrae | 18.06 | 100.06 |
| S10 | Nan | 18.72 | 100.75 |
| S11 | Prachuap Khiri Khan (Nong Phlap) | 12.588 | 99.731 |
| S12 | Doi Inthanon | 18.54 | 98.52 |
| S13 | Doi Inthanon (rada station) | 18.59 | 98.49 |
| S14 | Chiang Mai | 18.83 | 98.88 |
| S15 | Mae Hong Son | 19.48 | 97.95 |
| S16 | Mae Sariang | 18.17 | 97.93 |
| S17 | Chiang Rai | 20.08 | 99.88 |
| S18 | Nakhon Ratchasima | 14.97 | 102.08 |
| S19 | Surin | 14.88 | 103.50 |
| S20 | Khon Kaen | 16.19 | 102.80 |
| S21 | Roi Et | 16.07 | 103.00 |
| S22 | Nong Khai | 17.87 | 102.72 |
| S23 | Nakhon Phanom | 16.97 | 104.73 |
| S24 | Loei | 17.40 | 101.00 |
| S25 | Trat | 11.77 | 102.88 |
| S26 | Chonburi | 13.37 | 100.97 |
| S27 | Prachuap Khiri Khan | 11.83 | 99.83 |
| S28 | Chumphon | 10.40 | 99.18 |
| S29 | Ranong | 9.98 | 98.62 |
| S30 | Surat Thani (Phunphin) | 9.13 | 99.15 |
| S31 | Koh Samui | 9.47 | 100.05 |
| S32 | Phuket | 8.13 | 98.30 |
| S33 | Trang | 7.52 | 99.62 |
| S34 | Songkhla | 6.92 | 100.43 |
| S35 | Narathiwat | 6.40 | 101.82 |
| S36 | Sa Kaeo (Aranyaprathet) | 13.70 | 102.59 |
| S37 | Ubon Ratchathani | 15.28 | 105.14 |
References
- Ding, Q.; Wang, Y.; Zhuang, D. Comparison of the common spatial interpolation methods used to analyze potentially toxic elements surrounding mining regions. J. Environ. Manag. 2018, 212, 23–31. [Google Scholar] [CrossRef]
- Hadi, S.J.; Tombul, M. Comparison of Spatial Interpolation Methods of Precipitation and Temperature Using Multiple Integration Periods. J. Indian Soc. Remote Sens. 2018, 46, 1187–1199. [Google Scholar] [CrossRef]
- Qiao, P.; Li, P.; Cheng, Y.; Wei, W.; Yang, S.; Lei, M.; Chen, T. Comparison of common spatial interpolation methods for analyzing pollutant spatial distributions at contaminated sites. Environ. Geochem. Health 2019, 41, 2709–2730. [Google Scholar] [CrossRef] [PubMed]
- Hutchinson, M.; Booth, T.; McMahon, J.; Nix, H. Estimating monthly mean valuesof daily total solar radiation for Australia. Sol. Energy 1984, 32, 277–290. [Google Scholar] [CrossRef]
- Loghmari, I.; Timoumi, Y.; Messadi, A. Performance comparison of two global solar radiation models for spatial interpolation purposes. Renew. Sustain. Energy Rev. 2018, 82, 837–844. [Google Scholar] [CrossRef]
- Wu, W.; Tang, X.P.; Yang, C.; Guo, N.J.; Liu, H.B. Spatial estimation of monthly mean daily sunshine hours and solar radiation across mainland China. Renew. Energy 2013, 57, 546–553. [Google Scholar] [CrossRef]
- Gong, G.; Mattevada, S.; O’Bryant, S.E. Comparison of the accuracy of kriging and IDW interpolations in estimating groundwater arsenic concentrations in Texas. Environ. Res. 2014, 130, 59–69. [Google Scholar] [CrossRef] [PubMed]
- Coulibaly, M.; Becker, S. Spatial Interpolation of Annual Precipitation in South Africa-Comparison and Evaluation of Methods. Water Int. 2007, 32, 494–502. [Google Scholar] [CrossRef]
- Li, S.; Griffith, D.A.; Shu, H. Temperature prediction based on a space–time regression-kriging model. J. Appl. Stat. 2020, 47, 1168–1190. [Google Scholar] [CrossRef]
- Adhikary, S.K.; Muttil, N.; Yilmaz, A.G. Cokriging for enhanced spatial interpolation of rainfall in two Australian catchments. Hydrol. Process. 2017, 31, 2143–2161. [Google Scholar] [CrossRef]
- Pham, T.G.; Kappas, M.; Huynh, C.V.; Nguyen, L.H.K. Application of Ordinary Kriging and Regression Kriging Method for Soil Properties Mapping in Hilly Region of Central Vietnam. ISPRS Int. J. Geo-Inf. 2019, 8, 147. [Google Scholar] [CrossRef]
- Alsamamra, H.; Ruiz-Arias, J.A.; Pozo-Vázquez, D.; Tovar-Pescador, J. A comparative study of ordinary and residual kriging techniques for mapping global solar radiation over southern Spain. Agric. For. Meteorol. 2009, 149, 1343–1357. [Google Scholar] [CrossRef]
- Ertekin, C.; Evrendilek, F. Spatio-temporal modeling of global solar radiation dynamics as a function of sunshine duration for Turkey. Agric. For. Meteorol. 2007, 145, 36–47. [Google Scholar] [CrossRef]
- Matheron, G. Principles of geostatistics. Econ. Geol. 1963, 58, 1246–1266. [Google Scholar] [CrossRef]
- Montero, J.M.; Fernández-Avilés, G.; Mateu, J. Spatial and Spatio-Temporal Geostatistical Modeling and Kriging; John Wiley & Sons, Ltd.: Hoboken, NJ, USA, 2015. [Google Scholar] [CrossRef]
- Gotway, C.A.; Ferguson, R.B.; Hergert, G.W.; Peterson, T.A. Comparison of Kriging and Inverse-Distance Methods for Mapping Soil Parameters. Soil Sci. Soc. Am. J. 1996, 60, 1237–1247. [Google Scholar] [CrossRef]
- Lv, Z.; Lu, Z.; Wang, P. A new learning function for Kriging and its applications to solve reliability problems in engineering. Comput. Math. Appl. 2015, 70, 1182–1197. [Google Scholar] [CrossRef]
- Yeh, M.S.; Lin, Y.P.; Chang, L.C. Designing an optimal multivariate geostatistical groundwater quality monitoring network using factorial kriging and genetic algorithms. Environ. Geol. 2006, 50, 101–121. [Google Scholar] [CrossRef]
- Zhang, Y.; Xia, B.; Xie, D.; Koh, C.S. Optimum design of switched reluctance motor to minimize torque ripple using ordinary Kriging model and genetic algorithm. In Proceedings of the 2011 International Conference on Electrical Machines and Systems (ICEMS 2011), Beijing, China, 20–23 August 2011; pp. 1–4. [Google Scholar] [CrossRef]
- Shapiro, A.; Botha, J. Variogram fitting with a general class of conditionally nonnegative definite functions. Comput. Stat. Data Anal. 1991, 11, 87–96. [Google Scholar] [CrossRef]
- Bochner, S. Harmonic Analysis and the Theory of Probability; Mathematical Reviews (MathSciNet): MR17: 273d Zentralblatt MATH; MathSciNet: Berkeley, CA, USA; Los Angeles, CA, USA, 1955; p. 68. [Google Scholar]
- Yaglom, A.M. Some Classes of Random Fields in n-Dimensional Space, Related to Stationary Random Processes. Theory Probab. Appl. 1957, 2, 273–320. [Google Scholar] [CrossRef]
- Genton, M.G.; Gorsich, D.J. Nonparametric variogram and covariogram estimation with Fourier–Bessel matrices. Comput. Stat. Data Anal. 2002, 41, 47–57. [Google Scholar] [CrossRef]
- Gorsich, D.; Genton, M. On the discretization of nonparametric isotropic covariogram estimators. Stat. Comput. 2004, 14, 99–108. [Google Scholar] [CrossRef]
- Carmack, P.S.; Spence, J.S.; Schucany, W.R.; Gunst, R.F.; Lin, Q.; Haley, R.W. A new class of semiparametric semivariogram and nugget estimators. Comput. Stat. Data Anal. 2012, 56, 1737–1747. [Google Scholar] [CrossRef]
- Oliver, M.; Webster, R. A tutorial guide to geostatistics: Computing and modelling variograms and kriging. CATENA 2014, 113, 56–69. [Google Scholar] [CrossRef]
- Cressie, N. Statistics for Spatial Data; John Wiley & Sons, Inc.: Hoboken, NJ, USA, 1993. [Google Scholar]
- Young, L.J.; Young, J.H. Statistical Ecology; Springer: New York, NY, USA, 1998. [Google Scholar] [CrossRef]
- Cressie, N. Fitting variogram models by weighted least squares. J. Int. Assoc. Math. 1985, 17, 563–586. [Google Scholar] [CrossRef]
- Holland, J.H. Genetic algorithms. Sci. Am. 1992, 267, 66–73. [Google Scholar] [CrossRef]
- Cherry, S. Non-parametric estimation of the sill in geostatistics. Environmetrics 1997, 8, 13–27. [Google Scholar] [CrossRef]
- Pyrcz, M.; Deutsch, C. The whole story on the hole effect. Geostat. Assoc. Australas. Newsl. 2003, 18, 3–5. [Google Scholar]
- Alpaydin, E. Introduction to Machine Learning; The MIT Press: Cambridge, UK, 2010. [Google Scholar]
Publisher’s Note: MDPI stays neutral with regard to jurisdictional claims in published maps and institutional affiliations. |
© 2020 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (http://creativecommons.org/licenses/by/4.0/).