2.1. Simulation Study
Four trials, each with 100 simulations, from the Monte Carlo experiment were performed, where each simulated data set represents a set of stochastic process realizations {Z(
si),
si ∈
S}, where
si = (x
i, y
i)
T is a vector that represents the i-th location in the study area (i = 1, …,
n), such that
S ⊂ ℝ
2 and ℝ
2 is the two-dimensional Euclidean space [
10]. This simulation study reproduces possible real data sets and improves the theoretical and practical knowledge about the anisotropy.
For each simulated data set, an irregular sampling design with 100 points (n = 100) was considered, with ordinates ranging from 0 to 1 on the X and Y and where each Z(si), com i = 1, …, n, represents a georeferenced variable.
This georeferenced variable was expressed by the isotropic linear Gaussian spatial model Z(
si) = μ(
si) + ϵ(
si) [
11], where the deterministic term (μ(
si) = μ), ϵ(
si) is the stochastic term for i = 1,...,
n, such that both depend on the spatial location in which is observed Z(
si), such that E[ϵ(
si)] = 0 and the variation between points in space is determined by a covariance function C(
si,
sj) = cov[ϵ(
si), ϵ(
sj)]. Also, we have assumed that
Z = (Z(
s1), …, Z(
sn))
T has a Gaussian distribution
n-variate, with vector mean equal to μ1,
n × 1, and the covariance matrix
∑ = [(σ
ij)],
n ×
n, with σ
ij = C(
si,
sj), i, j = 1, …,
n, with particular parametric form given by
∑ = φ
1In + φ
2R, where
In is an identity matrix,
n ×
n,
R =
R(φ
3) = [(r
ij)] is a symmetric matrix,
n ×
n, with diagonal elements r
ii = 1 and r
ij = φ
2−1σ
ij for i ≠ j = 1, …,
n. Thus, the covariance function is the function associated with semivariance by γ(h
ij) = C(0) – C(h
ij) for many isotropic and stationary Gaussian processes, where h
ij = ‖
si –
sj‖ is the value of Euclidian distance between the points
si and
sj [
4]. Moreover, this covariance matrix has the following parameters: φ
3 is a function of the range (
a > 0), φ
1 is the nugget effect and φ
2 is the partial sill.
In the first trial, we are assuming that μ = 200, and an exponential function correlation [
12] composed of the following parameters that define the structure of spatial dependence: φ
1 = 0, φ
2 = 1, and
a = 0.6.
In the order trials (2°, 3°, and 4°), this georeferenced variable was expressed by the anisotropic linear Gaussian spatial model [
9], with the following technical characteristics: the same correlation function and one change in the linear Gaussian spatial model described above, where ‖
Ahij‖ expresses the Euclidean distance between pairs of points in
n sampled locations in the semivariance function, i.e.,
γ(‖
Ahij‖) =
C(‖0‖) −
C(‖
Ahij‖), considering a linear transformation at those locations expressed by the product matricial
Ahij, with
hij =
si −
sj, where the transformation matrix is equal to
, with
β as the highest spatial continuity angle in
π radians (0 ≤
β ≤
π) defined in the azimuth system and
is the anisotropic ratio (
Fa > 1), where
αβ is the spatial dependency distance (range) in the direction of higher spatial continuity (β) and
is the spatial dependency distance (range) in the direction of lower spatial continuity (
) [
5,
7,
9]. In the order trials (2°, 3° e 4°), we are assuming that the angle of greater spatial continuity is equal to 90° and the anisotropic ratio is (F
a = 2, 3, and 4). Moreover, the first trial (isotropic) is a particular case of the anisotropic model (F
a = 1) [
4].
For each simulation, the estimation of the anisotropic Gaussian space linear model was carried out using the maximum likelihood method obtained by the following parameters: mean (μ), nugget effect (φ
1), partial sill (φ
2), practical range (
a) and anisotropic ratio (F
a) [
9,
11].
Then, ordinary kriging was used in the spatial estimation of each simulated data set at unsampled locations. In order to compare the differences in the spatial estimation when considering the presence of geometric anisotropy in the application of the linear Gaussian spatial model, kriging was used for each simulated data set with an estimated isotropic linear Gaussian spatial model.
For each simulated data set, the directional comparison between the two maps using the predicted values (one considering the isotropic model and the other considering the anisotropic model) was made. The directional comparison for I(d) (Equation (1)), proposed by Rosenberg [
13] was measured. For this calculation, five distance classes (0.15, 0.30, 0.45, 0.60, and 0.75) and two directions (90° and 0°) were considered. Each distance was chosen to guarantee a relevant number of points pairs, greater than or equal to 40, for the calculation of I(d) [
13].
where
n is the number of unsampled points that were considered in the kriging;
and
are the predicted values in the point
and
(
);
is the mean of predicted values for all points;
is the sum of the elements of a spatial weights matrix
W′
elaborated with the distance class d; and
is the element at the
-th row and
-th column of the weights matrix
W′, expressed by:
where
, for which
is the Euclidean distance between points
and
(
and
);
is the direction (in radians) of interest for the calculation of the Moran directional index; and
is the angular direction (in radians) between points
and
(i.e., between a line parallel to the X axis and the line formed by the points
and
), such that
and
, calculated by:
, if
;
, if
;
, if
; and
, if
, where
and
are the coordinates of the points
and
(
and
), respectively.
Equation (2) demonstrates that Rosenberg [
13] defined the Moran directional considering, in this index, the inclusion of a weight (or penalty) factor weights matrix
, expressed by
, with
and
. In this way,
, if the direction between points
and
is equal to the direction of interest (i.e.,
). However, the greater the distance between the direction of interest (
) and the direction between points
and
(
), the smaller the value of
(relative to the value of
).
High values of
indicate that there is a spatial autocorrelation of the georeferenced variable [
14]. Thus, the influence of distance class on the magnitude of the index can be explained by the basic geostatistical concept that closer data look more like more distant data [
1]. Thus, the analysis of the variation of
as a function of the distance class
serves as an indication of the range of spatial dependence of the georeferenced variable for a specific direction.
Also, the significance of this index was evaluated by the pseudo-significance test, with a 5% probability [
14].
2.2. Soil Chemical Property Study
The agricultural data were collected in a commercial area of grain production in Cascavel, Paraná, Brazil (
Figure 1), with a total area of 167.35 ha. The area is located at approximately 24.95° of South and 53.57° of West, with a mean altitude of 650 m above sea level. The soil is classified as typical Dystroferric Red Latosol, with a clayey texture. The region’s climate is classified as mesothermal and super-humid temperate, climate type Cfa (Köeppen classification system), and the mean annual temperature is 21 °C. The sampling points were georeferenced and localized using a GNSS receiver (GeoExplorer, Trimble Navigation Limited, Sunnyvale, CA, USA) in a Datum WGS84 coordinate reference system, UTM (Universal Transverse Mercator) projection. The lattice plus close pairs sampling with 102 points was sampled [
1,
15]. This sampling consists of a regular grid, with a minimum distance between points of 141 m; 19 points were randomly added in this regular grid, such that the smallest distance between the points added and the point of this regular grid is 75 and 50 m (
Figure 1).
The sample data were obtained through a routine chemical analysis in the soil analysis laboratory of COODETEC (Cooperativa Central de Pesquisa Agrícola) of representative samples of each plot weighing approximately 500 g and collected at each demarcated point (
Figure 1). The following soil chemical properties with geometric anisotropy were considered: carbon content (C) (g dm
−3), calcium content (Ca) (cmolc dm
−3) and potassium content (K) (cmolc dm
−3). The chemical analyses were performed using the Walkley–Black method [
16].
Descriptive and geostatistical analyses were performed for each soil chemical property to verify the existence of directional trends and spatial dependence. Directional trends represent a linear association between the respective values of the soil chemical properties with the coordinates of the X or Y axis, and were assessed by Pearson’s linear correlation coefficient (
), in which values above 0.30 in a module indicate a directional trend [
17]. Spatial dependence was assessed by the spatial dependence index (SDI), being classified as weak when SDI ≤ 9%, moderate when 9%
SDI ≤ 20% and strong when SDI ≥ 20% [
4,
18]. Still, the directional comparison analyses described above were also performed. For the calculation of (Equation (1)), five distance classes (150, 300, 450, 600 and 750 m) and two directions (the direction of greater spatial continuity (θ) and the direction orthogonal to it) were considered.
Moreover, the global comparison between the two maps using the predicted values (one considering the isotropic model and the other considering the anisotropic model) were made. The measurements for global comparison were performed by the Global Accuracy and the Kappa and Tau concordance indices [
19,
20].