1. Introduction
For climate change, a quantitative assessment of rainwater infiltration into the soil is fundamental for evaluating the slope collapse caused by heavy rainfall in mountainous areas. The rainwater infiltration process was theoretically modeled and numerically simulated using the hydraulic properties of unsaturated soil, which were represented by the water retention curve (WRC) (the relationship between the volumetric soil water content, θ, and matric pressure head, ψ) and by unsaturated hydraulic conductivity. The WRC parameters are a part of the function of unsaturated hydraulic conductivity and are crucially important. These WRCs in the drying path are formed by pore size distribution [
1] and show a large spatial variability within a natural forested slope [
2,
3].
However, it is usually not feasible to directly determine the spatial variability in the WRC from the sampling of soil on the entire forest slope and measurements in the laboratory because these studies are time-, labor-, and cost-intensive. Therefore, the hydraulic properties of unsaturated soils have been assumed to be homogeneous in many studies on the numerical simulation of rainwater infiltration [
4,
5,
6]. However, simulations assuming homogeneous soil cannot fully predict the actual water flow in the field [
7,
8].
To date, many studies have developed a pedo-transfer function to predict WRC [
9,
10,
11,
12]. The pedo-transfer function could determine the WRCs easily. These methods require texture data, bulk density, and organic matter content, which require tiresome measurement and time.
A scaling technique was developed as an effective tool for simplifying the description of spatial variability in WRCs. The original scaling theory was introduced by Miller and Miller (1956) [
13], who derived the scaling theory using similar media theory. Scaling simplifications related WRCs at different spatial locations to a parameter called a scaling factor using the WRC of a set of reference parameters. Based on this concept, the microscopic structures of soils are assumed to be identical, and the soils differ only in their microscopic length scale, which is characterized by a scaling factor for each soil sample. This scaling technique is widely used in agricultural fields and grasslands [
14,
15]. In contrast to soils in agricultural fields and grasslands, few studies have applied this technique to forest soils.
Kosugi and Hopmans (1998) [
16] indicated that ψ
m represents the microscopic length of the soil using the lognormal model, the LN model [
1], which contains three parameters (effective porosity, θ
e; matric pressure head corresponding to the median pore radius of pore-size distribution, ψ
m; and standard deviation of pore-size distribution, σ). Based on this finding, Hayashi et al. (2009) [
17] examined three parameters of the LN model to determine the parameters that characterize the spatial variability of WRCs on a natural forested slope. The results indicated that θ
e, instead of ψ
m, mostly characterizes it. They proposed a new scaling approach that evaluates the spatial variability in WRCs using the spatial variability in θ
e. In this approach, θ
e was set as a scaling parameter that exhibits high spatial variability. At this time, the remaining two parameters showing small spatial variability (ψ
m and σ) were set as reference parameters and were identical within the forested slope.
For the practical application of the scaling technique, a simple method for determining the scaling and reference parameters is needed. For this purpose, Nasta et al. (2009) [
18] proposed a simple application of a scaling technique based on a similar media theory. In their methods, WRCs were estimated from soil-particle-size distribution data, whose measurement was much simpler than that of the WRC. They demonstrated that the spatial variability of WRCs can be characterized using the proposed method. To easily apply the scaling approach to forest soils, Hayashi et al. (2009) [
17] calculated the scaling parameter θ
e as a difference between the water content observed at saturation and the water content observed at ψ = −1000 cm, which is assumed equal to the residual water content. This method explained approximately 90% of the spatial variability in WRCs on the studied natural forested slope. Thus, the WRCs for the entire matric pressure range are not necessary for determining the scaling parameter.
However, little attention has been paid to methods for determining reference parameters. Kosugi and Hopmans (1998) [
16] proposed a method in which the reference parameters are optimized by minimizing the residuals of volumetric water content, θ. Several studies have used this method [
5,
19,
20,
21,
22]. Deurer et al. (2001) [
23] derived the reference parameters from geometric means of their parameters. Chari et al. (2020) [
24] used an infiltration curve to determine the reference parameters, and Fred Zhang et al. (2004) [
25] estimated the reference parameters at the field scale through the inverse modeling of field experiments. Vishkaee et al. (2014) [
26] used reference parameters, assuming that the reference soil is the one that consists of uniform-sized spherical particles.
Most previous studies used a large number of WRCs and needed labor approaches to determine the reference parameters. They did not try to reduce the number of WRCs. However, because the values of the reference parameters are common within a site, it may be possible to determine them with high confidence from a small number of WRCs [
25]. In addition, in natural forested slopes, the soil structure characterizing the WRC is affected by forest biological activities, weathering, and geomorphological processes in Japan. Thus, there may be a spatial pattern in the spatial variability of the soil structure. Consequently, by recognizing the spatial pattern of the soil structure, we need to select the optimal sampling location to determine the reference parameters. This optimal sampling design for reference parameters, in combination with a simple determination of the scaling parameter proposed by Hayashi et al. (2009) [
17], reduces the required number of WRCs and allows us to save effort in measuring the enormous number of WRCs for describing the spatial variability in WRCs in a forested slope. Few studies have focused on the number of samples required to determine the hydraulic properties. Salemi et al. (2020) [
27] estimated an adequate number using statistical analysis. However, they analyzed the saturated hydraulic conductivity and not the water retention curves.
Therefore, the aim of this study is to establish the optimal sampling design for WRCs to identify reference parameters and facilitate the application of the scaling technique based on Hayashi et al. (2009) [
17] to forest soil. For this purpose, we statistically examined the accuracy of estimation of the WRCs to clarify how many samples are required to obtain these parameters and to suggest a sampling location that accounts for the effects of spatial distribution characteristics of WRCs in the forest slope.
2. Materials and Methods
2.1. Soil Sampling Location
The soil samples were taken from a natural forested slope (main and middle lower panels of
Figure 1) underlain by the granitic bedrock (left upper panel of
Figure 1) in the Tanakami Mountain range, Shiga Prefecture, Japan, situated at 34°55′00″ N and 135°58′58″ E. Detailed information on this site is provided by Hayashi et al. (2009) [
17].
We collected soil samples from vertically distributed profiles, as shown by
Nh in
Figure 1.
Nh was correlated with the scaling parameter θ
e (Hayashi et al., 2009 [
17]), and reference parameters were not so correlated.
There was a small dip (indicated by an arrow in
Figure 1) on the soil surface of the middle slope. There were slope regions with a thin soil layer below this dip and a thick soil layer above this dip. Thus, the dip may correspond to the head of an ancient landslide, and it was presumed that the thick soil layer consisted of debris brought by the landslide from the middle slope region. This was also supported by the vertical distribution of penetration resistance,
Nh (
Figure 1).
Nh is the number of blows required for a 10 cm soil depth penetration, which was measured using a cone penetrometer with a 60° bit, cone diameter of 2 cm, weight of 2 kg, and fall distance of 50 cm. For points b–d, the soil in the soil layer remained loose (
Nh in the range of 0–20 blows/10 cm); however, the soil hardness on the bedrock increased rapidly and reached 100 blows/10 cm, which was the value observed at the interface between the soil layer and the bedrock. Such vertical distributions of
Nh values are characteristic of a soil layer consisting of debris [
28]. Points e–h, which had a thin soil layer, were thought to be the residual soil remaining after the landslide. For the soils at points i–o,
Nh gradually increased with depth. The soil layer indicating such a vertical distribution of the
Nh value was considered unaffected by the landslide [
28].
Undisturbed soil samples were collected from 15 points distributed from the downslope (point a) to the upslope (point o) segments to assess the spatial variability in the WRC on this forested slope (
Figure 1). The sampling depths were set from the surface layer to the layer above the bedrock. The sampling locations are indicated by white dots in the panels in
Figure 1. Soil samples were collected in triplicate at each point and depth. But, only at point j, we collected six samples at the depths of 15, 25, 35, 45, and 55 cm, and three samples at the depths of 2.5 and 75 cm.
2.2. Soil Sampling and Laboratory Experiments
Undisturbed soil samples were collected using thin-walled steel core samplers with a volume of 100 cm
3 (5 cm inner diameter and 5.1 cm height). A sampler with a sharp edge was inserted vertically into the soil. Impact energy was applied to the cylinder using a hammer-driven device (DIK-1630; Dike Rika Kogyo, Tokyo, Japan). We followed the method described by Grossman and Reinsch (2002) [
29] to collect undisturbed soil samples and ensure sampling with minimum disturbance. The roots and organic material were carefully cut around the sampler during the insertion process.
Soil samples were transferred to aluminum trays in the laboratory. These were slowly saturated by wetting from the bottom over a 24 h period, and then weighed to determine the saturated water content, θ
s. Soil WRCs were then measured using the pressure plate method [
30] for a matric pressure head, ψ, of −5, −10, −20, −30, −50, −70, −100, −200, −500, and −1000 cm. After measuring the water content at ψ = −1000 cm, each sample was oven-dried to determine the weight of soil in dry condition. The analyses described below were carried out using the average WRCs of three or six replicate samples (each with a volume of 100 cm
3) collected at each point and depth. We analyzed the water retention characteristics of 77 soil samples.
There is a problem of hysteresis. The numerical simulation without hysteresis makes the matric potential head more widely fluctuated, compared with the actual soil. However, because analysis including hysteresis is complicated, it is valid that in this study, water retention curves of drying processes are analyzed using the scaling method.
2.3. Water Retention Model
The LN model [
1] is based on the assumption that the soil pore radius distribution obeys a lognormal distribution and expresses the WRC as
where θ
r and θ
s (cm
3 cm
−3) are the residual and saturated water contents, respectively; ψ
m (cm) is the matric pressure head corresponding to the median pore radius; σ represents the width of the pore-size distribution; and
Q denotes the complementary normal distribution function, defined as
The difference between θs and θr (i.e., θs − θr) is referred to as the effective porosity, θe.
A plot of Equation (1) shown in
Figure 2 demonstrates that ψ
m is the matric pressure head at the inflection point. The parameter σ is defined as follows.
Equation (3) shows that σ controls the magnitude of the change in θ around the inflection point. The parameter θe is defined by θs − θr, and represents the volumetric pore ratio effective for water storage.
2.4. Fitting of the LN Model
Prior to testing the proposed scaling methods, the observed WRC for each soil sample was fitted using the LN model (Fitting Approach IF; Individually Fitting). For the
ith soil sample, the
jth calculated water content,
, was calculated by applying the following equation using a nonlinear least squares optimization procedure (Wraith & Or, 1998) [
31]:
where θ
r,i, θ
s,i, ψ
m,i, and σ
i are the parameters for each soil sample
i, and ψ
j is the
jth matric pressure head. To apply Equation (4), for each soil sample,
i, θ
r,i was fixed at the water content observed at ψ = −1000 cm, and θ
s,i was fixed at the observed saturated water content. As shown in
Figure 3, the slope of water retention curves at ψ = −1000 cm was almost 0. Therefore, we assumed θ at ψ = −1000 cm was θ
r.
The parameters ψ
m,i and σ
i were optimized by minimizing the residual sum of squares (
RSS), computed from
where
J is the total number of data points for WRC of each soil sample (=9),
is the
jth observed water content for a sample
i, and
is the calculated water content corresponding to
. The sum of
RSSi of all soil samples (
RSSIF) was calculated as follows:
where
I is the total number of soil samples (=77). This method provides the best description of the spatial variability in soil WRCs under the conditions assumed in the LN model.
In the Fitting Approach UF (Universal Fitting), a single curve was applied to the entire data set with the LN model to exclude the spatial variability in all the parameters (ψ
m, σ, and θ
e) of the pore-size distribution function, and a single set of parameters (i.e.,
and
) was optimized using
where
is the
jth water content calculated for sample
i using the Fitting Approach UF. The
is shown in
Figure 3.
and
were fixed to the arithmetic mean of all samples to apply Equation (7). The optimization procedure was conducted by minimizing
RSSUF computed from
2.5. Scaling Method
Hayashi et al. (2009) [
17] indicated that, among the three parameters expressing the soil pore-size distribution (ψ
m, σ, and θ
e), θ
e mostly characterizes the spatial variability in WRCs in natural forested slopes. Because θ
e is defined by the difference between θ
s and θ
r, they proposed a scaling approach in which θ
s and θ
r are set as scaling parameters differing for each soil, and the remaining two parameters (ψ
m and σ) are set as reference parameters identical within a study site. According to this scaling technique, WRC can be expressed as
where
is the
jth water content calculated for sample
i using Equation (9). θ
r,i and θ
s,i are the residual and saturated water contents of the soil sample
i, respectively.
and
are the reference parameters, and are identical for the entire dataset.
Hayashi et al. (2009) [
17] suggested a scaling method that determines the scaling parameters θ
r,i and θ
s,i for each sample without using a dataset of WRCs, whose measurement requires enormous effort. In this method, θ
s,i is set to be observed θ
s for each soil, and θ
r,i is set to be the water content observed at ψ = −1000 cm for each soil, which was assumed to be equal to the residual water content θ
r. The reference parameters
and
were determined using optimization so that the parameters produced the best description of the entire dataset of WRCs for the entire slope. That is, Equation (9) was applied to the whole dataset, and
and
were optimized using minimizing the residual sum of squares computed from
Using the whole dataset, I and J were 77 and 9, respectively.
2.6. Determination of Reference Parameters
If the universal reference parameters could be determined from a small number of WRCs without using the dataset of WRCs on the entire slope with acceptable confidence, the effort in describing the spatial variability in WRCs on a forested slope would be reduced tremendously. We conducted a numerical experiment involving random sampling of WRCs by varying the sample size, n, using a Monte Carlo approach to examine the efficiency of the reference parameters derived from a limited number of WRCs. The procedure was as follows.
(1) The first step was randomly selecting non-overlapping n WRC sets of WRCs among the 77 WRC datasets collected from the entire slope.
(2) Reference parameters ( and ) were determined using n datasets of the selected WRCs. For this purpose, Equation (9) was applied to n WRC datasets, and and were optimized by minimizing the RSS computed from Equation (10). θs,i was set to be observed θs for each soil, and θr,i was set to be the water content observed at ψ = −1000 cm for each soil to apply Equation (9). The I and J in Equation (10) were n and 9, respectively.
(3) WRCs were described using the determined reference parameters. That is, the entire datasets of WRCs for the entire slope were computed by substituting one set of reference parameters ( and ) determined in Procedure 2 into Equation (9). The scaling parameters (difference between θs,i and θr,i) were set to the observed values for each sample i to apply Equation (9). The RSS of the estimated WRCs was calculated using Equation (10). In this case, I and J as expressed in Equation (10) are n and 9, respectively. The accuracy of the scaling method was evaluated using the RSS values.
Procedures 1–3 were repeated 100,000 times for each assumed n value.
The scaling approach demonstrated in Hayashi et al. (2009) [
17] corresponds to the case for a sample size of 77 (i.e.,
n = 77).
2.7. Efficiency of Scaling Method
Using
RSS by Equation (10), the following two indices were computed to evaluate the accuracy of the scaling method. The coefficient of determination,
R2, was calculated as
where
is the arithmetic mean of the observed volumetric water content chosen in random sampling.
R2 can be used to evaluate the performance of each scaling method by considering the variation in the observed θ values. In the proposed scaling method,
I and
J were equal to
n and 9, respectively.
In addition to
R2, considering the possible
RSS range derived using the scaling method, the effect of scaling,
EOS, was computed according to Hayashi et al. (2009) [
17]:
Here, RSSIF represents the residual error when the spatial variability is most accurately expressed under the conditions in which the LN model is assumed. In contrast, RSSUF represents the error when the spatial variability in the pore-size distribution is not considered. Thus, EOS evaluates the effect of each scaling method by considering the possible range of RSS derived from the proposed scaling method. When the scaling method perfectly explains the spatial variation under the conditions in which the LN model is assumed, EOS is equal to one. In contrast, when the scaling method cannot explain spatial variability, EOS is equal to zero.
2.8. Semivariogram of Reference Parameters
We analyzed the spatial distribution of reference parameters using a semivariogram. The definition of semivariogram goes back to Matheron. The experimental semivariograms of log(−ψ
m) and σ were calculated for the slope and vertical direction to assess the spatial pattern of log(−ψ
m) and σ in detail. The semivariogram function was computed as follows (Goovaerts, 1997) [
32]:
where γ(
h) is the semivariance,
N(
h) is the number of pairs of variables,
z(
xi) is the measured variable at spatial location
xi,
z(
xi +
h) is the measured variable at spatial location
xi +
h.
In general, the interval of the lag distance, h, is set to be constant; however, in the present study, N(h) is set to be constant to eliminate the bias of the number of data points (N(h) was fixed to 20 and 15 for the slope and vertical directions, respectively). In order to plot the γ(h), the arithmetic mean of the lag distance in each class was used to plot (h).
4. Conclusions
We estimated the number of WRCs required to determine the reference parameters (ψ
m and σ) to effectively apply the scaling technique proposed for forest soil [
17]. For this purpose, we implemented random sampling of the WRCs collected from an entire forested slope by varying the sample size using the Monte Carlo method. As a result, in the figure of the relationship between log(−ψ
m) and σ, isolines of
EOS were distributed concentrically. The highest
EOS calculated from the whole 77 data sets of reference parameters was plotted at the center of the figure. In addition, as the sample number of WRCs required to determine reference parameters was larger, the probability of deriving the highest
EOS was greater. The effect of scaling increased with increasing sample size. Its rate of increase decreased and became about 0 at a sample size of approximately 20. In total, 78% of the spatial variability of the WRCs was explained at a 95% confidence level by using the reference parameters derived from eight samples.
Stratified sampling was performed to reduce the number of WRCs required with a high EOS. Three sampling schemes were tested by considering the different spatial patterns to investigate the effective stratified sampling. The sampling scheme that considered the variability in the slope direction but not in the vertical direction was the most advantageous. The reference parameters derived from eight samples using this scheme performed the accuracy of ten samples derived from complete random sampling.
This result suggested that the spatial variability in the reference parameters is affected by geomorphological processes. Here, geomorphological processes mean an ancient landslide on the studied slope caused a thick soil layer in the downslope region consisting of debris brought from the middle slope region, and the middle and upper slope regions contained the remaining soils. We found that such geomorphological processes had a governing effect on the determination of reference parameters.
We should describe the unsaturated hydraulic conductivity. LN model functionalized unsaturated hydraulic conductivity using parameters of saturated hydraulic conductivity and two parameters (ψ
m and σ). Here, the two parameters correspond to the reference parameters used in this study. Hayashi et al. (2009) [
17] insisted that reference parameters can be set to be homogeneous based on soil pore structure. Therefore, it is highly possible to assume that two parameters are homogeneous when we express unsaturated hydraulic conductivity.
In this study, we investigated datasets obtained from a steep forested slope underlain by weathered granite. Future studies should examine the applicability of the proposed scaling technique to slopes under different geomorphological, vegetative, and geological conditions.