A Deterministic Combinatorial Approach to Investigate Interactions of Soil Hydraulic Parameters on River Flow Modelling

Sales, Dhiego da Silva; Costa, David de Andrade; Lugon Junior, Jader; Neves, Ramiro Joaquim; Silva Neto, Antônio José da

doi:10.3390/w17172627

Open AccessArticle

A Deterministic Combinatorial Approach to Investigate Interactions of Soil Hydraulic Parameters on River Flow Modelling

by

Dhiego da Silva Sales

^1,2,*

,

David de Andrade Costa

¹

,

Jader Lugon Junior

¹

,

Ramiro Joaquim Neves

²

and

Antônio José da Silva Neto

³

¹

Department of Modeling and Technology for the Environment Applied to Water Resources (AMBHIDRO), Federal Fluminense Institute (IFF), St. Coronel Walter Kramer, 363, Pq Santo Antônio, Campos dos Goytacazes, Rio de Janeiro 28080-565, Brazil

²

Center for Environmental and Marine Science and Technology (MARETEC), Instituto Superior Técnico (IST), University of Lisbon, Av. Rovisco Pais, 1, 1049-001 Lisbon, Portugal

³

Mechanical Testing and Metrology Laboratory, Polytechnic Institute, Rio de Janeiro State University (UERJ), St. Bonfim, 25, Nova Friburgo, Rio de Janeiro 28625-570, Brazil

^*

Author to whom correspondence should be addressed.

Water 2025, 17(17), 2627; https://doi.org/10.3390/w17172627

Submission received: 11 July 2025 / Revised: 25 August 2025 / Accepted: 3 September 2025 / Published: 5 September 2025

(This article belongs to the Special Issue Soil–Water Interaction and Management)

Download

Browse Figures

Versions Notes

Abstract

Hydrological modeling is essential for the sustainable management of watershed systems. Physically based models like MOHID-Land simulate soil water dynamics using Richards’ equation, parameterized through the van Genuchten–Mualem (VGM) model. Although the sensitivity of individual VGM parameters—residual water content (

θ_{r}

), saturated water content (

θ_{s}

), pore size distribution (

n

), inverse of air entry pressure (

α

), and saturated hydraulic conductivity (

K_{s a t}

)—is well documented, their combined effects remain underexplored. This study assessed both isolated and joint impacts of these parameters through a deterministic ±10% perturbation scheme, resulting in 31 unique parameter combinations. Model performance was evaluated using the Nash–Sutcliffe Efficiency (NSE) and Percent Bias (PBIAS). Full-parameter interaction achieved the best results (NSE = 0.50, PBIAS = 25.32), compared to the uncalibrated baseline (NSE = 0.01, PBIAS = 34.06). The pair

θ_{s}

and

n

emerged as the most influential. Adding secondary parameters to this core pair yielded only marginal performance gains, while removing them from the full set caused similarly marginal declines. These findings reveal a hierarchical sensitivity structure, emphasizing

θ_{s}

and

n

as key targets for calibration. Prioritizing this pair enables a more efficient soil calibration process, preserving model accuracy while reducing computational cost by limiting parameter space exploration.

Keywords:

hydrological modelling; sensitivity analysis; soil hydraulic properties; model calibration strategies; watershed modelling; soil physics

1. Introduction

Hydrological modeling of watershed systems is a fundamental tool for understanding the movement and storage of water in soils, playing a pivotal role in the sustainable planning and management of water resources [1,2]. These models differ significantly in their approaches, ranging from simplified conceptual frameworks to highly detailed physically based systems.

Among these, physically based models, such as MOHID-Land, distinguish themselves by providing a mechanistic representation of hydrological processes through the direct numerical solution of partial differential equations [3]. This capability is essential for simulating complex watershed responses under dynamic environmental conditions, offering a distinct advantage over models that rely on more simplified conceptual or empirical formulations [4]. For instance, widely used and extensively validated hydrological models like the Soil and Water Assessment Tool (SWAT) are particularly effective for large-scale applications but often approximate soil infiltration and moisture dynamics using empirical methods, such as the Curve Number (CN) approach for surface runoff generation and layered water balance schemes for soil water movement [5].

Crucially, an accurate representation of soil water dynamics is one of the most important components of physically based models. This process is typically governed by Richards’ equation [6], whose performance strongly depends on accurately defined soil hydraulic parameters. The van Genuchten–Mualem (VGM) model remains the most widely adopted framework for describing these parameters. It requires five key variables: residual water content (

θ_{r}

), saturated water content (

θ_{s}

), a shape parameter related to pore size distribution (

n

), an air entry pressure parameter (

α

), and saturated hydraulic conductivity (

K_{s a t}

) [7,8]. Numerous studies have calibrated one or more of these parameters, demonstrating that such variations can substantially alter model predictions in the MOHID-Land system, where the redistribution of soil moisture is central to the hydrological balance [9,10,11].

Within this context, sensitivity analysis is essential for identifying the most influential parameters and informing calibration strategies. Oliveira et al. [12] pioneered sensitivity assessments in MOHID-Land by analyzing the impact of individual parameter perturbations on streamflow duration curves. Their approach applied one-at-a-time, positive-only variations—focused primarily on

K_{s a t}

among the VGM parameters—and employed a sensitivity index based on the effect of these variations on flow class along the flow duration curve. However, the analysis did not incorporate negative variations, symmetric perturbations, or explore interactions between parameters. More recently, Sales et al. [11] expanded this investigation by perturbing all five VGM parameters using a centered-difference scheme. Their method estimated local sensitivities from the derivative of residuals, based on symmetric ±10% perturbations around baseline values for each parameter. While more comprehensive, this approach combined individually optimized values under the implicit assumption of linear system behavior—an assumption often violated in nonlinear hydrological models [13]. Consequently, potential interactive effects among parameters remained unaddressed. This limitation is intrinsic to local sensitivity approaches, which, while computationally convenient, do not capture the complexity of nonlinear interactions—particularly in models that integrate Richards equation with time-varying boundary conditions or that simulate surface flows through Saint-Venant equations [14]. Considering this gap, the present study explicitly addresses how parameter interactions influence model performance, thereby extending the line of research line initiated in Sales et al. [11].

Ideally, capturing these interactions would require variance-based global sensitivity methods; however, their application in MOHID-Land is constrained by the high computational demand of executing thousands of simulations [15]. Dai et al. [16] recommend variance-based global sensitivity methods, such as Sobol indices and the Absolute Moment Approximation (AMA), which decompose model output variance to assess both main and higher-order parameter effects. Although such techniques are methodologically rigorous, their practical implementation remains challenging in this context [16]. This gap between the limitations of local analyses and the computational challenges of global approaches motivates the search for alternative frameworks tailored to complex hydrological models.

As a feasible alternative, this study adopts a deterministic and exhaustive combinatorial approach, explicitly designed to quantify both individual and combined effects of the five VGM parameters. By maintaining the same ±10% perturbation range established in Sales et al. [11], we constructed and evaluated all 31 non-empty combinations of the five VGM parameters, performing one simulation per scenario. This deterministic strategy enabled a systematic assessment of parameter interactions and their influence on model performance, using streamflow as the response variable. Unlike stochastic algorithms or high-dimensional sampling techniques, this approach offers a computationally viable alternative for complex models. The fixed perturbation range allowed for a standardized comparison of parameter effects, reducing potential biases associated with scale-dependent variances. Notably, the method revealed how specific combinations of parameters influence simulation accuracy—insights that would likely be overlooked by traditional local sensitivity methods.

Ultimately, this work aims not only to improve calibration protocols for MOHID-Land but also to contribute methodologically to the ongoing discourse on practical sensitivity assessment in distributed hydrological modeling. Systematic approaches of this nature have shown promise in other domains [17], albeit primarily through automated techniques. Here, we present a fully deterministic and interpretable alternative, expanding the toolkit available for the physically based calibration of soil hydraulic properties in complex watershed systems.

2. Materials and Methods

2.1. Study Area

The Pedro do Rio watershed, encompassing approximately 420 km², accounts for roughly 55% of the municipality of Petrópolis, located in the mountainous area of Rio de Janeiro State. It lies within the Atlantic Forest biome, globally recognized as a biodiversity hotspot, and supports diverse ecosystems rich in endemic species. Hydrologically, the watershed belongs to the Piabanha River basin, itself a significant tributary of the Paraíba do Sul basin—a critical freshwater source for domestic, agricultural, and industrial demands across the states of São Paulo, Minas Gerais, and Rio de Janeiro (Figure 1).

Owing to its ecological and hydrological significance, the Pedro do Rio watershed was designated as one of three pilot study areas under the Integrated Studies in Experimental and Representative Basins (ISERB, or EIBEX in Portuguese) Project, a national program coordinated by the Brazilian Geological Survey (BGS, or SGB in Portuguese). EIBEX aims to implement long-term monitoring of hydrological and environmental variables in basins that typify the complex physical, environmental, socioeconomic, and water-resource dynamics prevalent throughout Brazil [18]. This watershed is extensively monitored, hosting multiple hydrometeorological stations that provide high-resolution data essential for diverse scientific studies in hydrology, ecology, and environmental management. The watershed’s outlet contains a streamflow gauging station operated by the National Water and Sanitation Agency (NWSA, or ANA in Portuguese), providing crucial observational data for model calibration and validation.

As of 2022, Petrópolis had approximately 278,881 inhabitants, with a population density of 352.5 residents per km² [19]. This demographic concentration places substantial pressure on local water resources, intensifying both demand and the potential for environmental degradation. The watershed’s economy is strongly influenced by agriculture, particularly the cultivation of cereal, legume, and oilseed cultivation, which, while economically vital, exacerbate water withdrawal and contribute to soil erosion and sediment transport through agrochemical runoff [20].

Topographically, the basin is characterized by pronounced relief, with elevations ranging from 645 m at the outlet to 2200 m in the headwaters, resulting in a vertical drop of approximately 1450 m. Steep slopes dominate the landscape, with 44.6% classified as strongly undulating (20–35%) and an additional 36.4% categorized as mountainous terrain (45–75%). The region is subject to severe landslide risks, partly due to irregular settlement in high-slope areas, which comprise a large portion of the watershed.

Climatic gradients closely follow elevation changes: upland areas in the watershed experience higher precipitation, exceeding 2000 mm annually, while lower-elevation areas receive around 1300 mm per year. This pattern reflects a clear spatial distribution of rainfall, strongly influenced by the mountainous topography, with precipitation decreasing progressively from the headwaters to the watershed outlet. Seasonal variability is also pronounced, with heavy rainfall concentrated in the summer months, which can lead to flash floods and erosion, particularly in areas with sparse vegetation cover [21]. The spatial distribution of precipitation and isohyets is illustrated in Costa et al. [22].

Land use within the watershed is markedly heterogeneous, reflecting a complex mosaic of urban expansion, agricultural activity, and relatively well-preserved Atlantic Forest fragments. Approximately 62% of the basin remains under dense forest cover, including sections of the Serra dos Órgãos National Park, highlighting the region’s ecological significance. Agricultural and pastoral lands, primarily situated along riparian corridors and slopes, occupy around 26% of the area, while urban development comprises roughly 7%, with ongoing expansion linked to the proximity of the metropolitan Rio de Janeiro area. This urban growth exacerbates risks associated with flooding and landslides. The remaining land cover consists of rocky outcrops on steep gradients, further complicating hydrological responses and increasing erosion potential [18,21].

The soil map used in this study was produced under the Rio de Janeiro Project, a series of multidisciplinary studies of the physical environment conducted by the SGB, in partnership with other institutions, at a scale of 1:250,000 for the entire state of Rio de Janeiro [23]. Soil classification follows the Brazilian Soil Classification System (BSCS, or SiBCS in Portuguese) proposed by EMBRAPA [24], in which soils are grouped according to pedogenetic development, mineralogical composition, and their chemical and physical properties.

In the representative watershed, the predominant soil class is the Allic Cambisol, covering 66.74% of the basin area (Figure 2). Cambisols are soils at an intermediate stage of weathering, retaining several characteristics of their parent material. They typically exhibit low permeability and variable depth, ranging from shallow to moderately deep profiles. The qualifier Allic indicates high aluminum saturation, conferring natural acidity and potential aluminum toxicity, which limits agricultural use without proper soil amendment [24]. Within this soil class, multiple mapped units exist, represented by specific codes such as Ca1, Ca2, Ca6, and Ca7. These units differentiate the same soil class based on criteria such as depth, texture, drainage, and topographic position, enabling more precise mapping of soil variability across the landscape.

The second most representative soil class is the Allic Red-Yellow Latosol, which accounts for 22.83% of the basin area. Latosols are highly weathered soils, generally very deep and uniform, with high porosity and good drainage. These characteristics provide favorable conditions for root development and water percolation. Like Cambisols, this soil class also comprises multiple mapped units (e.g., LVa10, LVa14), which distinguish variations in depth, texture, and landscape position.

Additional minor soil units include Allic Litholic Soils (Ra), which are very shallow soils strongly influenced by the underlying bedrock, commonly occurring on steep slopes, and Rock Outcrops (e.g., AR2, AR3), where vegetation is limited due to the absence of soil cover. Urbanized areas were also mapped within the basin, reflecting anthropogenic land use rather than natural soil classes. In total, ten different soil classes are recognized within the watershed, each represented by mapped units in accordance with the classification system.

Taken together, these demographic, physiographic, climatic, and land use characteristics establish the Pedro do Rio watershed as a hydrologically complex and environmentally sensitive system, necessitating comprehensive monitoring and modeling efforts.

2.2. MOHID-Land Model Overview

The MOHID-Land model is a physically based hydrological model designed for the multifaceted simulation of hydrological processes intrinsic to watershed systems. Its computational architecture is inherently capable of assimilating and faithfully representing with fidelity the spatial heterogeneities associated with land cover, soil typology diversity, and complex subsurface hydrodynamics. This capability grants MOHID-Land remarkable versatility, enabling its broad applicability across diverse hydrological research contexts, ranging from detailed analyses of small micro-basins to modeling at the scale of large macro-basins of considerable geographic extent. The code can be accessed from an online repository (https://github.com/Mohid-Water-Modelling-System/Mohid, accessed on 1 December 2024). It employs the Finite Volume Method (FVM) to solve conservation equations for mass and momentum over a structured grid, combining horizontal and vertical discretization. The horizontal grid follows a user-defined resolution, while vertical discretization defines soil layer thicknesses down to a user-specified depth, beyond which no flow is simulated. This configuration allows for high-resolution representation of terrain, land cover, and soil properties, enabling the model to capture spatial heterogeneity across various hydrological domains [25].

Surface flow is routed through a drainage network derived from digital elevation data, in which cells are linked downslope to form channel segments. Overland and channel flows are computed using the Saint-Venant equation (Equation (1)), allowing for 2D simulation of surface runoff and 1D simulation of channel flow. Exchange fluxes between soil layers, the surface, and river networks are determined based on pressure gradients, accurately capturing interactions between compartments such as infiltration, percolation, exfiltration, and baseflow [12].

\frac{\partial Q_{i}}{\partial t} + v_{j} \frac{\partial Q_{i}}{\partial x_{j}} = - g A (\frac{\partial H}{\partial x_{i}} + S_{f i})

(1)

where

Q_{i}

is discharge [m³/s],

t

is time [s],

A

is flow area [m²], v is velocity [m/s],

x_{i}

is the flow direction,

x_{j}

are spatial directions,

g

is gravitational acceleration [m/s²],

H

is hydraulic head [m], and

S_{f i}

is the friction slope [m/m], which is computed using the empirical Manning’s equation.

Vegetation dynamics are simulated using a modified version of the Environmental Policy Integrated Climate (EPIC) model [26], which drives crop development through the accumulation of heat units (degree-days). This approach quantifies thermal time required for plant growth, enabling stage-specific simulation of phenology. The model also represents root growth, leaf area index (LAI), canopy height, and total biomass production. Water uptake from the root zone is modeled as a function of soil matric potential, following the approach proposed by Feddes et al. [27], which identifies four critical suction thresholds governing plant water stress. Under optimal soil moisture (h₂ < h < h₃), stress is minimal, and water uptake is maximized. Stress increases linearly when conditions deviate from this optimal range, reaching maximum inhibition at h < h₄ or h > h₁.

Evapotranspiration is simulated using the dual crop coefficient method proposed by Allen et al. [28], which distinguishes between basal crop water use and evaporation from exposed soil. Crop coefficients are assigned dynamically according to phenological stages, ensuring consistency between water demand and plant development. While MOHID-Land does not simulate atmospheric processes internally, it relies on externally supplied meteorological data (precipitation, temperature, solar radiation, cloud cover, relative humidity, wind speed, and wind velocity) as boundary inputs, enabling flexible coupling with observational data or climate model outputs.

2.2.1. MOHID-Land Soil Water Dynamics

MOHID-Land employs a fully three-dimensional representation of unsaturated flow processes, enabling detailed simulation of vertical and lateral water redistribution within the soil profile in response to pressure and moisture gradients. The transient movement of water within the soil in MOHID-Land is governed by the Richards equation (Equation (2)), a highly nonlinear partial differential equation due to the strong dependence of unsaturated hydraulic conductivity

K (θ)

on soil water content [6].

\frac{\partial θ}{\partial t} = - \frac{\partial Q_{i}}{x_{d}} - S (h) = \frac{\partial}{x_{d}} (K (θ) \frac{\partial H}{\partial x_{d}}) A - S (h)

(2)

where

K (θ)

is the unsaturated hydraulic conductivity [m/s],

Q

is the flux [m³/s],

A

is the area [m²],

θ

is the water content [m³/m³],

H

is the hydraulic gradient (topography + hydrostatic pressure + suction pressure) [m],

x_{d}

is the flow direction, and

S (h)

is the term for water uptake from the soil by plant roots [m³/s].

It is formulated under the continuum assumption with a representative elementary volume, meaning that water content and hydraulic properties are considered continuous over a defined length scale [29]. The classical Richards equation combines mass conservation and Darcy–Buckingham flux relationships [30] and assumes that air in the pores remains at atmospheric pressure, so that air does not impede water movement. These assumptions introduce important limitations. In soils with very low air permeability or highly compacted layers, air can become trapped, generating positive pore pressures that locally restrict water movement. This creates preferential flow paths that the classical Richards equation cannot represent, often leading to an underestimation of localized infiltration [31]. Additionally, the parabolic nature of the Richards equation imposes monotonic fluxes—meaning that water content changes smoothly and continuously without overshooting or oscillating—which prevents the standard formulation from capturing saturation overshoot, where water content temporarily exceeds equilibrium saturation at the wetting front [32,33].

Despite these limitations, the Richards equation remains the most widely used model for unsaturated flow due to its general applicability and computational efficiency. Extensions have been proposed specifically to address the inability of the classical formulation to capture saturation overshoot and preferential flow caused by air entrapment in compacted soils [31,32,33]. While these modifications improve the representation of such phenomena, the classical Richards equation still forms the backbone of most soil water modeling frameworks [30].

In MOHID-Land, the soil domain is bounded by two primary interfaces: the lower boundary, corresponding to impermeable bedrock, redirects infiltrated water as lateral subsurface flow along the terrain slope, while the upper boundary, the soil–atmosphere interface, applies precipitation, evapotranspiration, and other climatic fluxes. Constant air pressure is assumed at this interface, consistent with the classical Richards assumptions. Soil water retention is represented through VGM functional relationships (Equations (3) and (4)) [11,12], while vertical and lateral hydraulic anisotropy is accounted for through a

K F

multiplier (Equation (5)), where a value of 1 indicates isotropy and other values reflect degrees of directional variation [34]. These boundary conditions and hydraulic parameterizations allow MOHID-Land to simulate realistic water redistribution, with anisotropy playing a critical role in sloped or stratified soils by enhancing lateral flow and influencing streamflow and groundwater recharge.

θ (h) = θ_{r} + \frac{θ_{s} - θ_{r}}{{[{1 + (α |h|)}^{n}]}^{m}}

(3)

K (θ) = K_{s a t} S_{e}^{L} {(1 - {(1 - S_{e}^{1 / m})}^{m})}^{2}

(4)

K F = \frac{K_{s a t, h o r}}{K_{s a t}}

(5)

where

θ_{s}

is the saturated water content [m³/m³],

θ_{r}

is the residual water content [m³/m³],

h

is the suction pressure [m],

K_{s a t}

is the saturated hydraulic conductivity [m/s],

α

is the curve adjustment parameter, related to the inverse of the air entry [m⁻¹],

n

is the curve adjustment parameter, related to the pore size distribution [dimensionless],

m

is obtained from the relation

1 - 1 / n

,

L

is the empirical pore connectivity [m], equal to 0.5 [8],

K F

is the hydraulic conductive multiplying factor [dimensionless], and

K_{s a t, h o r}

is the horizontal saturated conductivity [m/s].

2.2.2. Model Set-Up (Baseline Simulation—S1)

The model domain was discretized using a regular grid with a spatial resolution of 200 m, encompassing 160 rows by 200 columns. The lower-left coordinate of the grid is anchored at 43.36° W and 22.59° S. Elevation data were obtained from the 30 m resolution Topodata digital elevation model [35] and subsequently interpolated to match the model’s spatial resolution.

River channel geometry was represented by trapezoidal cross-sections, parameterized based on upstream drainage areas and derived from field surveys conducted between 2019 and 2021 under the direction of the Piabanha Watershed Committee (Table 1). Each row in Table 1 represents the geometric parameters (heights, top width, and bottom width) of the trapezoidal cross-sections associated with a specific upstream drainage area. The smallest drainage area corresponds to the most upstream node of the drainage network, while the largest area represents the watershed outlet. Intermediate drainage areas are control points, and cross-sections for all other nodes are interpolated between these values to ensure that each node in the network is assigned a representative channel geometry.

Land use and land cover data were sourced from the 30 m resolution MapBiomas project [36] and used to assign vegetation classes and Manning’s roughness coefficients, which ranged from 0.03 to 0.16 s·m^(−1/3). Three predominant land cover types—forest, pasture, and agriculture—guided the assignment of crop coefficients (

K c

), varying from 0.6 to 1.0, in accordance with Food and Agriculture Organization (FAO) guidelines [28], as detailed in Table 2.

Soil texture [40] and bulk density [41] data from the Brazilian Agricultural Research Corporation (EMBRAPA), headquartered in Brasília, Brazil, were used as input. These datasets are provided as raster layers (90 m resolution), derived from national soil surveys and interpolations. The data were processed using the MOHID Soil Tool (https://github.com/dhiegosales/MOHID-SOIL-TOOL, accessed on 12 December 2024), which extracts soil texture fractions (sand, silt, and clay) and bulk density for each soil type polygon and depth layer. In the study area, 10 distinct soil types were identified. The EMBRAPA dataset contains six vertical layers, which were used to discretize the soil profile into seven computational layers in MOHID-Land. Since the last two layers share the same parameters, the model preserves six unique layers as shown in Table 3.

The soil texture and density values for each soil type and layer were then used as input for the Rosetta pedo-transfer model, generating the hydraulic parameters of the van Genuchten–Mualem formulation (

θ_{r}

,

θ_{s}

,

α

,

n

,

K_{s a t}

). In total, 60 parameter sets (10 soil types × 6 layers) were produced and implemented in MOHID-Land. Therefore, the soil profile is not uniform across layers: each depth interval is characterized by distinct hydraulic properties derived from texture and bulk density, ensuring a realistic vertical heterogeneity consistent with the EMBRAPA dataset. The full set of hydraulic parameters is presented in Appendix A (Table A1).

In the model, hydraulic conductivity anisotropy is represented by the

K F

multiplier, which was set to 10—the default value in MOHID-Land—indicating that horizontal conductivity is ten times higher than vertical conductivity.

The meteorological data required for the calculation of reference evapotranspiration were obtained from the ERA5 global reanalysis model [42]. This model, which provides hourly data on a 0.25° × 0.25° grid, was selected for its ability to provide long and continuous time series. The reliability and accuracy of ERA5 data for hydrological and climatological studies have been extensively documented in the scientific literature, with numerous studies validating its suitability for different regions, including Brazil [43,44,45,46]. Given this extensive use by the scientific community, we considered the ERA5 dataset to be an appropriate source for the meteorological parameters used in this study.

Precipitation inputs were compiled from 39 local rain gauges and consolidated into 15 representative stations using a clustering based on median rainfall characteristics [22]. Missing rainfall data were interpolated using UNESCO-IHE’s HyKit toolbox [47], which integrates station proximity and elevation to account for spatial heterogeneity—particularly important given the region’s pronounced orographic variability.

2.3. Van Genuchten Parameters Interactions and Scenarios

To rigorously evaluate the individual and combined influences of soil hydraulic parameters within the VGM framework—namely

θ_{s}

,

θ_{r}

,

α

,

n

, and

K_{s a t}

, hereafter referred to as

P_{1}

through

P_{5}

—a deterministic combinatorial strategy was adopted. Each parameter was perturbed by a fixed ±10% multiplicative factor, as specified by Sales et al. [11], relative to its baseline value. Specifically, the factors applied were 1.1 for

θ_{s}

and

n

, and 0.9 for

θ_{r}

,

α

, and

K_{s a t}

.

The methodological foundation rests on the mathematical construction of the power set of the parameter space. Let

P = {P_{1}, P_{2}, \dots, P_{j}}

denote the set of VGM parameters under analysis, where

j = 5

. The power set

P (P)

includes all

2^{j}

subsets of

P

. Excluding the null set (which corresponds to the unperturbed baseline simulation), the total number of unique simulations required to evaluate all non-empty parameter combinations is denoted by

C

and given by:

C = 2^{j} - 1 = 2^{5} - 1 = 31

(6)

The use of base 2 in this formulation is justified by the binary nature of each perturbation decision—a parameter is either included in each simulation (perturbed) or not (kept at its default value). Such inclusion/exclusion logic naturally follows a binary combinatorial structure, which is classically modeled using powers of two. Each subset

S_{k} \subseteq P

corresponds to a distinct simulation scenario in which exactly

k

parameters are perturbed (

1 \leq k \leq 5)

. For example, all subsets of cardinality

k = 2

correspond to pairwise interactions, such as

{P 1, P 2}

,

\{P 1, P 3\}

, …. For

k = 5

, there is only one subset, containing all parameters:

{P 1, P 2, P 3, P 4, P 5}

.

Simulation scenarios were generated using MOHID Soil Tool (MST), a dedicated Python-based software (version 4.0.3) with a graphical interface optimized for Windows 10/11 × 64 systems. MST was specifically developed to transform soil texture and bulk density data into hydraulic parameters through the following computational pipeline: (i) data ingestion, (ii) soil texture preprocessing, (iii) compaction analysis, (iv) derivation of VGM parameters via the Rosetta API, (v) application of multiplicative perturbation factors, and (vi) export of formatted input files for MOHID-Land. The executable version of the software along with its user manual is available at: https://github.com/dhiegosales/MOHID-SOIL-TOOL (accessed on 12 December 2024). Additionally, the complete set of input files can be accessed at: https://zenodo.org/records/14914611 (accessed on 23 February 2025). These resources enable replication of the results and verification of the methodology. Comprehensive technical documentation is provided in Sales et al. [11] to support correct application and interpretation of the tool.

Table 4 summarizes the baseline scenario (S1) and the 31 combinatorial simulations (S2–S32), detailing the specific VGM parameters perturbed in each scenario. A value of 1.0 in this table indicates that the parameter’s baseline value, which corresponds to the specific soil type and depth, remains unchanged; these values are detailed in the appendix (Table A1). Factors of 1.1 and 0.9 correspond to +10% and −10% perturbations, respectively. For instance, S2 tests

K_{s a t}

in isolation; S7 evaluates the joint influence of

K_{s a t}

and

θ_{s}

; and S32 includes all five parameters simultaneously.

Simulations were independently executed on the High-Performance Computing (HPC) cluster of the Escola de Sagres, affiliated with the Instituto Politécnico da Universidade do Estado do Rio de Janeiro (UERJ). This infrastructure enabled efficient and reproducible parallel execution computationally intensive model runs.

By maintaining the same ±10% perturbation range established in Sales et al. [11], we constructed and evaluated all 31 non-empty combinations of the five VGM parameters, performing one simulation for each scenario. This deterministic strategy enabled a systematic assessment of parameter interactions and their influence on model performance, using streamflow as the response variable. Unlike stochastic algorithms or high-dimensional sampling techniques, this approach offers a computationally efficient alternative for complex models. The fixed perturbation range allowed for a standardized comparison of parameter effects, reducing potential biases from scale-dependent variances. Notably, the method revealed how specific combinations of parameters affect simulation accuracy—insights that would likely be missed by traditional local sensitivity methods.

2.4. Model Evaluation

The combinatorial simulation experiments were conducted over the 2006–2008 period, with 2006 designated as a model warm-up year and therefore excluded from the performance evaluation. The observed streamflow dataset was provided at a daily time step, yielding a total of 731 observations from 1 January 2007 to 31 December 2008. This specific period was selected because it coincides with the calibration window adopted by Sales et al. [11], providing a solid comparative basis and methodological consistency between studies. Aligning the time frame with that calibration benchmark allows for direct assessment of model performance under equivalent boundary and input conditions. Additionally, limiting the simulation to this shorter period reduced the computational demands inherent to the MOHID-Land model, thereby enabling a computationally efficient yet scientifically robust sensitivity analysis. It is important to note that observational errors were not explicitly quantified in this deterministic setup, as the primary objective was to isolate parameter-driven variability in model performance.

Model performance was quantitatively assessed using two widely recognized hydrological metrics: the Nash–Sutcliffe Efficiency (NSE) and the Percentage Bias (PBIAS), defined, respectively, by Equations (7) and (8). Both indices were computed using the open-access application ErrUncSeriesAnalyzer: Error and Uncertainty Analysis Tool, version 2.0.0, available at https://github.com/dhiegosales/ErrUncSeriesAnalyzer (accessed on 28 August 2024). This approach is standard practice in the field and provides a robust framework for evaluating both the model’s predictive skill and its long-term water balance:

N S E = 1 - [\frac{\sum_{i = 1}^{p} {(Q_{i}^{o b s} - Q_{i}^{s i m})}^{2}}{\sum_{i = 1}^{p} {(Q_{i}^{o b s} - Q_{m e a n}^{o b s})}^{2}}]

(7)

P B I A S = \frac{\sum_{i = 1}^{p} (Q_{i}^{s i m} - Q_{i}^{o b s})}{\sum_{i = 1}^{p} Q_{i}^{o b s}} 100

(8)

where

Q_{i}^{s i m}

is the simulated flow for day

i

[m³/s];

Q_{i}^{o b s}

is the observed flow on day

i

[m³/s];

Q_{m e a n}^{o b s}

is the observed mean flow for the period [m³/s];

Q_{m e a n}^{s i m}

is the simulated mean flow for the period under consideration [m³/s]; and

p

is the total number of days in that same period.

The NSE measures the model’s predictive skill by comparing variance of residuals to the variance in the observed data. NSE values range from −∞ to 1, where values closer to 1 indicate higher model accuracy. According to Moriasi et al. [48], NSE > 0.80 is classified as “Very Good,” 0.70–0.80 as “Good,” 0.50–0.70 as “Satisfactory,” and values below 0.50 as “Unsatisfactory.” PBIAS quantifies the average deviation between simulated and observed volumes, with an ideal value of 0. Positive values reflect overestimation, while negative values indicate underestimation. Performance thresholds for PBIAS are as follows: within ±5% is “Very Good,” ±5–10% is “Good,” ±10–15% is “Satisfactory,” and greater than ±15% is “Unsatisfactory.”

To identify patterns and groupings among the simulation outcomes, a hierarchical cluster analysis was applied to the NSE and PBIAS scores. Prior to clustering, the performance metrics were normalized using the StandardScaler class from the Python scikit-learn library to ensure comparability in the distance computations. The clustering procedure employed Ward’s linkage method [49,50], which minimizes the total within-cluster variance during agglomeration. The resulting dendrogram visually represented the hierarchical structure of simulation performance, allowing the identification of groups with similar behavior in terms of model accuracy and bias.

2.5. Computational Infrastructure

The exhaustive interaction experiment, necessitated by the physically based and spatially distributed complexity of the MOHID-Land model, required substantial computational resources. A total of 32 simulations were executed on the HPC Escola de Sagres. Its infrastructure operates under a Linux environment and is accessed remotely via SSH protocol, requiring prior compilation of the MOHID-Land source code for Unix-based systems.

Each simulation was configured to utilize ten computational threads. Up to six independent model instances were executed concurrently, effectively leveraging the cluster’s parallel processing capabilities. Despite the high computational demand of the model, the simulations were successfully completed with a relatively modest memory allocation of only 1 GB per process, underscoring the computational efficiency achieved through optimized thread management and parallel scheduling. Further details regarding runtime and overall computational cost are provided in Section Computational Cost.

3. Results

The simultaneous perturbation of all five VGM parameters—in simulation S32 resulted in the highest performance among all evaluated scenarios. This configuration yielded an NSE of 0.50 and the lowest PBIAS of 25.32 (Table 5), thereby establishing S32 as the most effective configuration identified within the proposed experimental framework. This result empirically reinforces the assumption advanced by Sales et al. [11], namely that the simultaneous perturbation of all VGM parameters—even with uniform, fixed-magnitude adjustments—is more effective in capturing the coupled, nonlinear interactions that govern unsaturated flow and soil–water retention processes than isolated or partial combinations of parameter modifications. In contrast, the baseline simulation S1, which employed default parameter values without any perturbation, resulted in substantially lower model performance, with an NSE of 0.01 and a PBIAS of 34.06.

Beyond the assessment of absolute performance for individual simulations (S2–S6), this analysis also aimed to elucidate the nature of interactions among VGM parameters—specifically, to identify potential synergistic, complementary, or antagonistic behaviors. In this context, complementarity is defined as a scenario in which the joint perturbation of two or more parameters yields a performance gain that results from interdependence, even if not strictly amplifying in nature.

A notable complementary interaction emerged between

θ_{s}

and

n

. When perturbed individually,

θ_{s}

(S2: NSE = 0.13; PBIAS = 33.10) and

n

(S5: NSE = 0.33; PBIAS = 30.73) yielded modest improvements in model performance. However, their combined perturbation in simulation S9 resulted in a substantially higher NSE of 0.44 and a reduced PBIAS of 28.66. Although this joint effect exceeded the performance of each parameter individually, it remained slightly below the theoretical sum of their separate contributions (0.13 + 0.33 = 0.46), suggesting a complementary yet sub-additive interaction. This finding underscores a functional coupling between

θ_{s}

and

n

, likely reflecting their joint control over the unsaturated flow regime, particularly through their influence on the soil moisture retention curve and hydraulic conductivity.

Expanding upon the

θ_{s}

–

n

complementarity, the inclusion of a third parameter in selected simulations offered further insights into higher-order interactions. Specifically, configurations involving three-parameter combinations—S20 (

θ_{s}

,

α

,

n

), S22 (

θ_{s}

,

n

,

K_{s a t}

), and S18 (

θ_{s}

,

θ_{r}

,

n

)—exhibited a marginal increase in performance, with NSE values rising to 0.46 and PBIAS narrowing to the range of 27.66–28.07. Though these gains over S9 (NSE = 0.44) are incremental, they suggest that

α

,

K_{s a t}

, and

θ_{r}

provide supportive secondary effects when integrated with the dominant

θ_{s}

–

n

configuration.

Importantly, these parameters—when tested in isolation (e.g., S3 for

θ_{r}

, S4 for

α

, and S6 for

K_{s a t}

)—produced only limited improvements in performance. Their enhanced effectiveness in multi-parameter contexts highlights the nonlinear and interactive nature of soil hydraulic processes. These findings reinforce the notion that hydrological model calibration benefits not only from optimizing dominant parameters, but also from considering emergent interactions that arise in combinatorial configurations.

To further elucidate the relative sensitivity and potential redundancy among the VGM parameters, exclusion tests were performed by systematically omitting one parameter from the full five-parameter high-performing configuration. These four-parameter configurations—S27 (excluding

K_{s a t}

; NSE = 0.49), S30 (excluding

θ_{r}

; NSE = 0.48), and S29 (excluding

α

; NSE = 0.48)—exhibited only marginal reductions in model performance (≤0.02 decrease in NSE) alongside negligible changes in PBIAS relative to the full five-parameter best-performing scenario (S32). Such results indicate that while

K_{s a t}

,

θ_{r}

, and

α

contribute modest incremental gains when individually added to the remaining four-parameter set, their marginal utility diminishes within a near-optimal multi-parameter framework. Hence, the removal of any one of these parameters from the comprehensive high-performing combination exerts a minimal detrimental effect on model fidelity.

Complementing this insight is the pronounced and pivotal influence of the parameter

n

. This is clearly demonstrated by comparing the isolated perturbation of

n

in simulation S5 (NSE = 0.33) with the four-parameter scenario excluding

n

in S28 (NSE = 0.22). The substantial decline in model performance upon removing

n

from the parameter set (S28) underscores its critical role in driving calibration success. Moreover, the fact that

n

alone (S5) outperforms both the four-parameter combination without it (S28) and each of the other parameters individually highlights its uniquely strong and amplifying contribution to the model’s predictive capability. The significant performance gap between S28 and the full five-parameter high-performing scenario S32 further confirms that accurate representation and perturbation of

n

are essential to capturing the complex hydrological processes simulated by MOHID-Land. Collectively, these results emphasize that

n

is a cornerstone parameter, whose inclusion substantially enhances model fidelity and robustness.

Hence, while the full parameter set in simulation S32 delivers the best overall performance, highly effective results can be attained with reduced parameter combinations, especially those including the core parameters

θ_{s}

and

n

. The exclusion of

θ_{s}

,

α

, or

K_{s a t}

results in only marginal performance losses, reinforcing a clear sensitivity hierarchy among the VGM parameters. These findings provide valuable guidance for the development of efficient and computationally cost-effective calibration strategies within distributed hydrological modeling frameworks.

Figure 3 visually reinforces these conclusions by illustrating the distribution of NSE values across simulation scenarios. The height of the NSE bars show a clear pattern of declining model efficiency as parameter combinations become sparser or less optimal. Scenarios approaching the completeness of S32 consistently demonstrate higher NSE values. Notably, the exclusion scenarios S27, S29, and S30 feature NSE bars nearly indistinguishable from that of S32, corroborating the inference that omitting any of these parameters results in minimal impact on model performance.

The behavior of the PBIAS line is notably more erratic, indicating that improvements in NSE are not always accompanied by proportional reductions in volumetric bias. This decoupling underscores the multidimensional nature of model evaluation: whereas NSE captures fidelity in temporal dynamics, PBIAS reflects systematic over or underestimation of flow volumes. Consequently, performance assessments based on a single metric may obscure critical aspects of model behavior. The joint interpretation of NSE and PBIAS—especially when visualized through clustering and graphical diagnostics—therefore provides a more robust and comprehensive framework for evaluating the sensitivity and interactions of hydrological parameters.

In addition to the individual evaluation of NSE and PBIAS metrics, the visual representation provided by the dendrogram (Figure 4) introduces a complementary analytical layer by clustering simulation scenarios based on their joint performance profiles. Constructed from standardized NSE and PBIAS values, this hierarchical clustering simultaneously integrates both metrics, offering a multidimensional assessment of model behavior. The dendrogram distinguishes clusters using colors (orange, green, and blue), highlighting groups of simulations with similar efficiency–bias characteristics. By grouping simulations with comparable efficiency–bias characteristics, the dendrogram uncovers performance patterns that are not always apparent through univariate analysis. As such, it reinforces—and in some instances refines—the insights derived from the direct metric evaluation, providing a more integrative diagnostic perspective on parameter configuration effectiveness.

A clear separation is evident in the dendrogram between the simulations with lower performance (located further to the left, with low NSE values, such as S28, S21, S24) and those that achieved higher efficiency (towards the right, including S32 and associated simulations). This initial clustering structure reflects the model’s sensitivity to the calibration of key parameters, with the baseline simulation (S1) distinctly separated distant from the best-performing group, reinforcing the significance of parameter optimization.

Within the cluster of relatively high-performing simulations, the proximity of S32 (full calibration) to S27 (excluding

K_{s a t}

), S30 (excluding

θ_{r}

), and S29 (excluding

α

) in the dendrogram indicates that, in terms of the combined profile of NSE and PBIAS, the individual exclusion of these parameters results in a comparable overall outcome. This is consistent with marginal performance losses quantified in Table 5 and visualized by the similar NSE bars in Figure 3.

Interestingly, the position of S9 (calibration of

θ_{s}

and

n

) in a distinct yet relatively proximate subgroup to the high-performing configurations may indicate that while this synergistic combination is crucial for good performance, its specific trade-off profile between efficiency and bias (PBIAS of 28.66%) slightly differentiates it from the full calibration profile (PBIAS of 25.32%). The dendrogram suggests that other combinations of three or four parameters may display alternative balances between these metrics, resulting in an overall performance similarity that clusters them closer to S32.

The dispersion of simulations with isolated parameter calibration (S2, S3, S4, S5, S6) along the dendrogram, and their tendency to cluster in branches further from the high-performing group, reinforces the notion that individual parameter calibration, with the notable exception of

n

(S5), is insufficient to attain high model efficiency. The relatively closer position of S5 to the best-performing group in the dendrogram underscores the pronounced influence of the

n

parameter, as previously highlighted in the analysis of Table 5 and Figure 3.

4. Discussion

The results of this study underscore the critical importance of soil parameter calibration in enhancing the performance of the MOHID-Land model. This is clearly evidenced by the stark contrast between the poorly performing baseline simulation (S1), which yielded an NSE of 0.01 and a PBIAS of 34.06, and the significantly improved high-performing configuration (S32). The achievement of an NSE of 0.50 and a PBIAS of 25.32% through the simultaneous perturbation of all five VGM parameters highlights the model’s potential for delivering robust hydrological simulations when appropriately parameterized. These findings reinforce the broader consensus in the literature, highlighting that site-specific parameter optimization is essential for minimizing predictive uncertainty and improving model realism [51,52].

A distinctive contribution of this study lies in its methodological approach: a fully structured, deterministic framework that allowed for the systematic analysis of all five VGM parameters. By exhaustively evaluating every non-empty subset of parameter combinations (31 in total), this design provided a clear, objective, and reproducible assessment of both individual parameter effects and their complex interactions. Such a strategy is especially valuable for computationally demanding models like MOHID-Land, where simulation budgets are constrained and interpretability is critical. Unlike stochastic approaches, such as Monte Carlo or Latin Hypercube Sampling, which often obfuscate interpretation due to their inherent sampling variability, the deterministic, fixed-magnitude perturbation approach (±10%) employed here ensures consistent, scenario-by-scenario comparability, minimizing potential biases from extreme values and enhancing the interpretability of results.

At this stage it is important to acknowledge that alternative approaches—such as ensemble-based data assimilation methods (e.g., Ensemble Kalman Filter or Ensemble Smoother)—could in principle provide additional robustness by explicitly incorporating observational uncertainties into the calibration process [53]. However, due to the high computational cost of MOHID-Land (each simulation requiring more than one day), such methods were unfeasible for this study, highlighting a practical limitation that restricts the exploration of ensemble-based uncertainty analyses. Therefore, the deterministic perturbation framework adopted here represents a pragmatic balance between scientific rigor and computational feasibility, while still enabling a structured and interpretable assessment of parameter interactions.

One of the central insights from this comprehensive analysis is the dominant and functionally complementary role of the parameters

θ_{s}

and

n

, which govern key aspects of soil water dynamics. This finding aligns well with established hydrological theory and previous empirical studies emphasizing the critical influence of these parameters in governing unsaturated flow processes. Specifically,

θ_{s}

primarily increases the total soil water storage capacity by regulating gravitational water retention and baseflow contributions [54,55], effectively determining how much water the soil can hold. In contrast,

n

influences the steepness of the soil water retention curve, modulating the rate of infiltration and accelerating drainage efficiency [56,57]. The functional complementarity observed between

θ_{s}

and

n

thus reflects a crucial balance between enhancing soil water storage and ensuring its timely release, a dynamic essential for accurate representation of unsaturated flow and reliable hydrological simulations.

These findings not only corroborate but also extend the conclusions of Sales et al. [11], who previously identified

n

as the most sensitive parameter in the MOHID framework. Moreover, they complement the results of Verbist et al. [58], whose global sensitivity analysis using Sobol indices highlighted the importance of

n

through total-order effects, particularly in runoff generation. Although Verbist et al. [58] found limited first-order sensitivity for

n

, the present study—applying a deterministic perturbation scheme—demonstrates that

n

exerts substantial influence even when perturbed individually (S5, NSE = 0.33) and, notably, exhibits amplified effects when combined with other functionally synergistic parameters such as

θ_{s}

.

This interpretation provides a nuanced counterpoint to the findings of Pan et al. [59], who reported that the sensitivity of

n

is amplified when its correlation with

α

is considered. However, their analysis employed large and asymmetric parameter ranges, which may have inflated interaction effects. In contrast, the present study’s use of symmetric ±10%, minimizing bias and ensuring that observed interactions reflect intrinsic parameter dynamics rather than sampling artifacts. Nevertheless, this fixed perturbation range may constrain the exploration of extreme sensitivity scenarios, representing a methodological limitation of the chosen approach. This methodological refinement not only corroborates the isolated sensitivity of

n

noted by Sales et al. [11] but also reinforces the practical relevance of

n

in controlled calibration schemes. For instance, in simulation S14, where both

n

and α were perturbed, an NSE of 0.35 was achieved—exceeding the isolated performance of either parameter (

n

: 0.33;

α

: 0.03), but still below the core

θ_{s}

–

n

configuration (S9: NSE = 0.44). This indicates that while

α

has marginal influence on its own (S4: NSE = 0.03), it can offer incremental gains when combined with more dominant parameters. This context-dependent interaction further highlights the utility of structured perturbation frameworks in isolating parameter effects with clarity, something often obscured in stochastic methodologies.

Combinatorial design also allowed for a structured assessment of the relative influence of each VGM parameter under controlled calibration settings. While the omission of

θ_{r}

,

α

, or

K_{s a t}

from the high-performance configuration led to only marginal losses in model performance, this result should not be interpreted as evidence of functional irrelevance.

The hydrological functions of

θ_{r}

,

α

, and

K_{s a t}

are well established in the literature.

θ_{r}

governs residual water content and contributes to water availability under dry conditions [60,61].

α

regulates the inverse of the air-entry pressure and is particularly relevant during post-dry infiltration events where steep wetting fronts interact with the capillary fringe [11].

K_{s a t}

controls the infiltration–runoff partitioning during high-intensity rainfall [60,61].

Their relatively subdued influence on isolation, as observed here, may be attributed to two plausible, non-exclusive factors. First, the ±10% perturbation range—while ensuring physical plausibility and cross-scenario comparability—may not be sufficient to fully activate the dynamic effects of these parameters, especially in the hydrological regime studied. Second, the baseline parameter estimates, derived from pedotransfer functions (e.g., Rosetta) and soil datasets (e.g., EMBRAPA), may carry non-negligible uncertainty. If the initial values are not well aligned with local soil characteristics, small symmetric perturbations around such biased estimates may fail to explore regions of greater sensitivity—not because the parameters are functionally inert, but because the reference point is suboptimal.

In addition, it is important to recognize that observational streamflow data also carry non-negligible measurement uncertainty, which was not explicitly accounted for in this deterministic framework. While the purpose here was to isolate parameter-driven variability, unmodeled observational errors may partly influence performance scores such as NSE and PBIAS. This highlights a limitation regarding the potential impact of data uncertainty on quantitative evaluation of model performance.

This interpretation aligns with foundational insights from sensitivity theory, which emphasizes the role of parameter interaction, identifiability, and structural uncertainty in complex hydrological models [62,63]. Rather than undermining the findings, these considerations underscore their methodological transparency and practical relevance. The deterministic framework adopted here provides a conservative yet robust baseline, and future investigations may benefit from expanding perturbation ranges within physically realistic bounds, or from incorporating site-specific measurements to refine initial parameterization.

Importantly, while the inclusion of

θ_{r}

, α, and

K_{s a t}

in specific triadic combinations with the dominant pair

θ_{s}

–

n

(e.g.,

θ_{s}

–α–

n

,

θ_{s}

–

n

–

K_{s a t}

, and

θ_{s}

–

θ_{r}

–

n

) yielded slight improvements in model performance, these gains were marginal—typically increasing NSE from 0.44 to 0.46. Such incremental enhancement suggests that these secondary parameters exert a limited additive influence under the hydrological and structural conditions assessed. Their contribution appears to fine-tune, rather than transform, the model’s behavior, indicating that their calibration may be most impactful when core hydrophysical processes—primarily governed by

θ_{s}

and

n

—have already been well resolved.

This restrained response further reinforces the centrality of

θ_{s}

and

n

as the primary levers of model fidelity in the MOHID-Land framework, at least under the parameter ranges and boundary conditions tested. Rather than undermining the importance of

θ_{r}

,

α

, and

K_{s a t}

, these findings indicate that their influence operates primarily through hydrological accommodation—refining model behavior under transitional or boundary conditions rather than driving core flow dynamics. Their calibration priority, therefore, should be context sensitive.

These quantitative results are further corroborated by the general hydrological behavior associated with VGM parameter perturbations, as synthesized by Sales et al. [11] and summarized in Table 6. This framework classifies parameter sensitivity and elucidates how specific perturbations propagate to streamflow variations under contrasting wet and dry hydrological conditions, where we consider the wet period as corresponding to October–March and the dry period as April–September, which is typical of the tropical regime [11,21].

A key pattern that emerges is that the effect of perturbation is consistently inverse between wet and dry periods. Increases in

n

,

θ_{s}

, and

K_{s a t}

reduce streamflow under wet conditions (attenuating peak flows) while enhancing streamflow under dry conditions (supporting baseflow). Conversely, decreases in

θ_{r}

and

α

produce the same dual outcome: lower streamflow under wet conditions and higher streamflow under dry conditions.

This consistency across parameters is particularly relevant for calibration. It means that, despite their differing sensitivities, the parameters converge toward the same management implication: increasing

n

,

θ_{s}

, and

K_{s a t}

while reducing

θ_{r}

and

α

simultaneously contributes to dampening wet-season peaks and strengthening dry-season baseflows. Such adjustments reflect the mechanistic roles of these parameters in governing soil water storage, retention, and release dynamics.

It is worth noting that, if the calibration goal were the opposite—namely, to reduce baseflow during dry periods while increasing streamflow in wet periods—the direction of parameter adjustments identified in Table 6 could simply be inverted. In this case, decreasing

n

,

θ_{s}

, and

K_{s a t}

while increasing

θ_{r}

and

α

would achieve the desired hydrological effect.

By linking quantitative sensitivity results to the hydrological logic summarized in Table 6, this study not only reinforces the validity of the findings but also provides a clear guideline for prioritizing parameter adjustments in calibration efforts aimed at reconciling peak and baseflow dynamics.

Taking these findings together, these findings substantially advance prior research by integrating a computationally efficient and methodologically transparent approach to sensitivity analysis in hydrological modeling. The adoption of uniform, controlled perturbation magnitudes proved instrumental in minimizing distortions from extreme values while enhancing comparability of results. It is, however, important to acknowledge that the ±10% perturbation range, although physically plausible, may underrepresent the full behavioral spectrum of parameters such as

θ_{s}

and

n

—particularly given their persistent dominance in both this and previous studies [11], and that initial parameter uncertainties and observational errors could influence the quantified sensitivities. Consequently, expanding the perturbation range in future studies—within physically defensible bounds—could provide additional insights into model behavior and sensitivity gradients. Nevertheless, the present findings provide a practical and actionable framework for prioritizing parameter calibration under constrained computational resources. In particular, the consistent and substantial influence of

θ_{s}

and

n

across all interaction levels underscores their status as core calibration targets. By isolating and quantifying these effects with methodological rigor, this study contributes not only to improved simulation performance but also to a deeper mechanistic understanding of unsaturated flow processes in complex hydrological systems like MOHID-Land.

Computational Cost

The average runtime per simulation was approximately 49.65 h, resulting in a cumulative execution time of 1588.83 h across the 32 model runs. Conducted over approximately 13 consecutive days, this timeline demonstrates the practical viability of exhaustive combinatorial calibration when supported by high-performance computing (HPC) infrastructure.

A clear benchmark for evaluating this computational strategy is provided by Sales et al. [11], who performed a similar simulation-based experiment using four conventional personal computers with heterogeneous hardware configurations. Despite conducting only 11 simulations over the same hydrological period, their experiment required a total of 1998 h of processing time—averaging 182 h per machine—and spanned approximately 45 uninterrupted days.

This stark contrast underscores the strategic advantage of centralized HPC environments, which significantly reduces runtime, improves throughput, and enables deeper model exploration. Notably, the accelerated computation afforded by parallel processing was crucial for executing the full combinatorial design—an approach that would likely be impractical under standard multi-PC configurations.

Beyond efficiency gains, the HPC-based methodology enhances transparency, reproducibility, and scientific rigor. It enables other researchers to replicate or extend the experimental design with minimal variation, thereby aligning with emerging standards in robust and open hydrological modeling. This underscores the relevance of dedicated computational infrastructure as a catalyst for advancing methodological innovation in hydrological calibration.

5. Conclusions

The findings of this study underscore the critical importance of proper parameterization in the MOHID-Land model for achieving robust and reliable hydrological simulations. “Among the five VGM parameters evaluated,

θ_{s}

and

n

consistently emerged as the most influential. They exert dominant control across all tested scenarios, highlighting them as foundational targets for calibration. This allows a favorable balance between predictive accuracy and computational cost, which is particularly relevant for large-scale or operational applications of distributed hydrological models.

The inclusion of

θ_{r}

,

α

, or

K_{s a t}

into the core

θ_{s}

–

n

combination produced only marginal improvements, indicating additive rather than synergistic effects, and even in four-parameter configurations, the incremental gains remained limited, suggesting that targeted three-parameter combinations may provide a practical compromise between model fidelity and computational efficiency. While the full five-parameter calibration yielded the highest overall performance, the incremental gain over the best triadic configurations was relatively small, reinforcing the central role of

θ_{s}

and

n

.

Importantly, some limitations should be acknowledged: the ±10% perturbation range may underrepresent the full sensitivity of

θ_{s}

and

n

, initial parameter estimates derived from pedotransfer functions or soil datasets may carry biases, observational uncertainties in streamflow were not explicitly modeled, and computational constraints restricted the use of ensemble-based or stochastic calibration approaches. Future studies could expand perturbation ranges, incorporate site-specific measurements, and account for observational uncertainties to refine sensitivity analyses and improve model robustness.

Based on these findings, we recommend a two-step soil calibration strategy: first, an initial multi-parameter exploration within physically plausible perturbation ranges to broadly sample the response surface; second, a focused refinement stage targeting

θ_{s}

and

n

to optimize model performance efficiently. This structured approach balances hydrological realism with operational feasibility while providing a transparent and replicable framework.

In summary, this study delivers empirical evidence and methodological guidance for deterministic soil calibration in distributed hydrological models, highlights calibration priorities, clarifies parameter interactions, and acknowledges methodological limitations, providing actionable insights for improving predictive accuracy in complex watershed systems. We also encourage future work to integrate recent advancements in sensitivity analysis and data assimilation methods to further enhance calibration practices.

Author Contributions

D.d.S.S.: conceptualization, data curation, investigation, methodology, software, validation, visualization, and writing—original draft. D.d.A.C.: supervision, validation, visualization, and writing—review and editing. J.L.J.: conceptualization, methodology, supervision, validation, visualization, and writing—review and editing. R.J.N.: conceptualization, validation, and writing—review and editing. A.J.d.S.N.: project administration, supervision, and writing—review and editing. All authors have read and agreed to the published version of the manuscript.

Funding

The authors gratefully acknowledge the financial support provided in the form of grants by the following Brazilian agencies: FAPERJ, Carlos Chagas Filho Foundation for Research Support of the State of Rio de Janeiro; CNPq, National Council for Scientific and Technological Development; CAPES, Coordination for the Improvement of Higher Education Personnel (Finance Code 001).

Data Availability Statement

The original contributions presented in this study are included in the article. Further inquiries can be directed to the corresponding author.

Acknowledgments

The authors would like to thank the High-Performance Computer at the Escola de Sagres, Poly-technic Institute, Rio de Janeiro State University (RJSU, or UERJ in Portuguese), linked to the Atlantic International Research Centre (AIR Centre). We also acknowledge our colleagues at the Center for Environmental and Marine Science and Technology (MARETEC), Instituto Superior Técnico (IST), University of Lisbon, for hosting the first author as a visiting researcher. Additionally, we thank the Federal Fluminense Institute (FFI or IFF in Portuguese), where the first author completed his PhD.

Conflicts of Interest

The authors declare no conflicts of interest.

Abbreviations

The following abbreviations are used in this manuscript:

AMA	Absolute Moment Approximation
ANA	National Water and Sanitation Agency
CN	Curve Number
EIBEX	Integrated Studies in Experimental and Representative Basins
EMBRAPA	Brazilian Agricultural Research Corporation
EPIC	Environmental Policy Integrated Climate
FAO	Food and Agriculture Organization
FVM	Finite Volume Method
HPC	High-Performance Computing
LAI	Leaf area index
NSE	Nash–Sutcliffe Efficiency
PBIAS	Percentage Bias
SGB	Brazilian Geological Survey
SiBCS	Brazilian Soil Classification System
SWAT	Soil and Water Assessment Tool
UERJ	Universidade do Estado do Rio de Janeiro
VGM	van Genuchten–Mualem

Appendix A

Table A1. Soil properties (EMBRAPA) and corresponding hydraulic parameters generated by the Rosetta pedotransfer model for each soil type and layer—Generated in MOHID Soil Tool.

Layer	Soil Type (Polygons)	EMBRAPA Input				Rosetta Output
Layer	Soil Type (Polygons)	Sand (%)	Silt (%)	Clay (%)	Density (g/cm³)	$θ_{r}$ (m³/m³)	$θ_{s}$ (m³/m³)	$n$ (-)	$α$ (m⁻¹)	$K_{s a t}$ (m/s)
0–5 cm	AR2—Rock outcrops	51.82	19.34	28.84	1.08	0.106	0.512	1.364	1.033	8.407 × 10⁻⁶
0–5 cm	AR3—Rock outcrops	51.02	18.47	30.5	1.06	0.109	0.521	1.356	1.042	8.805 × 10⁻⁶
0–5 cm	Ca1—Allic Cambisol	53.1	17.19	29.71	1.07	0.108	0.518	1.357	1.088	8.930 × 10⁻⁶
0–5 cm	Ca2—Allic Cambisol	50.18	16.9	32.92	1.11	0.111	0.514	1.344	1.094	7.218 × 10⁻⁶
0–5 cm	Ca6—Allic Cambisol	48.37	17.2	34.43	1.15	0.112	0.506	1.338	1.092	5.989 × 10⁻⁶
0–5 cm	Ca7—Allic Cambisol	46.27	16.9	36.83	1.13	0.116	0.516	1.330	1.086	6.199 × 10⁻⁶
0–5 cm	LVa10—Allic Red-Yellow Latosol	46.37	18.12	35.52	1.18	0.113	0.500	1.334	1.072	5.102 × 10⁻⁶
0–5 cm	LVa14—Allic Red-Yellow Latosol	46.9	16.36	36.74	1.18	0.115	0.503	1.328	1.116	5.209 × 10⁻⁶
0–5 cm	Ra—Allic Litholic Soils	52.79	18.22	29	1.07	0.107	0.516	1.362	1.061	8.894 × 10⁻⁶
0–5 cm	Urban area	45.89	18.8	35.31	1.19	0.113	0.496	1.335	1.057	4.841 × 10⁻⁶
5–15 cm	AR2—Rock outcrops	50.58	20.44	28.98	1.04	0.107	0.522	1.365	0.985	9.429 × 10⁻⁶
5–15 cm	AR3—Rock outcrops	49.53	19.61	30.86	1.04	0.110	0.526	1.356	1.001	9.163 × 10⁻⁶
5–15 cm	Ca1—Allic Cambisol	51.96	19.3	28.74	1.05	0.107	0.520	1.364	1.023	9.372 × 10⁻⁶
5–15 cm	Ca2—Allic Cambisol	49.08	17.86	33.06	1.08	0.112	0.521	1.345	1.055	7.841 × 10⁻⁶
5–15 cm	Ca6—Allic Cambisol	47.49	18.07	34.43	1.12	0.113	0.513	1.340	1.058	6.563 × 10⁻⁶
5–15 cm	Ca7—Allic Cambisol	48.37	17.17	34.46	1.12	0.113	0.514	1.339	1.083	6.690 × 10⁻⁶
5–15 cm	LVa10—Allic Red-Yellow Latosol	45.12	19.11	35.77	1.16	0.114	0.505	1.336	1.039	5.365 × 10⁻⁶
5–15 cm	LVa14—Allic Red-Yellow Latosol	48.01	16.12	35.87	1.17	0.114	0.504	1.331	1.123	5.539 × 10⁻⁶
5–15 cm	Ra—Allic Litholic Soils	50.96	19.39	29.66	1.04	0.108	0.524	1.360	1.012	9.458 × 10⁻⁶
5–15 cm	Urban area	45.34	19.03	35.63	1.17	0.113	0.502	1.336	1.044	5.180 × 10⁻⁶
15–30 cm	AR2—Rock outcrops	49.56	17.8	32.64	1.09	0.111	0.518	1.347	1.062	7.644 × 10⁻⁶
15–30 cm	AR3—Rock outcrops	48.57	16.71	34.72	1.09	0.114	0.522	1.338	1.085	7.487 × 10⁻⁶
15–30 cm	Ca1—Allic Cambisol	52.48	15.66	31.86	1.07	0.111	0.523	1.347	1.121	8.736 × 10⁻⁶
15–30 cm	Ca2—Allic Cambisol	49.25	15.04	35.71	1.12	0.115	0.517	1.332	1.138	6.840 × 10⁻⁶
15–30 cm	Ca6—Allic Cambisol	47.33	15.81	36.86	1.16	0.115	0.509	1.328	1.124	5.679 × 10⁻⁶
15–30 cm	Ca7—Allic Cambisol	46.09	15.24	38.67	1.17	0.117	0.509	1.320	1.136	5.360 × 10⁻⁶
15–30 cm	LVa10—Allic Red-Yellow Latosol	44.28	17.04	38.68	1.19	0.117	0.503	1.322	1.094	4.756 × 10⁻⁶
15–30 cm	LVa14—Allic Red-Yellow Latosol	47.62	14.66	37.72	1.2	0.116	0.501	1.322	1.165	4.949 × 10⁻⁶
15–30 cm	Ra—Allic Litholic Soils	50.52	16.43	33.06	1.08	0.112	0.522	1.343	1.096	8.075 × 10⁻⁶
15–30 cm	Urban area	45.53	17.1	37.37	1.2	0.115	0.498	1.326	1.099	4.674 × 10⁻⁶
30–60 cm	AR2—Rock outcrops	46.36	19.1	34.54	1.15	0.113	0.505	1.340	1.040	5.708 × 10⁻⁶
30–60 cm	AR3—Rock outcrops	46.36	17.9	35.74	1.16	0.114	0.505	1.334	1.071	5.520 × 10⁻⁶
30–60 cm	Ca1—Allic Cambisol	51.05	17.73	31.23	1.15	0.108	0.500	1.351	1.093	6.378 × 10⁻⁶
30–60 cm	Ca2—Allic Cambisol	45.8	16.59	37.61	1.18	0.116	0.504	1.325	1.106	5.094 × 10⁻⁶
30–60 cm	Ca6—Allic Cambisol	43.76	15.85	40.39	1.21	0.118	0.502	1.314	1.125	4.395 × 10⁻⁶
30–60 cm	Ca7—Allic Cambisol	43.52	15.35	41.13	1.22	0.119	0.501	1.310	1.138	4.224 × 10⁻⁶
30–60 cm	LVa10—Allic Red-Yellow Latosol	40.53	16.02	43.45	1.21	0.122	0.507	1.305	1.112	4.170 × 10⁻⁶
30–60 cm	LVa14—Allic Red-Yellow Latosol	43	14.09	42.91	1.25	0.121	0.497	1.301	1.171	3.756 × 10⁻⁶
30–60 cm	Ra—Allic Litholic Soils	47.52	17.76	34.72	1.15	0.113	0.506	1.338	1.075	5.871 × 10⁻⁶
30–60 cm	Urban area	40.26	15.9	43.84	1.23	0.122	0.502	1.302	1.118	3.836 × 10⁻⁶
60–100 cm	AR2—Rock outcrops	55.45	16.01	28.53	1.25	0.102	0.472	1.359	1.211	5.097 × 10⁻⁶
60–100 cm	AR3—Rock outcrops	55.25	15.24	29.51	1.26	0.104	0.471	1.353	1.230	4.877 × 10⁻⁶
60–100 cm	Ca1—Allic Cambisol	59.44	15.43	25.13	1.25	0.097	0.466	1.379	1.272	5.973 × 10⁻⁶
60–100 cm	Ca2—Allic Cambisol	55.29	13.92	30.79	1.26	0.105	0.475	1.346	1.259	4.898 × 10⁻⁶
60–100 cm	Ca6—Allic Cambisol	51.07	13.16	35.77	1.26	0.112	0.483	1.324	1.240	4.328 × 10⁻⁶
60–100 cm	Ca7—Allic Cambisol	50.91	11.99	37.1	1.25	0.114	0.489	1.319	1.262	4.511 × 10⁻⁶
60–100 cm	LVa10—Allic Red-Yellow Latosol	44.7	13.56	41.74	1.23	0.120	0.500	1.306	1.185	4.205 × 10⁻⁶
60–100 cm	LVa14—Allic Red-Yellow Latosol	48.77	12.54	38.69	1.27	0.116	0.486	1.312	1.241	3.953 × 10⁻⁶
60–100 cm	Ra—Allic Litholic Soils	56.81	15.55	27.64	1.25	0.101	0.470	1.364	1.237	5.363 × 10⁻⁶
60–100 cm	Urban area	45.44	14.08	40.48	1.25	0.118	0.493	1.309	1.182	3.927 × 10⁻⁶
100–200 cm	AR2—Rock outcrops	48.54	13.81	37.65	1.25	0.115	0.488	1.318	1.204	4.201 × 10⁻⁶
100–200 cm	AR3—Rock outcrops	49.81	12.53	37.66	1.25	0.115	0.489	1.317	1.242	4.372 × 10⁻⁶
100–200 cm	Ca1—Allic Cambisol	52.47	12.71	34.82	1.24	0.111	0.487	1.329	1.254	4.858 × 10⁻⁶
100–200 cm	Ca2—Allic Cambisol	52.38	11.66	35.97	1.25	0.113	0.487	1.323	1.281	4.698 × 10⁻⁶
100–200 cm	Ca6—Allic Cambisol	48.93	12.36	38.72	1.25	0.116	0.491	1.313	1.240	4.287 × 10⁻⁶
100–200 cm	Ca7—Allic Cambisol	49.45	10.98	39.57	1.25	0.117	0.493	1.309	1.275	4.385 × 10⁻⁶
100–200 cm	LVa10—Allic Red-Yellow Latosol	45.55	13.22	41.23	1.24	0.119	0.497	1.306	1.199	4.122 × 10⁻⁶
100–200 cm	LVa14—Allic Red-Yellow Latosol	50.4	11.49	38.11	1.27	0.115	0.486	1.313	1.276	4.149 × 10⁻⁶
100–200 cm	Ra—Allic Litholic Soils	50.51	12.57	36.92	1.24	0.114	0.491	1.320	1.243	4.616 × 10⁻⁶
100–200 cm	Urban area	47.28	12.84	39.88	1.26	0.117	0.490	1.309	1.222	3.964 × 10⁻⁶

References

Keller, A.A.; Garner, K.; Rao, N.; Knipping, E.; Thomas, J. Hydrological models for climate-based assessments at the watershed scale: A critical review of existing hydrologic and water quality models. Sci. Total Environ. 2023, 867, 161209. [Google Scholar] [CrossRef] [PubMed]
Liu, D.; Liu, H.; Meng, X. Advanced Hydrologic Modeling in Watershed Scale. Water 2023, 15, 691. [Google Scholar] [CrossRef]
Simionesei, L.; Ramos, T.B.; Oliveira, A.R.; Jongen, M.; Darouich, H.; Weber, K.; Neves, R. Modeling soil water dynamics and pasture growth in the Montado ecosystem using MOHID land. Water 2018, 10, 489. [Google Scholar] [CrossRef]
Sarker, S.; Leta, O.T. Review of Watershed Hydrology and Mathematical Models. Eng 2025, 6, 129. [Google Scholar] [CrossRef]
Brighenti, T.M.; Bonumá, N.B.; Srinivasan, R.; Chaffe, P.L.B. Simulating sub-daily hydrological process with SWAT: A review. Hydrol. Sci. J. 2019, 64, 1415–1423. [Google Scholar] [CrossRef]
Richard, L.A. Capillary conduction of liquids through porous mediums. Physics 1931, 1, 318–333. [Google Scholar] [CrossRef]
van Genuchten, M.T. A closed-form equation for predicting the hydraulic conductivity of unsaturated soils. Soil Sci. Soc. Am. J. 1980, 44, 892–898. [Google Scholar] [CrossRef]
Mualem, Y. A new model for predicting the hydraulic conductivity of unsaturated porous media. Water Resour. Res. 1976, 12, 513–522. [Google Scholar] [CrossRef]
Simionesei, L.; Ramos, T.B.; Brito, D.; Jauch, E.; Leitão, P.C.; Almeida, C.; Neves, R. Numerical Simulation of Soil Water Dynamics Under Stationary Sprinkler Irrigation with Mohid-Land. Irrig. Drain. 2016, 65, 98–111. [Google Scholar] [CrossRef]
Chambel-Leitão, P.; Ramos, T.; Domingos, T.; Neves, R. MOHID Land-Porous Media, a tool for modeling soil hydrology at plot scale and watershed scale. Open Hydrol. J. 2015, 9, 1–12. [Google Scholar] [CrossRef]
Sales, D.S.; Lugon Junior, J.; Costa, D.A.; Sales, R.S.B.; Neves, R.J.; Silva Neto, A.J. Sensitivity Analysis of Soil Hydraulic Parameters for Improved Flow Predictions in an Atlantic Forest Watershed Using the MOHID-Land Platform. Eng 2025, 6, 65. [Google Scholar] [CrossRef]
Oliveira, A.R.; Ramos, T.B.; Simionesei, L.; Pinto, L.; Neves, R. Sensitivity analysis of the MOHID-Land hydrological model: A case study of the Ulla river basin. Water 2020, 12, 3258. [Google Scholar] [CrossRef]
Wu, Q.; Liu, S.; Cai, Y.; Li, X.; Jiang, Y. Improvement of hydrological model calibration by selecting multiple parameter ranges. Hydrol. Earth Syst. Sci. 2017, 21, 393–407. [Google Scholar] [CrossRef]
Sysoev, A. Sensitivity analysis of mathematical models. Computation 2023, 11, 159. [Google Scholar] [CrossRef]
Semiromi, M.T.; Omidvar, S.; Kamali, B. Reducing computational costs of automatic calibration of rainfall-runoff models: Meta-models or high-performance computers? Water 2018, 10, 1440. [Google Scholar] [CrossRef]
Dai, H.; Liu, Y.; Guadagnini, A.; Yuan, S.; Yang, J.; Ye, M. Comparative assessment of two global sensitivity approaches considering model and parameter uncertainty. Water Resour. Res. 2024, 60, e2023WR036096. [Google Scholar] [CrossRef]
Peng, F.; Sun, G. Identifying sensitive model parameter combinations for uncertainties in land surface process simulations over the Tibetan Plateau. Water 2019, 11, 1724. [Google Scholar] [CrossRef]
Villas-Boas, M.D.; Olivera, F.; de Azevedo, J.P.S. Assessment of the water quality monitoring network of the Piabanha River experimental watersheds in Rio de Janeiro, Brazil, using autoassociative neural networks. Environ. Monit. Assess. 2017, 189, 439. [Google Scholar] [CrossRef]
IBGE—Instituto Brasileiro de Geografia e Estatística Cidades. Petropolis. 2022. Available online: https://cidades.ibge.gov.br/brasil/rj/petropolis/panorama (accessed on 3 March 2025).
Nuruzzaman, M.; Bahar, M.M.; Naidu, R. Diffuse soil pollution from agriculture: Impacts and remediation. Sci. Total Environ. 2025, 962, 178398. [Google Scholar] [CrossRef]
Costa, D.; Bayissa, Y.; Villas-Boas, M.D.; Maskey, S.; Junior, J.L.; da Silva Neto, A.J.; Srinivasan, R. Water availability and extreme events under climate change scenarios in an experimental watershed of the Brazilian Atlantic Forest. Sci. Total Environ. 2024, 946, 174417. [Google Scholar] [CrossRef]
Costa, D.; Bayissa, Y.; Sales, D.; Dias, R.M.M.S.; Lugon Junior, J.; Silva Neto, A.J.; Srinivasan, R. Spatial and temporal variability of precipitation in a mountainous watershed using weighted interpolation by distance and elevation. In Proceedings of the ENSUS 2024-XII Encontro de Sustentabilidade em Projeto, Belo Horizonte, Brazil, 7–9 June 2024; Volume 12, pp. 602–610. [Google Scholar]
Carvalho-Filho, A.; Lumbreras, J.F.; Santos, D.S. Os Solos do Estado do Rio de Janeiro; Estudo Geoambiental do Estado do Rio de Janeiro; CPRM/MMA/EMBRAPA/CNPS: Brasília, Brazil, 2000; Available online: https://www.alice.cnptia.embrapa.br/alice/handle/doc/1090208 (accessed on 3 March 2025).
Embrapa. Sistema Brasileiro de Classificação de Solos; Centro Nacional de Pesquisa de Solos: Rio de Janeiro, Brazil, 2013; Volume 3. [Google Scholar]
Sales, D.S.; Lugon Junior, J.; Oliveira, V.P.; Silva Neto, A.J. Rainfall input from WRF-ARW atmospheric model coupled with MOHID land hydrological model for flow simulation in the Paraíba do Sul River-Brazil. J. Urban Environ. Eng. 2021, 15, 188–203. [Google Scholar]
Williams, J.R.; Jones, C.A.; Kiniry, J.R.; Spanel, D.A. The EPIC crop growth model. Trans. ASAE 1989, 32, 497–511. [Google Scholar] [CrossRef]
Feddes, R.A.; Kowalik, P.J.; Zaradny, H. Simulation of Field Water Use and Crop Yield; Wiley: Hoboken, NJ, USA, 1978. [Google Scholar]
Allen, R.G.; Pereira, L.S.; Raes, D.; Smith, M. Crop Evapotranspiration-Guidelines for Computing Crop Water Requirements-FAO Irrigation and Drainage Paper 56; Food and Agriculture Organization: Rome, Italy, 1998; Volume 300. [Google Scholar]
Bear, J. Dynamics of Fluids in Porous Media; Courier Corporation: North Chelmsford, MA, USA, 2013. [Google Scholar]
Hillel, D. Environmental Soil Physics: Fundamentals, Applications, and Environmental Considerations; Elsevier Science: Amsterdam, The Netherlands, 2014. [Google Scholar]
Eliassi, M.; Glass, R.J. On the continuum-scale modeling of gravity-driven fingers in unsaturated porous media: The inadequacy of the Richards equation with standard monotonic constitutive relations and hysteretic equations of state. Water Resour. Res. 2001, 37, 2019–2035. [Google Scholar] [CrossRef]
Fürst, T.; Vodák, R.; Šír, M.; Bíl, M. On the incompatibility of Richards’ equation and finger-like infiltration in unsaturated homogeneous porous media. Water Resour. Res. 2009, 45, 3. [Google Scholar] [CrossRef]
Cueto-Felgueroso, L.; Juanes, R. Stability analysis of a phase-field model of gravity-driven unsaturated flow through porous media. Phys. Rev. E Stat. Nonlinear Soft Matter Phys. 2009, 79, 036301. [Google Scholar] [CrossRef]
Sales, D.S.; Lugon Junior, J.; Costa, D.A.; Oliveira, A.R.; Pereira, D.R.; Neves, R.; Silva Neto, A.J. Assessing the Impact of Anisotropic Hydraulic Conductivity on Lateral Flow for Streamflow Predictions in a Representative Atlantic Forest Watershed. Rev. Cereus 2025, 17, 395–410. [Google Scholar] [CrossRef]
Valeriano, M.M.; Rossetti, D.F. Topodata: Brazilian full coverage refinement of SRTM data. Appl. Geogr. 2012, 32, 300–309. [Google Scholar] [CrossRef]
Souza, C.M.; Shimbo, J.Z.; Rosa, M.R.; Parente, L.L.; Alencar, A.A.; Rudorff, B.F.T.; Azevedo, T. Reconstructing three decades of land use and land cover changes in Brazilian biomes with landsat archive and earth engine. Remote Sens. 2020, 12, 2735. [Google Scholar] [CrossRef]
Chow, V.T. Open-Channel Hydraulics; Elsevier Science: Amsterdam, The Netherlands, 1959. [Google Scholar]
Šimunek, J.; Šejna, M.; Van Genuchten, M.T. The HYDRUS-1D Software Package for Simulating the One-Dimensional Movement of Water, Heat, and Multiple Solutes in Variably-Saturated Media; Version 2.0; Rep. IGWMC-TPS; CSIRO Land and Water: Clayton, Australia, 1998; Volume 70, p. 202. [Google Scholar]
Grinevskii, S.O. Modeling root water uptake when calculating unsaturated flow in the vadose zone and groundwater recharge. Mosc. Univ. Geol. Bull. 2011, 66, 189–201. [Google Scholar] [CrossRef]
Vasques, G.M.; Coelho, M.R.; Dart, R.O.; Cintra, L.C.; Baca, J.F.M. Soil Clay, Silt and Sand Content Maps for Brazil at 0–5, 5–15, 15–30, 30–60, 60–100 and 100–200 cm Depth Intervals with 90 m Spatial Resolution; Embrapa Solos: Rio de Janeiro, Brazil, 2021. [Google Scholar]
Vasques, G.M.; Coelho, M.R.; Dart, R.O.; Cintra, L.C.; Baca, J.F.M. Soil Bulk Density Maps for Brazil at 0–5, 5–15, 15–30, 30–60, 60–100 and 100–200 cm Depth Intervals with 90 m Spatial Resolution; Embrapa Solos: Rio de Janeiro, Brazil, 2021. [Google Scholar]
Hersbach, H.; Bell, B.; Berrisford, P.; Hirahara, S.; Horányi, A.; Muñoz-Sabater, J.; Thépaut, J.N. The era5 global reanalysis. Q. J. R. Meteorol. Soc. 2020, 146, 1999–2049. [Google Scholar] [CrossRef]
Liu, J.; Hagan, D.F.T.; Liu, Y. Global land surface temperature change (2003–2017) and its relationship with climate drivers: AIRS, MODIS, and ERA5-land based analysis. Remote Sens. 2020, 13, 44. [Google Scholar] [CrossRef]
Braga, R.A.H.W.; Santos, E.B.; Barros, M.F.D. Validação de dados de vento da reanálise ERA5-LAND Para estimativa de potencial eólico no Estado do Rio de Janeiro. Rev. Bras. Energ. 2021, 27, 142–166. [Google Scholar] [CrossRef]
de Araújo, C.S.P.; Silva, I.A.C.; Ippolito, M.; de Almeida, C.D.G.C. Evaluation of air temperature estimated by ERA5-land reanalysis using surface data in Pernambuco, Brazil. Environ. Monit. Assess. 2022, 194, 381. [Google Scholar] [CrossRef]
Matsunaga, W.K.; Sales, E.S.G.; Júnior, G.C.A.; Silva, M.T.; Lacerda, F.F.; de Paiva Lima, E.; dos Santos, C.A.C.; de Brito, J.I.B. Application of ERA5-land reanalysis data in zoning of climate risk for corn in the state of Bahia—Brazil. Theor. Appl. Climatol. 2023, 155, 945–963. [Google Scholar] [CrossRef]
Maskey, S. HyKit: A Tool for Grid-Based Interpolation of Hydrological Variables; User’s Guide (Version 1.3); IHE Delft Institute for Water Education: Delft, The Netherlands, 2013; pp. 1–6. [Google Scholar]
Moriasi, D.N.; Gitau, M.W.; Pai, N.; Daggupati, P. Hydrologic and water quality models: Performance measures and evaluation criteria. Trans. ASABE 2015, 58, 1763–1785. [Google Scholar] [CrossRef]
Ward, J.H., Jr. Hierarchical grouping to optimize an objective function. J. Am. Stat. Assoc. 1963, 58, 236–244. [Google Scholar] [CrossRef]
Modiri, E.; Bárdossy, A. Clustering simultaneous occurrences of the extreme floods in the Neckar catchment. Water 2021, 13, 399. [Google Scholar] [CrossRef]
Andraos, C. Breaking Uncertainty Barriers: Approximate Bayesian Computation Advances in Rainfall–Runoff Modeling. Water 2024, 16, 3499. [Google Scholar] [CrossRef]
Ziarh, G.F.; Kim, J.H.; Song, J.Y.; Chung, E.S. Quantifying Uncertainty in Runoff Simulation According to Multiple Evaluation Metrics and Varying Calibration Data Length. Water 2024, 16, 517. [Google Scholar] [CrossRef]
Zhang, J.; Cao, C.; Nan, T.; Ju, L.; Zhou, H.; Zeng, L. A Novel Deep Learning Approach for Data Assimilation of Complex Hydrological Systems. Water Resour. Res. 2024, 60, e2023WR035389. [Google Scholar] [CrossRef]
Trejo-Alonso, J.; Fuentes, S.; Morales-Durán, N.; Chávez, C. Evaluation and Development of Pedotransfer Functions and Artificial Neural Networks to Saturation Moisture Content Estimation. Water 2023, 15, 220. [Google Scholar] [CrossRef]
Dietrich, O.; Fahle, M.; Steidl, J. The Role of the Unsaturated Zone for Rainwater Retention and Runoff at a Drained Wetland Site. Water 2019, 11, 1404. [Google Scholar] [CrossRef]
Rattan, B.; Garg, A.; Sekharan, S.; Sahoo, L. Developing an environmental friendly approach for enhancing water retention with the amendment of water-absorbing polymer and fertilizers. Cent. Asian J. Water Res. 2023, 9, 113–129. [Google Scholar] [CrossRef]
Zhang, H.; Bian, J.; Wan, H.; Wei, N.; Ma, Y. Soil–water characteristic curves of extracellular polymeric substances-affected soils and sensitivity analyses of correlated parameters. Water Supply 2021, 21, 1323–1333. [Google Scholar] [CrossRef]
Verbist, K.M.J.; Pierreux, S.; Cornelis, W.M.; Mclaren, R.; Gabriëls, D. Parameterizing a coupled surface–subsurface three-dimensional soil hydrological model to evaluate the efficiency of a runoff water harvesting technique. Vadose Zone J. 2012, 11, vzj2011-0141. [Google Scholar] [CrossRef]
Pan, F.; Zhu, J.; Ye, M.; Pachepsky, Y.A.; Wu, Y.S. Sensitivity analysis of unsaturated flow and contaminant transport with correlated parameters. J. Hydrol. 2011, 397, 238–249. [Google Scholar] [CrossRef]
Bear, J.; Rubinstein, B.; Fel, L. Capillary pressure curve for liquid menisci in a cubic assembly of spherical particles below irreducible saturation. Transp. Porous Media 2011, 89, 63–73. [Google Scholar] [CrossRef]
Du, H.; Fok, H.S.; Chen, Y.; MA, Z. Characterization of the recharge-storage-runoff process of the Yangtze River source region under climate change. Water 2020, 12, 1940. [Google Scholar] [CrossRef]
Beven, K. How far can we go in distributed hydrological modelling? Hydrol. Earth Syst. Sci. 2001, 5, 1–12. [Google Scholar] [CrossRef]
Gupta, H.V.; Beven, K.J.; Wagener, T. Model calibration and uncertainty estimation. In Encyclopedia of Hydrological Sciences; Wiley: Hoboken, NJ, USA, 2006. [Google Scholar]

Figure 1. Location of the Pedro do Rio watershed, monitoring station and elevation. Adapted from Sales et al. [11].

Figure 2. Soil map of the representative watershed showing soil types and mapped units.

Figure 3. Simulation performance: NSE (bars) and PBIAS (red line).

Figure 4. Hierarchical clustering dendrogram (standardized).

Table 1. Dimensions of cross-sections based on field measurements.

Drainage Area (km²)	Heights (m)	Top Width (m)	Bottom Width (m)
2.00	1.50	3.00	1.00
6.32	2.00	4.60	1.00
12.60	2.00	5.10	2.00
34.40	2.00	8.70	2.00
49.18	5.00	11.20	6.00
103.45	5.00	14.00	6.00
419.33	5.00	19.00	6.00

Note: Adapted from Sales et al. [11].

Table 2. Surface and vegetation coefficients.

Land Use Classes	Manning Coefficient	$K_{c}$			Feddes Coefficients
Land Use Classes	Manning Coefficient	Initial	Mid-Season	End Season	$h_{1}$	$h_{2}$	$h_{3}$	$h_{4}$
Dense Forest	0.160	0.95	1.00	1.00	0	−1	−3.3	−150
Pasture	0.038	0.40	1.05	0.85	−0.1	−0.25	−8	−80
Agriculture	0.045	0.60	1.15	0.90	−0.1	−0.25	−15	−80
Urban	0.040	-	-	-	-	-	-	-
Rocky Outcrop	0.030	-	-	-	-	-	-	-

Note: Manning coefficients estimated according to Chow [37]; pasture and agriculture Feddes coefficients obtained from HYDRUS 1D model [38]; and forest Feddes coefficients defined according to Grinevskii [39]. Adapted from Sales et al. [11].

Table 3. Vertical discretization of the 3D soil domain.

Model Layers		EMBRAPA Layers
ID	Thickness	EMBRAPA Layers
1	5 cm	0–5 cm
2	10 cm	5–15 cm
3	15 cm	15–30 cm
4	30 cm	30–60 cm
5	40 cm	60–100 cm
6	300 cm	100–200 cm
7	300 cm	100–200 cm

Note: Adapted from Sales et al. [11].

Table 4. Simulation scenarios for evaluating interactions among VGM parameters.

SIM	MST Multiplying Factor					Parameter Perturbated
SIM	$θ_{s}$	$θ_{r}$	$α$	$n$	$K_{s a t}$	Parameter Perturbated
S1	1	1	1	1	1	-
S2	1.1	1	1	1	1	$θ_{s}$
S3	1	0.9	1	1	1	$θ_{r}$
S4	1	1	0.9	1	1	$α$
S5	1	1	1	1.1	1	$n$
S6	1	1	1	1	1.1	$K_{s a t}$
S7	1.1	0.9	1	1	1	$θ_{s}$ , $θ_{r}$
S8	1.1	1	0.9	1	1	$θ_{s}$ , $α$
S9	1.1	1	1	1.1	1	$θ_{s}$ , $n$
S10	1.1	1	1	1	1.1	$θ_{s}$ , $K_{s a t}$
S11	1	0.9	0.9	1	1	$θ_{r}, α$
S12	1	0.9	1	1.1	1	$θ_{r}, n$
S13	1	0.9	1	1	1.1	$θ_{r}, K_{s a t}$
S14	1	1	0.9	1.1	1	$α, n$
S15	1	1	0.9	1	1.1	$α, K_{s a t}$
S16	1	1	1	1.1	1.1	$n, K_{s a t}$
S17	1.1	0.9	0.9	1	1	$θ_{s}$ , $θ_{r}, α$
S18	1.1	0.9	1	1.1	1	$θ_{s}$ , $θ_{r}, n$
S19	1.1	0.9	1	1	1.1	$θ_{s}$ , $θ_{r}, K_{s a t}$
S20	1.1	1	0.9	1.1	1	$θ_{s}, α, n$
S21	1.1	1	0.9	1	1.1	$θ_{s}, α, K_{s a t}$
S22	1.1	1	1	1.1	1.1	$θ_{s}, n, K_{s a t}$
S23	1	0.9	0.9	1.1	1	$θ_{r}, α, n$
S24	1	0.9	0.9	1	1.1	$θ_{r}, α, K_{s a t}$
S25	1	0.9	1	1.1	1.1	$θ_{r}, n, K_{s a t}$
S26	1	1	0.9	1.1	1.1	$α, n, K_{s a t}$
S27	1.1	0.9	0.9	1.1	1	$θ_{s}$ , $θ_{r}, α, n$
S28	1.1	0.9	0.9	1	1.1	$θ_{s}$ , $θ_{r}, α, K_{s a t}$
S29	1.1	0.9	1	1.1	1.1	$θ_{s}$ , $θ_{r}, n, K_{s a t}$
S30	1.1	1	0.9	1.1	1.1	$θ_{s}$ , $α, n, K_{s a t}$
S31	1	0.9	0.9	1.1	1.1	$θ_{r}$ , $α, n, K_{s a t}$
S32	1.1	0.9	0.9	1.1	1.1	${θ_{s}, θ}_{r}$ , $α, n, K_{s a t}$

Table 5. Performance metrics for combinatorial simulation scenarios (ranked by NSE).

SIM	PBIAS	NSE	SIM	PBIAS	NSE
S32	25.32	0.50	S28	30.24	0.22
S27	26.30	0.49	S19	31.76	0.20
S30	25.96	0.48	S21	30.56	0.19
S29	26.56	0.48	S17	31.29	0.18
S18	28.07	0.46	S10	32.07	0.17
S20	27.93	0.46	S7	32.80	0.16
S22	27.66	0.46	S8	31.58	0.15
S9	28.66	0.44	S2	33.10	0.13
S31	27.69	0.40	S24	31.34	0.11
S25	29.31	0.38	S13	32.83	0.09
S23	28.62	0.38	S15	31.60	0.08
S26	28.17	0.37	S11	32.33	0.06
S12	30.26	0.36	S6	33.08	0.06
S16	29.78	0.35	S3	33.84	0.04
S14	29.09	0.35	S4	32.57	0.03
S5	30.73	0.33	S1	34.06	0.01

Table 6. Functional response of VGM parameters to ±10% perturbation: Implications for streamflow under wet and dry conditions.

Parameter	Sensitivity	Parameter Perturbation	Streamflow Effect (Wet)	Streamflow Effect (Dry)
$n$	High	Increase	Decrease	Increase
$θ_{s}$	Middle-high	Increase	Decrease	Increase
$K_{s a t}$	Middle	Increase	Decrease	Increase
$θ_{r}$	Middle-low	Decrease	Decrease	Increase
$α$	Low	Decrease	Decrease	Increase

Note: Adapted from Sales et al. [11].

Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

© 2025 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).

Share and Cite

MDPI and ACS Style

Sales, D.d.S.; Costa, D.d.A.; Lugon Junior, J.; Neves, R.J.; Silva Neto, A.J.d. A Deterministic Combinatorial Approach to Investigate Interactions of Soil Hydraulic Parameters on River Flow Modelling. Water 2025, 17, 2627. https://doi.org/10.3390/w17172627

AMA Style

Sales DdS, Costa DdA, Lugon Junior J, Neves RJ, Silva Neto AJd. A Deterministic Combinatorial Approach to Investigate Interactions of Soil Hydraulic Parameters on River Flow Modelling. Water. 2025; 17(17):2627. https://doi.org/10.3390/w17172627

Chicago/Turabian Style

Sales, Dhiego da Silva, David de Andrade Costa, Jader Lugon Junior, Ramiro Joaquim Neves, and Antônio José da Silva Neto. 2025. "A Deterministic Combinatorial Approach to Investigate Interactions of Soil Hydraulic Parameters on River Flow Modelling" Water 17, no. 17: 2627. https://doi.org/10.3390/w17172627

APA Style

Sales, D. d. S., Costa, D. d. A., Lugon Junior, J., Neves, R. J., & Silva Neto, A. J. d. (2025). A Deterministic Combinatorial Approach to Investigate Interactions of Soil Hydraulic Parameters on River Flow Modelling. Water, 17(17), 2627. https://doi.org/10.3390/w17172627

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Menu

A Deterministic Combinatorial Approach to Investigate Interactions of Soil Hydraulic Parameters on River Flow Modelling

Abstract

1. Introduction

2. Materials and Methods

2.1. Study Area

2.2. MOHID-Land Model Overview

2.2.1. MOHID-Land Soil Water Dynamics

2.2.2. Model Set-Up (Baseline Simulation—S1)

2.3. Van Genuchten Parameters Interactions and Scenarios

2.4. Model Evaluation

2.5. Computational Infrastructure

3. Results

4. Discussion

Computational Cost

5. Conclusions

Author Contributions

Funding

Data Availability Statement

Acknowledgments

Conflicts of Interest

Abbreviations

Appendix A

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI