Analysis of Water Balance Changes and Parameterization Reﬂecting Soil Characteristics in a Hydrological Simulation Program—FORTRAN Model

: Efficient water resource management requires accurate analyses of hydrological components and water balance. The Hydrological Simulation Program—FORTRAN (HSPF) model serves this purpose at the watershed scale. It has limited accuracy in calculating runoff and infiltration because the model simulates hydrological processes using one representative parameter for each land use in the watershed. Accuracy requires field-scale analysis of hydrological components. We calculated the lower zone storage nominal parameter, which markedly affects runoff in HSPF, from effective moisture content and depth of each soil layer. Analysis of hydrological components suggested re-calculating the parameters reflecting soil characteristics. We investigated two scenarios through simulations: Scenario 1 used the existing method. Scenario 2 used parameters that reflected soil properties. Total flows for each sub-catchment were identical, but proportions of direct and intermediate runoff were larger in Scenario 1. Ratios of baseflow, evapotranspiration, and infiltration were larger in Scenario 2, reflecting soil characteristics. Comparing the baseflow ratio to total flow, Scenario 2 values were similar to observed values. Comparisons of R 2 and Nash–Sutcliffe Efficiency (NSE) at the end of the watershed were well matched ( R 2 and NSE are higher than 0.9) in both scenarios, but proportions of each hydrological component differed. It is important to consider soil characteristics when applying water quantity and quality analyses in an HSPF simulation.


Introduction
Globally, rising temperatures with climate change can cause changes in soil moisture content, which affects the water cycle process and water resources [1,2]. Associated changes in hydrological components (such as precipitation, evapotranspiration, surface runoff, baseflow, and soil water content) have a significant influence on water resource management [3]. To prepare for adverse impacts of climate change on hydrology cycles, water quality, and aquatic ecosystems, it must be a priority to conduct accurate water balance analyses [4][5][6][7]. Quantitative evaluation of water resources by identifying the hydrological cycle and reflecting watershed characteristics is a fundamental analysis for establishing a water resource management plan [8].
Various hydrological models, such as the Stanford Watershed Model (SWM-IV) [9], Système Hydrologique Europeén (SHE) [10], TOPMODEL [11], Precipitation Runoff Modeling System (PRMS) [12], National Weather Service River Forecasting System (NWSRFS) [13], The novelty of this study is (1) the importance of accurate parameter estimation is shown through analysis of water balance for each hydrological component, even if the simulated flow at the end of the watershed is numerically well matched; (2) the importance of parameter estimation to reflect watershed properties is demonstrated by calculating parameters using soil properties and comparing the simulation results to the previous model that used average parameters for each watershed.

Study Area
The study area is the Uiam Dam basin located in Chuncheon-si, Gangwon-do, South Korea, which comprises downstream from Chuncheon Dam, downstream from Soyanggang Dam, the Uiam Dam, and the Gongjicheon sub-watershed ( Figure 1). The main water source for the basin is the effluent water from the Chuncheon and Soyanggang Dams. It is possible to quantitatively analyze the factors that affect the water body through simulation of the dam basin using a hydrological model [37]. The soil properties of the Uiam Dam basin differ depending on aspects of the soil profile, namely the number of soil layers (2 to 6), depth of each layer (0.06 to 0.27 m), and effective moisture content (0.02 to 0.48%) ( Figure 2). In Figure 2, (a) is a soil map of the target area and (b) is different depths for each 3D soil type. Even within the same watershed, soil type and depth properties are different, as shown on the right side of Figure 2, so it is necessary to reflect the soil characteristics in the model and calibrate the related parameters. The results of the mean temperature precipitation analysis and flow data of the target area using the input data of the model are shown in Figure 3. The total area of the Uiam Dam basin is 7709 km 2 , and the effective storage volume is 5,750,000 m 3 . The discharges from the Soyang River Dam and Chuncheon Dam are approximately 55% and 40%, respectively, of the inflow of the Uiam Dam basin. Analysis of land use in the basin revealed that the urbanized dry area was 4.74%, agricultural area 17.72%, and forested area 49.66%. Forest occupied the largest area, followed by water and agricultural areas.
x FOR PEER REVIEW 3 of 18 The novelty of this study is (1) the importance of accurate parameter estimation is shown through analysis of water balance for each hydrological component, even if the simulated flow at the end of the watershed is numerically well matched; (2) the importance of parameter estimation to reflect watershed properties is demonstrated by calculating parameters using soil properties and comparing the simulation results to the previous model that used average parameters for each watershed.

Study Area
The study area is the Uiam Dam basin located in Chuncheon-si, Gangwon-do, South Korea, which comprises downstream from Chuncheon Dam, downstream from Soyanggang Dam, the Uiam Dam, and the Gongjicheon sub-watershed ( Figure 1). The main water source for the basin is the effluent water from the Chuncheon and Soyanggang Dams. It is possible to quantitatively analyze the factors that affect the water body through simulation of the dam basin using a hydrological model [37]. The soil properties of the Uiam Dam basin differ depending on aspects of the soil profile, namely the number of soil layers (2 to 6), depth of each layer (0.06 to 0.27 m), and effective moisture content (0.02 to 0.48%) ( Figure 2). In Figure 2, (a) is a soil map of the target area and (b) is different depths for each 3D soil type. Even within the same watershed, soil type and depth properties are different, as shown on the right side of Figure 2, so it is necessary to reflect the soil characteristics in the model and calibrate the related parameters. The results of the mean temperature precipitation analysis and flow data of the target area using the input data of the model are shown in Figure 3. The total area of the Uiam Dam basin is 7709 km 2 , and the effective storage volume is 5,750,000 m 3 . The discharges from the Soyang River Dam and Chuncheon Dam are approximately 55% and 40%, respectively, of the inflow of the Uiam Dam basin. Analysis of land use in the basin revealed that the urbanized dry area was 4.74%, agricultural area 17.72%, and forested area 49.66%. Forest occupied the largest area, followed by water and agricultural areas.

Description of HSPF Model
The HSPF model was developed by the United States Environmental Protection Agency (U.S. EPA) and has been widely used for simulation of hydrology and water quality in various watersheds [38]. The HSPF model can be applied to runoff simulations in urban and rural watersheds based on the water balance equation. In HSPF, Watershed Data Management (WDM) input/output data are managed through separate software, WDMUtil, and long-term simulations of extensive hydrology and water quality are possible [38]. The HSPF model comprises application and utility modules that support the

Description of HSPF Model
The HSPF model was developed by the United States Environmental Protection Agency (U.S. EPA) and has been widely used for simulation of hydrology and water quality in various watersheds [38]. The HSPF model can be applied to runoff simulations in urban and rural watersheds based on the water balance equation. In HSPF, Watershed Data Management (WDM) input/output data are managed through separate software, WDMUtil, and long-term simulations of extensive hydrology and water quality are possible [38]. The HSPF model comprises application and utility modules that support the

Description of HSPF Model
The HSPF model was developed by the United States Environmental Protection Agency (U.S. EPA) and has been widely used for simulation of hydrology and water quality in various watersheds [38]. The HSPF model can be applied to runoff simulations in urban and rural watersheds based on the water balance equation. In HSPF, Watershed Data Management (WDM) input/output data are managed through separate software, WDMUtil, and long-term simulations of extensive hydrology and water quality are possible [38]. The HSPF model comprises application and utility modules that support the application module [39]. The application module can simulate runoff in urban and rural areas and comprises the PERLND module (which simulates the water quantity and water quality in the previous land), the IMPLND module (which simulates the impervious land), and the RCHRES module (which simulates the water body). Each module is independent of the others, and simultaneously interacts and performs simulations [40]. Time-series data, such as rainfall and temperature, are required as input data to simulate the water quantity and quality in the watershed (Table 1). In this study, to apply the HSPF model, we used the 10 m × 10 m Digital Elevation Model (DEM) provided by the National Geographic Information Institute [41], and the 2017 land use map and Korean Reach File (KRF) v3.0 use map provided by the Environment Geospatial Information Service [42,43]. The sub-classified 2017 land cover map provided by the Environmental Geospatial Information Service [42] was used for land use (Figure 4). Data constructed by the Water Resources Management Information System [44] were used for the daily dam discharge. Weather data (2006)(2007)(2008)(2009)(2010)(2011)(2012)(2013)(2014)(2015)(2016)(2017) such as precipitation (mm), temperature ( • C), wind speed (km/h), solar radiation (MJ/m 2 ), dew point temperature ( • C), and total cloud amount (tenth conversion) were provided by the Meteorological Agency and constructed as a WDM file (.wdm format) using WDMUtil for input to the HSPF model simulation. The weather data time series are stored in the WDM file, and with the watershed data constructed from BASINS, they become inputs to the HSPF model. Table 1 shows the data necessary for HSPF modeling.
Water 2022, 14, x FOR PEER REVIEW 5 of 18 application module [39]. The application module can simulate runoff in urban and rural areas and comprises the PERLND module (which simulates the water quantity and water quality in the previous land), the IMPLND module (which simulates the impervious land), and the RCHRES module (which simulates the water body). Each module is independent of the others, and simultaneously interacts and performs simulations [40]. Timeseries data, such as rainfall and temperature, are required as input data to simulate the water quantity and quality in the watershed (Table 1). In this study, to apply the HSPF model, we used the 10 m × 10 m Digital Elevation Model (DEM) provided by the National Geographic Information Institute [41], and the 2017 land use map and Korean Reach File (KRF) v3.0 use map provided by the Environment Geospatial Information Service [42,43]. The sub-classified 2017 land cover map provided by the Environmental Geospatial Information Service [42] was used for land use (Figure 4). Data constructed by the Water Resources Management Information System [44] were used for the daily dam discharge. Weather data (2006-2017) such as precipitation (mm), temperature (°C), wind speed (km/h), solar radiation (MJ/m 2 ), dew point temperature (°C), and total cloud amount (tenth conversion) were provided by the Meteorological Agency and constructed as a WDM file (.wdm format) using WDMUtil for input to the HSPF model simulation. The weather data time series are stored in the WDM file, and with the watershed data constructed from BASINS, they become inputs to the HSPF model. Table 1 shows the data necessary for HSPF modeling.

Estimation of the Effective Parameters Based on Soil Properties and Scenario Analysis
Among the parameters used in the HSPF model, Upper Zone Storage Nominal (UZSN), Lower Zone Storage Nominal (LZSN), Infiltration (INFILT), and Ground Recession Rate (AGWRC) have influence on the flow simulation [45]. The LZSN parameter was found to have the most influence on runoff [36]. Therefore, for accurate runoff and infiltration analysis, it is important to estimate the LZSN parameter well to reflect the soil characteristics across the watershed.
There are various practical difficulties in estimating parameters directly from measured information in hydrological models. Therefore, it is essential to calibrate and appropriately calculate parameter values using available information [46]. Many studies have been conducted to investigate the uncertainty of the flow simulated in the hydrologic model as well as the calibrations of the simulation results and to evaluate the reliability of the model [47,48]. Additionally, in hydrological models, various factors and methods of estimating parameters can be considered. In this study, the LZSN parameter was calculated using the soil layer depth and effective moisture content data based on characteristics of the soil profiles. To reflect the characteristics of the soil, information on soil type and its properties was used based on the soil map for each sub-watershed. As the LZSN parameter means the storage nominal, so AWC and depth information for each soil type were important factors. To consider the depth of the soil and the soil properties for each layer, the value of the LZSN parameter was calculated by adding the values obtained by multiplying the layer depth and the AWC for the soil types. The Available Water Content (AWC) is the effective moisture content of each soil layer and refers to the value obtained by subtracting the moisture content at the permanent wilting point from the field capacity. The LZSN parameter is related to the amount that can be stored in the lower layer of the soil. A smaller value means less water storage, and a larger value means more water storage. In this study, total storage was calculated based on the soil type data, and the upper 10 cm was assumed to be UZSN.
Two scenarios were constructed to compare and analyze the proportions of the hydrological components. Through the simulation and results analysis of the two scenarios, it is possible to numerically determine how the parameters reflecting the soil characteristics represent the differences in the flow and water balance analysis results. Scenario 1 applies the soil parameters (LZSN) calculated by the existing method, and Scenario 2 applies the effective soil parameters reflecting the soil characteristics of each layer as the newly calculated LZSN parameters. Water balance analysis was performed for the two scenarios to analyze the difference between hydrological components with and without soil characteristic consideration with the LZSN parameter in HSPF. In addition, to evaluate the impact of the effective LZSN parameter on the baseflow, the Baseflow Index (BFI) was calculated using BFLOW [14,24] and WHAT [49,50], which are the baseflow separation models. The BFI was compared with the baseflow ratio calculated using the measured flow for the two scenarios. Figure 5 presents the overall outline of this study.
The hydrological components were divided into direct runoff and baseflow, intermediate outflow, infiltration, and evapotranspiration. The analysis was conducted by dividing the total flow obtained from the simulation into Surface Outflow (SURO), Active Groundwater Outflow (AGWO), Intermediate Outflow (IFWO), Total Simulated Evapotranspiration (TAET), and Inflow to Inactive Groundwater (IGWI). Water balance analysis was performed per land use by dividing flow into SURO, TAET, and IGWI. The infiltration was calculated as the sum of the baseflow, intermediate outflow, and infiltration.

Baseflow Analysis Using Baseflow Separation Programs (WHAT, BFLOW)
The WHAT system is a web-based tool developed to estimate the baseflow separately from the streamflow data, and has been used in various studies for baseflow simulation and analysis in rivers [26,48,51]. A baseflow filter program (BFLOW) is one method that has been widely used to separate the baseflow from streamflow data. BFLOW separates the baseflow using digital filtering and provides the amount of baseflow to the total flow as a ratio.

Separation Methods (WHAT, BFLOW)
The WHAT can separate the baseflow by three methods. First, the Local Minimum Method (LMM) separates the baseflow by connecting the points where the river flow comprises only the baseflow, by decreasing the direct flow at the recession part of the flow hydrograph. The second method is the one-parameter digital filter method, which separates the baseflow ratio based on the method used in signal processing research to separate high and low frequencies. Third, the Eckhardt filter method uses a variable called the maximum value of BFI, which is the ratio of the baseflow to the total river flow [52]. The method is like the BFLOW filter method but has the advantage of considering the characteristics of the aquifer in the watershed. Therefore, in this study, the Eckhardt filter method in Equation (1) was used, which considers the maximum value of the ratio of baseflow to total flow:

Baseflow Analysis Using Baseflow Separation Programs (WHAT, BFLOW)
The WHAT system is a web-based tool developed to estimate the baseflow separately from the streamflow data, and has been used in various studies for baseflow simulation and analysis in rivers [26,48,51]. A baseflow filter program (BFLOW) is one method that has been widely used to separate the baseflow from streamflow data. BFLOW separates the baseflow using digital filtering and provides the amount of baseflow to the total flow as a ratio.

Separation Methods (WHAT, BFLOW)
The WHAT can separate the baseflow by three methods. First, the Local Minimum Method (LMM) separates the baseflow by connecting the points where the river flow comprises only the baseflow, by decreasing the direct flow at the recession part of the flow hydrograph. The second method is the one-parameter digital filter method, which separates the baseflow ratio based on the method used in signal processing research to separate high and low frequencies. Third, the Eckhardt filter method uses a variable called the maximum value of BFI, which is the ratio of the baseflow to the total river flow [52]. The method is like the BFLOW filter method but has the advantage of considering the characteristics of the aquifer in the watershed. Therefore, in this study, the Eckhardt filter method in Equation (1) was used, which considers the maximum value of the ratio of baseflow to total flow: where b t is the baseflow separated at time t (m 3 /s), b t−1 is the baseflow separated at time t − 1 (m 3 /s), Q t is the total stream flow during time t (m 3 /s), α is the filter parameter constant, and BFI max refers to the maximum value of the baseflow ratio to total flow. In this study, the value of BFI was calculated using the WHAT system. To accurately calculate the baseflow ratio using WHAT, the BFI max value suitable for the aquifer characteristics of the target watershed must be calculated. Eckhardt [52] proposed maximum values for each representative aquifer: 0.8 for the porous aquifer of perennial streams, 0.5 for ephemeral streams, and 0.25 for a stream that temporarily flows.
With In BFLOW, the digital filtering method was used to separate high and low frequencies in signal processing research. The baseflow separation method was applied to separate the high-frequency signals. The digital filter method does not consider the physical characteristics of direct runoff and baseflow, but it can be calculated quickly and reproduced consistently [53]. The method of separating the baseflow from the hydrograph is shown in Equation (2) where y t is average daily flow, f t is direct flow, b t is baseflow, and a is filter parameter. The BFLOW program uses daily flow data and a one-parameter filter [54] to calculate the baseflow. The user calculates the baseflow as the amount passing the filter at the first, second, and third passes. The user can select from the filtered results and use those suitable for the characteristics of the target watershed [43,53]. The first to third passes can calculate the baseflow ratio as the amount passing the filter for stream flow data. In particular, with the third pass, the contribution to the baseflow is relatively small, and the result of separation of the first and second passes is generally used [55]. Besides improving the accessibility of the existing BFLOW system, it also evaluated the contribution to the river by quantifying the direct and baseflow through the BFI and Coefficient of Variation (CV). The results demonstrated that BFLOW would be helpful for evaluating river flow characteristics for water resource and flood risk management.
The minimum and maximum number of days of rainfall duration for calculating the recession coefficient must be entered as the input data to BFLOW. However, with the Uiam Dam basin, it is difficult to calculate such specific values. Thus, in this study, the BFI value was calculated by using the minimum value of 2 and the maximum value of 100, which are the default values provided by BFLOW. Therefore, the results of the baseflow analysis using the BFLOW model can differ from the actual baseflow ratio of the streamflow. In addition, the results of the BFLOW model are presented as first, second, and third passes. It can be seen that the base outflow ratio decreases from the first pass to the second and third passes. Therefore, because Arnold and Allen [14,24] suggested that the first pass result and the baseflow ratio classified manually are similar, this study calculated the average baseflow ratio for five years using the first pass result and compared it with the baseflow ratio calculated using the measured flow ratio.

Evaluation Method
In the hydrological model, to remove uncertainty, reliable results are derived by calibrating the characteristics of the watershed and the dynamics of the runoff process based on the measured data [23,56]. The HSPF model also needs to be corrected for the target watershed based on the measured values. The accuracy of each element of the hydrological model must be evaluated to analyze the water balance according to the parameter changes. Heo et al. [57] suggested the importance of evaluating each hydrological element when calibrating hydrological models. The inflow into the Uiam Dam collected through the National Water Resources Management Information System was used as the measured data to evaluate the accuracy of the model [37,44]. After comparing the model-predicted daily flow with the measured value, the simulation results were evaluated by the coefficients of determination R 2 , Nash-Sutcliffe Efficiency (NSE), and % difference, respectively, as shown in Equations (3)- (5). The accuracy of the model was evaluated based on the results of Morioasi et al. [58]. According to the presented criteria, the result was classified as very good, good, satisfactory, or unsatisfactory ( Table 2).
where P i is the simulated value, O i is the observed value, n is the total number of data, P i is the mean of the simulated value, and O i is mean of the observed value. The closer R 2 and NSE are to 1 and % difference is to 0, the better the model's simulated value predicts the measured value, increasing the model efficiency [17,40].

Parameter (LZSN) Calculation Considering Soil Characteristics
The LZSN parameter for each soil profile in the study watershed was calculated using the AWC and depth of each soil layer. In Table 3, the default parameter is the value estimated by the built-in functions in the HSPF model, and the estimated parameter is the value calculated using soil properties. The estimated parameter varied according to the characteristics of the sub-watershed and soil types. It might be difficult to compare the parameter values due to regional variability of each study area. However, in this study the validity of the parameter estimation through comparison with the simulated and In this study, Scenario 1 showed high NSE and R 2 without additional calibration, so it might not represent the physical properties of the watershed well. Scenario 2, reflecting the watershed characteristics, showed better results (e.g., water balance) than Scenario 1. Therefore, it is possible to consider the effect of hydrological components for each subwatershed in Scenario 1 using default values and in Scenario 2 using the parameters calculated by considering soil characteristics. In addition, although the simulated flow at the end of the watershed numerically matched the actual flow well, this illustrates the importance of a reasonable prediction of the hydrological component for each land use while considering the watershed characteristics. In Table 3, the default parameter was the same value calculated for four sub-watersheds and land use by the built-in functions in the HSPF model. The estimated parameter was calculated by considering the soil properties. In the case of the forest, it was calculated as 3.9 for all sub-watersheds. However, with the estimated parameter, different values were calculated for each sub-watershed and land use. With urban areas, the default parameters were given as 2, but estimated parameters 0.35, 7.81, 12.06, and 10.50 were calculated in each sub-watershed. (In this study, the difference is minimum 82% and maximum 700% across the entire land use of sub-watersheds.) As the estimated parameters were calculated by considering the properties of each soil layer, the values are different even for the same land use. The LZSN parameter shows that, the lower the zone nominal storage and larger the value, the larger the water storage space in the soil layer is. The proportion of soil properties is different for each sub-watershed, 'Mn' soil properties took the largest proportion in the sub-watersheds of Chuncheon Dam-downstream, Soyanggang Dam-downstream, and Uiam Dam. In the Gongjicheon sub-watershed, 'Re' soil properties took the largest proportion.

Water Balance Analysis for Scenarios 1 and 2
The runoffs according to the application of Scenarios 1 and 2 and the hydrological components of each land use in the sub-watersheds were compared. R 2 , NSE, and % difference indicators were used to evaluate the accuracy of the simulation by comparing the observed Uiam Dam inflow. Even though R 2 and NSE were very good or better, there may be differences in hydrological components by land use in the sub-watersheds. Therefore, in this study, even if the simulation matched well with the observation at the end of the watershed, the parameter calculation reflecting the soil characteristics and the difference in the proportion occupied by hydrological components were analyzed. Data from 2013 to 2017 were used for the model-scenario calibration period. The validation period of the model is 2008-2012. In Figure 6, the observed values from 2013 to 2017 and the simulated results for each scenario are plotted. The data used for the calibration are the inflow of the Uiam Dam located downstream of the target area, and the data were provided with daily data observed by the Water Resources Management Information System (WAMIS) [44]. In the simulation for Scenario 1, R 2 was 0.99, NSE was 0.98, and the % difference was 1.56. In Scenario 2, R 2 was 0.99, NSE was 0.98, and the % difference was 1.39 ( Figure 6). The closer R 2 and NSE are to 1, the better the measured value that is simulated, and the closer to zero the % difference, the better the simulation [17]. Even though the modeling results from Scenario 1 showed high NSE and R 2 without additional calibration, it might not represent the physical properties of the watershed well. Scenario 2, reflecting the watershed characteristics, showed better results (e.g., water balance) than Scenario 1. Additionally, with the study watershed, the discharge of the two upstream dams accounted for 55% and 40% of the inflow of the Uiam Dam, so it was not significantly affected by the hydrological parameters in the study watershed.
from Scenario 1 showed high NSE and R 2 without additional calibration, it might not represent the physical properties of the watershed well. Scenario 2, reflecting the watershed characteristics, showed better results (e.g., water balance) than Scenario 1. Additionally, with the study watershed, the discharge of the two upstream dams accounted for 55% and 40% of the inflow of the Uiam Dam, so it was not significantly affected by the hydrological parameters in the study watershed. This study does not discuss the correctness of the model but presents the differences in hydrological components for each sub-watershed when the soil characteristics within the watershed were considered. The aim is to demonstrate why it is necessary to consider soil characteristics when calibrating the HSPF model. Thus, the simulated value at the end of the basin reflected the measured value well, but this was not a problem even if the soil characteristics were not reflected in the model. However, to evaluate the effect of water pollution on land use in sub-watersheds, the spatial characteristics must be reflected in the model. When applying a Best Management Practices (BMPs) scenario within a watershed, the effect of reducing pollution by sub-watersheds will appear differently, so it is important to simulate the spatial characteristics of each land use/soil within the target area. Tables 4 and 5 show the results of the water balance analysis by sub-watershed for each scenario. Looking at the results of Scenario 1, the proportion of direct runoff by each sub-watershed ranged from 14% to 27%, with an average of 19%; with baseflow, from 21% to 25%, with an average of 24%; and with intermediate outflow, from 12% to 14%, with an average of 13%. The percentage of evapotranspiration was 41% on average, ranging from 36% to 43%, and penetration was found to account for 3% on average, ranging from 2% to 3%. Looking at the results of Scenario 2, the percentage of direct outflow by sub-watershed ranged from 13% to 27%, with an average of 17%; baseflow averaged 26%, ranging from This study does not discuss the correctness of the model but presents the differences in hydrological components for each sub-watershed when the soil characteristics within the watershed were considered. The aim is to demonstrate why it is necessary to consider soil characteristics when calibrating the HSPF model. Thus, the simulated value at the end of the basin reflected the measured value well, but this was not a problem even if the soil characteristics were not reflected in the model. However, to evaluate the effect of water pollution on land use in sub-watersheds, the spatial characteristics must be reflected in the model. When applying a Best Management Practices (BMPs) scenario within a watershed, the effect of reducing pollution by sub-watersheds will appear differently, so it is important to simulate the spatial characteristics of each land use/soil within the target area. Tables 4 and 5 show the results of the water balance analysis by sub-watershed for each scenario. Looking at the results of Scenario 1, the proportion of direct runoff by each sub-watershed ranged from 14% to 27%, with an average of 19%; with baseflow, from 21% to 25%, with an average of 24%; and with intermediate outflow, from 12% to 14%, with an average of 13%. The percentage of evapotranspiration was 41% on average, ranging from 36% to 43%, and penetration was found to account for 3% on average, ranging from 2% to 3%. Looking at the results of Scenario 2, the percentage of direct outflow by sub-watershed ranged from 13% to 27%, with an average of 17%; baseflow averaged 26%, ranging from 23% to 27% with median outflow ranging from 12% to 14%; and intermediate outflow ranged from 12% to 14%, with an average of 13%. The percentage of evapotranspiration averaged 41%, ranging from 36% to 43%, and the penetration was 3% on average. Figure 7 shows the results for each scenario. The proportion of direct runoff was larger in Scenario 1, and the proportion of baseflow was larger in Scenario 2. The proportion of intermediate runoff was greater in Scenario 1, and the proportions of evapotranspiration and infiltration were greater in Scenario 2. Therefore, although the total flow for each subwatershed was similar in the two scenarios, the proportion of each hydrological component could be different. This means that even if the total flow was the same at the outlet of the watershed, the ratios of each hydrological component differ when the parameters reflecting the soil characteristics are applied.  In this study, the water balance analysis (Tables 6 and 7) was performed by land use, by dividing the total flow obtained from the simulation into SURO, TAET, and IGWI. The parameter values can be entered in the HSPF model according to the characteristics of the sub-watershed and land use, but the LZSN value in Scenario 1 was applied equally as 0.051 m for other land uses, excluding forest areas. The results of the water balance analysis for Scenario 1 confirmed that the same results were obtained for land uses with the same LZSN value. In Scenario 2, the effective parameter values (recalculated LZSN, which reflected the depth of each layer and effective moisture) were entered.
In Scenario 1, the results of the water balance analysis were the same according to the parameter values for each land use. In Scenario 2, the result of the water balance analysis also differed according to land use and soil characteristics. The method of performing water balance analysis differs depending on the purpose of the study.
In previous studies, a method of water balance analysis using MODSIM and a water balance analysis for agricultural return flow analysis were performed [59,60]. A study model using the linked HSPF-MASA-CREAMS-PADDY model and a study suggesting the circulation rate in agricultural watershed were conducted [61,62]. In this study, to analyze differences according to the input parameters, water balance analysis was performed by land use and sub-watersheds. Water infiltration is the most important hydrological parameter for the evaluation and diagnosis of the soil and water balance [63]. In this study, the differences between hydrological components appear according to parameters. However, a more in-depth analysis will be possible through the parameter calculation method, analysis of differences that occur depending on the soil factor being considered, and linkage with other models in future studies.      In this study, WHAT was used to separate the direct and base outflows, and the average annual flow and BFI were estimated. Average flow refers to the annual average flow for five years, from 2013 to 2017. Due to separating the baseflow from the observed flow using WHAT, the BFI was 0.61, the baseflow ratio in Scenario 1 was 0.65, and that in Scenario 2 was 0.61 (Table 8).

SURO TAET IGWI SUM
Due to the separation of the baseflow from the observed flow using BFLOW, the results of the first pass, second pass, and third pass were analyzed, and the BFI value calculated using the first pass was 0.65. The BFI analysis result of Scenario 1 was 0.69 and that of Scenario 2 was 0.63 (Table 9). Therefore, the result of Scenario 2 was more similar to the result of the baseflow separated by using the measured flow ratio, compared to the result of Scenario 1. The difference between hydrological components was analyzed through the water balance analysis, and comparison with the measured and simulated values was performed through the baseflow separation result. Accurate hydrological analysis and comparison are possible through baseflow separation.

Conclusions
The LZSN parameter was estimated for the Uiam Dam basin based on characteristics of each soil layer, including depth and AWC. Through the HSPF simulation, the water balance and baseflow were analyzed using the LZSN parameter and the baseflow separation program. The results and discussion were as follows: (1) With the forest, which occupies the largest area in all sub-watersheds, in Scenario 1 an LZSN value of 0.1 m was applied throughout all sub-watersheds. In Scenario 2, the values of LZSN were 0.16-0.26 m (for each sub-watersheds). For all other land uses, in Scenario 1 an LZSN value of 2 m was applied throughout all sub-watersheds; in Scenario 2 the depth and water capacity of the soil were calculated and applied differently to obtain LZSN for each sub-watershed. In this study, only soil depth and AWC were used as watershed characteristics in calculating the LZSN parameter. However, future studies related to LZSN parameter calculation that reflect more soil properties such as soil adsorption and detachment should be conducted. (2) Water balance analyses by sub-watershed and land use were performed for two scenarios before and after the application of the parameters (LZSN) reflecting soil characteristics. In Scenario 1, the same LZSN value was applied to all land uses except forests, and the water balance analysis showed similar results. In Scenario 2, different LZSN values (reflecting the characteristics of the soil) were applied, and the water balance results differed according to the land use in the sub-watershed. Therefore, even if the total flow was the same, the ratio of each hydrological component varied according to the LZSN value reflecting the soil characteristics. (3) Due to separating the baseflow using WHAT, the BFI for the measured value was 0.61. BFIs for Scenario 1 and Scenario 2 were 0.65 and 0.61, respectively; the results of Scenario 2 were like the actual measurements. The first pass of BFI by year using BFLOW gave a BFI of 0.65, and the BFIs for Scenarios 1 and 2 were 0.69 and 0.63, respectively. This could be because the BFI was also calculated (using the WHAT system) through analysis of the flow during the dry season because the default value of the model was used, not the value reflecting the runoff characteristics in the BFLOW simulation.
In this study, the parameters reflecting soil characteristics were calculated to overcome limitations of previous hydrological simulations in HSPF, which does not consider varying soil characteristics. This study confirmed that even though the simulated and measured flows were similar at the outlet of the watershed, the ratio of each hydrological component could be significantly different. In addition, the simulation accuracy of the direct runoff and baseflow runoff was improved in comparison with the measured values. The LZSN parameter values based on the soil characteristics can be applied in the HSPF according to the land use and area of the target watershed.
The HSPF model was also used to apply a BMP scenario and analyze the reduction in water pollution. In both scenarios, R 2 and NSE at the watershed outlet were very good, but there were differences in the LZSN parameter estimates, reflecting the soil characteristics and the proportion of each hydrological component. In this study, the simulation and analysis were performed on the target area where the observation values and simulated values were well matched, which is a possible limitation. Therefore, additional research on parameters that reflect watershed characteristics in various target areas are needed.
Author Contributions: S.K., J.K., H.K., W.S.J. and K.J.L. designed the research approach for this manuscript. S.K. led the analysis and modeling work with support from J.K., H.K., W.S.J. and K.J.L. and S.K. wrote the manuscript with feedback from J.K., H.K., W.S.J. and K.J.L. All authors have read and agreed to the published version of the manuscript.